Batch IS NOT Heavy: Learning Word Representations From All Samples

Please use this identifier to cite or link to this item: https://doi.org/10.18653/v1/P18-1172

DC Field	Value
dc.title	Batch IS NOT Heavy: Learning Word Representations From All Samples
dc.contributor.author	Xin Xin
dc.contributor.author	Fajie Yuan
dc.contributor.author	Xiangnan He
dc.contributor.author	Joemon M.Jose
dc.date.accessioned	2020-04-28T02:06:57Z
dc.date.available	2020-04-28T02:06:57Z
dc.date.issued	2018-07-20
dc.identifier.citation	Xin Xin, Fajie Yuan, Xiangnan He, Joemon M.Jose (2018-07-20). Batch IS NOT Heavy: Learning Word Representations From All Samples. ACL 2018 : 1853-1862. ScholarBank@NUS Repository. https://doi.org/10.18653/v1/P18-1172
dc.identifier.isbn	9781948087322
dc.identifier.uri	https://scholarbank.nus.edu.sg/handle/10635/167277
dc.description.abstract	Stochastic Gradient Descent (SGD) with negative sampling is the most prevalent approach to learn word representations. However, it is known that sampling methods are biased especially when the sampling distribution deviates from the true data distribution. Besides, SGD suffers from dramatic fluctuation due to the one-sample learning scheme. In this work, we propose AllVec that uses batch gradient learning to generate word representations from all training samples. Remarkably, the time complexity of AllVec remains at the same level as SGD, being determined by the number of positive samples rather than all samples. We evaluate AllVec on several benchmark tasks. Experiments show that AllVec outperforms sampling-based SGD methods with comparable efficiency, especially for small training corpora. © 2018 Association for Computational Linguistics
dc.publisher	Association for Computational Linguistics (ACL)
dc.type	Conference Paper
dc.contributor.department	DEPARTMENT OF COMPUTER SCIENCE
dc.description.doi	10.18653/v1/P18-1172
dc.description.sourcetitle	ACL 2018
dc.description.page	1853-1862
dc.grant.id	R-252-300-002-490
dc.grant.fundingagency	Infocomm Media Development Authority
dc.grant.fundingagency	National Research Foundation
Appears in Collections:	Staff Publications Elements

Show simple item record

Files in This Item:

File	Description	Size	Format	Access Settings	Version
Batch IS NOT Heavy Learning Word Representations From All Samples.pdf		365.38 kB	Adobe PDF	OPEN	None	View/Download

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM