Please use this identifier to cite or link to this item: https://doi.org/10.18653/v1/P18-1172
DC FieldValue
dc.titleBatch IS NOT Heavy: Learning Word Representations From All Samples
dc.contributor.authorXin Xin
dc.contributor.authorFajie Yuan
dc.contributor.authorXiangnan He
dc.contributor.authorJoemon M.Jose
dc.date.accessioned2020-04-28T02:06:57Z
dc.date.available2020-04-28T02:06:57Z
dc.date.issued2018-07-20
dc.identifier.citationXin Xin, Fajie Yuan, Xiangnan He, Joemon M.Jose (2018-07-20). Batch IS NOT Heavy: Learning Word Representations From All Samples. ACL 2018 : 1853-1862. ScholarBank@NUS Repository. https://doi.org/10.18653/v1/P18-1172
dc.identifier.isbn9781948087322
dc.identifier.urihttps://scholarbank.nus.edu.sg/handle/10635/167277
dc.description.abstractStochastic Gradient Descent (SGD) with negative sampling is the most prevalent approach to learn word representations. However, it is known that sampling methods are biased especially when the sampling distribution deviates from the true data distribution. Besides, SGD suffers from dramatic fluctuation due to the one-sample learning scheme. In this work, we propose AllVec that uses batch gradient learning to generate word representations from all training samples. Remarkably, the time complexity of AllVec remains at the same level as SGD, being determined by the number of positive samples rather than all samples. We evaluate AllVec on several benchmark tasks. Experiments show that AllVec outperforms sampling-based SGD methods with comparable efficiency, especially for small training corpora. © 2018 Association for Computational Linguistics
dc.publisherAssociation for Computational Linguistics (ACL)
dc.typeConference Paper
dc.contributor.departmentDEPARTMENT OF COMPUTER SCIENCE
dc.description.doi10.18653/v1/P18-1172
dc.description.sourcetitleACL 2018
dc.description.page1853-1862
dc.grant.idR-252-300-002-490
dc.grant.fundingagencyInfocomm Media Development Authority
dc.grant.fundingagencyNational Research Foundation
Appears in Collections:Staff Publications
Elements

Show simple item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
Batch IS NOT Heavy Learning Word Representations From All Samples.pdf365.38 kBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.