Please use this identifier to cite or link to this item: https://doi.org/10.1109/CIS.2013.93
Title: Inducing word senses for cross-lingual document clustering
Authors: Tang, G.
Xia, Y.
Cambria, E. 
Jin, P.
Keywords: Cross-lingual document clustering
Cross-lingual document representation
Word sense
Issue Date: 2013
Citation: Tang, G., Xia, Y., Cambria, E., Jin, P. (2013). Inducing word senses for cross-lingual document clustering. Proceedings - 9th International Conference on Computational Intelligence and Security, CIS 2013 : 409-414. ScholarBank@NUS Repository. https://doi.org/10.1109/CIS.2013.93
Abstract: Cross-lingual document clustering is the task of automatically organizing a large collection of cross-lingual documents into a few groups according to their content or topic. It is well known that language barrier and translation ambiguity are two challenging issues for cross-lingual document representation. To address such issues, we propose to represent cross-lingual documents through statistical word senses, which are learned from a parallel corpus by means of a novel cross-lingual word sense induction model. Furthermore, a sense clustering method is adopted to discover semantic relation of word senses, which are used to represent cross-lingual documents through a sense-based vector space model. Evaluation on a benchmarking dataset shows that the proposed model outperforms two state-of-the-art models in cross-lingual document clustering. © 2013 IEEE.
Source Title: Proceedings - 9th International Conference on Computational Intelligence and Security, CIS 2013
URI: http://scholarbank.nus.edu.sg/handle/10635/128923
ISBN: 9781479925483
DOI: 10.1109/CIS.2013.93
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.