Please use this identifier to cite or link to this item: https://doi.org/10.1109/TKDE.2003.1262193
DC FieldValue
dc.titleEvaluating keyword selection methods for WEBSOM text archives
dc.contributor.authorAzcarraga, A.P.
dc.contributor.authorYap Jr., T.N.
dc.contributor.authorTan, J.
dc.contributor.authorChua, T.S.
dc.date.accessioned2013-07-04T07:33:46Z
dc.date.available2013-07-04T07:33:46Z
dc.date.issued2004
dc.identifier.citationAzcarraga, A.P.,Yap Jr., T.N.,Tan, J.,Chua, T.S. (2004). Evaluating keyword selection methods for WEBSOM text archives. IEEE Transactions on Knowledge and Data Engineering 16 (3) : 380-383. ScholarBank@NUS Repository. <a href="https://doi.org/10.1109/TKDE.2003.1262193" target="_blank">https://doi.org/10.1109/TKDE.2003.1262193</a>
dc.identifier.issn10414347
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/39092
dc.description.abstractThe WEBSOM methodology, proven effective for building very large text archives, includes a method that extracts labels for each document cluster assigned to nodes in the map. However, the WEBSOM method needs to retrieve all the words of all the documents associated to each node. Since maps may have more than 100,000 nodes and since the archive may contain up to seven million documents, the WEBSOM methodology needs a faster alternative method for keyword selection. Presented here is such an alternative method that is abie to quickly deduce meaningful labels per node in the map. It does this just by analyzing the relative weight distribution of the SOM weight vectors and by taking advantage of some characteristics of the random projection method used in dimensionality reduction. The effectiveness of this technique is demonstrated on news document collections.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1109/TKDE.2003.1262193
dc.sourceScopus
dc.subjectKeyword extraction
dc.subjectRandom projection
dc.subjectText archives
dc.subjectWEBSOM
dc.typeArticle
dc.contributor.departmentCOMPUTER SCIENCE
dc.description.doi10.1109/TKDE.2003.1262193
dc.description.sourcetitleIEEE Transactions on Knowledge and Data Engineering
dc.description.volume16
dc.description.issue3
dc.description.page380-383
dc.description.codenITKEE
dc.identifier.isiutNOT_IN_WOS
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.