Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/40401
DC FieldValue
dc.titleComparing keyword extraction techniques for WEBSOM text archives
dc.contributor.authorAzcarraga, A.P.
dc.contributor.authorYap Jr., T.N.
dc.date.accessioned2013-07-04T08:03:28Z
dc.date.available2013-07-04T08:03:28Z
dc.date.issued2001
dc.identifier.citationAzcarraga, A.P.,Yap Jr., T.N. (2001). Comparing keyword extraction techniques for WEBSOM text archives. Proceedings of the International Conference on Tools with Artificial Intelligence : 187-194. ScholarBank@NUS Repository.
dc.identifier.issn10636730
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/40401
dc.description.abstractThe WEBSOM methodology for building very large text archives has a very slow method for extracting meaningful unit labels. This is because the method computes for the relative frequencies of all the words of all the documents associated to each unit and then compares these to the relative frequencies of all the words of all the other units of the map. Since maps may have more than 100,000 units and the archive may contain up to 7 million documents, the existing WEBSOM method is not practical. A fast alternative method is based on the distribution of weights in the weight vectors of the trained map, plus a simple manipulation of the random projection matrix used for input data compression. Comparisons made using a WEBSOM archive of the Reuters text collection reveal that a high percentage of keywords extracted using this method match the keywords extracted for the same map units using the original WEBSOM method.
dc.sourceScopus
dc.typeConference Paper
dc.contributor.departmentCOMPUTER SCIENCE
dc.description.sourcetitleProceedings of the International Conference on Tools with Artificial Intelligence
dc.description.page187-194
dc.description.codenPCTIF
dc.identifier.isiutNOT_IN_WOS
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Page view(s)

93
checked on Jan 20, 2022

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.