Please use this identifier to cite or link to this item: https://doi.org/10.1016/j.websem.2004.10.001
DC FieldValue
dc.titleLearning to integrate web taxonomies
dc.contributor.authorZhang, D.
dc.contributor.authorLee, W.S.
dc.date.accessioned2013-07-23T09:31:08Z
dc.date.available2013-07-23T09:31:08Z
dc.date.issued2004
dc.identifier.citationZhang, D.,Lee, W.S. (2004). Learning to integrate web taxonomies. Web Semantics 2 (2) : 131-151. ScholarBank@NUS Repository. <a href="https://doi.org/10.1016/j.websem.2004.10.001" target="_blank">https://doi.org/10.1016/j.websem.2004.10.001</a>
dc.identifier.issn15708268
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/43329
dc.description.abstractWe investigate machine learning methods for automatically integrating objects from different taxonomies into a master taxonomy. This problem is not only currently pervasive on the Web, but is also important to the emerging Semantic Web. A straightforward approach to automating this process would be to build classifiers through machine learning and then use these classifiers to classify objects from the source taxonomies into categories of the master taxonomy. However, conventional machine learning algorithms totally ignore the availability of the source taxonomies. In fact, source and master taxonomies often have common categories under different names or other more complex semantic overlaps. We introduce two techniques that exploit the semantic overlap between the source and master taxonomies to build better classifiers for the master taxonomy. The first technique, Cluster Shrinkage, biases the learning algorithm against splitting source categories by making objects in the same category appear more similar to each other. The second technique, Co-Bootstrapping, tries to facilitate the exploitation of inter-taxonomy relationships by providing category indicator functions as additional features for the objects. Our experiments with real-world Web data show that these proposed add-on techniques can enhance various machine learning algorithms to achieve substantial improvements in performance for taxonomy integration. © 2004 Elsevier B.V. All rights reserved.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1016/j.websem.2004.10.001
dc.sourceScopus
dc.subjectClassification
dc.subjectMachine learning
dc.subjectOntology mapping
dc.subjectSemantic Web
dc.subjectTaxonomy integration
dc.typeConference Paper
dc.contributor.departmentCOMPUTER SCIENCE
dc.contributor.departmentSINGAPORE-MIT ALLIANCE
dc.description.doi10.1016/j.websem.2004.10.001
dc.description.sourcetitleWeb Semantics
dc.description.volume2
dc.description.issue2
dc.description.page131-151
dc.identifier.isiutNOT_IN_WOS
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.