Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/40561
DC FieldValue
dc.titleXClust: Clustering XML schemas for effective integration
dc.contributor.authorLee, M.L.
dc.contributor.authorYang, L.H.
dc.contributor.authorHsu, W.
dc.contributor.authorYang, X.
dc.date.accessioned2013-07-04T08:07:10Z
dc.date.available2013-07-04T08:07:10Z
dc.date.issued2002
dc.identifier.citationLee, M.L.,Yang, L.H.,Hsu, W.,Yang, X. (2002). XClust: Clustering XML schemas for effective integration. International Conference on Information and Knowledge Management, Proceedings : 292-299. ScholarBank@NUS Repository.
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/40561
dc.description.abstractIt is increasingly important to develop scalable integration techniques for the growing number of XML data sources. A practical starting point for the integration of large numbers of Document Type Definitions (DTDs) of XML sources would be to first find clusters of DTDs that are similar in structure and semantics. Reconciling similar DTDs within such a cluster will be an easier task than reconciling DTDs that are different in structure and semantics as the latter would involve more restructuring. We introduce XClust, a novel integration strategy that involves the clustering of DTDs. A matching algorithm based on the semantics, immediate descendents and leaf-context similarity of DTD elements is developed. Our experiments to integrate real world DTDs demonstrate the effectiveness of the XClust approach.
dc.sourceScopus
dc.subjectClustering
dc.subjectData integration
dc.subjectSchema matching
dc.subjectXML schema
dc.typeConference Paper
dc.contributor.departmentCOMPUTER SCIENCE
dc.description.sourcetitleInternational Conference on Information and Knowledge Management, Proceedings
dc.description.page292-299
dc.identifier.isiutNOT_IN_WOS
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.