Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/40561
Title: XClust: Clustering XML schemas for effective integration
Authors: Lee, M.L. 
Yang, L.H. 
Hsu, W. 
Yang, X.
Keywords: Clustering
Data integration
Schema matching
XML schema
Issue Date: 2002
Citation: Lee, M.L.,Yang, L.H.,Hsu, W.,Yang, X. (2002). XClust: Clustering XML schemas for effective integration. International Conference on Information and Knowledge Management, Proceedings : 292-299. ScholarBank@NUS Repository.
Abstract: It is increasingly important to develop scalable integration techniques for the growing number of XML data sources. A practical starting point for the integration of large numbers of Document Type Definitions (DTDs) of XML sources would be to first find clusters of DTDs that are similar in structure and semantics. Reconciling similar DTDs within such a cluster will be an easier task than reconciling DTDs that are different in structure and semantics as the latter would involve more restructuring. We introduce XClust, a novel integration strategy that involves the clustering of DTDs. A matching algorithm based on the semantics, immediate descendents and leaf-context similarity of DTD elements is developed. Our experiments to integrate real world DTDs demonstrate the effectiveness of the XClust approach.
Source Title: International Conference on Information and Knowledge Management, Proceedings
URI: http://scholarbank.nus.edu.sg/handle/10635/40561
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.