Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/40668
DC FieldValue
dc.titleEfficient processing of XML twig patterns with parent child edges: A look-ahead approach
dc.contributor.authorLu, J.
dc.contributor.authorChen, T.
dc.contributor.authorLing, T.W.
dc.date.accessioned2013-07-04T08:09:37Z
dc.date.available2013-07-04T08:09:37Z
dc.date.issued2004
dc.identifier.citationLu, J., Chen, T., Ling, T.W. (2004). Efficient processing of XML twig patterns with parent child edges: A look-ahead approach. International Conference on Information and Knowledge Management, Proceedings : 533-542. ScholarBank@NUS Repository.
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/40668
dc.description.abstractWith the growing importance of semi-structure data in information exchange, much research has been done to provide an effective mechanism to match a twig query in an XML database. A number of algorithms have been proposed recently to process a twig query holistically. Those algorithms are quite efficient for quires with only ancestor-descendant edges. But for queries with mixed ancestor-descendant and parent-child edges, the previous approaches still may produce large intermediate results, even when the input and output size are more manageable. To overcome this limitation, in this paper, we propose a novel holistic twig join algorithm, namely TwigStackList. Our main technique is to look-ahead read some elements in input data steams and cache limited number of them to lists in the main memory. The number of elements in any list is bounded by the length of the longest path in the XML document. We show that TwigStackList is I/O optimal for queries with only ancestor-descendant relationships below branching nodes. Further, even when queries contain parent-child relationship below branching nodes, the set of intermediate results in TwigStackList is guaranteed to be a subset of that in previous algorithms. We complement our experimental results on a range of real and synthetic data to show the significant superiority of TwigStackList over previous algorithms for queries with parent-child relationships. Copyright 2004 ACM.
dc.sourceScopus
dc.subjectHolistic twig pattern matching
dc.subjectXML
dc.typeConference Paper
dc.contributor.departmentCOMPUTATIONAL SCIENCE
dc.description.sourcetitleInternational Conference on Information and Knowledge Management, Proceedings
dc.description.page533-542
dc.identifier.isiutNOT_IN_WOS
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.