Please use this identifier to cite or link to this item: https://doi.org/10.1016/j.infsof.2007.02.019
DC FieldValue
dc.titleEfficient mining of frequent XML query patterns with repeating-siblings
dc.contributor.authorYang, L.H.
dc.contributor.authorLee, M.L.
dc.contributor.authorHsu, W.
dc.contributor.authorHuang, D.
dc.contributor.authorWong, L.
dc.date.accessioned2013-07-04T07:39:34Z
dc.date.available2013-07-04T07:39:34Z
dc.date.issued2008
dc.identifier.citationYang, L.H., Lee, M.L., Hsu, W., Huang, D., Wong, L. (2008). Efficient mining of frequent XML query patterns with repeating-siblings. Information and Software Technology 50 (5) : 375-389. ScholarBank@NUS Repository. https://doi.org/10.1016/j.infsof.2007.02.019
dc.identifier.issn09505849
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/39346
dc.description.abstractA recent approach to improve the performance of XML query evaluation is to cache the query results of frequent query patterns. Unfortunately, discovering these frequent query patterns is an expensive operation. In this paper, we develop a two-pass mining algorithm 2PXMiner that guarantees the discovery of frequent query patterns by scanning the database at most twice. By exploiting a transaction summary data structure, and an enumeration tree, we are able to determine the upper bounds of the frequencies of the candidate patterns, and to quickly prune away the infrequent patterns. We also design an index to trace the repeating candidate subtrees generated by sibling repetition, thus avoiding redundant computations. Experiments results indicate that 2PXMiner is both efficient and scalable. © 2007 Elsevier B.V. All rights reserved.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1016/j.infsof.2007.02.019
dc.sourceScopus
dc.subjectFrequent pattern mining
dc.subjectStructured pattern
dc.subjectTree pattern mining
dc.subjectXML query pattern
dc.typeArticle
dc.contributor.departmentCOMPUTER SCIENCE
dc.description.doi10.1016/j.infsof.2007.02.019
dc.description.sourcetitleInformation and Software Technology
dc.description.volume50
dc.description.issue5
dc.description.page375-389
dc.description.codenISOTE
dc.identifier.isiut000255153200002
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.