Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/161064
Title: Discovering frequent weak XML patterns
Authors: ZHOU FENG
Keywords: Data Mining, Frequent Pattern Mining, XML Mining, Tree Mining, Unordered, Embedded
Issue Date: 5-Dec-2004
Citation: ZHOU FENG (2004-12-05). Discovering frequent weak XML patterns. ScholarBank@NUS Repository.
Abstract: 

PROBLEMS TO DISCOVER FREQUENT PATTERNS FROM A LARGE XML REPOSITORY HAVE ATTRACTED INCREASING RESEARCH ATTENTIONS IN RECENT YEARS. SUCH PROBLEMS BELONG TO THE CATEGORY OF UNORDERED AND INDUCED SUBTREE MINING PROBLEM FOR WHICH NO EFFICIENT AND COMPLETE ALGORITHM HAS BEEN PROPOSED SO FAR. IN THIS THESIS, WE INTRODUCE THE NOTION OF MAPPING EQUIVALENT CLASS FOR REPRESENTING TREES THAT HAVE THE SAME NODE LABEL SET. BASED ON THESE MAPPING EQUIVALENT CLASSES A NOVEL DIVIDE-AND-CONQUER ALGORITHM, WTIMINER, IS PROPOSED FOR DISCOVERY OF FREQUENT UNORDERED AND EMBEDDED SUBTREES WITHOUT LOSS. THE ALGORITHM SUCCESSFULLY DOWNGRADES THE COMPLEXITY OF PATTERN MATCHING AND COUNTING PROBLEM THAT A REGULAR TREE MINING ALGORITHM FACES. EXPERIMENTAL RESULTS DEMONSTRATE THE HIGH EFFICIENCY AND SCALABILITY OF WTIMINER IN TERMS OF BOTH TIME AND SPACE.

URI: https://scholarbank.nus.edu.sg/handle/10635/161064
Appears in Collections:Master's Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
DiscoveringFrequentWeakXMLPatterns.pdf1.08 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.