Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/43201
DC FieldValue
dc.titleVERT: A semantic approach for content search and content extraction in XML query processing
dc.contributor.authorWu, H.
dc.contributor.authorLing, T.W.
dc.contributor.authorChen, B.
dc.date.accessioned2013-07-23T09:27:42Z
dc.date.available2013-07-23T09:27:42Z
dc.date.issued2007
dc.identifier.citationWu, H.,Ling, T.W.,Chen, B. (2007). VERT: A semantic approach for content search and content extraction in XML query processing. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 4801 LNCS : 534-549. ScholarBank@NUS Repository.
dc.identifier.isbn9783540755623
dc.identifier.issn03029743
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/43201
dc.description.abstractProcessing a twig pattern query in XML document includes structural search and content search. Most existing algorithms only focus on structural search. They treat content nodes the same as element nodes during query processing with structural joins. Due to the high variety of contents, to mix content search and structural search suffers from management problem of contents and low performance. Another disadvantage is to find the actual values asked by a query, they have to rely on the original document. In this paper, we propose a novel algorithm Value Extraction with Relational Table (VERT) to overcome these limitations. The main technique of VERT is introducing relational tables to store document contents instead of treating them as nodes and labeling them. Tables in our algorithm are created based on semantic information of documents. As more semantics is captured, we can further optimize tables and queries to significantly enhance efficiency. Last, we show by experiments that besides solving different content problems, VERT also has superiority in performance of twig pattern query processing compared with existing algorithms. © Springer-Verlag Berlin Heidelberg 2007.
dc.sourceScopus
dc.typeConference Paper
dc.contributor.departmentCOMPUTER SCIENCE
dc.contributor.departmentELECTRICAL & COMPUTER ENGINEERING
dc.description.sourcetitleLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
dc.description.volume4801 LNCS
dc.description.page534-549
dc.identifier.isiutNOT_IN_WOS
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.