Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/99245
Title: Discovering typical structures of documents: A road map approach
Authors: Wang, Ke 
Liu, Huiqing
Issue Date: 1998
Citation: Wang, Ke,Liu, Huiqing (1998). Discovering typical structures of documents: A road map approach. SIGIR Forum (ACM Special Interest Group on Information Retrieval) : 146-154. ScholarBank@NUS Repository.
Abstract: The structure of a document refers to the role and hierarchy of subdocument references. Many online documents are similarly structured, though not identically structured. We study the problem of discovering `typical' structures of a collection of such documents, where the user specifies the minimum frequency of a typical structure. We will consider structural features of sub-document references such as labeling, nesting, ordering, cyclicity, and wild-card references, like those found on the Web and digital libraries. Typical structures can be used to serve the following purposes. (a) The `table-of-content' for gaining the general information of a source. (b) A road map for browsing and querying a source. (c) A basis for clustering documents. (d) Partial schemas for building structured layers to provide standard database access methods. (e) User/customer's interests and browsing patterns. We present a solution to the discovery problem.
Source Title: SIGIR Forum (ACM Special Interest Group on Information Retrieval)
URI: http://scholarbank.nus.edu.sg/handle/10635/99245
ISSN: 01635840
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.