Please use this identifier to cite or link to this item: http://scholarbank.nus.edu.sg/handle/10635/41314
Title: Web page categorization without the web page
Authors: Kan, M.-Y. 
Keywords: Abbreviation expansion
Text categorization
Uniform resource locator
Word segmentation
Issue Date: 2004
Source: Kan, M.-Y. (2004). Web page categorization without the web page. Thirteenth International World Wide Web Conference Proceedings, WWW2004 : 994-995. ScholarBank@NUS Repository.
Abstract: Uniform resource locators (URLs), which mark the address of a resource on the World Wide Web, are often human-readable and can hint at the category of the resource. This paper explores the use of URLs for web page categorization via a two-phase pipeline of word segmentation/expansion and classification. We quantify its performance against document-based methods, which require the retrieval of the source document.
Source Title: Thirteenth International World Wide Web Conference Proceedings, WWW2004
URI: http://scholarbank.nus.edu.sg/handle/10635/41314
ISBN: 158113844X
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Page view(s)

50
checked on Dec 16, 2017

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.