Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/41314
Title: Web page categorization without the web page
Authors: Kan, M.-Y. 
Keywords: Abbreviation expansion
Text categorization
Uniform resource locator
Word segmentation
Issue Date: 2004
Citation: Kan, M.-Y. (2004). Web page categorization without the web page. Thirteenth International World Wide Web Conference Proceedings, WWW2004 : 994-995. ScholarBank@NUS Repository.
Abstract: Uniform resource locators (URLs), which mark the address of a resource on the World Wide Web, are often human-readable and can hint at the category of the resource. This paper explores the use of URLs for web page categorization via a two-phase pipeline of word segmentation/expansion and classification. We quantify its performance against document-based methods, which require the retrieval of the source document.
Source Title: Thirteenth International World Wide Web Conference Proceedings, WWW2004
URI: http://scholarbank.nus.edu.sg/handle/10635/41314
ISBN: 158113844X
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.