Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/129714
Title: A Case Study on Context-Centered Lexicon Construction
Authors: Guo, J. 
Issue Date: 1996
Citation: Guo, J. (1996). A Case Study on Context-Centered Lexicon Construction. Communications of COLIPS 6 (2) : 61-71. ScholarBank@NUS Repository.
Abstract: An approach is proposed for extracting Chinese words or phrases to allow construction of a lexical database for processing the language. The lack of explicit word boundaries has been an obstacle to reliable collection of unknown Chinese lexical items. A context-centered template pattern matching strategy involves left & right strings of predefined elements surrounding a token string of fixed or variable length. An algorithm was tried using a template where four-gram tokens between a fixed string & a comma were extracted; the 205 matches from a 4,000,000-character corpus are analyzed. The algorithm is judged productive, & application of syntactic constraints would make it even more effective.
Source Title: Communications of COLIPS
URI: http://scholarbank.nus.edu.sg/handle/10635/129714
ISSN: 02187019
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.