Please use this identifier to cite or link to this item:
https://scholarbank.nus.edu.sg/handle/10635/129714
Title: | A Case Study on Context-Centered Lexicon Construction | Authors: | Guo, J. | Issue Date: | 1996 | Citation: | Guo, J. (1996). A Case Study on Context-Centered Lexicon Construction. Communications of COLIPS 6 (2) : 61-71. ScholarBank@NUS Repository. | Abstract: | An approach is proposed for extracting Chinese words or phrases to allow construction of a lexical database for processing the language. The lack of explicit word boundaries has been an obstacle to reliable collection of unknown Chinese lexical items. A context-centered template pattern matching strategy involves left & right strings of predefined elements surrounding a token string of fixed or variable length. An algorithm was tried using a template where four-gram tokens between a fixed string & a comma were extracted; the 205 matches from a 4,000,000-character corpus are analyzed. The algorithm is judged productive, & application of syntactic constraints would make it even more effective. | Source Title: | Communications of COLIPS | URI: | http://scholarbank.nus.edu.sg/handle/10635/129714 | ISSN: | 02187019 |
Appears in Collections: | Staff Publications |
Show full item record
Files in This Item:
There are no files associated with this item.
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.