Please use this identifier to cite or link to this item:
https://scholarbank.nus.edu.sg/handle/10635/16330
Title: | 基于词典的语料库词义标注 = Dictionary Based Corpus Sense Tagging | Authors: | 肖航 XIAO HANG |
Keywords: | sense tagging, word sense disambiguation, sense distinctions, sense tagged corpus, sense distribution, disambiguation cues | Issue Date: | 13-Aug-2009 | Citation: | 肖航, XIAO HANG (2009-08-13). 基于词典的语料库词义标注 = Dictionary Based Corpus Sense Tagging. ScholarBank@NUS Repository. | Abstract: | This study is aiming to build a word sense tagged Chinese corpus. The thesis firstly probes into the effects of different linguistic disambiguation cues extracted from dictionary to practice word sense tagging. The study investigates how to use Pinyin to distinguish the homographs; how to employ part-of-speech to discriminate the ambiguities, and further examines the effect and limitation of using collocations as disambiguation cue. The study also investigates how to use word sense frequency distribution to improve baseline of automatic sense tagging. The thesis secondly analyzes the difficulties summed up in manual and automatic sense tagging. On the basis of the practice of sense tagged corpus construction, the study supports that a high precision word sense tagging procedure is realizable by using hybrid linguistic disambiguation cues; however, from the analysis of difficult parts in sense tagging, more studies in dictionary sense distinctions should be done to improve tagging consistency. | URI: | http://scholarbank.nus.edu.sg/handle/10635/16330 |
Appears in Collections: | Master's Theses (Open) |
Show full item record
Files in This Item:
File | Description | Size | Format | Access Settings | Version | |
---|---|---|---|---|---|---|
XIAOHANG_MA_ChineseStudies_DICTIONARY BASED CORPUS SENSE TAGGING_2009.pdf | 2.79 MB | Adobe PDF | OPEN | None | View/Download |
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.