Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/16330
Title: 基于词典的语料库词义标注 = Dictionary Based Corpus Sense Tagging
Authors: 肖航
XIAO HANG
Keywords: sense tagging, word sense disambiguation, sense distinctions, sense tagged corpus, sense distribution, disambiguation cues
Issue Date: 13-Aug-2009
Citation: 肖航, XIAO HANG (2009-08-13). 基于词典的语料库词义标注 = Dictionary Based Corpus Sense Tagging. ScholarBank@NUS Repository.
Abstract: This study is aiming to build a word sense tagged Chinese corpus. The thesis firstly probes into the effects of different linguistic disambiguation cues extracted from dictionary to practice word sense tagging. The study investigates how to use Pinyin to distinguish the homographs; how to employ part-of-speech to discriminate the ambiguities, and further examines the effect and limitation of using collocations as disambiguation cue. The study also investigates how to use word sense frequency distribution to improve baseline of automatic sense tagging. The thesis secondly analyzes the difficulties summed up in manual and automatic sense tagging. On the basis of the practice of sense tagged corpus construction, the study supports that a high precision word sense tagging procedure is realizable by using hybrid linguistic disambiguation cues; however, from the analysis of difficult parts in sense tagging, more studies in dictionary sense distinctions should be done to improve tagging consistency.
URI: http://scholarbank.nus.edu.sg/handle/10635/16330
Appears in Collections:Master's Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
XIAOHANG_MA_ChineseStudies_DICTIONARY BASED CORPUS SENSE TAGGING_2009.pdf2.79 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.