Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/33266
Title: 现代汉语词义分类体系的建立和自动标注 = Building Word Sense Taxonomy and Automatic Annotation For Mandarin Chinese
Authors: 柏晓鹏
BAI XIAOPENG
Keywords: word sense taxonomy; corpus annotation; syntagmatic theory; word sense disambiguation; Chinese lexical semantics
Issue Date: 28-Sep-2011
Citation: 柏晓鹏, BAI XIAOPENG (2011-09-28). 现代汉语词义分类体系的建立和自动标注 = Building Word Sense Taxonomy and Automatic Annotation For Mandarin Chinese. ScholarBank@NUS Repository.
Abstract: In this dissertation, we created word sense taxonomy for Chinese noun, verb and adjective, in terms of natural language processing. Then conduct automatic word sense class annotation, generating Chinese word sense class corpus. We have operatable definition for each class in the taxonomy, and make it suitable for corpus annotation, which are distinguishing characteristics of our word sense taxonomy. We study the issue of word sense taxonomy in the frame of distributional theory, semantic selectional restrictions theory and syntagmatic theory. Each class in the word sense taxonomy is defined with three types of features: syntactic performance, semantic role (for noun)/ argument structure (for verb), and semantic selectional restrictions. The result shows that the description of word sense class definition makes the taxonomy operateble in the process of sense class annotation. The methodology applied for building the taxonomy is one of the contributions we made through this dissertation. Automatic classification experiments are performed for multi-class words. The result of the experiments is quite encouraging: 84.1% words get the precision of over 90%; 96% words get the precision of over 85%. The disambiguation results show that the word sense taxonomy has enough distinction among sense classes, and verify that the word sense taxonomy is applicable in automatic annotation.
URI: http://scholarbank.nus.edu.sg/handle/10635/33266
Appears in Collections:Ph.D Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
Building Word Sense Taxonomy and Automatic Annotation for Mandarin Chinese.pdf4.4 MBAdobe PDF

OPEN

NoneView/Download
BaiXP.pdf4.4 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.