Please use this identifier to cite or link to this item:
https://scholarbank.nus.edu.sg/handle/10635/33266
Title: | 现代汉语词义分类体系的建立和自动标注 = Building Word Sense Taxonomy and Automatic Annotation For Mandarin Chinese | Authors: | 柏晓鹏 BAI XIAOPENG |
Keywords: | word sense taxonomy; corpus annotation; syntagmatic theory; word sense disambiguation; Chinese lexical semantics | Issue Date: | 28-Sep-2011 | Citation: | 柏晓鹏, BAI XIAOPENG (2011-09-28). 现代汉语词义分类体系的建立和自动标注 = Building Word Sense Taxonomy and Automatic Annotation For Mandarin Chinese. ScholarBank@NUS Repository. | Abstract: | In this dissertation, we created word sense taxonomy for Chinese noun, verb and adjective, in terms of natural language processing. Then conduct automatic word sense class annotation, generating Chinese word sense class corpus. We have operatable definition for each class in the taxonomy, and make it suitable for corpus annotation, which are distinguishing characteristics of our word sense taxonomy. We study the issue of word sense taxonomy in the frame of distributional theory, semantic selectional restrictions theory and syntagmatic theory. Each class in the word sense taxonomy is defined with three types of features: syntactic performance, semantic role (for noun)/ argument structure (for verb), and semantic selectional restrictions. The result shows that the description of word sense class definition makes the taxonomy operateble in the process of sense class annotation. The methodology applied for building the taxonomy is one of the contributions we made through this dissertation. Automatic classification experiments are performed for multi-class words. The result of the experiments is quite encouraging: 84.1% words get the precision of over 90%; 96% words get the precision of over 85%. The disambiguation results show that the word sense taxonomy has enough distinction among sense classes, and verify that the word sense taxonomy is applicable in automatic annotation. | URI: | http://scholarbank.nus.edu.sg/handle/10635/33266 |
Appears in Collections: | Ph.D Theses (Open) |
Show full item record
Files in This Item:
File | Description | Size | Format | Access Settings | Version | |
---|---|---|---|---|---|---|
Building Word Sense Taxonomy and Automatic Annotation for Mandarin Chinese.pdf | 4.4 MB | Adobe PDF | OPEN | None | View/Download | |
BaiXP.pdf | 4.4 MB | Adobe PDF | OPEN | None | View/Download |
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.