Please use this identifier to cite or link to this item:
https://scholarbank.nus.edu.sg/handle/10635/121108
Title: | ADDRESSING INFORMALITY IN PROCESSING CHINESE MICROTEXT | Authors: | WANG AOBO | Keywords: | Chinese microtext, social media, word segmentation, normalization, named entity recognition, conditional random field (CRF) | Issue Date: | 8-Jan-2015 | Citation: | WANG AOBO (2015-01-08). ADDRESSING INFORMALITY IN PROCESSING CHINESE MICROTEXT. ScholarBank@NUS Repository. | Abstract: | In this thesis, I tackle the problem of processing Chinese microtext, with the goal of building the natural language processing (NLP) tools for the microtext domain. I discover that informal words and named entities that are formed in a free-style manner are key reasons why microtext is diffi- cult to understand and process by conventional nature language processing tools. As such in this thesis, I study three key areas to address informality in processing Chinese microtext: 1. informal word recognition and word segmentation, 2. informal word normalization, and 3. named entity recognition. | URI: | http://scholarbank.nus.edu.sg/handle/10635/121108 |
Appears in Collections: | Ph.D Theses (Open) |
Show full item record
Files in This Item:
File | Description | Size | Format | Access Settings | Version | |
---|---|---|---|---|---|---|
Aobo_WANG_phd_thesis.pdf | 1.52 MB | Adobe PDF | OPEN | None | View/Download |
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.