Please use this identifier to cite or link to this item: http://scholarbank.nus.edu.sg/handle/10635/121108
Title: ADDRESSING INFORMALITY IN PROCESSING CHINESE MICROTEXT
Authors: WANG AOBO
Keywords: Chinese microtext, social media, word segmentation, normalization, named entity recognition, conditional random field (CRF)
Issue Date: 8-Jan-2015
Citation: WANG AOBO (2015-01-08). ADDRESSING INFORMALITY IN PROCESSING CHINESE MICROTEXT. ScholarBank@NUS Repository.
Abstract: In this thesis, I tackle the problem of processing Chinese microtext, with the goal of building the natural language processing (NLP) tools for the microtext domain. I discover that informal words and named entities that are formed in a free-style manner are key reasons why microtext is diffi- cult to understand and process by conventional nature language processing tools. As such in this thesis, I study three key areas to address informality in processing Chinese microtext: 1. informal word recognition and word segmentation, 2. informal word normalization, and 3. named entity recognition.
URI: http://scholarbank.nus.edu.sg/handle/10635/121108
Appears in Collections:Ph.D Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
Aobo_WANG_phd_thesis.pdf1.52 MBAdobe PDF

OPEN

NoneView/Download

Page view(s)

61
checked on Sep 21, 2018

Download(s)

80
checked on Sep 21, 2018

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.