Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/121108
Title: ADDRESSING INFORMALITY IN PROCESSING CHINESE MICROTEXT
Authors: WANG AOBO
Keywords: Chinese microtext, social media, word segmentation, normalization, named entity recognition, conditional random field (CRF)
Issue Date: 8-Jan-2015
Citation: WANG AOBO (2015-01-08). ADDRESSING INFORMALITY IN PROCESSING CHINESE MICROTEXT. ScholarBank@NUS Repository.
Abstract: In this thesis, I tackle the problem of processing Chinese microtext, with the goal of building the natural language processing (NLP) tools for the microtext domain. I discover that informal words and named entities that are formed in a free-style manner are key reasons why microtext is diffi- cult to understand and process by conventional nature language processing tools. As such in this thesis, I study three key areas to address informality in processing Chinese microtext: 1. informal word recognition and word segmentation, 2. informal word normalization, and 3. named entity recognition.
URI: http://scholarbank.nus.edu.sg/handle/10635/121108
Appears in Collections:Ph.D Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
Aobo_WANG_phd_thesis.pdf1.52 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.