Please use this identifier to cite or link to this item: http://scholarbank.nus.edu.sg/handle/10635/14600
Title: Global rule induction for information extraction
Authors: XIAO JING
Keywords: information extraction; rule induction; rule generalization
Issue Date: 14-Apr-2005
Source: XIAO JING (2005-04-14). Global rule induction for information extraction. ScholarBank@NUS Repository.
Abstract: Information Extraction (IE) is designed to extract specific data from high volumes of text, using robust means. Pattern rule induction is one kind of techniques which have been widely used in IE. This thesis focuses on pattern rule induction for IE on both semi-structured and free texts. First, we introduce GRID, a Global Rule Induction for text Documents, which emphasizes on utilizing the global feature distribution of all of the training examples to start the rule induction process. Then, we show GRID can be applied successfully in definitional question answering and video story segmentation tasks. Lastly, we introduce two weakly supervised learning paradigms by using GRID as the base learner. One weakly supervised learning scheme is realized by combing co-training GRID with two views and active learning. The other weakly supervised learning paradigm is implemented by cascading use of a soft pattern learner and GRID.
URI: http://scholarbank.nus.edu.sg/handle/10635/14600
Appears in Collections:Ph.D Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
XiaoJ.pdf653.52 kBAdobe PDF

OPEN

NoneView/Download

Page view(s)

188
checked on Dec 11, 2017

Download(s)

233
checked on Dec 11, 2017

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.