Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/14600
DC FieldValue
dc.titleGlobal rule induction for information extraction
dc.contributor.authorXIAO JING
dc.date.accessioned2010-04-08T10:44:50Z
dc.date.available2010-04-08T10:44:50Z
dc.date.issued2005-04-14
dc.identifier.citationXIAO JING (2005-04-14). Global rule induction for information extraction. ScholarBank@NUS Repository.
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/14600
dc.description.abstractInformation Extraction (IE) is designed to extract specific data from high volumes of text, using robust means. Pattern rule induction is one kind of techniques which have been widely used in IE. This thesis focuses on pattern rule induction for IE on both semi-structured and free texts. First, we introduce GRID, a Global Rule Induction for text Documents, which emphasizes on utilizing the global feature distribution of all of the training examples to start the rule induction process. Then, we show GRID can be applied successfully in definitional question answering and video story segmentation tasks. Lastly, we introduce two weakly supervised learning paradigms by using GRID as the base learner. One weakly supervised learning scheme is realized by combing co-training GRID with two views and active learning. The other weakly supervised learning paradigm is implemented by cascading use of a soft pattern learner and GRID.
dc.language.isoen
dc.subjectinformation extraction; rule induction; rule generalization
dc.typeThesis
dc.contributor.departmentCOMPUTER SCIENCE
dc.contributor.supervisorCHUA TAT SENG
dc.description.degreePh.D
dc.description.degreeconferredDOCTOR OF PHILOSOPHY
dc.identifier.isiutNOT_IN_WOS
Appears in Collections:Ph.D Theses (Open)

Show simple item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
XiaoJ.pdf653.52 kBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.