Combining relations for information extraction from free text | ScholarBank@NUS

Please use this identifier to cite or link to this item: https://doi.org/10.1145/1777432.1777437

Title:	Combining relations for information extraction from free text
Authors:	Maslennikov, M. Chua, T.-S.
Keywords:	Bootstrapping Dependency relations Discourse relations Information extraction Semantic relations
Issue Date:	2010
Citation:	Maslennikov, M., Chua, T.-S. (2010). Combining relations for information extraction from free text. ACM Transactions on Information Systems 28 (3). ScholarBank@NUS Repository. https://doi.org/10.1145/1777432.1777437
Abstract:	Relations between entities of the same semantic type tend to be sparse in free texts. Therefore, combining relations is the key to effective information extraction (IE) on free text datasets with a small set of training samples. Previous approaches to bootstrapping for IE used different types of relations, such as dependency or co-occurrence, and faced the problems of paraphrasing and misalignment of instances. To cope with these problems, we propose a framework that integrates several types of relations. After extracting candidate entities, our framework evaluates relations between them at the phrasal, dependency, semantic frame, and discourse levels. For each of these levels, we build a classifier that outputs a score for relation instances. In order to integrate these scores, we propose three strategies: (1) integrate evaluation scores from each relation classifier; (2) incorporate the elimination of negatively labeled instances in a previous strategy; and (3) add cascading of extracted relations into strategy (2). Our framework improves the state-of-art results for supervised systems by 8%, 15%, 3%, and 5% on MUC4 (terrorism); MUC6 (management succession); ACE RDC 2003 (news, general types); and ACE RDC 2003 (news, specific types) domains respectively. © 2010 ACM.
Source Title:	ACM Transactions on Information Systems
URI:	http://scholarbank.nus.edu.sg/handle/10635/39890
ISSN:	10468188
DOI:	10.1145/1777432.1777437
Appears in Collections:	Staff Publications

Show full item record

Files in This Item:

There are no files associated with this item.

Google Scholar^TM

Check

Altmetric

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.