Please use this identifier to cite or link to this item:
https://scholarbank.nus.edu.sg/handle/10635/27702
DC Field | Value | |
---|---|---|
dc.title | Multi-criteria-based active learning for named entity recognition | |
dc.contributor.author | SHEN DAN | |
dc.date.accessioned | 2011-10-12T18:01:30Z | |
dc.date.available | 2011-10-12T18:01:30Z | |
dc.date.issued | 2004-05-05 | |
dc.identifier.citation | SHEN DAN (2004-05-05). Multi-criteria-based active learning for named entity recognition. ScholarBank@NUS Repository. | |
dc.identifier.uri | https://scholarbank.nus.edu.sg/handle/10635/27702 | |
dc.description.abstract | In this thesis, we propose a multi-criteria-based active learning approach and effectively apply it to the named entity recognition task. Active learning targets to minimize the human annotation efforts to learn a model with the same performance level as supervised learning by selecting the most useful examples for labeling. To maximize the contribution of the selected examples, we consider the multiple criteria including informativeness, representativeness and diversity and propose some measurements to quantify them respectively in the SVM-based named entity recognition. More comprehensively, we effectively incorporate all the criteria using two active learning strategies, both of which result in less labeling cost than the single-criterion-based method. The best results show that the labeling cost can be reduced by 95% in the newswire domain and 86% in the biomedical domain without degrading the performance of the named entity recognizer. To our best knowledge, this is not only the first work to incorporate the multiple criteria in active learning but also the first work to study active learning for named entity recognition. Furthermore, since the above measurements and active learning strategies are quite general, they can also be easily adapted to other natural language processing tasks. | |
dc.language.iso | en | |
dc.subject | active learning, named entity recognition, multiple criteria, informativeness, representativeness, diversity | |
dc.type | Thesis | |
dc.contributor.department | COMPUTER SCIENCE | |
dc.description.degree | Master's | |
dc.description.degreeconferred | MASTER OF SCIENCE | |
dc.identifier.isiut | NOT_IN_WOS | |
Appears in Collections: | Master's Theses (Open) |
Show simple item record
Files in This Item:
File | Description | Size | Format | Access Settings | Version | |
---|---|---|---|---|---|---|
SHEND.pdf | 872.85 kB | Adobe PDF | OPEN | None | View/Download |
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.