Please use this identifier to cite or link to this item:
Title: Enhancing HMM-based biomedical named entity recognition by studying special phenomena
Authors: Zhang, J.
Shen, D.
Zhou, G.
Su, J.
Tan, C.-L. 
Keywords: Abbreviation recognition
Biomedical named entity recognition
Cascaded named entity recognition
Issue Date: 2004
Citation: Zhang, J., Shen, D., Zhou, G., Su, J., Tan, C.-L. (2004). Enhancing HMM-based biomedical named entity recognition by studying special phenomena. Journal of Biomedical Informatics 37 (6) : 411-422. ScholarBank@NUS Repository.
Abstract: The purpose of this research is to enhance an HMM-based named entity recognizer in the biomedical domain. First, we analyze the characteristics of biomedical named entities. Then, we propose a rich set of features, including orthographic, morphological, part-of-speech, and semantic trigger features. All these features are integrated via a Hidden Markov Model with back-off modeling. Furthermore, we propose a method for biomedical abbreviation recognition and two methods for cascaded named entity recognition. Evaluation on the GENIA V3.02 and V1.1 shows that our system achieves 66.5 and 62.5 F-measure, respectively, and outperforms the previous best published system by 8.1 F-measure on the same experimental setting. The major contribution of this paper lies in its rich feature set specially designed for biomedical domain and the effective methods for abbreviation and cascaded named entity recognition. To our best knowledge, our system is the first one that copes with the cascaded phenomena. © 2004 Elsevier Inc. All rights reserved.
Source Title: Journal of Biomedical Informatics
ISSN: 15320464
DOI: 10.1016/j.jbi.2004.08.005
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.


checked on Mar 16, 2019


checked on Feb 27, 2019

Page view(s)

checked on Mar 3, 2019

Google ScholarTM



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.