A machine learning approach for the duration of biomedical literature

Shi, M.; Edwin, D.S.; Menon, R.; Shen, L.; Lim, J.Y.K.; Loh, H.T.; Keerthi, S.S.; Ong, C.J.

Publication

A machine learning approach for the duration of biomedical literature

Shi, M.;
Edwin, D.S.
; Menon, R.; Shen, L.;
Lim, J.Y.K.
; Loh, H.T.; Keerthi, S.S.; Ong, C.J.

Abstract

In the field of the biomedical sciences there exists a vast repository of information located within large quantities of research papers. Very often, researchers need to spend considerable amounts of time reading through entire papers before being able to determine whether or not they should be curated (archived). In this paper, we present an automated text classification system for the classification of biomedical papers. This classification is based on whether there is experimental evidence for the expression of molecular gene products for specified genes within a given paper. The system performs pre-processing and data cleaning, followed by feature extraction from the raw text. It subsequently classifies the paper using the extracted features with a Naïve Bayes Classifier. Our approach has made it possible to classify (and curate) biomedical papers automatically, thus potentially saving considerable time and resources. The system proved to be highly accurate, and won honourable mention in the KDD Cup 2002 task 1. © Springer-Verlag Berlin Heidelberg 2003.

Source Title

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Collections

Staff Publications

Show all metadata

Organizational Units

Organizational Unit

MECHANICAL ENGINEERING

dept

Date

2003

URI

https://scholarbank.nus.edu.sg/handle/10635/54326

Type

Article

A machine learning approach for the duration of biomedical literature

Shi, M.;
Edwin, D.S.
; Menon, R.; Shen, L.;
Lim, J.Y.K.
; Loh, H.T.; Keerthi, S.S.; Ong, C.J.

Citations

Alternative Title

Abstract

Keywords

Source Title

Publisher

Series/Report No.

Collections

Organizational Units

Rights

Date

URI

DOI

Type

Additional Links

Related Datasets

Related Publications

A machine learning approach for the duration of biomedical literature

Shi, M.; Edwin, D.S. ; Menon, R.; Shen, L.; Lim, J.Y.K. ; Loh, H.T.; Keerthi, S.S.; Ong, C.J.

Citations

Alternative Title

Abstract

Keywords

Source Title

Publisher

Series/Report No.

Collections

Organizational Units

Rights

Date

URI

DOI

Type

Additional Links

Related Datasets

Related Publications

Shi, M.;
Edwin, D.S.
; Menon, R.; Shen, L.;
Lim, J.Y.K.
; Loh, H.T.; Keerthi, S.S.; Ong, C.J.