Please use this identifier to cite or link to this item: https://doi.org/10.1261/rna.5890304
DC FieldValue
dc.titlePrediction of RNA-binding proteins from primary sequence by a support vector machine approach
dc.contributor.authorHan, L.Y.
dc.contributor.authorCai, C.Z.
dc.contributor.authorLo, S.L.
dc.contributor.authorChung, M.C.M.
dc.contributor.authorChen, Y.Z.
dc.date.accessioned2014-05-19T02:54:13Z
dc.date.available2014-05-19T02:54:13Z
dc.date.issued2004-03
dc.identifier.citationHan, L.Y., Cai, C.Z., Lo, S.L., Chung, M.C.M., Chen, Y.Z. (2004-03). Prediction of RNA-binding proteins from primary sequence by a support vector machine approach. RNA 10 (3) : 355-368. ScholarBank@NUS Repository. https://doi.org/10.1261/rna.5890304
dc.identifier.issn13558382
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/53100
dc.description.abstractElucidation of the interaction of proteins with different molecules is of significance in the understanding of cellular processes. Computational methods have been developed for the prediction of protein-protein interactions. But insufficient attention has been paid to the prediction of protein-RNA interactions, which play central roles in regulating gene expression and certain RNA-mediated enzymatic processes. This work explored the use of a machine learning method, support vector machines (SVM), for the prediction of RNA-binding proteins directly from their primary sequence. Based on the knowledge of known RNA-binding and non-RNA-binding proteins, an SVM system was trained to recognize RNA-binding proteins. A total of 4011 RNA-binding and 9781 non-RNA-binding proteins was used to train and test the SVM classification system, and an independent set of 447 RNA-binding and 4881 non-RNA-binding proteins was used to evaluate the classification accuracy. Testing results using this independent evaluation set show a prediction accuracy of 94.1%, 79.3%, and 94.1% for rRNA-, mRNA-, and tRNA-binding proteins, and 98.7%, 96.5%, and 99.9% for non-rRNA-, non-mRNA-, and non-tRNA-binding proteins, respectively. The SVM classification system was further tested on a small class of snRNA-binding proteins with only 60 available sequences. The prediction accuracy is 40.0% and 99.9% for snRNA-binding and non-snRNA-binding proteins, indicating a need for a sufficient number of proteins to train SVM. The SVM classification systems trained in this work were added to our Web-based protein functional classification software SVMProt, at http://jing.cz3.nus.edu.sg/cgi-bin/svmprot.cgi. Our study suggests the potential of SVM as a useful tool for facilitating the prediction of protein-RNA interactions.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1261/rna.5890304
dc.sourceScopus
dc.subjectmRNA
dc.subjectRNA-binding proteins
dc.subjectRNA-protein interactions
dc.subjectrRNA
dc.subjectsnRNA
dc.subjectSupport vector machine
dc.subjecttRNA
dc.typeArticle
dc.contributor.departmentBIOLOGICAL SCIENCES
dc.contributor.departmentCOMPUTATIONAL SCIENCE
dc.contributor.departmentBIOPROCESSING TECHNOLOGY CENTRE
dc.description.doi10.1261/rna.5890304
dc.description.sourcetitleRNA
dc.description.volume10
dc.description.issue3
dc.description.page355-368
dc.description.codenRNARF
dc.identifier.isiut000189115400003
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.