Please use this identifier to cite or link to this item:
https://doi.org/10.1261/rna.5890304
DC Field | Value | |
---|---|---|
dc.title | Prediction of RNA-binding proteins from primary sequence by a support vector machine approach | |
dc.contributor.author | Han, L.Y. | |
dc.contributor.author | Cai, C.Z. | |
dc.contributor.author | Lo, S.L. | |
dc.contributor.author | Chung, M.C.M. | |
dc.contributor.author | Chen, Y.Z. | |
dc.date.accessioned | 2014-05-19T02:54:13Z | |
dc.date.available | 2014-05-19T02:54:13Z | |
dc.date.issued | 2004-03 | |
dc.identifier.citation | Han, L.Y., Cai, C.Z., Lo, S.L., Chung, M.C.M., Chen, Y.Z. (2004-03). Prediction of RNA-binding proteins from primary sequence by a support vector machine approach. RNA 10 (3) : 355-368. ScholarBank@NUS Repository. https://doi.org/10.1261/rna.5890304 | |
dc.identifier.issn | 13558382 | |
dc.identifier.uri | http://scholarbank.nus.edu.sg/handle/10635/53100 | |
dc.description.abstract | Elucidation of the interaction of proteins with different molecules is of significance in the understanding of cellular processes. Computational methods have been developed for the prediction of protein-protein interactions. But insufficient attention has been paid to the prediction of protein-RNA interactions, which play central roles in regulating gene expression and certain RNA-mediated enzymatic processes. This work explored the use of a machine learning method, support vector machines (SVM), for the prediction of RNA-binding proteins directly from their primary sequence. Based on the knowledge of known RNA-binding and non-RNA-binding proteins, an SVM system was trained to recognize RNA-binding proteins. A total of 4011 RNA-binding and 9781 non-RNA-binding proteins was used to train and test the SVM classification system, and an independent set of 447 RNA-binding and 4881 non-RNA-binding proteins was used to evaluate the classification accuracy. Testing results using this independent evaluation set show a prediction accuracy of 94.1%, 79.3%, and 94.1% for rRNA-, mRNA-, and tRNA-binding proteins, and 98.7%, 96.5%, and 99.9% for non-rRNA-, non-mRNA-, and non-tRNA-binding proteins, respectively. The SVM classification system was further tested on a small class of snRNA-binding proteins with only 60 available sequences. The prediction accuracy is 40.0% and 99.9% for snRNA-binding and non-snRNA-binding proteins, indicating a need for a sufficient number of proteins to train SVM. The SVM classification systems trained in this work were added to our Web-based protein functional classification software SVMProt, at http://jing.cz3.nus.edu.sg/cgi-bin/svmprot.cgi. Our study suggests the potential of SVM as a useful tool for facilitating the prediction of protein-RNA interactions. | |
dc.description.uri | http://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1261/rna.5890304 | |
dc.source | Scopus | |
dc.subject | mRNA | |
dc.subject | RNA-binding proteins | |
dc.subject | RNA-protein interactions | |
dc.subject | rRNA | |
dc.subject | snRNA | |
dc.subject | Support vector machine | |
dc.subject | tRNA | |
dc.type | Article | |
dc.contributor.department | BIOLOGICAL SCIENCES | |
dc.contributor.department | COMPUTATIONAL SCIENCE | |
dc.contributor.department | BIOPROCESSING TECHNOLOGY CENTRE | |
dc.description.doi | 10.1261/rna.5890304 | |
dc.description.sourcetitle | RNA | |
dc.description.volume | 10 | |
dc.description.issue | 3 | |
dc.description.page | 355-368 | |
dc.description.coden | RNARF | |
dc.identifier.isiut | 000189115400003 | |
Appears in Collections: | Staff Publications |
Show simple item record
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.