Please use this identifier to cite or link to this item:
https://doi.org/10.1261/rna.5890304
Title: | Prediction of RNA-binding proteins from primary sequence by a support vector machine approach | Authors: | Han, L.Y. Cai, C.Z. Lo, S.L. Chung, M.C.M. Chen, Y.Z. |
Keywords: | mRNA RNA-binding proteins RNA-protein interactions rRNA snRNA Support vector machine tRNA |
Issue Date: | Mar-2004 | Citation: | Han, L.Y., Cai, C.Z., Lo, S.L., Chung, M.C.M., Chen, Y.Z. (2004-03). Prediction of RNA-binding proteins from primary sequence by a support vector machine approach. RNA 10 (3) : 355-368. ScholarBank@NUS Repository. https://doi.org/10.1261/rna.5890304 | Abstract: | Elucidation of the interaction of proteins with different molecules is of significance in the understanding of cellular processes. Computational methods have been developed for the prediction of protein-protein interactions. But insufficient attention has been paid to the prediction of protein-RNA interactions, which play central roles in regulating gene expression and certain RNA-mediated enzymatic processes. This work explored the use of a machine learning method, support vector machines (SVM), for the prediction of RNA-binding proteins directly from their primary sequence. Based on the knowledge of known RNA-binding and non-RNA-binding proteins, an SVM system was trained to recognize RNA-binding proteins. A total of 4011 RNA-binding and 9781 non-RNA-binding proteins was used to train and test the SVM classification system, and an independent set of 447 RNA-binding and 4881 non-RNA-binding proteins was used to evaluate the classification accuracy. Testing results using this independent evaluation set show a prediction accuracy of 94.1%, 79.3%, and 94.1% for rRNA-, mRNA-, and tRNA-binding proteins, and 98.7%, 96.5%, and 99.9% for non-rRNA-, non-mRNA-, and non-tRNA-binding proteins, respectively. The SVM classification system was further tested on a small class of snRNA-binding proteins with only 60 available sequences. The prediction accuracy is 40.0% and 99.9% for snRNA-binding and non-snRNA-binding proteins, indicating a need for a sufficient number of proteins to train SVM. The SVM classification systems trained in this work were added to our Web-based protein functional classification software SVMProt, at http://jing.cz3.nus.edu.sg/cgi-bin/svmprot.cgi. Our study suggests the potential of SVM as a useful tool for facilitating the prediction of protein-RNA interactions. | Source Title: | RNA | URI: | http://scholarbank.nus.edu.sg/handle/10635/53100 | ISSN: | 13558382 | DOI: | 10.1261/rna.5890304 |
Appears in Collections: | Staff Publications |
Show full item record
Files in This Item:
There are no files associated with this item.
SCOPUSTM
Citations
111
checked on Jun 1, 2023
WEB OF SCIENCETM
Citations
98
checked on Jun 1, 2023
Page view(s)
229
checked on May 25, 2023
Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.