Please use this identifier to cite or link to this item: https://doi.org/10.1159/000088839
Title: Prediction of functional class of novel bacterial proteins without the use of sequence similarity by a statistical learning method
Authors: Cui, J.
Han, L.Y. 
Cai, C.Z. 
Zheng, C.J. 
Ji, Z.L.
Chen, Y.Z. 
Keywords: Open reading frames, protein
Proteins, non-homologous
SVMProt
Issue Date: Nov-2005
Citation: Cui, J., Han, L.Y., Cai, C.Z., Zheng, C.J., Ji, Z.L., Chen, Y.Z. (2005-11). Prediction of functional class of novel bacterial proteins without the use of sequence similarity by a statistical learning method. Journal of Molecular Microbiology and Biotechnology 9 (2) : 86-100. ScholarBank@NUS Repository. https://doi.org/10.1159/000088839
Abstract: A substantial percentage of the putative protein-encoding open reading frames (ORFs) in bacterial genomes have no homolog of known function, and their function cannot be confidently assigned on the basis of sequence similarity. Methods not based on sequence similarity are needed and being developed. One method, SVMProt (http://jing.cz3.nus.edu.sg/cgi-bin/svmprot.cgi), predicts protein functional family irrespective of sequence similarity (Nucleic Acids Res. 2003;31:3692-3697). While it has been tested on a large number of proteins, its capability for non-homologous proteins has so far been evaluated for a relatively small number of proteins, and additional tests are needed to more fully assess SVMProt. In this work, 90 novel bacterial proteins (non-homologous to known proteins) are used to evaluate the capability of SVMProt. These proteins are such that none of their homologs are in the Swiss-Prot database, their functions not clearly described in the literature, and they themselves and their homologs are not included in the training sets of SVMProt. They represent proteins whose function cannot be confidently predicted by sequence similarity methods at present. The predicted functional class of 76.7% of each of these proteins shows various levels of consistency with the literature-described function, compared to the overall accuracy of 87% for the SVMProt functional class assignment of 34,582 proteins that have at least one homolog of known function. Our study suggests that SVMProt is capable of assigning functional class for novel bacterial proteins at a level not too much lower than that of sequence alignment methods for homologous proteins. Copyright © 2005 S. Karger AG.
Source Title: Journal of Molecular Microbiology and Biotechnology
URI: http://scholarbank.nus.edu.sg/handle/10635/104843
ISSN: 14641801
DOI: 10.1159/000088839
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.