Please use this identifier to cite or link to this item: https://doi.org/10.1002/prot.24278
DC FieldValue
dc.titleAccurate prediction of hot spot residues through physicochemical characteristics of amino acid sequences
dc.contributor.authorChen, P.
dc.contributor.authorLi, J.
dc.contributor.authorWong, L.
dc.contributor.authorKuwahara, H.
dc.contributor.authorHuang, J.Z.
dc.contributor.authorGao, X.
dc.date.accessioned2014-07-04T03:08:59Z
dc.date.available2014-07-04T03:08:59Z
dc.date.issued2013-08
dc.identifier.citationChen, P., Li, J., Wong, L., Kuwahara, H., Huang, J.Z., Gao, X. (2013-08). Accurate prediction of hot spot residues through physicochemical characteristics of amino acid sequences. Proteins: Structure, Function and Bioinformatics 81 (8) : 1351-1362. ScholarBank@NUS Repository. https://doi.org/10.1002/prot.24278
dc.identifier.issn08873585
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/77809
dc.description.abstractHot spot residues of proteins are fundamental interface residues that help proteins perform their functions. Detecting hot spots by experimental methods is costly and time-consuming. Sequential and structural information has been widely used in the computational prediction of hot spots. However, structural information is not always available. In this article, we investigated the problem of identifying hot spots using only physicochemical characteristics extracted from amino acid sequences. We first extracted 132 relatively independent physicochemical features from a set of the 544 properties in AAindex1, an amino acid index database. Each feature was utilized to train a classification model with a novel encoding schema for hot spot prediction by the IBk algorithm, an extension of the K-nearest neighbor algorithm. The combinations of the individual classifiers were explored and the classifiers that appeared frequently in the top performing combinations were selected. The hot spot predictor was built based on an ensemble of these classifiers and to work in a voting manner. Experimental results demonstrated that our method effectively exploited the feature space and allowed flexible weights of features for different queries. On the commonly used hot spot benchmark sets, our method significantly outperformed other machine learning algorithms and state-of-the-art hot spot predictors. The program is available at http://sfb.kaust.edu.sa/pages/software.aspx. © 2013 Wiley Periodicals, Inc.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1002/prot.24278
dc.sourceScopus
dc.subjectClassification
dc.subjectFeature selection
dc.subjectHot spot residue
dc.subjectPhysicochemical characteristic
dc.subjectProtein-protein interaction
dc.typeArticle
dc.contributor.departmentCOMPUTER SCIENCE
dc.description.doi10.1002/prot.24278
dc.description.sourcetitleProteins: Structure, Function and Bioinformatics
dc.description.volume81
dc.description.issue8
dc.description.page1351-1362
dc.identifier.isiut000329220400006
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.