Please use this identifier to cite or link to this item: https://doi.org/10.1186/1471-2105-15-166
DC FieldValue
dc.titleOn the necessity of dissecting sequence similarity scores into segment-specific contributions for inferring protein homology, function prediction and annotation
dc.contributor.authorWong, W.-C
dc.contributor.authorMaurer-Stroh, S
dc.contributor.authorEisenhaber, B
dc.contributor.authorEisenhaber, F
dc.date.accessioned2020-09-04T02:13:44Z
dc.date.available2020-09-04T02:13:44Z
dc.date.issued2014
dc.identifier.citationWong, W.-C, Maurer-Stroh, S, Eisenhaber, B, Eisenhaber, F (2014). On the necessity of dissecting sequence similarity scores into segment-specific contributions for inferring protein homology, function prediction and annotation. BMC Bioinformatics 15 (1) : 166. ScholarBank@NUS Repository. https://doi.org/10.1186/1471-2105-15-166
dc.identifier.issn14712105
dc.identifier.urihttps://scholarbank.nus.edu.sg/handle/10635/174303
dc.description.abstractBackground: Protein sequence similarities to any types of non-globular segments (coiled coils, low complexity regions, transmembrane regions, long loops, etc. where either positional sequence conservation is the result of a very simple, physically induced pattern or rather integral sequence properties are critical) are pertinent sources for mistaken homologies. Regretfully, these considerations regularly escape attention in large-scale annotation studies since, often, there is no substitute to manual handling of these cases. Quantitative criteria are required to suppress events of function annotation transfer as a result of false homology assignments.Results: The sequence homology concept is based on the similarity comparison between the structural elements, the basic building blocks for conferring the overall fold of a protein. We propose to dissect the total similarity score into fold-critical and other, remaining contributions and suggest that, for a valid homology statement, the fold-relevant score contribution should at least be significant on its own. As part of the article, we provide the DissectHMMER software program for dissecting HMMER2/3 scores into segment-specific contributions. We show that DissectHMMER reproduces HMMER2/3 scores with sufficient accuracy and that it is useful in automated decisions about homology for instructive sequence examples. To generalize the dissection concept for cases without 3D structural information, we find that a dissection based on alignment quality is an appropriate surrogate. The approach was applied to a large-scale study of SMART and PFAM domains in the space of seed sequences and in the space of UniProt/SwissProt.Conclusions: Sequence similarity core dissection with regard to fold-critical and other contributions systematically suppresses false hits and, additionally, recovers previously obscured homology relationships such as the one between aquaporins and formate/nitrite transporters that, so far, was only supported by structure comparison. © 2014 Wong et al.; licensee BioMed Central Ltd.
dc.publisherBioMed Central Ltd.
dc.sourceUnpaywall 20200831
dc.subjectDissection
dc.subjectHidden Markov models
dc.subjectProtein domains
dc.subjectProtein function annotation
dc.subjectProtein sequences
dc.subjectSequence homology
dc.subjectSequence similarity
dc.subjectSimilarity scores
dc.subjectProteins
dc.subjectprotein
dc.subjectamino acid sequence
dc.subjectanimal
dc.subjectarticle
dc.subjectchemistry
dc.subjectcomputer program
dc.subjectgenetics
dc.subjecthuman
dc.subjectmolecular genetics
dc.subjectnucleotide sequence
dc.subjectsequence alignment
dc.subjectsequence homology
dc.subjectAmino Acid Sequence
dc.subjectAnimals
dc.subjectConserved Sequence
dc.subjectHumans
dc.subjectMolecular Sequence Data
dc.subjectProteins
dc.subjectSequence Alignment
dc.subjectSequence Homology, Amino Acid
dc.subjectSoftware
dc.typeArticle
dc.contributor.departmentBIOLOGICAL SCIENCES
dc.description.doi10.1186/1471-2105-15-166
dc.description.sourcetitleBMC Bioinformatics
dc.description.volume15
dc.description.issue1
dc.description.page166
dc.published.statePublished
Appears in Collections:Elements
Staff Publications

Show simple item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
10_1186_1471-2105-15-166.pdf3.59 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.