Please use this identifier to cite or link to this item: https://doi.org/10.1186/1471-2105-7-164
DC FieldValue
dc.titleApplication of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases
dc.contributor.authorSchneider, G
dc.contributor.authorNeuberger, G
dc.contributor.authorWildpaner, M
dc.contributor.authorTian, S
dc.contributor.authorBerezovsky, I
dc.contributor.authorEisenhaber, F
dc.date.accessioned2020-10-20T04:51:20Z
dc.date.available2020-10-20T04:51:20Z
dc.date.issued2006
dc.identifier.citationSchneider, G, Neuberger, G, Wildpaner, M, Tian, S, Berezovsky, I, Eisenhaber, F (2006). Application of a sensitive collection heuristic for very large protein families: Evolutionary relationship between adipose triglyceride lipase (ATGL) and classic mammalian lipases. BMC Bioinformatics 7 : 164. ScholarBank@NUS Repository. https://doi.org/10.1186/1471-2105-7-164
dc.identifier.issn14712105
dc.identifier.urihttps://scholarbank.nus.edu.sg/handle/10635/178021
dc.description.abstractBackground: Manually finding subtle yet statistically significant links to distantly related homologues becomes practically impossible for very populated protein families due to the sheer number of similarity searches to be invoked and analyzed. The unclear evolutionary relationship between classical mammalian lipases and the recently discovered human adipose triglycericle lipase (ATGL; a patatin family member) is an exemplary case for such a problem. Results: We describe an unsupervised, sensitive sequence segment collection heuristic suitable for assembling very large protein families. It is based on fan-like expanding, iterative database searches. To prevent inclusion of unrelated hits, additional criteria are introduced: minimal alignment length and overlap with starting sequence segments, finding starting sequences in reciprocal searches, automated filtering for compositional bias and repetitive patterns. This heuristic was implemented as FAMILYSEARCHER in the ANNIE sequence analysis environment and applied to search for protein links between the classical lipase family and the patatin-like group. Conclusion: The FAMILYSEARCHER is an efficient tool for tracing distant evolutionary relationships involving large protein families. Although classical lipases and ATGIL have no obvious sequence similarity and differ with regard to fold and catalytic mechanism, homology links detected with FAMILYSEARCHER show that they are evolutionarily related. The conserved sequence parts can be narrowed down to an ancestral core module consisting of three ?-strands, one ?-helix and a turn containing the typical nucleophilic serine. Moreover, this ancestral module also appears in numerous enzymes with various substrate specificities, but that critically rely on nucleophilic attack mechanisms. © 2006 Schneider et al; licensee BioMed Central Ltd.
dc.rightsAttribution 4.0 International
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.sourceUnpaywall 20201031
dc.subjectCatalytic mechanisms
dc.subjectDatabase searches
dc.subjectEvolutionary relationships
dc.subjectNucleophilic attack
dc.subjectRepetitive pattern
dc.subjectSequence analysis
dc.subjectSequence similarity
dc.subjectVarious substrates
dc.subjectAmino acids
dc.subjectMammals
dc.subjectProteins
dc.subjectSearch engines
dc.subjectLipases
dc.subjectserine
dc.subjecttriacylglycerol lipase
dc.subjecttriacylglycerol lipase
dc.subjectalpha helix
dc.subjectamino acid sequence
dc.subjectarticle
dc.subjectcatalysis
dc.subjectcomputer program
dc.subjectdata base
dc.subjectenzyme specificity
dc.subjecthuman
dc.subjectmammal
dc.subjectmolecular evolution
dc.subjectnonhuman
dc.subjectnucleotide sequence
dc.subjectprotein family
dc.subjectprotein folding
dc.subjectsequence alignment
dc.subjectunindexed sequence
dc.subjectadipose tissue
dc.subjectalgorithm
dc.subjectanimal
dc.subjectchromosome map
dc.subjectDNA sequence
dc.subjectgene linkage disequilibrium
dc.subjectgenetics
dc.subjectmetabolism
dc.subjectprocedures
dc.subjectsequence homology
dc.subjectMammalia
dc.subjectAdipose Tissue
dc.subjectAlgorithms
dc.subjectAnimals
dc.subjectChromosome Mapping
dc.subjectConserved Sequence
dc.subjectEvolution, Molecular
dc.subjectHumans
dc.subjectLinkage Disequilibrium
dc.subjectLipase
dc.subjectMammals
dc.subjectSequence Alignment
dc.subjectSequence Analysis, DNA
dc.subjectSequence Homology, Nucleic Acid
dc.typeArticle
dc.contributor.departmentBIOLOGICAL SCIENCES
dc.description.doi10.1186/1471-2105-7-164
dc.description.sourcetitleBMC Bioinformatics
dc.description.volume7
dc.description.page164
Appears in Collections:Staff Publications
Elements

Show simple item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
10_1186_1471-2105-7-164.pdf3.3 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check

Altmetric


This item is licensed under a Creative Commons License Creative Commons