Please use this identifier to cite or link to this item: https://doi.org/10.1142/S0219720009004436
DC FieldValue
dc.titleSirius PSB: A generic system for analysis of biological sequences
dc.contributor.authorKoh, C.H.
dc.contributor.authorLin, S.
dc.contributor.authorJedd, G.
dc.contributor.authorWong, L.
dc.date.accessioned2013-07-04T07:51:41Z
dc.date.available2013-07-04T07:51:41Z
dc.date.issued2009
dc.identifier.citationKoh, C.H.,Lin, S.,Jedd, G.,Wong, L. (2009). Sirius PSB: A generic system for analysis of biological sequences. Journal of Bioinformatics and Computational Biology 7 (6) : 973-990. ScholarBank@NUS Repository. <a href="https://doi.org/10.1142/S0219720009004436" target="_blank">https://doi.org/10.1142/S0219720009004436</a>
dc.identifier.issn02197200
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/39879
dc.description.abstractComputational tools are essential components of modern biological research. For example, BLAST searches can be used to identify related proteins based on sequence homology, or when a new genome is sequenced, prediction models can be used to annotate functional sites such as transcription start sites, translation initiation sites and polyadenylation sites and to predict protein localization. Here we present Sirius Prediction Systems Builder (PSB), a new computational tool for sequence analysis, classification and searching. Sirius PSB has four main operations: (1) Building a classifier, (2) Deploying a classifier, (3) Search for proteins similar to query proteins, (4) Preliminary and post-prediction analysis. Sirius PSB supports all these operations via a simple and interactive graphical user interface. Besides being a convenient tool, Sirius PSB has also introduced two novelties in sequence analysis. Firstly, genetic algorithm is used to identify interesting features in the feature space. Secondly, instead of the conventional method of searching for similar proteins via sequence similarity, we introduced searching via features' similarity. To demonstrate the capabilities of Sirius PSB, we have built two prediction models - one for the recognition of Arabidopsis polyadenylation sites and another for the subcellular localization of proteins. Both systems are competitive against current state-of-the-art models based on evaluation of public datasets. More notably, the time and effort required to build each model is greatly reduced with the assistance of Sirius PSB. Furthermore, we show that under certain conditions when BLAST is unable to find related proteins, Sirius PSB can identify functionally related proteins based on their biophysical similarities. Sirius PSB and its related supplements are available at: http://compbio.ddns. comp.nus.edu.sg/∼sirius. © 2009 Imperial College Press.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1142/S0219720009004436
dc.sourceScopus
dc.subjectPolyadenylation site recognition
dc.subjectReticulon search
dc.subjectSequence analysis
dc.subjectSubcellular localization prediction
dc.typeArticle
dc.contributor.departmentCOMPUTER SCIENCE
dc.description.doi10.1142/S0219720009004436
dc.description.sourcetitleJournal of Bioinformatics and Computational Biology
dc.description.volume7
dc.description.issue6
dc.description.page973-990
dc.description.codenJBCBB
dc.identifier.isiutNOT_IN_WOS
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.