Please use this identifier to cite or link to this item: https://doi.org/10.1142/S0219720009004436
Title: Sirius PSB: A generic system for analysis of biological sequences
Authors: Koh, C.H.
Lin, S.
Jedd, G.
Wong, L. 
Keywords: Polyadenylation site recognition
Reticulon search
Sequence analysis
Subcellular localization prediction
Issue Date: 2009
Citation: Koh, C.H.,Lin, S.,Jedd, G.,Wong, L. (2009). Sirius PSB: A generic system for analysis of biological sequences. Journal of Bioinformatics and Computational Biology 7 (6) : 973-990. ScholarBank@NUS Repository. https://doi.org/10.1142/S0219720009004436
Abstract: Computational tools are essential components of modern biological research. For example, BLAST searches can be used to identify related proteins based on sequence homology, or when a new genome is sequenced, prediction models can be used to annotate functional sites such as transcription start sites, translation initiation sites and polyadenylation sites and to predict protein localization. Here we present Sirius Prediction Systems Builder (PSB), a new computational tool for sequence analysis, classification and searching. Sirius PSB has four main operations: (1) Building a classifier, (2) Deploying a classifier, (3) Search for proteins similar to query proteins, (4) Preliminary and post-prediction analysis. Sirius PSB supports all these operations via a simple and interactive graphical user interface. Besides being a convenient tool, Sirius PSB has also introduced two novelties in sequence analysis. Firstly, genetic algorithm is used to identify interesting features in the feature space. Secondly, instead of the conventional method of searching for similar proteins via sequence similarity, we introduced searching via features' similarity. To demonstrate the capabilities of Sirius PSB, we have built two prediction models - one for the recognition of Arabidopsis polyadenylation sites and another for the subcellular localization of proteins. Both systems are competitive against current state-of-the-art models based on evaluation of public datasets. More notably, the time and effort required to build each model is greatly reduced with the assistance of Sirius PSB. Furthermore, we show that under certain conditions when BLAST is unable to find related proteins, Sirius PSB can identify functionally related proteins based on their biophysical similarities. Sirius PSB and its related supplements are available at: http://compbio.ddns. comp.nus.edu.sg/∼sirius. © 2009 Imperial College Press.
Source Title: Journal of Bioinformatics and Computational Biology
URI: http://scholarbank.nus.edu.sg/handle/10635/39879
ISSN: 02197200
DOI: 10.1142/S0219720009004436
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.