On the design of high-performance algorithms for aligning multiple protein sequences on mesh-based multiprocessor architectures

Please use this identifier to cite or link to this item: https://doi.org/10.1016/j.jpdc.2007.03.007

DC Field	Value
dc.title	On the design of high-performance algorithms for aligning multiple protein sequences on mesh-based multiprocessor architectures
dc.contributor.author	Low, D.H.P.
dc.contributor.author	Veeravalli, B.
dc.contributor.author	Bader, D.A.
dc.date.accessioned	2014-06-17T02:59:51Z
dc.date.available	2014-06-17T02:59:51Z
dc.date.issued	2007-09
dc.identifier.citation	Low, D.H.P., Veeravalli, B., Bader, D.A. (2007-09). On the design of high-performance algorithms for aligning multiple protein sequences on mesh-based multiprocessor architectures. Journal of Parallel and Distributed Computing 67 (9) : 1007-1017. ScholarBank@NUS Repository. https://doi.org/10.1016/j.jpdc.2007.03.007
dc.identifier.issn	07437315
dc.identifier.uri	http://scholarbank.nus.edu.sg/handle/10635/56891
dc.description.abstract	In this paper, we address the problem of multiple sequence alignment (MSA) for handling very large number of proteins sequences on mesh-based multiprocessor architectures. As the problem has been conclusively shown to be computationally complex, we employ divisible load paradigm (also, referred to as divisible load theory, DLT) to handle such large number of sequences. We design an efficient computational engine that is capable of conducting MSAs by exploiting the underlying parallelism embedded in the computational steps of multiple sequence algorithms. Specifically, we consider the standard Smith-Waterman (SW) algorithm in our implementation, however, our approach is by no means restrictive to SW class of algorithms alone. The treatment used in this paper is generic to a class of similar dynamic programming problems. Our approach is recursive in the sense that the quality of solutions can be refined continuously till an acceptable level of quality is achieved. After first phase of computation, we design a heuristic scheme that renders the final solution for MSA. We conduct rigorous simulation experiments using several hundreds of homologous protein sequences derived from the Rattus Norvegicus and Mus Musculus databases of olfactory receptors. We quantify the performance based on speed-up metric. We compare our algorithms to serial or single machine processing approaches. We testify our findings by comparing with conventional equal load partitioning (ELP) strategy that is commonly used in the parallel processing literature. Based on our extensive simulation study, we observe that DLT paradigm offers an excellent speed-up characteristics and provides avenues for its use in several other biological sequence processing related problem. This study is a first time attempt in using the DLT paradigm to devise efficient strategies to handle large scale multiple protein sequence alignment problem on mesh-based multiprocessor systems. © 2007 Elsevier Inc. All rights reserved.
dc.description.uri	http://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1016/j.jpdc.2007.03.007
dc.source	Scopus
dc.subject	Divisible loads
dc.subject	Mesh topology
dc.subject	Multiple sequence alignment
dc.subject	Protein sequences
dc.subject	Smith-Waterman algorithm
dc.type	Article
dc.contributor.department	ELECTRICAL & COMPUTER ENGINEERING
dc.description.doi	10.1016/j.jpdc.2007.03.007
dc.description.sourcetitle	Journal of Parallel and Distributed Computing
dc.description.volume	67
dc.description.issue	9
dc.description.page	1007-1017
dc.description.coden	JPDCE
dc.identifier.isiut	000249172000004
Appears in Collections:	Staff Publications

Show simple item record

Files in This Item:

There are no files associated with this item.

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM