Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/104637
Title: Superiority and complexity of the spaced seeds
Authors: Li, M.
Ma, B.
Zhang, L. 
Issue Date: 2006
Citation: Li, M.,Ma, B.,Zhang, L. (2006). Superiority and complexity of the spaced seeds. Proceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms : 444-453. ScholarBank@NUS Repository.
Abstract: Optimal spaced seeds were introduced by the theoretical computer science community to bioinformatics to effectively increase homology search sensitivity. They are now serving thousands of homology search queries daily. While dozens of papers have been published on optimal spaced seeds since their invention, many fundamental questions still remain unanswered. In this paper, we settle several open questions in this area. Specifically, we prove that when the length of a non-uniformly spaced seed is bounded by an exponential function of the seed weight, the seed outperforms strictly the traditional consecutive seed in both (i) the average number of non-overlapping hits and (ii) the asymptotic hit probability. Then, we study the computation of the hit probability of a spaced seed, solving three more open questions: (iii) hit probability computation in a uniform homologous region is NP-hard and (iv) it admits a PTAS; (v) the asymptotic hit probability is computable in exponential time in seed length, independent of the homologous region length.
Source Title: Proceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms
URI: http://scholarbank.nus.edu.sg/handle/10635/104637
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.