Please use this identifier to cite or link to this item: http://scholarbank.nus.edu.sg/handle/10635/40854
Title: Piers: An efficient model for similarity search in DNA sequence databases
Authors: Cao, X.
Li, S.C.
Ooi, B.C. 
Tung, A.K.H. 
Issue Date: 2004
Source: Cao, X.,Li, S.C.,Ooi, B.C.,Tung, A.K.H. (2004). Piers: An efficient model for similarity search in DNA sequence databases. SIGMOD Record 33 (2) : 39-44. ScholarBank@NUS Repository.
Abstract: Growing interest in genomic research has resulted in the creation of huge biological sequence databases. In this paper, we present a hash-based pier model for efficient homology search in large DNA sequence databases. In our model, only certain segments in the databases called 'piers' need to be accessed during searches as opposite to other approaches which require a full scan on the biological sequence database. To further improve the search efficiency, the piers are stored in a specially designed hash table which helps to avoid expensive alignment operation. The hash table is small enough to reside in main memory, hence avoiding I/O in the search steps. We show theoretically and empirically that the proposed approach can efficiently detect biological sequences that are similar to a query sequence with very high sensitivity.
Source Title: SIGMOD Record
URI: http://scholarbank.nus.edu.sg/handle/10635/40854
ISSN: 01635808
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Page view(s)

59
checked on Dec 16, 2017

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.