Please use this identifier to cite or link to this item:
https://scholarbank.nus.edu.sg/handle/10635/42128
Title: | Space efficient indexes for string matching with don't cares | Authors: | Lam, T.-W. Sung, W.-K. Tam, S.-L. Yiu, S.-M. |
Issue Date: | 2007 | Citation: | Lam, T.-W.,Sung, W.-K.,Tam, S.-L.,Yiu, S.-M. (2007). Space efficient indexes for string matching with don't cares. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 4835 LNCS : 846-857. ScholarBank@NUS Repository. | Abstract: | Given a text T of length n, the classical indexing problem for pattern matching is to build an index for T so that for any query pattern P, we can report efficiently all occurrences of P in T. Cole et al (2004) extended this problem to allow don't care characters (wildcards) in the text and pattern, and they gave the first index that supports efficient pattern matching. The space complexity of this index is linear in n (text length) but exponential in terms of the number of wildcards. Motivated by bioinformatics applications, we investigate indexes whose size depends on n only. In the literature, space efficient indexes for wildcard matching are known only for the special case when wildcards appear only in the pattern (Iliopoulos and Rahman, 2007); the space required is O(n). Not much has been heard for the case when wildcards appear in the text only, or in both the text and pattern. In this paper we give an O(n) space index to support efficient wildcard matching in all three cases. Our solution to the pattern-only case improves the matching time of the previous work tremendously in practice. In addition, our solution can be extended to handle optional wildcards, each of which can match zero or one character. © Springer-Verlag Berlin Heidelberg 2007. | Source Title: | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | URI: | http://scholarbank.nus.edu.sg/handle/10635/42128 | ISBN: | 9783540771180 | ISSN: | 03029743 |
Appears in Collections: | Staff Publications |
Show full item record
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.