Please use this identifier to cite or link to this item: https://doi.org/10.4230/LIPIcs.FSTTCS.2019.10
DC FieldValue
dc.titleApproximate Online Pattern Matching in Sublinear Time.
dc.contributor.authorChakraborty, Diptarka
dc.contributor.authorDas, Debarati
dc.contributor.authorKoucký, Michal
dc.date.accessioned2020-06-01T08:15:55Z
dc.date.available2020-06-01T08:15:55Z
dc.date.issued2019-12
dc.identifier.citationChakraborty, Diptarka, Das, Debarati, Koucký, Michal (2019-12). Approximate Online Pattern Matching in Sublinear Time.. Leibniz International Proceedings in Informatics, LIPIcs 150 : 10:1-10:1. ScholarBank@NUS Repository. https://doi.org/10.4230/LIPIcs.FSTTCS.2019.10
dc.identifier.isbn9783959771313
dc.identifier.issn18688969
dc.identifier.urihttps://scholarbank.nus.edu.sg/handle/10635/168864
dc.description.abstractWe consider the approximate pattern matching problem under edit distance. In this problem we are given a pattern P of length m and a text T of length n over some alphabet Σ, and a positive integer k. The goal is to find all the positions j in T such that there is a substring of T ending at j which has edit distance at most k from the pattern P. Recall, the edit distance between two strings is the minimum number of character insertions, deletions, and substitutions required to transform one string into the other. For a position t in {1,...,n}, let kt be the smallest edit distance between P and any substring of T ending at t. In this paper we give a constant factor approximation to the sequence k1,k2,...,kn. We consider both offline and online settings. In the offline setting, where both P and T are available, we present an algorithm that for all t in {1,...,n}, computes the value of kt approximately within a constant factor. The worst case running time of our algorithm is Õ(nm3/4). In the online setting, we are given P and then T arrives one symbol at a time. We design an algorithm that upon arrival of the t-th symbol of T computes kt approximately within O(1)multiplicative factor and m8/9-additive error. Our algorithm takes Õ(m1−(7/54)) amortized time per symbol arrival and takes Õ(m1−(1/54)) additional space apart from storing the pattern P. Both of our algorithms are randomized and produce correct answer with high probability. To the best of our knowledge this is the first algorithm that takes worst-case sublinear (in the length of the pattern) time and sublinear extra space for the online approximate pattern matching problem. To get our result we build on the technique of Chakraborty, Das, Goldenberg, Koucký and Saks [FOCS'18] for computing a constant factor approximation of edit distance in sub-quadratic time.
dc.publisherSchloss Dagstuhl - Leibniz-Zentrum für Informatik
dc.sourceElements
dc.typeConference Paper
dc.date.updated2020-05-28T07:27:13Z
dc.contributor.departmentDEPARTMENT OF COMPUTER SCIENCE
dc.description.doi10.4230/LIPIcs.FSTTCS.2019.10
dc.description.sourcetitleLeibniz International Proceedings in Informatics, LIPIcs
dc.description.volume150
dc.description.page10:1-10:1
dc.published.statePublished
dc.description.redepositcompleted
Appears in Collections:Elements
Staff Publications

Show simple item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
LIPIcs-FSTTCS-2019-10.pdfPublished version492.36 kBAdobe PDF

OPEN

PublishedView/Download

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.