Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/171716
DC FieldValue
dc.titleExponential Improvements for Testing Independence in Data Streams
dc.contributor.authorArnab Bhattacharyya
dc.contributor.authorDesai, Rathin
dc.contributor.authorLi, Yi
dc.contributor.authorWoodruff, David P
dc.date.accessioned2020-07-27T07:15:47Z
dc.date.available2020-07-27T07:15:47Z
dc.date.issued2020-07-16
dc.identifier.citationArnab Bhattacharyya, Desai, Rathin, Li, Yi, Woodruff, David P (2020-07-16). Exponential Improvements for Testing Independence in Data Streams. ScholarBank@NUS Repository.
dc.identifier.urihttps://scholarbank.nus.edu.sg/handle/10635/171716
dc.description.abstractWe study the problem of testing whether a joint distribution given by a stream of m tuples in [k] n is a product distribution. Independence testing has been studied previously in the streaming context by Indyk and McGregor (SODA ‘08) and Braverman et al. (STOC ’10, STACS ’10). They designed algorithms to obtain a (1 ± ε)-approximation of the distance between the input joint distribution on tuples in [k] n and the product distribution of its marginals. We give new algorithms with improved space complexity bounds. In particular, for approximating the `2 distance between the joint and product distributions, our space usage is O(nε−2log(mk)) instead of the previous best O˜(3nε−2log(mk)). For approximating the `1 distance, when m =poly(kn), our space usage is exp(O(n2 + n log(n/ε) + log log k)) instead of the previous best (ε−1log(k))(30+n)n. Additionally, we investigate streaming algorithms for testing goodness-offit with respect to Markov chains and Markov random fields.
dc.sourceElements
dc.typeArticle
dc.date.updated2020-07-24T12:34:39Z
dc.contributor.departmentDEPARTMENT OF COMPUTER SCIENCE
dc.published.statePublished
dc.description.redepositCompleted
Appears in Collections:Staff Publications
Elements

Show simple item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
MAIN.pdfSubmitted version480.91 kBAdobe PDF

CLOSED

Pre-print

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.