Please use this identifier to cite or link to this item:
https://scholarbank.nus.edu.sg/handle/10635/171716
Title: | Exponential Improvements for Testing Independence in Data Streams | Authors: | Arnab Bhattacharyya Desai, Rathin Li, Yi Woodruff, David P |
Issue Date: | 16-Jul-2020 | Citation: | Arnab Bhattacharyya, Desai, Rathin, Li, Yi, Woodruff, David P (2020-07-16). Exponential Improvements for Testing Independence in Data Streams. ScholarBank@NUS Repository. | Abstract: | We study the problem of testing whether a joint distribution given by a stream of m tuples in [k] n is a product distribution. Independence testing has been studied previously in the streaming context by Indyk and McGregor (SODA ‘08) and Braverman et al. (STOC ’10, STACS ’10). They designed algorithms to obtain a (1 ± ε)-approximation of the distance between the input joint distribution on tuples in [k] n and the product distribution of its marginals. We give new algorithms with improved space complexity bounds. In particular, for approximating the `2 distance between the joint and product distributions, our space usage is O(nε−2log(mk)) instead of the previous best O˜(3nε−2log(mk)). For approximating the `1 distance, when m =poly(kn), our space usage is exp(O(n2 + n log(n/ε) + log log k)) instead of the previous best (ε−1log(k))(30+n)n. Additionally, we investigate streaming algorithms for testing goodness-offit with respect to Markov chains and Markov random fields. | URI: | https://scholarbank.nus.edu.sg/handle/10635/171716 |
Appears in Collections: | Staff Publications Elements |
Show full item record
Files in This Item:
File | Description | Size | Format | Access Settings | Version | |
---|---|---|---|---|---|---|
MAIN.pdf | Submitted version | 480.91 kB | Adobe PDF | CLOSED | Pre-print |
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.