Please use this identifier to cite or link to this item: https://scholarbank.nus.edu.sg/handle/10635/171716
Title: Exponential Improvements for Testing Independence in Data Streams
Authors: Arnab Bhattacharyya 
Desai, Rathin
Li, Yi 
Woodruff, David P
Issue Date: 16-Jul-2020
Citation: Arnab Bhattacharyya, Desai, Rathin, Li, Yi, Woodruff, David P (2020-07-16). Exponential Improvements for Testing Independence in Data Streams. ScholarBank@NUS Repository.
Abstract: We study the problem of testing whether a joint distribution given by a stream of m tuples in [k] n is a product distribution. Independence testing has been studied previously in the streaming context by Indyk and McGregor (SODA ‘08) and Braverman et al. (STOC ’10, STACS ’10). They designed algorithms to obtain a (1 ± ε)-approximation of the distance between the input joint distribution on tuples in [k] n and the product distribution of its marginals. We give new algorithms with improved space complexity bounds. In particular, for approximating the `2 distance between the joint and product distributions, our space usage is O(nε−2log(mk)) instead of the previous best O˜(3nε−2log(mk)). For approximating the `1 distance, when m =poly(kn), our space usage is exp(O(n2 + n log(n/ε) + log log k)) instead of the previous best (ε−1log(k))(30+n)n. Additionally, we investigate streaming algorithms for testing goodness-offit with respect to Markov chains and Markov random fields.
URI: https://scholarbank.nus.edu.sg/handle/10635/171716
Appears in Collections:Staff Publications
Elements

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
MAIN.pdfSubmitted version480.91 kBAdobe PDF

CLOSED

Pre-print

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.