Collecting and analyzing multidimensional data with local differential privacy

Please use this identifier to cite or link to this item: https://doi.org/10.1109/ICDE.2019.00063

DC Field	Value
dc.title	Collecting and analyzing multidimensional data with local differential privacy
dc.contributor.author	Wang, N
dc.contributor.author	Xiao, X
dc.contributor.author	Yang, Y
dc.contributor.author	Zhao, J
dc.contributor.author	Hui, SC
dc.contributor.author	Shin, H
dc.contributor.author	Shin, J
dc.contributor.author	Yu, G
dc.date.accessioned	2019-07-23T04:04:05Z
dc.date.available	2019-07-23T04:04:05Z
dc.date.issued	2019-04-01
dc.identifier.citation	Wang, N, Xiao, X, Yang, Y, Zhao, J, Hui, SC, Shin, H, Shin, J, Yu, G (2019-04-01). Collecting and analyzing multidimensional data with local differential privacy. Proceedings - International Conference on Data Engineering 2019-April : 638-649. ScholarBank@NUS Repository. https://doi.org/10.1109/ICDE.2019.00063
dc.identifier.issn	1084-4627
dc.identifier.uri	https://scholarbank.nus.edu.sg/handle/10635/156903
dc.description.abstract	© 2019 IEEE. Local differential privacy (LDP) is a recently proposed privacy standard for collecting and analyzing data, which has been used, e.g., in the Chrome browser, iOS and macOS. In LDP, each user perturbs her information locally, and only sends the randomized version to an aggregator who performs analyses, which protects both the users and the aggregator against private information leaks. Although LDP has attracted much research attention in recent years, the majority of existing work focuses on applying LDP to complex data and/or analysis tasks. In this paper, we point out that the fundamental problem of collecting multidimensional data under LDP has not been addressed sufficiently, and there remains much room for improvement even for basic tasks such as computing the mean value over a single numeric attribute under LDP. Motivated by this, we first propose novel LDP mechanisms for collecting a numeric attribute, whose accuracy is at least no worse (and usually better) than existing solutions in terms of worst-case noise variance. Then, we extend these mechanisms to multidimensional data that can contain both numeric and categorical attributes, where our mechanisms always outperform existing solutions regarding worst-case noise variance. As a case study, we apply our solutions to build an LDP-compliant stochastic gradient descent algorithm (SGD), which powers many important machine learning tasks. Experiments using real datasets confirm the effectiveness of our methods, and their advantages over existing solutions.
dc.publisher	IEEE
dc.source	Elements
dc.subject	cs.CR
dc.subject	cs.CR
dc.subject	cs.CY
dc.subject	cs.DB
dc.subject	cs.LG
dc.subject	Local differential privacy, multidimensional data, stochastic gradient descent
dc.type	Article
dc.date.updated	2019-07-22T10:47:34Z
dc.contributor.department	DEPARTMENT OF COMPUTER SCIENCE
dc.description.doi	10.1109/ICDE.2019.00063
dc.description.sourcetitle	Proceedings - International Conference on Data Engineering
dc.description.volume	2019-April
dc.description.page	638-649
dc.published.state	Published
Appears in Collections:	Staff Publications Elements

Show simple item record

Files in This Item:

File	Description	Size	Format	Access Settings	Version
1907.00782v1.pdf		871.62 kB	Adobe PDF	OPEN	Post-print	View/Download

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM