DocumentCode
1701540
Title
A Multiplicative Weights Mechanism for Privacy-Preserving Data Analysis
Author
Hardt, Moritz ; Rothblum, Guy N.
Author_Institution
Dept. of Comput. Sci., Princeton Univ., Princeton, NJ, USA
fYear
2010
Firstpage
61
Lastpage
70
Abstract
We consider statistical data analysis in the interactive setting. In this setting a trusted curator maintains a database of sensitive information about individual participants, and releases privacy-preserving answers to queries as they arrive. Our primary contribution is a new differentially private multiplicative weights mechanism for answering a large number of interactive counting (or linear) queries that arrive online and may be adaptively chosen. This is the first mechanism with worst-case accuracy guarantees that can answer large numbers of interactive queries and is efficient (in terms of the runtime´s dependence on the data universe size). The error is asymptotically optimal in its dependence on the number of participants, and depends only logarithmically on the number of queries being answered. The running time is nearly linear in the size of the data universe. As a further contribution, when we relax the utility requirement and require accuracy only for databases drawn from a rich class of databases, we obtain exponential improvements in running time. Even in this relaxed setting we continue to guarantee privacy for any input database. Only the utility requirement is relaxed. Specifically, we show that when the input database is drawn from a smooth distribution - a distribution that does not place too much weight on any single data item - accuracy remains as above, and the running time becomes poly-logarithmic in the data universe size. The main technical contributions are the application of multiplicative weights techniques to the differential privacy setting, a new privacy analysis for the interactive setting, and a technique for reducing data dimensionality for databases drawn from smooth distributions.
Keywords
data analysis; data privacy; query processing; question answering (information retrieval); statistical analysis; data dimensionality reduction; data universe; differentially private multiplicative weights mechanism; multiplicative weights mechanism; privacy-preserving data analysis; statistical data analysis; Accuracy; Data privacy; Databases; Histograms; Noise; Noise measurement; Privacy;
fLanguage
English
Publisher
ieee
Conference_Titel
Foundations of Computer Science (FOCS), 2010 51st Annual IEEE Symposium on
Conference_Location
Las Vegas, NV
ISSN
0272-5428
Print_ISBN
978-1-4244-8525-3
Type
conf
DOI
10.1109/FOCS.2010.85
Filename
5670948
Link To Document