• DocumentCode
    1701540
  • Title

    A Multiplicative Weights Mechanism for Privacy-Preserving Data Analysis

  • Author

    Hardt, Moritz ; Rothblum, Guy N.

  • Author_Institution
    Dept. of Comput. Sci., Princeton Univ., Princeton, NJ, USA
  • fYear
    2010
  • Firstpage
    61
  • Lastpage
    70
  • Abstract
    We consider statistical data analysis in the interactive setting. In this setting a trusted curator maintains a database of sensitive information about individual participants, and releases privacy-preserving answers to queries as they arrive. Our primary contribution is a new differentially private multiplicative weights mechanism for answering a large number of interactive counting (or linear) queries that arrive online and may be adaptively chosen. This is the first mechanism with worst-case accuracy guarantees that can answer large numbers of interactive queries and is efficient (in terms of the runtime´s dependence on the data universe size). The error is asymptotically optimal in its dependence on the number of participants, and depends only logarithmically on the number of queries being answered. The running time is nearly linear in the size of the data universe. As a further contribution, when we relax the utility requirement and require accuracy only for databases drawn from a rich class of databases, we obtain exponential improvements in running time. Even in this relaxed setting we continue to guarantee privacy for any input database. Only the utility requirement is relaxed. Specifically, we show that when the input database is drawn from a smooth distribution - a distribution that does not place too much weight on any single data item - accuracy remains as above, and the running time becomes poly-logarithmic in the data universe size. The main technical contributions are the application of multiplicative weights techniques to the differential privacy setting, a new privacy analysis for the interactive setting, and a technique for reducing data dimensionality for databases drawn from smooth distributions.
  • Keywords
    data analysis; data privacy; query processing; question answering (information retrieval); statistical analysis; data dimensionality reduction; data universe; differentially private multiplicative weights mechanism; multiplicative weights mechanism; privacy-preserving data analysis; statistical data analysis; Accuracy; Data privacy; Databases; Histograms; Noise; Noise measurement; Privacy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Foundations of Computer Science (FOCS), 2010 51st Annual IEEE Symposium on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    0272-5428
  • Print_ISBN
    978-1-4244-8525-3
  • Type

    conf

  • DOI
    10.1109/FOCS.2010.85
  • Filename
    5670948