• DocumentCode
    3121345
  • Title

    Deriving Private Information from Association Rule Mining Results

  • Author

    Zhu, Zutao ; Wang, Guan ; Du, Wenliang

  • Author_Institution
    Dept. of Electr. Eng. & Comput. Sci., Syracuse Univ., Syracuse, NY
  • fYear
    2009
  • fDate
    March 29 2009-April 2 2009
  • Firstpage
    18
  • Lastpage
    29
  • Abstract
    Data publishing can provide enormous benefits to the society. However, due to privacy concerns, data cannot be published in their original forms. Two types of data publishing can address the privacy issue: one is to publish the sanitized version of the original data, and the other is to publish the aggregate information from the original data, such as data mining results. There have been extensive studies to understand the privacy consequence in the first approach, but there is not much investigation on the privacy consequence of publishing data mining results, although, it is well believed that publishing data mining results can lead to the disclosure of private information. We propose a systematic method to study the privacy consequence of data mining results. Based on a well-established theory, the principle of maximum entropy, we have developed a method to precisely quantify the privacy risk when data mining results are published. We take the association rule mining as an example in this paper, and demonstrate how we quantify the privacy risk based on the published association rules. We have conducted experiments to evaluate the effectiveness and performance of our method. We have drawn several interesting observations from our experiments.
  • Keywords
    data mining; data privacy; maximum entropy methods; association rule mining; data mining; data publishing; maximum entropy; privacy consequence; privacy issue; private information; Aggregates; Association rules; Data analysis; Data engineering; Data mining; Data privacy; Itemsets; Joining processes; Protection; Publishing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
  • Conference_Location
    Shanghai
  • ISSN
    1084-4627
  • Print_ISBN
    978-1-4244-3422-0
  • Electronic_ISBN
    1084-4627
  • Type

    conf

  • DOI
    10.1109/ICDE.2009.97
  • Filename
    4812388