• DocumentCode
    3323640
  • Title

    Butterfly: Protecting Output Privacy in Stream Mining

  • Author

    Wang, Ting ; Liu, Ling

  • Author_Institution
    Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA
  • fYear
    2008
  • fDate
    7-12 April 2008
  • Firstpage
    1170
  • Lastpage
    1179
  • Abstract
    Privacy preservation in data mining demands protecting both input and output privacy. The former refers to sanitizing the raw data itself before performing mining. The latter refers to preventing the mining output (model/pattern) from malicious pattern-based inference attacks. The preservation of input privacy does not necessarily lead to that of output privacy. This work studies the problem of protecting output privacy in the context of frequent pattern mining over data streams. After exposing the privacy breaches existing in current stream mining systems, we propose Butterfly, a light-weighted countermeasure that can effectively eliminate these breaches without explicitly detecting them, meanwhile minimizing the loss of the output accuracy. We further optimize the basic scheme by taking account of two types of semantic constraints, aiming at maximally preserving utility-related semantics while maintaining the hard privacy and accuracy guarantee. We conduct extensive experiments over real- life datasets to show the effectiveness and efficiency of our approach.
  • Keywords
    data mining; data privacy; Butterfly light-weighted countermeasure; data stream mining; frequent pattern mining; malicious pattern-based inference attacks; output privacy protection; privacy preservation; raw data sanitization; Costs; Data mining; Data privacy; Databases; Diseases; Drives; Educational institutions; History; Hospitals; Protection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
  • Conference_Location
    Cancun
  • Print_ISBN
    978-1-4244-1836-7
  • Electronic_ISBN
    978-1-4244-1837-4
  • Type

    conf

  • DOI
    10.1109/ICDE.2008.4497526
  • Filename
    4497526