• DocumentCode
    2080772
  • Title

    Correlation hiding by independence masking

  • Author

    Tao, Yufei ; Pei, Jian ; Li, Jiexing ; Xiao, Xiaokui ; Yi, Ke ; Xing, Zhengzheng

  • Author_Institution
    Chinese Univ. of Hong Kong, Hong Kong, China
  • fYear
    2010
  • fDate
    1-6 March 2010
  • Firstpage
    964
  • Lastpage
    967
  • Abstract
    Extracting useful correlation from a dataset has been extensively studied. In this paper, we deal with the opposite, namely, a problem we call correlation hiding (CH), which is fundamental in numerous applications that need to disseminate data containing sensitive information. In this problem, we are given a relational table T whose attributes can be classified into three disjoint sets A, B, and C. The objective is to distort some values in T so that A becomes independent from B, and yet, their correlation with C is preserved as much as possible. CH is different from all the problems studied previously in the area of data privacy, in that CH demands complete elimination of the correlation between two sets of attributes, whereas the previous research focuses on partial elimination up to a certain level. A new operator called independence masking is proposed to solve the CH problem. Implementations of the operator with good worst case guarantees are described in the full version of this short note.
  • Keywords
    approximation theory; computational complexity; data encapsulation; data mining; minimisation; correlation hiding; data dissemination; data privacy; independence masking; relational table; Approximation algorithms; Ash; Association rules; Data mining; Data privacy; Databases; Government;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2010 IEEE 26th International Conference on
  • Conference_Location
    Long Beach, CA
  • Print_ISBN
    978-1-4244-5445-7
  • Electronic_ISBN
    978-1-4244-5444-0
  • Type

    conf

  • DOI
    10.1109/ICDE.2010.5447849
  • Filename
    5447849