• DocumentCode
    751134
  • Title

    Random projection-based multiplicative data perturbation for privacy preserving distributed data mining

  • Author

    Liu, Kun ; Kargupta, Hillol ; Ryan, Jessica

  • Author_Institution
    Dept. of Comput. Sci. & Electr. Eng., Maryland Univ., Baltimore, MD, USA
  • Volume
    18
  • Issue
    1
  • fYear
    2006
  • Firstpage
    92
  • Lastpage
    106
  • Abstract
    This paper explores the possibility of using multiplicative random projection matrices for privacy preserving distributed data mining. It specifically considers the problem of computing statistical aggregates like the inner product matrix, correlation coefficient matrix, and Euclidean distance matrix from distributed privacy sensitive data possibly owned by multiple parties. This class of problems is directly related to many other data-mining problems such as clustering, principal component analysis, and classification. This paper makes primary contributions on two different grounds. First, it explores independent component analysis as a possible tool for breaching privacy in deterministic multiplicative perturbation-based models such as random orthogonal transformation and random rotation. Then, it proposes an approximate random projection-based technique to improve the level of privacy protection while still preserving certain statistical characteristics of the data. The paper presents extensive theoretical analysis and experimental results. Experiments demonstrate that the proposed technique is effective and can be successfully used for different types of privacy-preserving data mining applications.
  • Keywords
    data mining; data privacy; distributed databases; independent component analysis; independent component analysis; multiplicative data perturbation; privacy preserving distributed data mining; random projection; statistical computing; Aggregates; Data mining; Data privacy; Distributed computing; Euclidean distance; Independent component analysis; Medical services; Principal component analysis; Protection; Transaction databases; Index Terms- Random projection; multiplicative data perturbation; privacy preserving data mining.;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2006.14
  • Filename
    1549830