• DocumentCode
    2649674
  • Title

    Distributed Model Based Sampling Technique for Privacy Preserving Clustering

  • Author

    Xiao-dan, WU ; Dian-min, YUE ; Feng-li, LIU ; Chao-hsien, CHU

  • Author_Institution
    Hebei Univ. of Technol., Wuhan
  • fYear
    2007
  • fDate
    20-22 Aug. 2007
  • Firstpage
    192
  • Lastpage
    197
  • Abstract
    The sharing of data has been proven beneficial in data mining applications. However, privacy regulations and other privacy concerns may prevent data owners from sharing information for data analysis. To resolve this challenging problem, data owners must design a solution that meets privacy requirements and guarantees valid data clustering results. To achieve this dual goal, we introduce a new method for privacy-preserving clustering based on the probability distributed model for clustering. This paper proposes square wave-based clustering model, gauss distribution-based clustering model and the multivariate normal distribution-based clustering model based on the information provided by the K-means clustering result separately. And we mainly show the correctness and feasibility of the sampled data for clustering by experiment. This new preserving technique can be used not only for distributed clustering, but also for simultaneous clustering rule and data hiding.
  • Keywords
    Gaussian distribution; data analysis; data encapsulation; data mining; data privacy; normal distribution; pattern clustering; sampling methods; K-means clustering; data analysis; data clustering; data hiding; data mining; distributed model based sampling technique; gauss distribution-based clustering model; multivariate normal distribution-based; privacy preserving clustering; privacy requirements; probability distributed model; square wave-based clustering model; Clustering algorithms; Conference management; Data analysis; Data mining; Data privacy; Engineering management; Partitioning algorithms; Protection; Sampling methods; Technology management; density-based clustering; distribution model; privacy preserving clustering; sampling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Management Science and Engineering, 2007. ICMSE 2007. International Conference on
  • Conference_Location
    Harbin
  • Print_ISBN
    978-7-88358-080-5
  • Electronic_ISBN
    978-7-88358-080-5
  • Type

    conf

  • DOI
    10.1109/ICMSE.2007.4421846
  • Filename
    4421846