• DocumentCode
    1565268
  • Title

    Private representative-based clustering for vertically partitioned data

  • Author

    Estivill-Castro, Vladimir

  • Author_Institution
    Inst. for Intelligent & Integrated Syst., Griffith Univ., Brisbane, Qld., Australia
  • fYear
    2004
  • Firstpage
    160
  • Lastpage
    167
  • Abstract
    This work studies how to construct a representative-based clustering algorithm under the scenario that the dataset is partitioned into at least two sections. One section of the data is owned by Alice while the other is owned by Bob. Both want to compute clusters from the union of the data but do not trust each other. Thus, they do not want the other party to learn anything about their share of the data except what can be inferred from the results. We present a protocol that allows Alice and Bob to carry this task under the k-medoids algorithm. Clustering with medoids (medians or other loss functions) is a more robust alternative that clustering with k-MEANS (the only method for which a privacy preserving protocol is known, but a methods that is statistically biased and statistically inconsistent with very low robustness to noise). Our approach highlights the necessary building blocks for extending our protocol to the family of representative-based clustering algorithms.
  • Keywords
    data mining; data privacy; pattern clustering; protocols; k-MEANS; k-medoids algorithm; loss functions; medians function; privacy preserving protocol; private representative-based clustering; vertically partitioned data; Clustering algorithms; Data analysis; Data mining; Data privacy; Delta modulation; Information analysis; Noise robustness; Perturbation methods; Protocols; Terrorism;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science, 2004. ENC 2004. Proceedings of the Fifth Mexican International Conference in
  • Print_ISBN
    0-7695-2160-6
  • Type

    conf

  • DOI
    10.1109/ENC.2004.1342601
  • Filename
    1342601