• DocumentCode
    2543898
  • Title

    Towards Privacy Preserving Mining over Distributed Cloud Databases

  • Author

    Ibrahim, Amin ; Hai Jin ; Yassin, Ali A. ; Deqing Zou

  • Author_Institution
    Cluster & Grid Comput. Lab., Huazhong Univ. of Sci. & Technol., Wuhan, China
  • fYear
    2012
  • fDate
    1-3 Nov. 2012
  • Firstpage
    130
  • Lastpage
    136
  • Abstract
    Due to great advances in computing and Internet technologies, organizations have been enabled to collect and generate a large amount of data. Most of these organizations tend to analyze their data to discover new patterns. Usually, analyzing such amount of data requires huge computational power and storage facilities that may not be available to these organizations. Cloud computing offers the best way to solve this problem. Storing the private data of different organizations in the same cloud server enhances the mining process, but at the same time, raises privacy concerns. Therefore, it is highly recommended to support privacy preserving data mining algorithms in the cloud environment. This paper introduces an efficient and accurate cryptography-based scheme for mining the cloud data in a secure way without loss of accuracy. Specifically, we address the problem of K-nearest neighbor (KNN) classification over horizontally distributed databases without revealing any unnecessary information. We have utilized the recently developed cryptography primitive, order preserving symmetric encryption (OPSE), to integrate securely the local classifications at a lower cost than the previously presented privacy preserving data mining schemes. Empirical results on real datasets demonstrate that the proposed scheme has similar performance with the naive mining systems in terms of classification accuracy.
  • Keywords
    cloud computing; cryptography; data mining; data privacy; distributed databases; pattern classification; Internet technology; K-nearest neighbor classification; KNN classification; OPSE cryptography primitive; classification accuracy; cloud computing; cloud server; computing technology; cryptography-based scheme; data collection; distributed cloud database; order preserving symmetric encryption; privacy preserving mining; private data storage; Accuracy; Cryptography; Data privacy; Distributed databases; Training; Vectors; KNN; cloud computing; data mining; encryption; privacy preserving; security;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud and Green Computing (CGC), 2012 Second International Conference on
  • Conference_Location
    Xiangtan
  • Print_ISBN
    978-1-4673-3027-5
  • Type

    conf

  • DOI
    10.1109/CGC.2012.86
  • Filename
    6382808