DocumentCode :
2543898
Title :
Towards Privacy Preserving Mining over Distributed Cloud Databases
Author :
Ibrahim, Amin ; Hai Jin ; Yassin, Ali A. ; Deqing Zou
Author_Institution :
Cluster & Grid Comput. Lab., Huazhong Univ. of Sci. & Technol., Wuhan, China
fYear :
2012
fDate :
1-3 Nov. 2012
Firstpage :
130
Lastpage :
136
Abstract :
Due to great advances in computing and Internet technologies, organizations have been enabled to collect and generate a large amount of data. Most of these organizations tend to analyze their data to discover new patterns. Usually, analyzing such amount of data requires huge computational power and storage facilities that may not be available to these organizations. Cloud computing offers the best way to solve this problem. Storing the private data of different organizations in the same cloud server enhances the mining process, but at the same time, raises privacy concerns. Therefore, it is highly recommended to support privacy preserving data mining algorithms in the cloud environment. This paper introduces an efficient and accurate cryptography-based scheme for mining the cloud data in a secure way without loss of accuracy. Specifically, we address the problem of K-nearest neighbor (KNN) classification over horizontally distributed databases without revealing any unnecessary information. We have utilized the recently developed cryptography primitive, order preserving symmetric encryption (OPSE), to integrate securely the local classifications at a lower cost than the previously presented privacy preserving data mining schemes. Empirical results on real datasets demonstrate that the proposed scheme has similar performance with the naive mining systems in terms of classification accuracy.
Keywords :
cloud computing; cryptography; data mining; data privacy; distributed databases; pattern classification; Internet technology; K-nearest neighbor classification; KNN classification; OPSE cryptography primitive; classification accuracy; cloud computing; cloud server; computing technology; cryptography-based scheme; data collection; distributed cloud database; order preserving symmetric encryption; privacy preserving mining; private data storage; Accuracy; Cryptography; Data privacy; Distributed databases; Training; Vectors; KNN; cloud computing; data mining; encryption; privacy preserving; security;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cloud and Green Computing (CGC), 2012 Second International Conference on
Conference_Location :
Xiangtan
Print_ISBN :
978-1-4673-3027-5
Type :
conf
DOI :
10.1109/CGC.2012.86
Filename :
6382808
Link To Document :
بازگشت