DocumentCode
2543898
Title
Towards Privacy Preserving Mining over Distributed Cloud Databases
Author
Ibrahim, Amin ; Hai Jin ; Yassin, Ali A. ; Deqing Zou
Author_Institution
Cluster & Grid Comput. Lab., Huazhong Univ. of Sci. & Technol., Wuhan, China
fYear
2012
fDate
1-3 Nov. 2012
Firstpage
130
Lastpage
136
Abstract
Due to great advances in computing and Internet technologies, organizations have been enabled to collect and generate a large amount of data. Most of these organizations tend to analyze their data to discover new patterns. Usually, analyzing such amount of data requires huge computational power and storage facilities that may not be available to these organizations. Cloud computing offers the best way to solve this problem. Storing the private data of different organizations in the same cloud server enhances the mining process, but at the same time, raises privacy concerns. Therefore, it is highly recommended to support privacy preserving data mining algorithms in the cloud environment. This paper introduces an efficient and accurate cryptography-based scheme for mining the cloud data in a secure way without loss of accuracy. Specifically, we address the problem of K-nearest neighbor (KNN) classification over horizontally distributed databases without revealing any unnecessary information. We have utilized the recently developed cryptography primitive, order preserving symmetric encryption (OPSE), to integrate securely the local classifications at a lower cost than the previously presented privacy preserving data mining schemes. Empirical results on real datasets demonstrate that the proposed scheme has similar performance with the naive mining systems in terms of classification accuracy.
Keywords
cloud computing; cryptography; data mining; data privacy; distributed databases; pattern classification; Internet technology; K-nearest neighbor classification; KNN classification; OPSE cryptography primitive; classification accuracy; cloud computing; cloud server; computing technology; cryptography-based scheme; data collection; distributed cloud database; order preserving symmetric encryption; privacy preserving mining; private data storage; Accuracy; Cryptography; Data privacy; Distributed databases; Training; Vectors; KNN; cloud computing; data mining; encryption; privacy preserving; security;
fLanguage
English
Publisher
ieee
Conference_Titel
Cloud and Green Computing (CGC), 2012 Second International Conference on
Conference_Location
Xiangtan
Print_ISBN
978-1-4673-3027-5
Type
conf
DOI
10.1109/CGC.2012.86
Filename
6382808
Link To Document