DocumentCode :
33759
Title :
A Random Decision Tree Framework for Privacy-Preserving Data Mining
Author :
Vaidya, Jaideep ; Shafiq, Basit ; Wei Fan ; Mehmood, Danish ; Lorenzi, David
Author_Institution :
MSIS Dept., Rutgers Univ., Newark, NJ, USA
Volume :
11
Issue :
5
fYear :
2014
fDate :
Sept.-Oct. 2014
Firstpage :
399
Lastpage :
411
Abstract :
Distributed data is ubiquitous in modern information driven applications. With multiple sources of data, the natural challenge is to determine how to collaborate effectively across proprietary organizational boundaries while maximizing the utility of collected information. Since using only local data gives suboptimal utility, techniques for privacy-preserving collaborative knowledge discovery must be developed. Existing cryptography-based work for privacy-preserving data mining is still too slow to be effective for large scale data sets to face today´s big data challenge. Previous work on random decision trees (RDT) shows that it is possible to generate equivalent and accurate models with much smaller cost. We exploit the fact that RDTs can naturally fit into a parallel and fully distributed architecture, and develop protocols to implement privacy-preserving RDTs that enable general and efficient distributed privacy-preserving knowledge discovery.
Keywords :
data mining; data privacy; decision trees; big data challenge; cryptography-based work; distributed data; distributed privacy-preserving knowledge discovery; organizational boundaries; privacy-preserving RDT; privacy-preserving collaborative knowledge discovery; privacy-preserving data mining; random decision tree framework; suboptimal utility; Data mining; Decision trees; Encryption; Protocols; Vectors; Vegetation; Privacy-preserving data mining; classification;
fLanguage :
English
Journal_Title :
Dependable and Secure Computing, IEEE Transactions on
Publisher :
ieee
ISSN :
1545-5971
Type :
jour
DOI :
10.1109/TDSC.2013.43
Filename :
6616536
Link To Document :
بازگشت