DocumentCode :
1047901
Title :
A Tree-Based Data Perturbation Approach for Privacy-Preserving Data Mining
Author :
Li, Xiao-Bai ; Sarkar, Sumit
Author_Institution :
Coll. of Manage., Massachusetts Univ., Lowell, MA
Volume :
18
Issue :
9
fYear :
2006
Firstpage :
1278
Lastpage :
1283
Abstract :
Due to growing concerns about the privacy of personal information, organizations that use their customers´ records in data mining activities are forced to take actions to protect the privacy of the individuals. A frequently used disclosure protection method is data perturbation. When used for data mining, it is desirable that perturbation preserves statistical relationships between attributes, while providing adequate protection for individual confidential data. To achieve this goal, we propose a kd-tree based perturbation method, which recursively partitions a data set into smaller subsets such that data records within each subset are more homogeneous after each partition. The confidential data in each final subset are then perturbed using the subset average. An experimental study is conducted to show the effectiveness of the proposed method
Keywords :
data mining; data privacy; perturbation theory; statistical databases; tree data structures; disclosure protection method; kd-tree based data perturbation approach; privacy-preserving data mining; Additive noise; Classification tree analysis; Computer Society; Data mining; Data privacy; Decision trees; Perturbation methods; Protection; Statistics; Storage area networks; Privacy; data mining; data perturbation; kd-trees.; microaggregation;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2006.136
Filename :
1661517
Link To Document :
بازگشت