DocumentCode :
2213979
Title :
Combined data distortion strategies for privacy-preserving data mining
Author :
Peng, Bo ; Geng, Xingyu ; Zhang, Jun
Author_Institution :
Coll. of Comput. Sci., Southwest Pet. Univ., Chengdu, China
Volume :
1
fYear :
2010
fDate :
20-22 Aug. 2010
Abstract :
The problem of privacy-preserving data mining has become more and more important in recent years. Many successful and efficient techniques have been developed. However, in collaborative data analysis, part of the datasets may come from different data owners and may be processed using different data distortion methods. Thus, combinations of datasets processed using different methods are of practical interests. In this paper, a class of novel data distortion strategies is proposed. Four schemes via attribute partition, with different combinations of singular value decomposition (SVD), nonnegative matrix factorization (NMF), discrete wavelet transformation (DWT), are designed to perturb submatrix of the original datasets for privacy protection. We use some metrics to measure the performance of the proposed new strategies. Data utility is examined by using a binary classification based on the support vector machine. Our experimental results indicate that, in comparison with the individual data distortion techniques, the proposed schemes are very efficient in achieving a good trade-off between data privacy and data utility, and provide a feasible solution for collaborative data analysis.
Keywords :
data analysis; data mining; data privacy; discrete wavelet transforms; groupware; pattern classification; singular value decomposition; support vector machines; attribute partition; binary classification; collaborative data analysis; data distortion strategies; data utility; discrete wavelet transformation; nonnegative matrix factorization; privacy preserving data mining; singular value decomposition; support vector machine; Accuracy; Conferences; Discrete wavelet transforms; Matrix decomposition; Support vector machines; World Wide Web; data distortation; data mining; privacy preservation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Computer Theory and Engineering (ICACTE), 2010 3rd International Conference on
Conference_Location :
Chengdu
ISSN :
2154-7491
Print_ISBN :
978-1-4244-6539-2
Type :
conf
DOI :
10.1109/ICACTE.2010.5578952
Filename :
5578952
Link To Document :
بازگشت