DocumentCode
2884205
Title
Non-linear Dimensionality Reduction for Privacy-Preserving Data Classification
Author
AlOtaibi, Khaled ; Rayward-Smith, V.J. ; Wenjia Wang ; de la Iglesia, B.
Author_Institution
Sch. of Comput. Sci., Univ. of East Anglia, Norwich, UK
fYear
2012
fDate
3-5 Sept. 2012
Firstpage
694
Lastpage
701
Abstract
Many techniques have been proposed to protect the privacy of data outsourced for analysis by external parties. However, most of these techniques distort the underlying data properties, and therefore, hinder data mining algorithms from discovering patterns. The aim of Privacy-Preserving Data Mining (PPDM) is to generate a data-friendly transformation that maintains both the privacy and the utility of the data. We have proposed a novel privacy-preserving framework based on non-linear dimensionality reduction (i.e. non-metric multidimensional scaling) to perturb the original data. The perturbed data exhibited good utility in terms of distance-preservation between objects. This was tested on a clustering task with good results. In this paper, we test our novel PPDM approach on a classification task using a k-Nearest Neighbour (k-NN) classification algorithm. We compare the classification results obtained from both the original and the perturbed data and find them to be much same particularly for the few lower dimensions. We show that, for distance-based classification, our approach preserves the utility of the data while hiding the private details.
Keywords
data mining; data privacy; data reduction; pattern classification; PPDM approach; data mining algorithm; data properties; data utility; data-friendly transformation; distance preservation; distance-based classification; external parties; k-NN classification algorithm; k-nearest neighbour classification algorithm; nonlinear dimensionality reduction; nonmetric multidimensional scaling; outsourcing; privacy-preserving data classification; Additives; Algorithm design and analysis; Data privacy; Noise; Stress; Uncertainty; Data Perturbation; Multidimensional Scaling; Privacy-Preserving Data Mining; k-NN Classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Privacy, Security, Risk and Trust (PASSAT), 2012 International Conference on and 2012 International Confernece on Social Computing (SocialCom)
Conference_Location
Amsterdam
Print_ISBN
978-1-4673-5638-1
Type
conf
DOI
10.1109/SocialCom-PASSAT.2012.76
Filename
6406295
Link To Document