DocumentCode :
3122032
Title :
Using Anonymized Data for Classification
Author :
Inan, Ali ; Kantarcioglu, Murat ; Bertino, Elisa
Author_Institution :
Dept. of Comput. Sci., Univ. of Texas at Dallas, Richardson, TX
fYear :
2009
fDate :
March 29 2009-April 2 2009
Firstpage :
429
Lastpage :
440
Abstract :
In recent years, anonymization methods have emerged as an important tool to preserve individual privacy when releasing privacy sensitive data sets. This interest in anonymization techniques has resulted in a plethora of methods for anonymizing data under different privacy and utility assumptions. At the same time, there has been little research addressing how to effectively use the anonymized data for data mining in general and for distributed data mining in particular. In this paper, we propose a new approach for building classifiers using anonymized data by modeling anonymized data as uncertain data. In our method, we do not assume any probability distribution over the data. Instead, we propose collecting all necessary statistics during anonymization and releasing these together with the anonymized data. We show that releasing such statistics does not violate anonymity. Experiments spanning various alternatives both in local and distributed data mining settings reveal that our method performs better than heuristic approaches for handling anonymized data.
Keywords :
data handling; data mining; data privacy; pattern classification; probability; anonymization methods; anonymized data handling; data mining; privacy sensitive data sets; probability distribution; Classification algorithms; Computer science; Data engineering; Data mining; Data privacy; Drugs; Euclidean distance; Probability distribution; Statistical distributions; USA Councils;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
Conference_Location :
Shanghai
ISSN :
1084-4627
Print_ISBN :
978-1-4244-3422-0
Electronic_ISBN :
1084-4627
Type :
conf
DOI :
10.1109/ICDE.2009.19
Filename :
4812423
Link To Document :
بازگشت