DocumentCode
638637
Title
A quantifying method for trade-off between privacy and utility
Author
Gu Yonghao ; Wu Weiming
Author_Institution
Sch. of Comput. Sci., Beijing Univ. of Posts & Telecommun., Beijing, China
fYear
2013
fDate
27-29 April 2013
Firstpage
270
Lastpage
273
Abstract
Many anonymization methods have been used in data publishing and data mining. In the meantime, they reduce the utility of the dataset. So it is important to consider the tradeoff between privacy and utility. Quantifying the trade-off between usefulness and privacy of dataset has been the subject of much research in recent years. In this paper, we provide the concepts of privacy loss and utility loss and also give a method to quantify them using divergence distance in probability theory. And then, we evaluate our methodology on the Adult dataset from the UCI machine learning repository. Our result shows the relationship between privacy and utility, and also provide data users how to choose the right trade-off between privacy and utility. Finally, we conclude and show the future research direction on how to select best divergence measurement.
Keywords
data mining; data privacy; learning (artificial intelligence); publishing; security of data; UCI machine learning repository; anonymization methods; data mining; data publishing; divergence distance; divergence measurement; privacy loss; probability theory; quantifying method; utility loss; utility reduction; Divergence; Entropy; Privacy Loss; Trade-off; Utility Loss;
fLanguage
English
Publisher
iet
Conference_Titel
Information and Communications Technologies (IETICT 2013), IET International Conference on
Conference_Location
Beijing
Electronic_ISBN
978-1-84919-653-6
Type
conf
DOI
10.1049/cp.2013.0062
Filename
6617505
Link To Document