DocumentCode :
2081887
Title :
On optimal anonymization for l+-diversity
Author :
Liu, Junqiang ; Wang, Ke
Author_Institution :
Simon Fraser Univ., Burnaby, BC, Canada
fYear :
2010
fDate :
1-6 March 2010
Firstpage :
213
Lastpage :
224
Abstract :
Publishing person specific data while protecting privacy is an important problem. Existing algorithms that enforce the privacy principle called l-diversity are heuristic based due to the NP-hardness. Several questions remain open: can we get a significant gain in the data utility from an optimal solution compared to heuristic ones; can we improve the utility by setting a distinct privacy threshold per sensitive value; is it practical to find an optimal solution efficiently for real world datasets. This paper addresses these questions. Specifically, we present a pruning based algorithm for finding an optimal solution to an extended form of the l-diversity problem. The novelty lies in several strong techniques: a novel structure for enumerating all solutions, methods for estimating cost lower bounds, strategies for dynamically arranging the enumeration order and updating lower bounds. This approach can be instantiated with any reasonable cost metric. Experiments on real world datasets show that our algorithm is efficient and improves the data utility.
Keywords :
computational complexity; data privacy; optimisation; NP-hardness; cost lower bounds; l+-diversity; optimal anonymization; optimal solution; protecting privacy; publishing person specific data; Cost function; Data privacy; Indexing; Joining processes; Protection; Publishing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2010 IEEE 26th International Conference on
Conference_Location :
Long Beach, CA
Print_ISBN :
978-1-4244-5445-7
Electronic_ISBN :
978-1-4244-5444-0
Type :
conf
DOI :
10.1109/ICDE.2010.5447898
Filename :
5447898
Link To Document :
بازگشت