DocumentCode
3086903
Title
Systematic Clustering-Based Microaggregation for Statistical Disclosure Control
Author
Kabir, Md Enamul ; Wang, Hua
Author_Institution
Dept. of Math. & Comput., Univ. of Southern Queensland, Toowoomba, QLD, Australia
fYear
2010
fDate
1-3 Sept. 2010
Firstpage
435
Lastpage
441
Abstract
Microdata protection in statistical databases has recently become a major societal concern. Micro aggregation for Statistical Disclosure Control (SDC) is a family of methods to protect microdata from individual identification. Micro aggregation works by partitioning the microdata into groups of at least k records and then replacing the records in each group with the centroid of the group. This paper presents a clustering-based micro aggregation method to minimize the information loss. The proposed technique adopts to group similar records together in a systematic way and then anonymized with the centroid of each group individually. The structure of systematic clustering problem is defined and investigated and an algorithm of the proposed problem is developed. Experimental results show that our method attains a reasonable dominance with respect to both information loss and execution time than the most popular heuristic algorithm called Maximum Distance to Average Vector (MDAV).
Keywords
data privacy; pattern clustering; security of data; statistical databases; maximum distance to average vector; microdata protection; statistical databases; statistical disclosure control; systematic clustering-based microaggregation; Clustering algorithms; Data privacy; Equations; Loss measurement; Privacy; Sorting; Systematics; Disclosure control; Microaggregation; Microdata protection; Privacy; k-anonymity;
fLanguage
English
Publisher
ieee
Conference_Titel
Network and System Security (NSS), 2010 4th International Conference on
Conference_Location
Melbourne, VIC
Print_ISBN
978-1-4244-8484-3
Electronic_ISBN
978-0-7695-4159-4
Type
conf
DOI
10.1109/NSS.2010.66
Filename
5635822
Link To Document