DocumentCode :
2492672
Title :
A variable-MDAV-based partitioning strategy to continuous multivariate microaggregation with genetic algorithms
Author :
Solanas, Agusti ; González-Nicolás, Úrsula ; Martínez-Ballesté, Antoni
Author_Institution :
Dept. of Comput. Eng. & Math., Univ. Rovira i Virgili, Tarragona, Spain
fYear :
2010
fDate :
18-23 July 2010
Firstpage :
1
Lastpage :
7
Abstract :
Microaggregation is a Statistical Disclosure Control (SDC) technique that aims at protecting the privacy of individual respondents before their data are released. Optimally microaggregating multivariate data sets is known to be an NP-hard problem. Thus, using heuristics has been suggested as a possible strategy to tackle it. Specifically, Genetic Algorithms have been shown to be serious candidates that can find good solutions on small data sets. However, due to the very nature of these algorithms and the coding of the microaggregation problem, GA can hardly cope with large data sets. In order to apply them to large data sets, the latter have to be previously partitioned into smaller disjoint subsets that the GA can handle. In this article we summarise several proposals for partitioning data sets, in order to use GA to microaggregate them. In addition, we suggest a new partitioning strategy based on the variable-MDAV algorithm, and we compare it with the most relevant previous proposals. The experimental results show that our method outperforms the previous ones in terms of information loss.
Keywords :
computational complexity; data handling; genetic algorithms; statistical analysis; NP-hard problem; continuous multivariate microaggregation; genetic algorithms; statistical disclosure control technique; variable MDAV based partitioning strategy; Acquired immune deficiency syndrome; Cities and towns;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), The 2010 International Joint Conference on
Conference_Location :
Barcelona
ISSN :
1098-7576
Print_ISBN :
978-1-4244-6916-1
Type :
conf
DOI :
10.1109/IJCNN.2010.5596660
Filename :
5596660
Link To Document :
بازگشت