Title :
A variable-MDAV-based partitioning strategy to continuous multivariate microaggregation with genetic algorithms
Author :
Solanas, Agusti ; González-Nicolás, Úrsula ; Martínez-Ballesté, Antoni
Author_Institution :
Dept. of Comput. Eng. & Math., Univ. Rovira i Virgili, Tarragona, Spain
Abstract :
Microaggregation is a Statistical Disclosure Control (SDC) technique that aims at protecting the privacy of individual respondents before their data are released. Optimally microaggregating multivariate data sets is known to be an NP-hard problem. Thus, using heuristics has been suggested as a possible strategy to tackle it. Specifically, Genetic Algorithms have been shown to be serious candidates that can find good solutions on small data sets. However, due to the very nature of these algorithms and the coding of the microaggregation problem, GA can hardly cope with large data sets. In order to apply them to large data sets, the latter have to be previously partitioned into smaller disjoint subsets that the GA can handle. In this article we summarise several proposals for partitioning data sets, in order to use GA to microaggregate them. In addition, we suggest a new partitioning strategy based on the variable-MDAV algorithm, and we compare it with the most relevant previous proposals. The experimental results show that our method outperforms the previous ones in terms of information loss.
Keywords :
computational complexity; data handling; genetic algorithms; statistical analysis; NP-hard problem; continuous multivariate microaggregation; genetic algorithms; statistical disclosure control technique; variable MDAV based partitioning strategy; Acquired immune deficiency syndrome; Cities and towns;
Conference_Titel :
Neural Networks (IJCNN), The 2010 International Joint Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4244-6916-1
DOI :
10.1109/IJCNN.2010.5596660