• DocumentCode
    2492672
  • Title

    A variable-MDAV-based partitioning strategy to continuous multivariate microaggregation with genetic algorithms

  • Author

    Solanas, Agusti ; González-Nicolás, Úrsula ; Martínez-Ballesté, Antoni

  • Author_Institution
    Dept. of Comput. Eng. & Math., Univ. Rovira i Virgili, Tarragona, Spain
  • fYear
    2010
  • fDate
    18-23 July 2010
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    Microaggregation is a Statistical Disclosure Control (SDC) technique that aims at protecting the privacy of individual respondents before their data are released. Optimally microaggregating multivariate data sets is known to be an NP-hard problem. Thus, using heuristics has been suggested as a possible strategy to tackle it. Specifically, Genetic Algorithms have been shown to be serious candidates that can find good solutions on small data sets. However, due to the very nature of these algorithms and the coding of the microaggregation problem, GA can hardly cope with large data sets. In order to apply them to large data sets, the latter have to be previously partitioned into smaller disjoint subsets that the GA can handle. In this article we summarise several proposals for partitioning data sets, in order to use GA to microaggregate them. In addition, we suggest a new partitioning strategy based on the variable-MDAV algorithm, and we compare it with the most relevant previous proposals. The experimental results show that our method outperforms the previous ones in terms of information loss.
  • Keywords
    computational complexity; data handling; genetic algorithms; statistical analysis; NP-hard problem; continuous multivariate microaggregation; genetic algorithms; statistical disclosure control technique; variable MDAV based partitioning strategy; Acquired immune deficiency syndrome; Cities and towns;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), The 2010 International Joint Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1098-7576
  • Print_ISBN
    978-1-4244-6916-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2010.5596660
  • Filename
    5596660