• DocumentCode
    3681070
  • Title

    Solving the Top-K problem for sequence counting using differential privacy

  • Author

    Sergiu Costea;Nicolae Ţăpuş

  • Author_Institution
    Computer Science and Engineering Department, University Politehnica of Bucharest
  • fYear
    2015
  • Firstpage
    208
  • Lastpage
    213
  • Abstract
    Online databases currently store large quantities of sensitive information. This information includes sequences, for example trajectories or network packet flows. While interesting conclusions might be drawn from analyzing it, the process is not trivial, as the results risk compromising the privacy of participating users. Differential privacy proposes a solution for the safe analysis of such data, using statistical means to perform anonymization. We look at two differential privacy algorithms for sequence counting and see how they perform when adapted to solve the top-K problem, which selects the greatest K values from a set. We analyze the performance of both algorithms in a variety of scenarios, and compute the precision and errors of the results. We conclude with a series of recommendations on which algorithm is best suited to solving the top-K problem depending on user goals.
  • Keywords
    Decision support systems
  • Publisher
    ieee
  • Conference_Titel
    RoEduNet International Conference - Networking in Education and Research (RoEduNet NER), 2015 14th
  • ISSN
    2068-1038
  • Print_ISBN
    978-1-4673-8179-6
  • Electronic_ISBN
    2247-5443
  • Type

    conf

  • DOI
    10.1109/RoEduNet.2015.7311996
  • Filename
    7311996