DocumentCode
3318017
Title
Effects of network trace sampling methods on privacy and utility metrics
Author
Fazio, Phil ; Tan, Keren ; Kotz, David
Author_Institution
Dartmouth Coll., Hanover, NH, USA
fYear
2012
fDate
3-7 Jan. 2012
Firstpage
1
Lastpage
8
Abstract
Researchers choosing to share wireless-network traces with colleagues must first anonymize sensitive information, trading off the removal of information in the interest of identity protection and the preservation of useful data within the trace. While several metrics exist to quantify this privacy-utility tradeoff, they are often computationally expensive. Computing these metrics using a sample of the trace could potentially save precious time. In this paper, we examine several sampling methods to discover their effects on measurement of the privacy-utility tradeoff when anonymizing network traces. We tested the relative accuracy of several packet and flow-sampling methods on existing privacy and utility metrics. We concluded that, for our test trace, no single sampling method we examined allowed us to accurately measure the tradeoff, and that some sampling methods can produce grossly inaccurate estimates of those values. We call for further research to develop sampling methods that maintain relevant privacy and utility properties.
Keywords
data privacy; radio networks; sampling methods; data preservation; flow-sampling method; identity protection; network trace sampling method; privacy-utility tradeoff measurement; wireless-network; Accuracy; Entropy; IP networks; Privacy; Sampling methods; Size measurement;
fLanguage
English
Publisher
ieee
Conference_Titel
Communication Systems and Networks (COMSNETS), 2012 Fourth International Conference on
Conference_Location
Bangalore
Print_ISBN
978-1-4673-0296-8
Electronic_ISBN
978-1-4673-0297-5
Type
conf
DOI
10.1109/COMSNETS.2012.6151387
Filename
6151387
Link To Document