Title :
Obfuscatory obscanturism: Making workload traces of commercially-sensitive systems safe to release
Author :
Reiss, Charles ; Wilkes, John ; Hellerstein, Joseph L.
Author_Institution :
Univ. of California, Berkeley, Berkeley, CA, USA
Abstract :
Cloud providers such as Google are interested in fostering research on the daunting technical challenges they face in supporting planetary-scale distributed systems, but no academic organizations have similar scale systems on which to experiment. Fortunately, good research can still be done using traces of real-life production workloads, but there are risks in releasing such data, including inadvertently disclosing confidential or proprietary information, as happened with the Netflix Prize data. This paper discusses these risks, and our approach to them, which we call systematic obfuscation. It protects proprietary and personal data while leaving it possible to answer interesting research questions. We explain and motivate some of the risks and concerns and propose how they can best be mitigated, using as an example our recent publication of a month-long trace of a production system workload on a 11k-machine cluster.
Keywords :
cloud computing; data privacy; security of data; 11k-machine cluster; Google; Netflix Prize data; cloud providers; commercially-sensitive systems; confidential information; obfuscatory obscanturism; personal data protection; planetary-scale distributed systems; proprietary data protection; proprietary information; real-life production workload traces; systematic obfuscation; Companies; Google; Hardware; IP networks; Production; Software; Timing;
Conference_Titel :
Network Operations and Management Symposium (NOMS), 2012 IEEE
Conference_Location :
Maui, HI
Print_ISBN :
978-1-4673-0267-8
Electronic_ISBN :
1542-1201
DOI :
10.1109/NOMS.2012.6212064