DocumentCode
3095053
Title
Discovery of De-identification Policies Considering Re-identification Risks and Information Loss
Author
He-Ming Ruan ; Ming-Hwa Tsai ; Yen-Nun Huang ; Yen-Hua Liao ; Chin-Laung Lei
Author_Institution
Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fYear
2015
fDate
24-26 May 2015
Firstpage
69
Lastpage
76
Abstract
In data analysis, it is always a tough task to strike the balance between the privacy and the applicability of the data. Due to the demand for individual privacy, the data are being more or less obscured before being released or outsourced to avoid possible privacy leakage. This process is so called de-identification. To discuss a de-identification policy, the most important two aspects should be the re-identification risk and the information loss. In this paper, we introduce a novel policy searching method to efficiently find out proper de-identification policies according to acceptable re-identification risk while retaining the information resided in the data. With the UCI Machine Learning Repository as our real world dataset, the re-identification risk can therefore be able to reflect the true risk of the de-identified data under the de-identification policies. Moreover, using the proposed algorithm, one can then efficiently acquire policies with higher information entropy.
Keywords
data analysis; data privacy; entropy; learning (artificial intelligence); risk analysis; UCI machine learning repository; data analysis; deidentification policies; deidentified data; information entropy; information loss; privacy leakage; reidentification risks; Computational modeling; Data analysis; Data privacy; Lattices; Privacy; Synthetic aperture sonar; Upper bound; De-identification; HIPPA; Safe Harbor; data privacy;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Security (AsiaJCIS), 2015 10th Asia Joint Conference on
Conference_Location
Kaohsiung
Type
conf
DOI
10.1109/AsiaJCIS.2015.23
Filename
7153938
Link To Document