DocumentCode :
3290980
Title :
k-Anonymity via Clustering Domain Knowledge for Privacy Preservation
Author :
Li, Taiyong ; Tang, Changjie ; Wu, Jiang ; Luo, Qian ; Li, Shengzhi ; Lin, Xun ; Zuo, Jie
Author_Institution :
Sch. of Comput. Sci., Sichuan Univ., Chengdu
Volume :
4
fYear :
2008
fDate :
18-20 Oct. 2008
Firstpage :
697
Lastpage :
701
Abstract :
Preservation of privacy in micro-data release is a challenging task in data mining. The k-anonymity method has attracted much attention of researchers. Quasi-identifier is a key concept in k-anonymity. The tuples whose quasi-identifiers have near effect on the sensitive attributes should be grouped to reduce information loss. The previous investigations ignored this point. This paper studies k-anonymity via clustering domain knowledge. The contributions include: (a) Constructing a weighted matrix based on domain knowledge and proposing measure methods. It carefully considers the effect between the quasi-identifiers and the sensitive attributes. (b) Developing a heuristic algorithm to achieve k-anonymity via clustering domain knowledge based on the measure methods. (c) Implementing the algorithm for privacy preservation, and (d) Experiments on real data demonstrate that the proposed k-anonymous methods decrease 30% information loss compared with basic k-anonymity.
Keywords :
data mining; data privacy; pattern clustering; clustering domain knowledge; data mining; heuristic algorithm; k-anonymity method; microdata release; privacy preservation; quasi-identifiers; weighted matrix; Clustering algorithms; Computer science; Data engineering; Data mining; Data privacy; Finance; Fuzzy systems; Heuristic algorithms; Joining processes; Knowledge engineering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on
Conference_Location :
Jinan Shandong
Print_ISBN :
978-0-7695-3305-6
Type :
conf
DOI :
10.1109/FSKD.2008.428
Filename :
4666473
Link To Document :
بازگشت