DocumentCode :
1985704
Title :
Privacy Preserving Data Publishing for Recommender System
Author :
Chen, Xiaoqiang ; Huang, Vincent
Author_Institution :
Dept. of Inf. Technol., Uppsala Univ., Uppsala, Sweden
fYear :
2012
fDate :
16-20 July 2012
Firstpage :
128
Lastpage :
133
Abstract :
Driven by mutual benefits, exchange and publication of data among various parties is an inevitable trend. However, released data often contains sensitive user information thus direct publication violates individual privacy. Among many privacy models, k-anonymity framework is popular and well-studied, it protects information by constructing groups of anonymous records such that each record in the table released is covered by no fewer than k-1 other records. In this paper, we first investigate different privacy preserving technologies and then focus on achieving k-anonymity for large scale and sparse databases, especially recommender systems. We present a general process for anonymization of large scale database. A preprocessing phase strategically extracts preference matrix from original data by Singular Value Decomposition (SVD) and eliminates the high dimensionality and sparsity problem. We developed a new clustering based k-anonymity heuristic named Bisecting K-Gather (BKG) and it is proven to be efficient and accurate. To support customized user privacy assignments, we also proposed a new concept called customized k-anonymity along with a corresponding algorithm (BOKG). We use MovieLens database to assess our algorithms. The results show that we can efficiently release anonymized data without compromising the utility of data.
Keywords :
data privacy; database management systems; electronic data interchange; matrix algebra; pattern clustering; publishing; recommender systems; singular value decomposition; BKG; MovieLens database; SVD; anonymous records; bisecting k-gather; clustering based k-anonymity heuristic; customized user privacy assignments; data exchange; individual privacy; k-anonymity framework; large scale database; preference matrix; privacy preserving data publishing; recommender system; sensitive user information; singular value decomposition; sparse databases; Clustering algorithms; Data privacy; Databases; Feature extraction; Motion pictures; Privacy; Publishing; Bisecting K-Gather; Bisecting One-K-Gather; Customized K-Anonymity; K-Anonymity; Privacy Preserving Data Publishing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Software and Applications Conference Workshops (COMPSACW), 2012 IEEE 36th Annual
Conference_Location :
Izmir
Print_ISBN :
978-1-4673-2714-5
Electronic_ISBN :
978-0-7695-4758-9
Type :
conf
DOI :
10.1109/COMPSACW.2012.33
Filename :
6341563
Link To Document :
بازگشت