Title :
A privacy preserving Jaccard similarity function for mining encrypted data
Author :
Singh, Meena Dilip ; Krishna, Radha P. ; Saxena, Ashutosh
Author_Institution :
SETLabs, Infosys Technol. Ltd., Bangalore, India
Abstract :
Due to advances in data collection and increasing dependency on data mining experts, preserving privacy of the data is a major concern when mining the data. Most of the classifier implementations for data mining have the tradeoff between classification accuracy and maintenance of data privacy. Another important aspect in distance-based classifiers is to accurately compute distance (or similarity) between two or more data points. In privacy preserving data mining techniques, providing a suitable distance measure to classify the data while maintaining data privacy is a challenging task. In this paper, we present an approach to compute similarity between two encrypted data points. We augmented Jaccard similarity function with Private Equality Test protocol facilitating a semi honest third party to conduct the equality test. The proposed privacy preserving scheme provides an efficient mechanism for similarity computation with reduced communication cost for mining the data.
Keywords :
cryptography; data mining; data privacy; classifier implementations; encrypted data mining; encrypted data points; privacy preserving jaccard similarity function; private equality test protocol; similarity computation; Clustering algorithms; Costs; Cryptographic protocols; Cryptography; Data mining; Data privacy; Data security; Databases; Testing; Usability; Cryptography; Encrypted Data; Privacy Preserving Data Mining; Private Equality Test;
Conference_Titel :
TENCON 2009 - 2009 IEEE Region 10 Conference
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-4546-2
Electronic_ISBN :
978-1-4244-4547-9
DOI :
10.1109/TENCON.2009.5395869