Title :
Simultaneous pattern and data hiding in supervised learning
Author :
Pengpeng Lin ; Jun Zhang ; Xiwei Wang ; Shindhelm, A.
Author_Institution :
Comput. Sci. Dept., Univ. of Kentucky, Lexington, KY, USA
Abstract :
The ability to hide private data and confidential patterns from potential adversaries while still maintaining data mining value is an important aspect in privacy preserving data mining. In this paper, we study a nonnegative matrix factorization technique, where we show how to define objective functions and derive corresponding multiplicative update functions. We then use that knowledge to propose a data value perturbation scheme that hides data values but still keeps the data pattern to a large degree. Based on the proposed data value perturbation scheme, we develop a dual data hiding scheme which not only hides data but also hides individual sample´s class membership. The essential idea is to use an indicator matrix as a guide for the update process. The performance of the proposed schemes are examined on benchmark datasets for both utility value and data perturbation degree. The empirical results show that the data values are well perturbed and our schemes are capable of hiding a data sample´s class membership without side effects. At the end, we draw some interesting conclusions and layout potential future work.
Keywords :
data encapsulation; data mining; data privacy; learning (artificial intelligence); matrix decomposition; perturbation techniques; benchmark datasets; class membership; confidential patterns; data mining value; data pattern; data perturbation degree; data value perturbation scheme; dual data hiding scheme; indicator matrix; multiplicative update functions; nonnegative matrix factorization technique; objective functions; privacy preserving data mining; private data hiding; simultaneous pattern; supervised learning; update process; utility value; Clustering algorithms; Convergence; Data privacy; Linear programming; Matrix decomposition; USA Councils; Vectors; Classification; Indicator Matrix; NMF; PPDM;
Conference_Titel :
Information Reuse and Integration (IRI), 2012 IEEE 13th International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4673-2282-9
Electronic_ISBN :
978-1-4673-2283-6
DOI :
10.1109/IRI.2012.6303035