Title :
Privacy-oriented discovery of interesting pattern from numeric attributes
Author_Institution :
Nat. Univ. of Singapore, Singapore
Abstract :
The use and dissemination of the sensitive information is one of the major issues causing concern surrounding knowledge discovery. Existing mining algorithm use the discretization method to partition each numeric attribute into a set of interval during data prepossessing phase. However, not only can such method bring the problem of producing many irrelevant and uninteresting patterns, but also the information is disclosed. In this paper, we propose a new framework to address this issue. The new approach first perturbs and transforms the original data set based on a set of different belief level without information loss. After that, the transformed data are sent to the data mining consultancy, then rules under different belief levels are generated. After that, the interesting filter is used to eliminate the redundant rules. Rules are useful only in the context of partition performed by the data provider and there is no information disclosure. The proposed technique has been applied to a number of sensitive real life data sets. Experiments results show that our proposed technique is very effective especially when there are many numeric attributes in the data set.
Keywords :
belief networks; data mining; data privacy; fuzzy set theory; information dissemination; belief level; data mining algorithm; data prepossessing phase; data sets; discretization method; fuzzy set theory; knowledge discovery; numeric attributes; privacy oriented discovery; Credit cards; Data mining; Data privacy; Data security; Databases; Filters; Fuzzy systems; Information security; Remuneration; Risk analysis;
Conference_Titel :
Systems, Man and Cybernetics, 2003. IEEE International Conference on
Print_ISBN :
0-7803-7952-7
DOI :
10.1109/ICSMC.2003.1244228