Title :
Privacy-Preserving Computation of Bayesian Networks on Vertically Partitioned Data
Author :
Yang, Zhiqiang ; Wright, Rebecca N.
Author_Institution :
Dept. of Comput. Sci., Stevens Inst. of Technol., Hoboken, NJ
Abstract :
Traditionally, many data mining techniques have been designed in the centralized model in which all data is collected and available in one central site. However, as more and more activities are carried out using computers and computer networks, the amount of potentially sensitive data stored by business, governments, and other parties increases. Different parties often wish to benefit from cooperative use of their data, but privacy regulations and other privacy concerns may prevent the parties from sharing their data. Privacy-preserving data mining provides a solution by creating distributed data mining algorithms in which the underlying data need not be revealed. In this paper, we present privacy-preserving protocols for a particular data mining task: learning a Bayesian network from a database vertically partitioned among two parties. In this setting, two parties owning confidential databases wish to learn the Bayesian network on the combination of their databases without revealing anything else about their data to each other. We present an efficient and privacy-preserving protocol to construct a Bayesian network on the parties´ joint data
Keywords :
belief networks; data mining; data privacy; distributed algorithms; learning (artificial intelligence); protocols; Bayesian network learning; confidential databases; distributed data mining algorithm; privacy regulation; privacy-preserving data mining; privacy-preserving protocol; vertically partitioned data; Bayesian methods; Computer networks; DNA; Data mining; Data privacy; Databases; Government; Hospitals; Partitioning algorithms; Protocols; Bayesian networks; Data privacy; privacy-preserving data mining.;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2006.147