DocumentCode
2437071
Title
Distributed Mining Core of Attributes on Horizontally Partitioned Data
Author
Hu, Dan ; Yu, Xianchuan ; Feng, Yuanfu
Author_Institution
Coll. of Inf. Sci. & Technol., Beijing Normal Univ., Beijing
Volume
2
fYear
2008
fDate
19-20 Dec. 2008
Firstpage
112
Lastpage
116
Abstract
The field of distributed data mining (DDM) has emerged as an active area in recent years because the key challenge in knowledge discovery is the extraction of knowledge from massive databases. Rough set theory (RST) is one of the powerful approaches in data mining, which has been demonstrated to have its usefulness in successfully solving a variety of problems. But there is almost no literature related to the distributed computation in RST. In this paper, the relation between the cores of partitioned data and global data are discussed. An useful proposition is obtained, which shows that the union of the cores of partitioned data is determinedly included in the core of global data. In following, two algorithms, DMC and PPDMC, are proposed for distributed mining of core on horizontally partitioned data. DMC concerns the reduction of time complexity while PPDMC focuses on privacy preserving. Experiment and propositions show the excellent function of DMC and PPDMC through practical and academic ways. Just because the pivotal status of core in RST, the algorithms proposed in this paper will show good foreground in distributed data mining.
Keywords
data analysis; data mining; distributed databases; information retrieval; rough set theory; security of data; distributed data mining; horizontally partitioned data; knowledge discovery; knowledge extraction; massive databases; privacy preservation; rough set theory; Computational intelligence; Computer industry; Computer networks; Conferences; Data analysis; Data mining; Distributed computing; Distributed decision making; Mining industry; Partitioning algorithms; Core; Distributed data mining; Rough set theory;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Industrial Application, 2008. PACIIA '08. Pacific-Asia Workshop on
Conference_Location
Wuhan
Print_ISBN
978-0-7695-3490-9
Type
conf
DOI
10.1109/PACIIA.2008.114
Filename
4756746
Link To Document