• DocumentCode
    2437071
  • Title

    Distributed Mining Core of Attributes on Horizontally Partitioned Data

  • Author

    Hu, Dan ; Yu, Xianchuan ; Feng, Yuanfu

  • Author_Institution
    Coll. of Inf. Sci. & Technol., Beijing Normal Univ., Beijing
  • Volume
    2
  • fYear
    2008
  • fDate
    19-20 Dec. 2008
  • Firstpage
    112
  • Lastpage
    116
  • Abstract
    The field of distributed data mining (DDM) has emerged as an active area in recent years because the key challenge in knowledge discovery is the extraction of knowledge from massive databases. Rough set theory (RST) is one of the powerful approaches in data mining, which has been demonstrated to have its usefulness in successfully solving a variety of problems. But there is almost no literature related to the distributed computation in RST. In this paper, the relation between the cores of partitioned data and global data are discussed. An useful proposition is obtained, which shows that the union of the cores of partitioned data is determinedly included in the core of global data. In following, two algorithms, DMC and PPDMC, are proposed for distributed mining of core on horizontally partitioned data. DMC concerns the reduction of time complexity while PPDMC focuses on privacy preserving. Experiment and propositions show the excellent function of DMC and PPDMC through practical and academic ways. Just because the pivotal status of core in RST, the algorithms proposed in this paper will show good foreground in distributed data mining.
  • Keywords
    data analysis; data mining; distributed databases; information retrieval; rough set theory; security of data; distributed data mining; horizontally partitioned data; knowledge discovery; knowledge extraction; massive databases; privacy preservation; rough set theory; Computational intelligence; Computer industry; Computer networks; Conferences; Data analysis; Data mining; Distributed computing; Distributed decision making; Mining industry; Partitioning algorithms; Core; Distributed data mining; Rough set theory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Industrial Application, 2008. PACIIA '08. Pacific-Asia Workshop on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-0-7695-3490-9
  • Type

    conf

  • DOI
    10.1109/PACIIA.2008.114
  • Filename
    4756746