• DocumentCode
    2258146
  • Title

    Distributed Mining Reducts of Attributes on Horizontally Partitioned Data

  • Author

    Hu, Dan ; Yu, Xianchuan ; Feng, Yuanfu

  • Author_Institution
    Coll. of Inf. Sci. & Technol., Beijing Normal Univ., Beijing, China
  • Volume
    1
  • fYear
    2008
  • fDate
    20-22 Dec. 2008
  • Firstpage
    55
  • Lastpage
    59
  • Abstract
    The field of distributed data mining (DDM) has emerged as an active area in recent years because the key challenge in knowledge discovery is the extraction of knowledge from massive databases. Rough set theory (RST) is one of the powerful approaches in data mining, which has been demonstrated to have its usefulness in successfully solving a variety of problems. But there is almost no literature related to the distributed computation in RST. In this paper, the relation between the reducts of partitioned data and global data are discussed. An useful proposition is obtained, which shows that every reduct of global data determinedly has subsets as the elements in reducts of partitioned data. In following, two algorithms, DMR and PPDMR, are proposed for distributed mining of reducts on horizontally partitioned data. DMR concerns the reduction of time complexity while PPDMR focuses on privacy preserving. Experiments and propositions show the excellent function of DMR and PPDMR through practical and academic ways. Just because the pivotal status of reduct in RST, the algorithms proposed in this paper will show good foreground in distributed data mining.
  • Keywords
    computational complexity; data mining; data privacy; data reduction; distributed algorithms; rough set theory; very large databases; distributed data mining algorithm; horizontally partitioned data attribute reduction; knowledge discovery; knowledge extraction; massive database; privacy preservation; rough set theory; time complexity; Computer networks; Data analysis; Data mining; Deductive databases; Distributed computing; Distributed decision making; Information science; Information technology; Partitioning algorithms; Privacy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Information Technology Application, 2008. IITA '08. Second International Symposium on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-0-7695-3497-8
  • Type

    conf

  • DOI
    10.1109/IITA.2008.398
  • Filename
    4739534