• DocumentCode
    2328581
  • Title

    Distributed data mining on the grid

  • Author

    Jiang, Wu-Shan ; Yu, Ji-Hui

  • Author_Institution
    Dept. of Electr. Eng., Chongqing Univ., China
  • Volume
    4
  • fYear
    2005
  • fDate
    18-21 Aug. 2005
  • Firstpage
    2010
  • Abstract
    Distributed data mining (DDM) is widely used in industrial, scientific and commercial applications to analyze large data sets maintained over geographically distributed sites, which makes DDM a major research issue on today´s data mining system. As a latest member in distributed computing technology family, the grid computing can play an increasingly important role with the progress of the DDM technology in recent years. This paper analyzed the drawback of existing DDM systems and put forward a service-oriented architecture of DDM on the grid. The mining algorithm and distributed data sets in the proposed framework are abstracted as Web service resource (WS-resource), which can cooperate to perform DDM as required dynamically. Finally, a grid based on local area network was built with Globus Toolkit 4.0Beta and the algorithm of WS-resource, dataset WS-resource for data mining on the grid are developed.
  • Keywords
    Internet; data mining; grid computing; DDM technology; Globus Toolkit 4.0 Beta; Web service resource; distributed data mining; grid computing; service-oriented architecture; Data analysis; Data mining; Distributed computing; Distributed decision making; Electronic mail; Grid computing; Machine learning; Machine learning algorithms; Mining industry; Service oriented architecture; DDM; Grid computing; WS-Resource;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
  • Conference_Location
    Guangzhou, China
  • Print_ISBN
    0-7803-9091-1
  • Type

    conf

  • DOI
    10.1109/ICMLC.2005.1527275
  • Filename
    1527275