DocumentCode
2328581
Title
Distributed data mining on the grid
Author
Jiang, Wu-Shan ; Yu, Ji-Hui
Author_Institution
Dept. of Electr. Eng., Chongqing Univ., China
Volume
4
fYear
2005
fDate
18-21 Aug. 2005
Firstpage
2010
Abstract
Distributed data mining (DDM) is widely used in industrial, scientific and commercial applications to analyze large data sets maintained over geographically distributed sites, which makes DDM a major research issue on today´s data mining system. As a latest member in distributed computing technology family, the grid computing can play an increasingly important role with the progress of the DDM technology in recent years. This paper analyzed the drawback of existing DDM systems and put forward a service-oriented architecture of DDM on the grid. The mining algorithm and distributed data sets in the proposed framework are abstracted as Web service resource (WS-resource), which can cooperate to perform DDM as required dynamically. Finally, a grid based on local area network was built with Globus Toolkit 4.0Beta and the algorithm of WS-resource, dataset WS-resource for data mining on the grid are developed.
Keywords
Internet; data mining; grid computing; DDM technology; Globus Toolkit 4.0 Beta; Web service resource; distributed data mining; grid computing; service-oriented architecture; Data analysis; Data mining; Distributed computing; Distributed decision making; Electronic mail; Grid computing; Machine learning; Machine learning algorithms; Mining industry; Service oriented architecture; DDM; Grid computing; WS-Resource;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
Conference_Location
Guangzhou, China
Print_ISBN
0-7803-9091-1
Type
conf
DOI
10.1109/ICMLC.2005.1527275
Filename
1527275
Link To Document