DocumentCode :
1159852
Title :
Distributed data mining on grids: services, tools, and applications
Author :
Cannataro, Mario ; Congiusta, Antonio ; Pugliese, Andrea ; Talia, Domenico ; Trunfio, Paolo
Author_Institution :
Univ. di Catanzaro, Italy
Volume :
34
Issue :
6
fYear :
2004
Firstpage :
2451
Lastpage :
2465
Abstract :
Data mining algorithms are widely used today for the analysis of large corporate and scientific datasets stored in databases and data archives. Industry, science, and commerce fields often need to analyze very large datasets maintained over geographically distributed sites by using the computational power of distributed and parallel systems. The grid can play a significant role in providing an effective computational support for distributed knowledge discovery applications. For the development of data mining applications on grids we designed a system called KNOWLEDGE GRID. This paper describes the KNOWLEDGE GRID framework and presents the toolset provided by the KNOWLEDGE GRID for implementing distributed knowledge discovery. The paper discusses how to design and implement data mining applications by using the KNOWLEDGE GRID tools starting from searching grid resources, composing software and data components, and executing the resulting data mining process on a grid. Some performance results are also discussed.
Keywords :
data mining; grid computing; parallel processing; very large databases; distributed data mining; distributed knowledge discovery application; distributed system; geographically distributed sites; grid computing; grid programming; grid scheduling; parallel system; very large datasets; Algorithm design and analysis; Application software; Business; Computer industry; Concurrent computing; Data analysis; Data mining; Databases; Distributed computing; Grid computing; Grid computing; data mining; grid programming; grid scheduling; knowledge grid; Algorithms; Artificial Intelligence; Computer Communication Networks; Database Management Systems; Databases, Factual; Information Dissemination; Information Storage and Retrieval; Pattern Recognition, Automated; Software; Software Design; User-Computer Interface;
fLanguage :
English
Journal_Title :
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
1083-4419
Type :
jour
DOI :
10.1109/TSMCB.2004.836890
Filename :
1356036
Link To Document :
بازگشت