DocumentCode :
2130722
Title :
Service Oriented KDD: A Framework for Grid Data Mining Workflows
Author :
Lackovic, Marco ; Talia, Domenico ; Trunfio, Paolo
Author_Institution :
Univ. of Calabria, Rende
fYear :
2008
fDate :
15-19 Dec. 2008
Firstpage :
496
Lastpage :
505
Abstract :
Weka4WS is an extension of the Weka toolkit to support remote execution of data mining tasks as grid services. A first version of Weka4WS supporting concurrent execution of multiple data mining tasks on remote grid nodes has been presented in a previous work. In this paper we present a new version supporting also the composition and execution of data mining workflows on a grid. This new version of Weka4WS extends the KnowledgeFlow component of Weka by allowing the data mining tasks of the workflow to run in parallel on different machines, hence reducing the execution time. Besides the performance improvement, the capability of designing data mining applications as workflows allows to define typical patterns and to reuse them in different contexts. In this paper we describe the architecture of the system, the functionalities of the Weka4WS KnowledgeFlow, and some examples of use with their performance.
Keywords :
data mining; grid computing; KnowledgeFlow component; Weka toolkit; Weka4WS; concurrent execution; grid data mining workflows; grid services; service oriented KDD; Computer networks; Conferences; Data mining; Distributed computing; Grid computing; Inspection; Resource management; Service oriented architecture; Web services; Data mining; Grid; WSRF; Web services; Weka; Weka4WS; workflows;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops, 2008. ICDMW '08. IEEE International Conference on
Conference_Location :
Pisa
Print_ISBN :
978-0-7695-3503-6
Electronic_ISBN :
978-0-7695-3503-6
Type :
conf
DOI :
10.1109/ICDMW.2008.28
Filename :
4733973
Link To Document :
بازگشت