DocumentCode
562655
Title
Architectural perspective of parallelizing association rule mining
Author
Sumathi, R. ; Kirubakaran, E.
Author_Institution
Dept. of Comput. Sci. & Eng., J.J. Coll. of Eng. & Technol., Tiruchirappalli, India
fYear
2012
fDate
30-31 March 2012
Firstpage
437
Lastpage
442
Abstract
Association rule mining algorithms play a major role in the data mining research. Association rule mining algorithm is one of the data mining algorithms used to find the association between the items in the item set. Association rule mining is not only used widely for finding the patterns, it can also be extended for prediction. Its major drawback is that, it consumes a computational time. In order to reduce the execution time, the parallel and distributed based algorithms are brought into play. Hence in this paper, we highlight the architectural aspects related to parallel association rule mining (PARM). The ways of achieving parallelism and in turn reduced computational time by employing task and data parallelism in multiprocessing and multicomputing environment is analyzed in this paper. Another major issue encountered by PARM is load balancing. To overcome this, we suggest the incorporation of dynamic load balancing in PARM. Grid computing can be used both for compute intensive tasks and data intensive applications as they proffer resources and data access mechanisms. Grid can offer a computing and data management infrastructure for supporting decentralized and parallel data analysis. In the grid environment, mining system developers can focus on the mining applications and not the issues associated with distributed processing. In this paper, we have also given the infrastructure needed for grid based distributed association mining.
Keywords
data analysis; data mining; grid computing; parallel algorithms; resource allocation; software architecture; PARM; architectural aspect; architectural perspective; association rule mining algorithm; compute intensive task; data access mechanism; data intensive application; data management infrastructure; data mining; data parallelism; decentralized data analysis; distributed based algorithm; distributed processing; dynamic load balancing; execution time reduction; grid based distributed association mining; grid computing; grid environment; item association; item set; multicomputing environment; multiprocessing environment; parallel algorithm; parallel association rule mining; parallel data analysis; pattern finding; task parallelism; Itemsets; Outsourcing; Association rule mining; Distributed and Grid; Parallel;
fLanguage
English
Publisher
ieee
Conference_Titel
Advances in Engineering, Science and Management (ICAESM), 2012 International Conference on
Conference_Location
Nagapattinam, Tamil Nadu
Print_ISBN
978-1-4673-0213-5
Type
conf
Filename
6215884
Link To Document