Title :
A weighted load-balancing parallel Apriori algorithm for association rule mining
Author :
Yu, Kun-Ming ; Zhou, Jia-Ling
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Chung Hua Univ., Hsinchu
Abstract :
Because of the exponential growth in worldwide information, companies have to deal with an ever growing amount of digital information. One of the most important challenges for data mining is quickly and correctly finding the relationship between data. The Apriori algorithm is the most popular technique in association rules mining; however, when applying this method, a database has to be scanned many times and many candidate itemsets are generated. Parallel computing is an effective strategy for accelerating the mining process. In this paper, the weighted distributed parallel apriori algorithm (WDPA) is presented as a solution to this problem. In the proposed method, metadata are stored in TID forms, thus only a single scan to the database is needed. The TID counts are also taken into consideration, and therefore better load-balancing as well as reducing idle time for processors can be achieved. According to the experimental results, WDPA outperforms other algorithms while having lower minimum support.
Keywords :
data mining; parallel processing; resource allocation; WDPA; association rule mining; data mining; parallel computing; weighted distributed parallel apriori algorithm; weighted load-balancing parallel apriori algorithm; Acceleration; Association rules; Computer science; Concurrent computing; Data mining; Distributed computing; Information management; Itemsets; Parallel processing; Transaction databases;
Conference_Titel :
Granular Computing, 2008. GrC 2008. IEEE International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4244-2512-9
Electronic_ISBN :
978-1-4244-2513-6
DOI :
10.1109/GRC.2008.4664768