DocumentCode :
3306815
Title :
Parallel Implementation of Apriori Algorithm Based on MapReduce
Author :
Li, Ning ; Zeng, Li ; He, Qing ; Shi, Zhongzhi
Author_Institution :
Key Lab. of Intell. Inf. Process., Inst. of Comput. Technol., Beijing, China
fYear :
2012
fDate :
8-10 Aug. 2012
Firstpage :
236
Lastpage :
241
Abstract :
Searching frequent patterns in transactional databases is considered as one of the most important data mining problems and Apriori is one of the typical algorithms for this task. Developing fast and efficient algorithms that can handle large volumes of data becomes a challenging task due to the large databases. In this paper, we implement a parallel Apriori algorithm based on MapReduce, which is a framework for processing huge datasets on certain kinds of distributable problems using a large number of computers (nodes). The experimental results demonstrate that the proposed algorithm can scale well and efficiently process large datasets on commodity hardware.
Keywords :
data mining; parallel databases; parallel processing; MapReduce; commodity hardware; data mining problems; distributable problems; frequent patterns searching; parallel apriori algorithm; parallel implementation; transactional databases; Algorithm design and analysis; Association rules; Computers; Itemsets; Program processors; Apriori algorithm; Frequent itemsets; Large database; MapReduce; Parallel implementation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Engineering, Artificial Intelligence, Networking and Parallel & Distributed Computing (SNPD), 2012 13th ACIS International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-2120-4
Type :
conf
DOI :
10.1109/SNPD.2012.31
Filename :
6299286
Link To Document :
بازگشت