DocumentCode :
182978
Title :
An improved parallel association rules algorithm based on MapReduce framework for big data
Author :
Xinhao Zhou ; Yongfeng Huang
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
fYear :
2014
fDate :
19-21 Aug. 2014
Firstpage :
284
Lastpage :
288
Abstract :
Association rules mining is one of the most popular and significant issue in data mining and intends to discovery interest relations between variables in database. In our paper, we implemented an improved parallel Apriori algorithm which realized both count and candidate generation steps under MapReduce framework, while existing parallel Apriori algorithm only considered count step. We analyzed the time complexity of our improved parallel algorithm and compared to the original parallel algorithm, which indicates advantages of our algorithm with massive candidate item sets. Based on our experiment result, we proved that our algorithm performs better under big data situation and achieves excellent speedup feature.
Keywords :
Big Data; computational complexity; data handling; data mining; parallel algorithms; MapReduce framework; association rule mining; big data; big data situation; candidate generation steps; improved parallel Apriori algorithm; improved parallel association rule algorithm; time complexity; Algorithm design and analysis; Association rules; Big data; Clustering algorithms; Databases; Time complexity; Apriori; Association Rules; Data Mining; Hadoop; MapReduce;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2014 11th International Conference on
Conference_Location :
Xiamen
Print_ISBN :
978-1-4799-5147-5
Type :
conf
DOI :
10.1109/FSKD.2014.6980847
Filename :
6980847
Link To Document :
بازگشت