DocumentCode :
3200200
Title :
Association Rule Mining with the Micron Automata Processor
Author :
Ke Wang ; Yanjun Qi ; Fox, Jeffrey J. ; Stan, Mircea R. ; Skadron, Kevin
Author_Institution :
Dept. of Comp. Sci., Univ. of Virginia, Charlottesville, VA, USA
fYear :
2015
fDate :
25-29 May 2015
Firstpage :
689
Lastpage :
699
Abstract :
Association rule mining (ARM) is a widely used data mining technique for discovering sets of frequently associated items in large databases. As datasets grow in size and real-time analysis becomes important, the performance of ARM implementation can impede its applicability. We accelerate ARM by using Micron´s Automata Processor (AP), a hardware implementation of non-deterministic finite automata (NFAs), with additional features that significantly expand the APs capabilities beyond those of traditional NFAs. The Apriori algorithm that ARM uses for discovering item sets maps naturally to the massive parallelism of the AP. We implement the multipass pruning strategy used in the Apriori ARM through the APs symbol replacement capability, a form of lightweight reconfigurability. Up to 129X and 49X speedups are achieved by the AP-accelerated Apriori on seven synthetic and real-world datasets, when compared with the Apriori single-core CPU implementation and Eclat, a more efficient ARM algorithm, 6-core multicourse CPU implementation, respectively. The AP-accelerated Apriori solution also outperforms GPU implementations of Eclat especially for large datasets. Technology scaling projections suggest even better speedups from future generations of AP.
Keywords :
data mining; finite automata; multiprocessing systems; 6-core multicourse CPU; APs symbol replacement capability; Eclat; NFAs; a priori ARM algorithm; a priori single-core CPU; association rule mining; data mining technique; micron automata processor; multipass pruning strategy; nondeterministic finite automata; technology scaling projections; Association rules; Automata; Graphics processing units; Itemsets; Optimization; Particle separators; Radiation detectors; Automata Processor; association rule mining; frequent set mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium (IPDPS), 2015 IEEE International
Conference_Location :
Hyderabad
ISSN :
1530-2075
Type :
conf
DOI :
10.1109/IPDPS.2015.101
Filename :
7161556
Link To Document :
بازگشت