DocumentCode
3200200
Title
Association Rule Mining with the Micron Automata Processor
Author
Ke Wang ; Yanjun Qi ; Fox, Jeffrey J. ; Stan, Mircea R. ; Skadron, Kevin
Author_Institution
Dept. of Comp. Sci., Univ. of Virginia, Charlottesville, VA, USA
fYear
2015
fDate
25-29 May 2015
Firstpage
689
Lastpage
699
Abstract
Association rule mining (ARM) is a widely used data mining technique for discovering sets of frequently associated items in large databases. As datasets grow in size and real-time analysis becomes important, the performance of ARM implementation can impede its applicability. We accelerate ARM by using Micron´s Automata Processor (AP), a hardware implementation of non-deterministic finite automata (NFAs), with additional features that significantly expand the APs capabilities beyond those of traditional NFAs. The Apriori algorithm that ARM uses for discovering item sets maps naturally to the massive parallelism of the AP. We implement the multipass pruning strategy used in the Apriori ARM through the APs symbol replacement capability, a form of lightweight reconfigurability. Up to 129X and 49X speedups are achieved by the AP-accelerated Apriori on seven synthetic and real-world datasets, when compared with the Apriori single-core CPU implementation and Eclat, a more efficient ARM algorithm, 6-core multicourse CPU implementation, respectively. The AP-accelerated Apriori solution also outperforms GPU implementations of Eclat especially for large datasets. Technology scaling projections suggest even better speedups from future generations of AP.
Keywords
data mining; finite automata; multiprocessing systems; 6-core multicourse CPU; APs symbol replacement capability; Eclat; NFAs; a priori ARM algorithm; a priori single-core CPU; association rule mining; data mining technique; micron automata processor; multipass pruning strategy; nondeterministic finite automata; technology scaling projections; Association rules; Automata; Graphics processing units; Itemsets; Optimization; Particle separators; Radiation detectors; Automata Processor; association rule mining; frequent set mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing Symposium (IPDPS), 2015 IEEE International
Conference_Location
Hyderabad
ISSN
1530-2075
Type
conf
DOI
10.1109/IPDPS.2015.101
Filename
7161556
Link To Document