• DocumentCode
    3200200
  • Title

    Association Rule Mining with the Micron Automata Processor

  • Author

    Ke Wang ; Yanjun Qi ; Fox, Jeffrey J. ; Stan, Mircea R. ; Skadron, Kevin

  • Author_Institution
    Dept. of Comp. Sci., Univ. of Virginia, Charlottesville, VA, USA
  • fYear
    2015
  • fDate
    25-29 May 2015
  • Firstpage
    689
  • Lastpage
    699
  • Abstract
    Association rule mining (ARM) is a widely used data mining technique for discovering sets of frequently associated items in large databases. As datasets grow in size and real-time analysis becomes important, the performance of ARM implementation can impede its applicability. We accelerate ARM by using Micron´s Automata Processor (AP), a hardware implementation of non-deterministic finite automata (NFAs), with additional features that significantly expand the APs capabilities beyond those of traditional NFAs. The Apriori algorithm that ARM uses for discovering item sets maps naturally to the massive parallelism of the AP. We implement the multipass pruning strategy used in the Apriori ARM through the APs symbol replacement capability, a form of lightweight reconfigurability. Up to 129X and 49X speedups are achieved by the AP-accelerated Apriori on seven synthetic and real-world datasets, when compared with the Apriori single-core CPU implementation and Eclat, a more efficient ARM algorithm, 6-core multicourse CPU implementation, respectively. The AP-accelerated Apriori solution also outperforms GPU implementations of Eclat especially for large datasets. Technology scaling projections suggest even better speedups from future generations of AP.
  • Keywords
    data mining; finite automata; multiprocessing systems; 6-core multicourse CPU; APs symbol replacement capability; Eclat; NFAs; a priori ARM algorithm; a priori single-core CPU; association rule mining; data mining technique; micron automata processor; multipass pruning strategy; nondeterministic finite automata; technology scaling projections; Association rules; Automata; Graphics processing units; Itemsets; Optimization; Particle separators; Radiation detectors; Automata Processor; association rule mining; frequent set mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium (IPDPS), 2015 IEEE International
  • Conference_Location
    Hyderabad
  • ISSN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2015.101
  • Filename
    7161556