• DocumentCode
    3306815
  • Title

    Parallel Implementation of Apriori Algorithm Based on MapReduce

  • Author

    Li, Ning ; Zeng, Li ; He, Qing ; Shi, Zhongzhi

  • Author_Institution
    Key Lab. of Intell. Inf. Process., Inst. of Comput. Technol., Beijing, China
  • fYear
    2012
  • fDate
    8-10 Aug. 2012
  • Firstpage
    236
  • Lastpage
    241
  • Abstract
    Searching frequent patterns in transactional databases is considered as one of the most important data mining problems and Apriori is one of the typical algorithms for this task. Developing fast and efficient algorithms that can handle large volumes of data becomes a challenging task due to the large databases. In this paper, we implement a parallel Apriori algorithm based on MapReduce, which is a framework for processing huge datasets on certain kinds of distributable problems using a large number of computers (nodes). The experimental results demonstrate that the proposed algorithm can scale well and efficiently process large datasets on commodity hardware.
  • Keywords
    data mining; parallel databases; parallel processing; MapReduce; commodity hardware; data mining problems; distributable problems; frequent patterns searching; parallel apriori algorithm; parallel implementation; transactional databases; Algorithm design and analysis; Association rules; Computers; Itemsets; Program processors; Apriori algorithm; Frequent itemsets; Large database; MapReduce; Parallel implementation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering, Artificial Intelligence, Networking and Parallel & Distributed Computing (SNPD), 2012 13th ACIS International Conference on
  • Conference_Location
    Kyoto
  • Print_ISBN
    978-1-4673-2120-4
  • Type

    conf

  • DOI
    10.1109/SNPD.2012.31
  • Filename
    6299286