• DocumentCode
    3263117
  • Title

    A weighted load-balancing parallel Apriori algorithm for association rule mining

  • Author

    Yu, Kun-Ming ; Zhou, Jia-Ling

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Chung Hua Univ., Hsinchu
  • fYear
    2008
  • fDate
    26-28 Aug. 2008
  • Firstpage
    756
  • Lastpage
    761
  • Abstract
    Because of the exponential growth in worldwide information, companies have to deal with an ever growing amount of digital information. One of the most important challenges for data mining is quickly and correctly finding the relationship between data. The Apriori algorithm is the most popular technique in association rules mining; however, when applying this method, a database has to be scanned many times and many candidate itemsets are generated. Parallel computing is an effective strategy for accelerating the mining process. In this paper, the weighted distributed parallel apriori algorithm (WDPA) is presented as a solution to this problem. In the proposed method, metadata are stored in TID forms, thus only a single scan to the database is needed. The TID counts are also taken into consideration, and therefore better load-balancing as well as reducing idle time for processors can be achieved. According to the experimental results, WDPA outperforms other algorithms while having lower minimum support.
  • Keywords
    data mining; parallel processing; resource allocation; WDPA; association rule mining; data mining; parallel computing; weighted distributed parallel apriori algorithm; weighted load-balancing parallel apriori algorithm; Acceleration; Association rules; Computer science; Concurrent computing; Data mining; Distributed computing; Information management; Itemsets; Parallel processing; Transaction databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Granular Computing, 2008. GrC 2008. IEEE International Conference on
  • Conference_Location
    Hangzhou
  • Print_ISBN
    978-1-4244-2512-9
  • Electronic_ISBN
    978-1-4244-2513-6
  • Type

    conf

  • DOI
    10.1109/GRC.2008.4664768
  • Filename
    4664768