• DocumentCode
    2030937
  • Title

    Parallel sequential pattern mining by transaction decomposition

  • Author

    Wang, Xueqiang ; Wang, Jing ; Wang, Tengjiao ; Li, Hongyan ; Yang, Dongqing

  • Author_Institution
    Sch. of Electron. Eng. & Comput. Sci., Peking Univ., Beijing, China
  • Volume
    4
  • fYear
    2010
  • fDate
    10-12 Aug. 2010
  • Firstpage
    1746
  • Lastpage
    1750
  • Abstract
    Sequential pattern mining is an important and useful tool with broad applications, such as analyzing customer purchase behavior, recommending services to customers, and so on. It is challenging since explosive number of subsequences need to be examined and both the memory and computational cost are becoming extremely expensive when the sequence database grows huge. Many previous algorithms developed for efficient mining of sequential patterns encounter problems to deal with large scale data. In this paper, we propose a parallel sequential pattern mining method, called PTDS (i.e., Parallel Transaction-Decomposed Sequential pattern mining), which decomposes transactions to mine sequential patterns. PTDS greatly accelerates pattern growth and improves the efficiency of parallel algorithm on large scale data. We experiment on a large dataset consisting of 16 million service purchase sequences. Besides scalability, the empirical comparisons show that PTDS consistently outperforms both the PrefixSpan-based parallel method and serial algorithm.
  • Keywords
    business data processing; data mining; transaction processing; data mining; parallel algorithm; parallel sequential pattern mining; service purchase sequences; transaction decomposition; Algorithm design and analysis; Computers; Data mining; Databases; Scalability; Silicon; Sorting; data mining; parallel sequential pattern mining; transaction decomposition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery (FSKD), 2010 Seventh International Conference on
  • Conference_Location
    Yantai, Shandong
  • Print_ISBN
    978-1-4244-5931-5
  • Type

    conf

  • DOI
    10.1109/FSKD.2010.5569404
  • Filename
    5569404