Title :
Parallel sequential pattern mining by transaction decomposition
Author :
Wang, Xueqiang ; Wang, Jing ; Wang, Tengjiao ; Li, Hongyan ; Yang, Dongqing
Author_Institution :
Sch. of Electron. Eng. & Comput. Sci., Peking Univ., Beijing, China
Abstract :
Sequential pattern mining is an important and useful tool with broad applications, such as analyzing customer purchase behavior, recommending services to customers, and so on. It is challenging since explosive number of subsequences need to be examined and both the memory and computational cost are becoming extremely expensive when the sequence database grows huge. Many previous algorithms developed for efficient mining of sequential patterns encounter problems to deal with large scale data. In this paper, we propose a parallel sequential pattern mining method, called PTDS (i.e., Parallel Transaction-Decomposed Sequential pattern mining), which decomposes transactions to mine sequential patterns. PTDS greatly accelerates pattern growth and improves the efficiency of parallel algorithm on large scale data. We experiment on a large dataset consisting of 16 million service purchase sequences. Besides scalability, the empirical comparisons show that PTDS consistently outperforms both the PrefixSpan-based parallel method and serial algorithm.
Keywords :
business data processing; data mining; transaction processing; data mining; parallel algorithm; parallel sequential pattern mining; service purchase sequences; transaction decomposition; Algorithm design and analysis; Computers; Data mining; Databases; Scalability; Silicon; Sorting; data mining; parallel sequential pattern mining; transaction decomposition;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2010 Seventh International Conference on
Conference_Location :
Yantai, Shandong
Print_ISBN :
978-1-4244-5931-5
DOI :
10.1109/FSKD.2010.5569404