DocumentCode
2030937
Title
Parallel sequential pattern mining by transaction decomposition
Author
Wang, Xueqiang ; Wang, Jing ; Wang, Tengjiao ; Li, Hongyan ; Yang, Dongqing
Author_Institution
Sch. of Electron. Eng. & Comput. Sci., Peking Univ., Beijing, China
Volume
4
fYear
2010
fDate
10-12 Aug. 2010
Firstpage
1746
Lastpage
1750
Abstract
Sequential pattern mining is an important and useful tool with broad applications, such as analyzing customer purchase behavior, recommending services to customers, and so on. It is challenging since explosive number of subsequences need to be examined and both the memory and computational cost are becoming extremely expensive when the sequence database grows huge. Many previous algorithms developed for efficient mining of sequential patterns encounter problems to deal with large scale data. In this paper, we propose a parallel sequential pattern mining method, called PTDS (i.e., Parallel Transaction-Decomposed Sequential pattern mining), which decomposes transactions to mine sequential patterns. PTDS greatly accelerates pattern growth and improves the efficiency of parallel algorithm on large scale data. We experiment on a large dataset consisting of 16 million service purchase sequences. Besides scalability, the empirical comparisons show that PTDS consistently outperforms both the PrefixSpan-based parallel method and serial algorithm.
Keywords
business data processing; data mining; transaction processing; data mining; parallel algorithm; parallel sequential pattern mining; service purchase sequences; transaction decomposition; Algorithm design and analysis; Computers; Data mining; Databases; Scalability; Silicon; Sorting; data mining; parallel sequential pattern mining; transaction decomposition;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2010 Seventh International Conference on
Conference_Location
Yantai, Shandong
Print_ISBN
978-1-4244-5931-5
Type
conf
DOI
10.1109/FSKD.2010.5569404
Filename
5569404
Link To Document