Title :
An Efficient Mining Maximal Frequent Traversal Sequences Algorithm Based on Bidirectional Constraint
Author :
Ren, Jia-dong ; Zhang, Xiao-jian ; Peng, Hui-li
Author_Institution :
Coll. of Inf. Sci. & Eng., YanShan Univ., Qinhuangdao
Abstract :
Mining maximal frequent traversal sequence is a crucial application in Web usage mining, since users´ traversal pattern and motivation are latent in session sequence at some time segment. A Frequent Traversal Sequence Tree structure with Dwell time (FTSD-Tree) is designed. Utilizing FTSD-Tree to store, compress the session database that is constrained by the bidirectional dwell time, and simplify the configuration of dwell time thresholds during mining. A novel algorithm named maximal frequent traversal sequence mining (MFTSM) is presented, which quickly traverses FTSD-Tree and discovers maximal frequent traversal sequence from the session sequences. Experimental results show that MFTSM can significantly improve the execution time efficiency for mining maximal frequent traversal sequence as long as the decision-makers or users give the appropriate constraints. Our performance study at runtime shows that MFTSM is faster than the well-known algorithms GSP, SPAM, MSPS and SPADE in the time constraint environment
Keywords :
Internet; data mining; tree data structures; tree searching; FTSD-Tree in-memory data structure; MFTSM algorithm; Web usage mining; bidirectional constraint; bidirectional dwell time technique; maximal frequent traversal sequence mining algorithm; Cybernetics; Data mining; Databases; Educational institutions; Electronic mail; Information science; Machine learning; Machine learning algorithms; Runtime environment; Time factors; Tree data structures; Unsolicited electronic mail; Web server; Dwell time; FTSD-Tree; Maximal frequent traversal sequence;
Conference_Titel :
Machine Learning and Cybernetics, 2006 International Conference on
Conference_Location :
Dalian, China
Print_ISBN :
1-4244-0061-9
DOI :
10.1109/ICMLC.2006.258832