DocumentCode :
1925718
Title :
A Scalable Method of Mining Approximate Multidimensional Sequential Patterns on Distributed Systemts
Author :
Hu, Kong-fa ; Zhang, Chang-hai ; Chen, Ling
Author_Institution :
Yangzhou Univ., Yangzhou
Volume :
2
fYear :
2007
fDate :
19-22 Aug. 2007
Firstpage :
762
Lastpage :
766
Abstract :
A scalable and effective algorithm called AMGMSP (approximate mining of global multidimensional sequential patterns) is proposed to solve the problem of mining the multidimensional sequential patterns for large databases in the distributed environment. First, the multidimensional information is embedded into the corresponding sequences in order to convert the mining on the multidimensional sequential patterns to sequential patterns. Then the sequences are clustered, summarized, and analyzed on the distributed sites, and the local patterns could be obtained by the effective approximate sequential pattern mining method. Finally, the global multidimensional sequential patterns could be mined by high vote sequential patterns after collecting all the local patterns on one site. Both the theories and the experiments indicate that this method could simplify the problem of mining the multidimensional sequential patterns and avoid mining the redundant information. The global sequential patterns could be obtained effectively by the scalable method after reducing the cost of communication.
Keywords :
approximation theory; data mining; distributed databases; pattern clustering; very large databases; approximate mining; distributed database; global multidimensional sequential pattern mining; large databases; sequence analysis; sequence clustering; sequence summarization; Computer science; Cybernetics; Data engineering; Data mining; Distributed databases; Itemsets; Machine learning; Multidimensional systems; Pattern matching; Voting; Approximate sequential pattern mining; Distributed database; Multidimensional sequential patterns;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2007 International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-0973-0
Electronic_ISBN :
978-1-4244-0973-0
Type :
conf
DOI :
10.1109/ICMLC.2007.4370246
Filename :
4370246
Link To Document :
بازگشت