Title :
Mining Interesting and Contiguous Maximal Sequential Patterns on High Dimensional Sequences
Author :
Jian Ding ; Meng Han
Author_Institution :
Beifang Univ. of Nat., Yinchuan, China
Abstract :
Previous methods have presented convincing arguments that mining complete set of patterns is huge for effective usage. A compact but high quality set of patterns, such as closed patterns and maximal patterns is needed. Most of the previously maximal sequential pattern mining algorithms on high dimensional sequence, such as biological data set, work under the same support. In this paper, an efficient algorithm MM-Prefix Span (Maximal and Multi-Support-based Prefix Span) for mining maximal patterns based on multi-support is proposed. Thorough performances on Beta-globin gene sequences have demonstrated that MM-Prefix Span consumes less memory usage and runtime than Prefix Span. It generates compressed results and two kinds of interesting patterns.
Keywords :
biology computing; data mining; Beta-globin gene sequences; MM-Prefix Span algorithm; closed patterns; contiguous maximal sequential pattern mining; high dimensional sequences; interesting pattern mining; maximal and multisupport-based prefix span algorithm; Algorithm design and analysis; Bioinformatics; DNA; Data mining; Databases; Runtime; data mining; high dimensional sequence; maximal pattern; multi support; sequential pattern mining;
Conference_Titel :
Measuring Technology and Mechatronics Automation (ICMTMA), 2013 Fifth International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4673-5652-7
DOI :
10.1109/ICMTMA.2013.173