Title :
An approach for fast subsequence matching through KMP algorithm in time series databases
Author :
Li, Ai-jun ; Liu, Yun-Hui ; Qi, Ying-jian ; Luo, Si-Wei
Author_Institution :
Dept. of Comput. Sci., Northern Jiaotong Univ., Beijing, China
Abstract :
Sequence matching in time series databases is one of the most important data mining applications. In this paper, we focus on subsequence matching. We propose an efficient approach to compare time series. To simplify the searching process, we first use the KMP algorithm to carry through rough sequence matching. As KMP is a typical algorithm for string matching, we must transform time series into 0-1 string inspired by literature (Keogh and Pazzani,1999); then we quickly. search all rough similar subsequences from major sequence and finally, to reduce the dimension of raw time series data, we use Harr wavelet transform to represent the sequence to be compared and use WT (Wavelet Transformations) coefficients to compute the similarity of two sequences. That we carry out rough matching at first may reduce the numbers of WT and quicken the whole subsequence matching process.
Keywords :
computational complexity; data mining; string matching; time series; wavelet transforms; Wavelet Transformations; rough sequence matching; sequence matching; similarity matching; string matching; subsequence matching; time series; time series databases; wavelet transform; Application software; Computer science; Data mining; Databases; Discrete wavelet transforms; Electronic mail; Euclidean distance; Partial response channels; Time measurement; Wavelet transforms;
Conference_Titel :
Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on
Print_ISBN :
0-7803-7508-4
DOI :
10.1109/ICMLC.2002.1167412