Title :
An online algorithm for segmenting time series
Author :
Keogh, Eamonn ; Chu, Selina ; Hart, David ; Pazzani, Michael
Author_Institution :
Dept. of Inf. & Comput. Sci., California Univ., Irvine, CA, USA
Abstract :
In recent years, there has been an explosion of interest in mining time-series databases. As with most computer science problems, representation of the data is the key to efficient and effective solutions. One of the most commonly used representations is piecewise linear approximation. This representation has been used by various researchers to support clustering, classification, indexing and association rule mining of time-series data. A variety of algorithms have been proposed to obtain this representation, with several algorithms having been independently rediscovered several times. In this paper, we undertake the first extensive review and empirical comparison of all proposed techniques. We show that all these algorithms have fatal flaws from a data-mining perspective. We introduce a novel algorithm that we empirically show to be superior to all others in the literature
Keywords :
data mining; online operation; piecewise linear techniques; reviews; time series; association rule mining; classification; clustering; data mining; data representation; empirical comparison; indexing; online algorithm; piecewise linear approximation; review; time series segmentation; time-series database mining; Association rules; Change detection algorithms; Clustering algorithms; Computer science; Data mining; Databases; Explosions; Indexing; Piecewise linear approximation; Piecewise linear techniques;
Conference_Titel :
Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
0-7695-1119-8
DOI :
10.1109/ICDM.2001.989531