Title :
Mining motifs in massive time series databases
Author :
Patel, Pranav ; Keogh, Eamonn ; Lin, Jessica ; Lonardi, Stefano
Author_Institution :
Comput. Sci. & Eng. Dept., California Univ., Riverside, CA, USA
Abstract :
The problem of efficiently locating previously known patterns in a time series database (i.e., query by content) has received much attention and may now largely be regarded as a solved problem. However, from a knowledge discovery viewpoint, a more interesting problem is the enumeration of previously unknown, frequently occurring patterns. We call such patterns "motifs", because of their close analogy to their discrete counterparts in computation biology. An efficient motif discovery algorithm for time series would be useful as a tool for summarizing and visualizing massive time series databases. In addition it could be used as a subroutine in various other data mining tasks, including the discovery of association rules, clustering and classification. In this paper we carefully motivate, then introduce, a nontrivial definition of time series motifs. We propose an efficient algorithm to discover them, and we demonstrate the utility and efficiency of our approach on several real world datasets.
Keywords :
data mining; database management systems; time series; classification; computation biology; knowledge discovery; massive time series databases; motifs mining; query by content; real world datasets; Association rules; Biology computing; Clustering algorithms; Computer science; Convergence; Data engineering; Data mining; Data visualization; Prototypes; Visual databases;
Conference_Titel :
Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
Print_ISBN :
0-7695-1754-4
DOI :
10.1109/ICDM.2002.1183925