• DocumentCode
    3249538
  • Title

    Mining motifs in massive time series databases

  • Author

    Patel, Pranav ; Keogh, Eamonn ; Lin, Jessica ; Lonardi, Stefano

  • Author_Institution
    Comput. Sci. & Eng. Dept., California Univ., Riverside, CA, USA
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    370
  • Lastpage
    377
  • Abstract
    The problem of efficiently locating previously known patterns in a time series database (i.e., query by content) has received much attention and may now largely be regarded as a solved problem. However, from a knowledge discovery viewpoint, a more interesting problem is the enumeration of previously unknown, frequently occurring patterns. We call such patterns "motifs", because of their close analogy to their discrete counterparts in computation biology. An efficient motif discovery algorithm for time series would be useful as a tool for summarizing and visualizing massive time series databases. In addition it could be used as a subroutine in various other data mining tasks, including the discovery of association rules, clustering and classification. In this paper we carefully motivate, then introduce, a nontrivial definition of time series motifs. We propose an efficient algorithm to discover them, and we demonstrate the utility and efficiency of our approach on several real world datasets.
  • Keywords
    data mining; database management systems; time series; classification; computation biology; knowledge discovery; massive time series databases; motifs mining; query by content; real world datasets; Association rules; Biology computing; Clustering algorithms; Computer science; Convergence; Data engineering; Data mining; Data visualization; Prototypes; Visual databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
  • Print_ISBN
    0-7695-1754-4
  • Type

    conf

  • DOI
    10.1109/ICDM.2002.1183925
  • Filename
    1183925