• DocumentCode
    679535
  • Title

    Time Series Classification Using Compression Distance of Recurrence Plots

  • Author

    Silva, Diego F. ; De Souza, Vinicius M. A. ; Batista, Gustavo E. A. P. A.

  • Author_Institution
    Inst. de Cienc. Mat. e de Comput., Univ. de Sao Paulo, Sao Carlos, Brazil
  • fYear
    2013
  • fDate
    7-10 Dec. 2013
  • Firstpage
    687
  • Lastpage
    696
  • Abstract
    There is a huge increase of interest for time series methods and techniques. Virtually every piece of information collected from human, natural, and biological processes is susceptible to changes over time, and the study of how these changes occur is a central issue in fully understanding such processes. Among all time series mining tasks, classification is likely to be the most prominent one. In time series classification there is a significant body of empirical research that indicates that k-nearest neighbor rule in the time domain is very effective. However, certain time series features are not easily identified in this domain and a change in representation may reveal some significant and unknown features. In this work, we propose the use of recurrence plots as representation domain for time series classification. Our approach measures the similarity between recurrence plots using Campana-Keogh (CK-1) distance, a Kolmogorov complexity-based distance that uses video compression algorithms to estimate image similarity. We show that recurrence plots allied to CK-1 distance lead to significant improvements in accuracy rates compared to Euclidean distance and Dynamic Time Warping in several data sets. Although recurrence plots cannot provide the best accuracy rates for all data sets, we demonstrate that we can predict ahead of time that our method will outperform the time representation with Euclidean and Dynamic Time Warping distances.
  • Keywords
    data compression; data mining; image classification; time series; video coding; CK-1 distance; Campana-Keogh distance; Euclidean distance; Kolmogorov complexity-based distance; dynamic time warping; image similarity estimation; k-nearest neighbor rule; recurrence plot compression distance; similarity measurement; time series classification; time series mining task; video compression algorithms; Accuracy; Complexity theory; Equations; Euclidean distance; Mathematical model; Time series analysis; Training; Time series; classification; dstance measure; recurrence plot;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2013 IEEE 13th International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1550-4786
  • Type

    conf

  • DOI
    10.1109/ICDM.2013.128
  • Filename
    6729553