Title :
SAXO: An optimized data-driven symbolic representation of time series
Author :
Bondu, A. ; Boulle, Marc ; Grossin, B.
Author_Institution :
EDF R&D, Clamart, France
Abstract :
In France, the currently emerging “smart grid” and more particularly the 35 millions of “smart meters” will produce a large amount of daily updated metering data. The main french provider of electricity (EDF) is interested by compact and generic representations of time series which allow to accelerate the processing of data. This article proposes a new data-driven symbolic representation of time series named SAXO, where each symbol represents a typical distribution of data points. Furthermore, the time dimension is optimally discretized into intervals by using a parameter free Bayesian coclustering approach (MODL). SAXO is favorably compared with the SAX representation by evaluating a classifier trained from recoded datasets. Our experiments highlight a significant gap in performance between both approaches.
Keywords :
belief networks; data structures; pattern clustering; time series; MODL; SAXO; data points distribution; optimized data-driven symbolic representation; parameter free Bayesian coclustering approach; time series; Clustering algorithms; Data models; Integrated circuit modeling; Joints; Time series analysis; Transforms;
Conference_Titel :
Neural Networks (IJCNN), The 2013 International Joint Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4673-6128-6
DOI :
10.1109/IJCNN.2013.6706816