DocumentCode :
1791567
Title :
TRISTAN: Real-time analytics on massive time series using sparse dictionary compression
Author :
Marascu, Alice ; Pompey, Pascal ; Bouillet, Eric ; Wurst, Michael ; Verscheure, Olivier ; Grund, Martin ; Cudre-Mauroux, Philippe
fYear :
2014
fDate :
27-30 Oct. 2014
Firstpage :
291
Lastpage :
300
Abstract :
Large-scale critical infrastructures such as transportation, energy, or water distribution networks are increasingly equipped with smart sensor technologies. Low-latency analytics on the resulting times series would open the door to many exciting opportunities to improve our grasp on complex urban systems. However, sensor-generated time series often turn out to be noisy, non-uniformly sampled, and misaligned in practice, making them ill-suited for traditional data processing. In this paper, we introduce TRISTAN (massive TRIckletS Time series ANalysis), a new data management system for efficient storage and real-time processing of fine-grained time series data. TRISTAN relies on a dedicated, compressed sparse representation of the time series using a dictionary. In contrast to previous approaches, TRISTAN is able to execute most analytics queries on the compressed data directly, and supports efficient and approximate query answering based on the most significant atoms of the dictionary only. We present the overall architecture of our system and discuss its performance on several smarter city datasets, showing that TRISTAN can achieve up to 20:1 compression ratios and 250x speedup compared to a state-of-the-art system.
Keywords :
Big Data; intelligent sensors; real-time systems; storage management; time series; TRISTAN; data management system; data processing; data storage; fine-grained time series data; large-scale critical infrastructures; low-latency analytics; massive tricklets time series analysis; real-time analytics; real-time processing; smart sensor technologies; sparse dictionary compression; Cities and towns; Databases; Dictionaries; Matching pursuit algorithms; Noise measurement; Sensors; Time series analysis; dictionary compression; matching pursuit; tricklets;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data (Big Data), 2014 IEEE International Conference on
Conference_Location :
Washington, DC
Type :
conf
DOI :
10.1109/BigData.2014.7004244
Filename :
7004244
Link To Document :
بازگشت