DocumentCode :
3705420
Title :
Computing data quality indicators on Big Data streams using a CEP
Author :
Wenlu Yang;Alzennyr Da Silva;Marie-Luce Picard
Author_Institution :
Sorbonne Universit?s, UPMC Univ Paris 06, CNRS, LIP6 UMR 7606, 4 place Jussieu 75005, France
fYear :
2015
Firstpage :
1
Lastpage :
5
Abstract :
Big Data is often referred to as the 3Vs: Volume, Velocity and Variety. A 4th V (validity) was introduced to address the quality dimension. Poor data quality can be costly, lead to breaks in processes and invalidate the company´s efforts on regulatory compliance. In order to process data streams in real time, a new technology called CEP (complex event processing) was developed. In France, the current deployment of smart meters will generate massive electricity consumption data. In this work, we developed a diagnostic approach to compute generic quality indicators of smart meter data streams on the fly. This solution is based on Tibco StreamBase CEP. Visualization tools were also developed in order to give a better understanding of the inter-relation between quality issues and geographical/temporal dimensions. According to the application purpose, two visualization methods can be loaded: (1) StreamBase LiveView is used to visualize quality indicators in real time; and (2) a Web application provides a posteriori and geographical analysis of the quality indicators which are plotted on a map within a color scale (lighter colors indicate good quality and darker colors indicate poor quality). In future works, new quality indicators could be added to the solution which can be applied in an operational context in order to monitor data quality from smart meters.
Keywords :
"Smart meters","Data visualization","Smart grids","Real-time systems","Image color analysis","Indexes"
Publisher :
ieee
Conference_Titel :
Computational Intelligence for Multimedia Understanding (IWCIM), 2015 International Workshop on
Type :
conf
DOI :
10.1109/IWCIM.2015.7347061
Filename :
7347061
Link To Document :
بازگشت