DocumentCode
1791647
Title
Bridging high velocity and high volume industrial big data through distributed in-memory storage & analytics
Author
Williams, Jenny Weisenberg ; Aggour, Kareem S. ; Interrante, John ; McHugh, Justin ; Pool, Eric
Author_Institution
Knowledge Discovery Lab., GE Global Res., Niskayuna, NY, USA
fYear
2014
fDate
27-30 Oct. 2014
Firstpage
932
Lastpage
941
Abstract
With an exponential increase in time series sensor data generated by an ever-growing number of sensors on industrial equipment, new systems are required to efficiently store and analyze this “Industrial Big Data.” To actively monitor industrial equipment there is a need to process large streams of high velocity time series sensor data as it arrives, and then store that data for subsequent analysis. Historically, separate systems would meet these needs, with neither system having the ability to perform fast analytics incorporating both just-arrived and historical data. In-memory data grids are a promising technology that can support both near real-time analysis and mid-term storage of big datasets, bridging the gap between high velocity and high volume big time series sensor data. This paper describes the development of a prototype infrastructure with an in-memory data grid at its core to analyze high velocity (>100,000 points per second), high volume (TB´s) time series data produced by a fleet of gas turbines monitored at GE Power & Water´s Remote Monitoring & Diagnostics Center.
Keywords
Big Data; distributed memory systems; gas turbines; production engineering computing; sensors; time series; GE Power and Water´s Remote Monitoring and Diagnostics Center; big datasets; distributed in-memory analytics; distributed in-memory storage; gas turbines; high velocity analysis; high velocity big time series sensor data; high velocity industrial big data; high velocity time series sensor data; high volume big time series sensor data; high volume industrial big data; in-memory data grids; industrial equipment; mid-term storage; near real-time analysis; sensors; time series data; Big data; Data structures; Distributed databases; Hardware; Memory management; Real-time systems; Time series analysis; big data; distributed computing; in-memory data grids; remote monitoring and diagnostics; time series data;
fLanguage
English
Publisher
ieee
Conference_Titel
Big Data (Big Data), 2014 IEEE International Conference on
Conference_Location
Washington, DC
Type
conf
DOI
10.1109/BigData.2014.7004325
Filename
7004325
Link To Document