• DocumentCode
    1791647
  • Title

    Bridging high velocity and high volume industrial big data through distributed in-memory storage & analytics

  • Author

    Williams, Jenny Weisenberg ; Aggour, Kareem S. ; Interrante, John ; McHugh, Justin ; Pool, Eric

  • Author_Institution
    Knowledge Discovery Lab., GE Global Res., Niskayuna, NY, USA
  • fYear
    2014
  • fDate
    27-30 Oct. 2014
  • Firstpage
    932
  • Lastpage
    941
  • Abstract
    With an exponential increase in time series sensor data generated by an ever-growing number of sensors on industrial equipment, new systems are required to efficiently store and analyze this “Industrial Big Data.” To actively monitor industrial equipment there is a need to process large streams of high velocity time series sensor data as it arrives, and then store that data for subsequent analysis. Historically, separate systems would meet these needs, with neither system having the ability to perform fast analytics incorporating both just-arrived and historical data. In-memory data grids are a promising technology that can support both near real-time analysis and mid-term storage of big datasets, bridging the gap between high velocity and high volume big time series sensor data. This paper describes the development of a prototype infrastructure with an in-memory data grid at its core to analyze high velocity (>100,000 points per second), high volume (TB´s) time series data produced by a fleet of gas turbines monitored at GE Power & Water´s Remote Monitoring & Diagnostics Center.
  • Keywords
    Big Data; distributed memory systems; gas turbines; production engineering computing; sensors; time series; GE Power and Water´s Remote Monitoring and Diagnostics Center; big datasets; distributed in-memory analytics; distributed in-memory storage; gas turbines; high velocity analysis; high velocity big time series sensor data; high velocity industrial big data; high velocity time series sensor data; high volume big time series sensor data; high volume industrial big data; in-memory data grids; industrial equipment; mid-term storage; near real-time analysis; sensors; time series data; Big data; Data structures; Distributed databases; Hardware; Memory management; Real-time systems; Time series analysis; big data; distributed computing; in-memory data grids; remote monitoring and diagnostics; time series data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data (Big Data), 2014 IEEE International Conference on
  • Conference_Location
    Washington, DC
  • Type

    conf

  • DOI
    10.1109/BigData.2014.7004325
  • Filename
    7004325