• DocumentCode
    3525158
  • Title

    Dynamically Scaling Apache Storm for the Analysis of Streaming Data

  • Author

    van der Veen, Jan Sipke ; Van Der Waaij, Bram ; Lazovik, Elena ; Wijbrandi, Wilco ; Meijer, Robert J.

  • Author_Institution
    TNO, Groningen, Netherlands
  • fYear
    2015
  • fDate
    March 30 2015-April 2 2015
  • Firstpage
    154
  • Lastpage
    161
  • Abstract
    Stream processing platforms allow applications to analyse incoming data continuously. Several use cases exist that make use of these capabilities, ranging from monitoring of physical infrastructures to pre selecting video surveillance feeds for human inspection. It is difficult to predict how much computing resources are needed for these stream processing platforms, because the volume and velocity of input data may vary over time. The open source Apache Storm software provides a framework for developers to build processing applications that use the computing resources of all machines within an established cluster. Because of the varying processing needs of such applications, the platform should be able to automatically grow and shrink as needed. Unfortunately, the current Storm platform does not provide this capability. In this paper we describe the design and implementation of a tool that monitors several aspects of the Storm platform, the applications running on top of it, and external systems such as queues and databases. Based on this information, the tool decides whether extra servers are needed or machines may be decommissioned from the cluster, and then acts upon it by actually creating new virtual machines or shutting them down.
  • Keywords
    data analysis; inspection; video surveillance; virtual machines; Apache storm; human inspection; physical infrastructures; stream processing platforms; streaming data analysis; video surveillance; virtual machines; Fasteners; Measurement; Monitoring; Software; Storms; Topology; Virtual machining; Analysis Platform; Apache Storm; Big Data; Cloud Computing; Dynamic Scaling; Elasticity; Streaming Data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data Computing Service and Applications (BigDataService), 2015 IEEE First International Conference on
  • Conference_Location
    Redwood City, CA
  • Type

    conf

  • DOI
    10.1109/BigDataService.2015.56
  • Filename
    7184876