• DocumentCode
    1999717
  • Title

    Resource Provisioning for Staging Components

  • Author

    Tuan Anh Nguyen ; Eisenhauer, Greg ; Schwan, Karsten ; Wolf, Michael ; Abbasi, Hasan ; Klasky, Scott ; Podhorszki, Norbert

  • Author_Institution
    Georgia Inst. of Technol., Atlanta, GA, USA
  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    1947
  • Lastpage
    1953
  • Abstract
    To deal with the inordinate output data volumes of current and future high end simulations, researchers are turning to online methods in which multiple software components that implement desired data analytics and visualization are run on ´staging resources´ of the petascale machine, concurrently and coupled with the simulations producing these outputs. Efficient online execution of data analytics ´in the output stream´, however, requires careful provisioning of staging resources, to obtain delays for analytics processing that prevent applications from blocking on stalled output, while also bounding total required staging resources. This paper addresses the ´staging provisioning´ problem, assuming sets of components arranged as potentially multiple analytics/output pipelines that differ in runtime behavior and resource requirements. For such configurations, it then meets the throughput constraint of online analytics while also minimizing end-to-end pipeline latency, all based on runtime observations and predictions of component performance. Experimental evaluations demonstrate the algorithm´s utility. Its complexity for minimizing latency without violating throughput constraints is O(M), where M is the number of components in the staging area.
  • Keywords
    computational complexity; data visualisation; minimisation; object-oriented programming; resource allocation; data analytics; data visualization; end-to-end pipeline latency; inordinate output data volumes; latency minimization; multiple software components; resource provisioning; runtime observations; staging provisioning problem; staging resources; Analytical models; Data models; Pipelines; Principal component analysis; Program processors; Runtime; Throughput; resource management; resource provisioning; staging;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
  • Conference_Location
    Cambridge, MA
  • Print_ISBN
    978-0-7695-4979-8
  • Type

    conf

  • DOI
    10.1109/IPDPSW.2013.152
  • Filename
    6651098