DocumentCode :
1999951
Title :
I/O Containers: Managing the Data Analytics and Visualization Pipelines of High End Codes
Author :
Dayal, Jai ; Jianting Cao ; Eisenhauer, Greg ; Schwan, Karsten ; Wolf, Michael ; Fang Zheng ; Abbasi, Hasan ; Klasky, Scott ; Podhorszki, Norbert ; Lofstead, Jay
fYear :
2013
fDate :
20-24 May 2013
Firstpage :
2015
Lastpage :
2024
Abstract :
Lack of I/O scalability is known to cause measurable slowdowns for large-scale scientific applications running on high end machines. This is prompting researchers to devise ´I/O staging´ methods in which outputs are processed via online analysis and visualization methods to support desired science outcomes. Organized as online workflows and carried out in I/O pipelines, these analysis components run concurrently with science simulations, often using a smaller set of nodes on the high end machine termed ´staging areas´. This paper presents a new approach to dealing with several challenges arising for such online analytics, including: how to efficiently run multiple analytics components on staging area resources providing them with the levels of end-to-end performance they need and how to manage staging resources when analytics actions change due to user or data-dependent behavior. Our approach designs and implements middleware constructs that delineate and manage I/O pipeline resources called ´I/O Containers´. Experimental evaluations of containers with realistic scientific applications demonstrate the feasibility and utility of the approach.
Keywords :
data analysis; data visualisation; middleware; pipeline processing; resource allocation; software performance evaluation; I-O pipeline resource management; I-O scalability; I-O staging methods; I/O containers; data analytics; data visualization pipelines; data-dependent behavior; end-to-end performance; high end codes; high end machine; high end machines; large-scale scientific applications; middleware constructs; online analysis methods; online visualization methods; online workflows; staging area resources; Analytical models; Computational modeling; Containers; Data models; Data visualization; Monitoring; Pipelines; Data Analytics; Data Staging; Runtime Management; Scalable I/O; Visualization; in-Situ; resource sharing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
Conference_Location :
Cambridge, MA
Print_ISBN :
978-0-7695-4979-8
Type :
conf
DOI :
10.1109/IPDPSW.2013.198
Filename :
6651106
Link To Document :
بازگشت