DocumentCode
668132
Title
FlexQuery: An online query system for interactive remote visual data exploration at large scale
Author
Hongbo Zou ; Schwan, Karsten ; Slawinska, Magdalena ; Wolf, Michael ; Eisenhauer, Greg ; Fang Zheng ; Dayal, Jai ; Logan, J. ; Qing Liu ; Klasky, Scott ; Bode, Tanja ; Clark, Matthew ; Kinsey, Matt
Author_Institution
Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA
fYear
2013
fDate
23-27 Sept. 2013
Firstpage
1
Lastpage
8
Abstract
The remote visual exploration of live data generated by scientific simulations is useful for scientific discovery, performance monitoring, and online validation for the simulation results. Online visualization methods are challenged, however, by the continued growth in the volume of simulation output data that has to be transferred from its source - the simulation running on the high end machine - to where it is analyzed, visualized, and displayed. A specific challenge in this context is limits in the communication bandwidth between data source(s) and sinks. Previous work places queries `near´ data sources, exploiting their data reduction capabilities, but such work does not address the common scenario in which scientists make multiple different queries on the data being produced. This paper considers the general case in which science users are interested in different (sub)sets of the data produced by a high end simulation. We offer the FlexQuery online data query system that can deploy and execute data queries `along´ the I/O and analytics pipelines. FlexQuery carefully extends such analytics pipelines, using online performance monitoring and data location tracking, to realize data queries in ways that minimize additional data movement and offer low latency in data query execution. Using a real-world scientific application - the Maya astrophysics code and its analytics workflow - we demonstrate FlexQuery´s ability to dynamically deploy queries for low-latency remote data visualization.
Keywords
data visualisation; query processing; FlexQuery system; Maya astrophysics code; analytics workflow; communication bandwidth; data location tracking; data movement; data reduction capabilities; data sink; data source; interactive remote visual data exploration; low-latency remote data visualization; online data query system; online performance monitoring; online visualization methods; scientific discovery; scientific simulations; simulation output data; Bandwidth; Contracts; Data models; Data visualization; Engines; Monitoring; Pipelines; data reduction; online query; remote visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing (CLUSTER), 2013 IEEE International Conference on
Conference_Location
Indianapolis, IN
Type
conf
DOI
10.1109/CLUSTER.2013.6702635
Filename
6702635
Link To Document