DocumentCode
2016435
Title
Hierarchical Caches for Grid Workflows
Author
Chiu, David ; Agrawal, Gagan
Author_Institution
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH
fYear
2009
fDate
18-21 May 2009
Firstpage
228
Lastpage
235
Abstract
From personal software to advanced systems, caching mechanisms have steadfastly been a ubiquitous means for reducing workloads. It is no surprise, then, that under the grid and cluster paradigms, middlewares and other large-scale applications often seek caching solutions. Among these distributed applications, scientific workflow management systems have gained ground towards mitigating the often painstaking process of composing sequences of scientific data sets and services to derive virtual data. In the past, workflow managers have relied on low-level system cache for reuse support. But in distributed query intensive environments, where high volumes of intermediate virtual data can potentially be stored anywhere on the grid, a novel cache structure is needed to efficiently facilitate workflow planning. In this paper, we describe an approach to combat the challenges of maintaining large, fast virtual data caches for workflow composition. A hierarchical structure is proposed for indexing scientific data with spatiotemporal annotations across grid nodes. Our experimental results show that our hierarchical index is scalable and outperforms a centralized indexing scheme by an exponential factor in query intensive environments.
Keywords
cache storage; grid computing; indexing; middleware; natural sciences computing; planning; query processing; workflow management software; workstation clusters; cluster paradigms; distributed query intensive environments; grid workflows; hierarchical caches; middlewares; personal software; scientific data indexing; scientific workflow management systems; spatiotemporal annotations; workflow composition; workflow planning; Application software; Computer science; Data mining; Grid computing; Indexing; Information retrieval; Middleware; Pervasive computing; Spatiotemporal phenomena; Workflow management software; grid workflows; scientific workflows; workflow cache; workflow management;
fLanguage
English
Publisher
ieee
Conference_Titel
Cluster Computing and the Grid, 2009. CCGRID '09. 9th IEEE/ACM International Symposium on
Conference_Location
Shanghai
Print_ISBN
978-1-4244-3935-5
Electronic_ISBN
978-0-7695-3622-4
Type
conf
DOI
10.1109/CCGRID.2009.10
Filename
5071876
Link To Document