Title :
Latency tolerance through parallelization of time in scientific applications
Author :
Srinivasan, Ashok ; Chandra, Namas
Author_Institution :
Dept. of Comput. Sci., Florida State Univ., Tallahassee, FL, USA
Abstract :
Summary form only given. Distributed computing environments, such as the grid, promise enormous raw computational power, but involve high communication overheads. It is therefore considered that they are ideally suited for "embarrassingly parallel" applications, such as Monte Carlo, and for certain applications where the loosely-coupled nature of the science involved in the simulations leads to a coarse grained computation. In a typical application, this is not feasible. We discuss our solution strategy, based on scalable functional decomposition, which can be used to keep the computation coarse grained, even on a large number of processors. Such decomposition can be attempted through a variety of means. We discuss the use of time parallelization to achieve this. We demonstrate results with a model problem, and then discuss its implementation for an important problem in nanomaterials simulation. We also show that this technique can be extended to make it inherently fault-tolerant.
Keywords :
communication complexity; fault tolerant computing; grid computing; parallel processing; communication overhead; distributed computing; fault-tolerant computing; grid computing; latency tolerance; nanomaterials simulation; scalable functional decomposition; scientific applications; time parallelization; Computational efficiency; Concurrent computing; Costs; Delay; Differential equations; Distributed computing; Distributed processing; Grid computing; Parallel processing; Stability;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2004. Proceedings. 18th International
Print_ISBN :
0-7695-2132-0
DOI :
10.1109/IPDPS.2004.1303067