Title :
D2T: Doubly Distributed Transactions for High Performance and Distributed Computing
Author :
Lofstead, Jay ; Dayal, Jai ; Schwan, Karsten ; Oldfield, Ron
Abstract :
Current exascale computing projections suggest rather than a monolithic simulation running for the majority of the machine, a collection of components comprising the scientific discovery process will be employed in an online workflow. This move to an online workflow scenario requires knowledge that inter-step operations are completed and correct before the next phase begins. Further, dynamic load balancing or fault tolerance techniques may dynamically deploy or redeploy resources for optimal use of computing resources. These newly configured resources should only be used if they are successfully deployed. Our D2T system offers a mechanism to support these kinds of operations by providing database-like transactions with distributed servers and clients. Ultimately, with adequate hardware support, full ACID compliance is possible for the transactions. To prove the viability of this approach, we show that the D2T protocol has less than 1.2 seconds of overhead using 4096 clients and 32 servers with good scaling characteristics using this initial prototype implementation.
Keywords :
distributed processing; fault tolerant computing; natural sciences computing; protocols; resource allocation; ACID compliance; D2T protocol; database-like transactions; distributed computing; distributed servers; doubly distributed transactions; dynamic load balancing; exascale computing projections; fault tolerance techniques; high performance computing; inter-step operations; monolithic simulation; online workflow; scientific discovery process; Computational modeling; Couplings; Fault tolerance; Fault tolerant systems; Protocols; Scalability; Servers; HPC; IO; IO tuning; data staging; exascale; transaction;
Conference_Titel :
Cluster Computing (CLUSTER), 2012 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4673-2422-9
DOI :
10.1109/CLUSTER.2012.79