Title :
Research and Application of Fault-Tolerance Based on Watershed Model Grid Platform
Author :
Wang, Zhijian ; Shang, Ling ; Xu, Feng
Author_Institution :
Coll. of Comput. & Inf. Eng., Hohai Univ., Nanjing
Abstract :
A systematic scheme to form the watershed computational platform was developed based on lightweight Grid technique in this paper. The scheme that takes advantage of widely deployed local network makes full use of the non dedicated distributed computing resources. To overcome the vagary of overall system, MPICH-T a trust model based fault tolerant model was adopted, and the checkpoint based on pessimistic log can ensure that process repeats in single node and task migration on multi-nodes. and the transplant of system is guaranteed on the watershed model Grid platform, lastly several experiments were made on this platform and the results show that this platform has better performance though has a slightly time delay and the fault-tolerance mechanism based on MPICHT model is a nice choice suiting to the watershed model Grid platform.
Keywords :
distributed algorithms; fault tolerance; grid computing; water resources; MPICHT model; fault tolerance; local network; nondedicated distributed computing resources; watershed model grid platform; Computer networks; Costs; Delay effects; Distributed computing; Fault tolerance; Fault tolerant systems; Grid computing; Large-scale systems; Personal communication networks; Water resources; Grid platform; fault tolerance; water resources computing; watershed model;
Conference_Titel :
Computer Science and Software Engineering, 2008 International Conference on
Conference_Location :
Wuhan, Hubei
Print_ISBN :
978-0-7695-3336-0
DOI :
10.1109/CSSE.2008.444