Title :
Eliminating replica selection - using multiple replicas to accelerate data transfer on grids
Author :
Feng, Jun ; Humphrey, Marty
Author_Institution :
Dept. of Comput. Sci., Virginia Univ., USA
Abstract :
Data-intensive, high-performance computing applications often require the efficient transfer of terabytes or even petabytes of data in wide-area, distributed computing environments. To increase the efficiency of wide area data movement, researchers have devised various techniques such as TCP tuning, multiple streams and asynchronous I/O. This paper adopts an approach to increase performance further by exploiting replica-level parallelism in grids. rFTP, a grid data transferring tool, improves the data transfer rate and reliability on grids by utilizing multiple replica sources concurrently. Experiments on the NPACI grid show as much as a 2.02× speedup over a single data source by adaptively retrieving partial data segments from 4 replicas using the data provided by NWS.
Keywords :
electronic data interchange; grid computing; parallel processing; wide area networks; NPACI grid; TCP tuning; asynchronous I/O; data transfer rate; data transfer reliability; distributed computing environment; grid data transferring tool; multiple replicas; multiple streams; partial data segment retrieval; rFTP; replica selection; replica sources; replica-level parallelism; single data source; wide area data movement; wide-area computing environment; Acceleration; Application software; Computer applications; Computer science; Distributed computing; Grid computing; Information retrieval; Leg; Throughput; Weather forecasting;
Conference_Titel :
Parallel and Distributed Systems, 2004. ICPADS 2004. Proceedings. Tenth International Conference on
Print_ISBN :
0-7695-2152-5
DOI :
10.1109/ICPADS.2004.1316115