DocumentCode
3215735
Title
Eliminating replica selection - using multiple replicas to accelerate data transfer on grids
Author
Feng, Jun ; Humphrey, Marty
Author_Institution
Dept. of Comput. Sci., Virginia Univ., USA
fYear
2004
fDate
7-9 July 2004
Firstpage
356
Lastpage
366
Abstract
Data-intensive, high-performance computing applications often require the efficient transfer of terabytes or even petabytes of data in wide-area, distributed computing environments. To increase the efficiency of wide area data movement, researchers have devised various techniques such as TCP tuning, multiple streams and asynchronous I/O. This paper adopts an approach to increase performance further by exploiting replica-level parallelism in grids. rFTP, a grid data transferring tool, improves the data transfer rate and reliability on grids by utilizing multiple replica sources concurrently. Experiments on the NPACI grid show as much as a 2.02× speedup over a single data source by adaptively retrieving partial data segments from 4 replicas using the data provided by NWS.
Keywords
electronic data interchange; grid computing; parallel processing; wide area networks; NPACI grid; TCP tuning; asynchronous I/O; data transfer rate; data transfer reliability; distributed computing environment; grid data transferring tool; multiple replicas; multiple streams; partial data segment retrieval; rFTP; replica selection; replica sources; replica-level parallelism; single data source; wide area data movement; wide-area computing environment; Acceleration; Application software; Computer applications; Computer science; Distributed computing; Grid computing; Information retrieval; Leg; Throughput; Weather forecasting;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Systems, 2004. ICPADS 2004. Proceedings. Tenth International Conference on
ISSN
1521-9097
Print_ISBN
0-7695-2152-5
Type
conf
DOI
10.1109/ICPADS.2004.1316115
Filename
1316115
Link To Document