DocumentCode :
1914516
Title :
Accelerating Data Movement Leveraging End-System and Network Parallelism
Author :
Jun Yi ; Kettimuthu, Rajkumar ; Vishwanath, Venkatram
fYear :
2012
fDate :
10-16 Nov. 2012
Firstpage :
516
Lastpage :
525
Abstract :
Data volumes produced by simulation, experimental and observational science is rapidly increasing. This data needs to be moved from its source to another resource for analysis, visualization and archival purposes. The destination resource could be either local or remote. The data intensive science is critically dependent upon the high-performance parallel file and storage end systems to read/write and high-speed networks to move their enormous data between local and remote computing and storage facilities. 100 Gigabit per second networks such as DOE´s Advanced Network Initiative (ANI), Internet2´s 100G network represent a major step forward in wide area network performance. Effective utilization of these networks requires substantial and pervasive parallelism, at the file system, end system, and network levels. Additional obstacles such as heterogeneity and time-varying conditions of network and end system arise that, if not adequately addressed, will render high performance storage and network systems extremely under-performed. In this paper, we propose a data movement system that dynamically and adaptively adjusts end systems and networks parallelisms in response to changing conditions of end systems and networks to sustain high-throughput for data transfers. We evaluate our system in multiple settings and show that (1) in a homogeneous configuration, the design can achieve better throughput for light and medium workload than GridFTP and achieve comparable throughput for heavy workload, (2) and in a heterogeneous configuration, the design can achieve several factors higher throughput for all workloads than GridFTP.
Keywords :
Internet; computer network performance evaluation; file organisation; grid computing; parallel processing; storage area networks; ubiquitous computing; wide area networks; ANI; DOE advanced network initiative; GridFTP; Internet2 100G network; data intensive science; data movement leveraging end-system; data volumes; destination resource; experimental science; heterogeneous configuration; high performance network systems; high performance storage systems; high-performance parallel file systems; high-performance parallel storage end systems; high-speed networks; network parallelism; observational science; pervasive parallelism; read-write networks; remote computing facilities; remote storage facilities; substantial parallelism; wide area network performance; Bulk data transfer; Data movement; Parallel data transfer;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion:
Conference_Location :
Salt Lake City, UT
Print_ISBN :
978-1-4673-6218-4
Type :
conf
DOI :
10.1109/SC.Companion.2012.74
Filename :
6495856
Link To Document :
بازگشت