DocumentCode :
2580145
Title :
The second trans-Pacific Grid Datafarm testbed and experiments for SC2003
Author :
Tatebe, Osamu ; Ogawa, Hirotaka ; Kodoma, Y. ; Kudoh, Tomohiro ; Sekiguchi, Satoshi ; Matsuoka, Satoshi ; Aida, Kento ; Boku, Taisuke ; Sato, Mitsuhisa ; Morita, Youhei ; Kitatsuji, Yoshinori ; Williams, Jim ; Hicks, John
Author_Institution :
National Inst. of Adv. Ind. Sci. & Technol., Japan
fYear :
2004
fDate :
26-30 Jan. 2004
Firstpage :
602
Lastpage :
607
Abstract :
The Grid Datafarm architecture is designed for global petascale data-intensive computing. It provides a global parallel file system (Gfarm file system) with online petascale storage, scalable I/O bandwidth, and scalable parallel processing by federating thousands of local file systems in a grid of clusters securely using Grid security infrastructure. One of features is that it manages file replicas in filesystem metadata for fault tolerance and load balancing. Here, we present an overview of our planned experiment performed as the SC2003 Bandwidth Challenge at the Supercomputing 2003 site in Phoenix, Arizona, USA. In the experiment, five clusters in Japan and three clusters in US comprise a Gfarm file system, on which world-wide largescale data analysis is performed. In the Gfarm file system, a file is dispersed in several cluster nodes, each of which is replicated independently and in parallel by multiple third-party transfers between multiple cluster nodes. For the Challenge, terabyte-scale experimental data is replicated between US and Japan via APAN/TransPAC and SuperSINET (about 10,000 km or 6,000 miles). At the workshop we present the full detail of the experiment.
Keywords :
LAN interconnection; data analysis; file servers; parallel architectures; workstation clusters; Gfarm file system; Grid Datafarm architecture; clusters grid; fault tolerance; filesystem metadata; global parallel file system; global petascale data-intensive computing; grid security infrastructure; load balancing; multiple cluster nodes; online petascale storage; scalable I-O bandwidth; scalable parallel processing; trans-Pacific Grid Datafarm testbed; world-wide largescale data analysis; Bandwidth; Computer architecture; Data security; Fault tolerance; File systems; Grid computing; Parallel processing; Petascale computing; Secure storage; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applications and the Internet Workshops, 2004. SAINT 2004 Workshops. 2004 International Symposium on
Print_ISBN :
0-7695-2050-2
Type :
conf
DOI :
10.1109/SAINTW.2004.1268694
Filename :
1268694
Link To Document :
بازگشت