DocumentCode
2721349
Title
Enhancing I/O throughput via efficient routing and placement for large-scale parallel file systems
Author
Dillow, David A. ; Shipman, Galen M. ; Oral, Sarp ; Zhang, Zhe ; Kim, Youngjae
fYear
2011
fDate
17-19 Nov. 2011
Firstpage
1
Lastpage
9
Abstract
As storage systems get larger to meet the demands of petascale systems, careful planning must be applied to avoid congestion points and extract the maximum performance. In addition, the large data sets generated by such systems makes it desirable for all compute resources to have common access to this data without needing to copy it to each machine. This paper describes a method of placing I/O close to the storage nodes to minimize contention on Cray´s SeaStar2+ network, and extends it to a routed Lustre configuration to gain the same benefits when running against a center-wide file system. Our experiments using half of the resources of Spider - the center-wide file system at the Oak Ridge Leadership Computing Facility - show that I/O write bandwidth can be improved by up to 45% (from 71.9 to 104 GB/s) for a direct-attached configuration and by 137% (47.6 GB/s to 115 GB/s) for a routed configuration. We demonstrated up to 20.7% reduction in run-time for production scientific applications. With the full Spider system, we demonstrated over 240 GB/s of aggregate bandwidth using our techniques.
Keywords
input-output programs; parallel machines; parallel processing; Cray SeaStar2+ network; I/O throughput; I/O write bandwidth; center wide file system; congestion points; full Spider system; large scale parallel file systems; oak ridge leadership computing facility; petascale systems; production scientific applications; routed Lustre configuration; routed configuration; Aggregates; Bandwidth; Fabrics; Portals; Routing; Servers; Testing; Lustre file systems; Network congestion; SeaStar network; Spider;
fLanguage
English
Publisher
ieee
Conference_Titel
Performance Computing and Communications Conference (IPCCC), 2011 IEEE 30th International
Conference_Location
Orlando, FL
ISSN
1097-2641
Print_ISBN
978-1-4673-0010-0
Type
conf
DOI
10.1109/PCCC.2011.6108062
Filename
6108062
Link To Document