Title :
On-demand file staging system for Linux clusters
Author :
Hori, Atsushi ; Kamoshida, Yoshikazu ; Matsuba, Hiroya ; Ohta, Kazuki ; Yasui, Takashi ; Sumimoto, Shinji ; Ishikawa, Yutaka
Author_Institution :
Inf. Technol. Center, Univ. of Tokyo, Tokyo, Japan
fDate :
Aug. 31 2009-Sept. 4 2009
Abstract :
An on-demand file staging system, Catwalk, is proposed. Catwalk is designed so that it can run on any Linux clusters without any special or additional hardware. By having hook functions on the system calls of file operations, a file staging system can be transparent from the view of users, and users can be free from having wrong file staging scripts. In Catwalk, the file copying is done via normal TCP protocol so that Catwalk can run over ordinary, widely-used Ethernet. The stage-in file copy is pipelined to maximize the bandwidth from single file server. The performance of Catwalk is evaluated and compared with NFS using synthetic but realistic workloads. The evaluations show the stage-in performance with the pipeline technique is much better than the performance of NFS. The stage-out performance is comparable with the NFS performance despite the extra copying of files, and the file server is lightly loaded with the Catwalk stage-out while NFS entails much heavier server loads. The biggest problems of NFS are its centralized design and lack of scheduling for the parallel workloads. The performance of Catwalk shows that remote file access performance can be improved much better if file accesses are scheduled in a proper way. Thus the proposed file staging system can be a strong complement to NFS, especially for small clusters often having no dedicated parallel file system.
Keywords :
Linux; file organisation; file servers; local area networks; transport protocols; Catwalk; Ethernet; Linux clusters; TCP protocol; file server; on-demand file staging system; parallel file system; system calls; Ethernet networks; File servers; File systems; Hardware; Information science; Information technology; Laboratories; Linux; Processor scheduling; Protocols; cluster file system; file staging; network file system; parallel file system;
Conference_Titel :
Cluster Computing and Workshops, 2009. CLUSTER '09. IEEE International Conference on
Conference_Location :
New Orleans, LA
Print_ISBN :
978-1-4244-5011-4
Electronic_ISBN :
1552-5244
DOI :
10.1109/CLUSTR.2009.5289189