Title :
A Comparison of Data-Access Platforms for the Computing of Large Hadron Collider Experiments
Author :
Bencivenni, M. ; Bonifazi, F. ; Carbone, A. ; Chierici, A. ; Apice, A.D. ; Girolamo, D. De ; Agnello, L. Dell ; Donatelli, M. ; Donvito, G. ; Fella, A. ; Furano, F. ; Galli, D. ; Ghiselli, A. ; Italiano, A. ; Re, G. Lo ; Marconi, U. ; Martelli, B. ; Mazzu
Author_Institution :
Ist. Naz. di Fis. Nucleare, Bari
fDate :
6/1/2008 12:00:00 AM
Abstract :
Performance, reliability and scalability in data-access are key issues in the context of the computing Grid and High Energy Physics data processing and analysis applications, in particular considering the large data size and I/O load that a Large Hadron Collider data centre has to support. In this paper we present the technical details and the results of a large scale validation and performance measurement employing different data-access platforms-namely CASTOR, dCache, GPFS and Scalla/Xrootd. The tests have been performed at the CNAF Tier-1, the central computing facility of the Italian National Institute for Nuclear Research (INFN). Our storage back-end was based on Fibre Channel disk-servers organized in a Storage Area Network, being the disk-servers connected to the computing farm via Gigabit LAN. We used 24 disk-servers, 260 TB of raw-disk space and 280 worker nodes as computing clients, able to run concurrently up to about 1100 jobs. The aim of the test was to perform sequential and random read/write accesses to the data, as well as more realistic access patterns, in order to evaluate efficiency, availability, robustness and performance of the various data-access solutions.
Keywords :
data analysis; high energy physics instrumentation computing; information retrieval; CASTOR; GPFS; Large Hadron Collider experiments; Scalla/Xrootd; dCache; data-access platforms; fibre channel disk-servers; grid computing; high energy physics data analysis; high energy physics data processing; random read/write access; sequential read/write access; storage area network; Data analysis; Data processing; Grid computing; High performance computing; Large Hadron Collider; Large-scale systems; Performance analysis; Performance evaluation; Physics computing; Scalability; Computer facilities; computer input-output; data handling; data processing; mass memories; system analysis and design;
Journal_Title :
Nuclear Science, IEEE Transactions on
DOI :
10.1109/TNS.2008.924087