Title :
Simulations of distributed systems in a computing centre
Author :
Dongiovanni, D. ; Ronchieri, E. ; Pra, S. Dal ; Agnello, L. Dell ; Ferrari, T. ; Antonelli, S. ; Cavalli, A. ; Gregori, D. ; Martelli, B. ; Prosperini, A. ; Ricci, P.P.
Author_Institution :
Nat. Inst. of Nucl. Phys., INFN-CNAF, Bologna, Italy
fDate :
Oct. 24 2009-Nov. 1 2009
Abstract :
Modern computing centres addressed to High Energy Physics user communities have to deal with rapidly hardware and software systems evolution. These centres normally face a variety of problems associated with the dimensioning and configuration of several services which must satisfy performance targets under different usage patterns. Therefore, the identification of key variables and the estimation of their impact on services performances is challenging. For example, given an hardware-software configuration for a considered service, how will service performance vary in relation to user dependent settings? Will it be able to support a certain number of requests per minute over the common parameter ranges? In addition, it is difficult to generalize the impact of same settings over different usage scenarios. Therefore, the design of a mathematical model able to relate services performance to key variables in the user computing patterns under common hardware-software settings can help to optimize the exploitation of computing resources. In the present work, starting from the analysis of a typical job of ATLAS as representative HEP user communities, we focus on how local data movement operations use hardware-software resources of INFN-CNAF computing centre and which variables affect data movement performances. As a result of this framework analysis we identify GridFTP protocol and GPFS data storage as core services whose performance to study in depen-dancy of typical user defined variables. We therefore decompose data movement commands in operations of increasing complexity i.e., cp, globus-url-copy with and without network, and after defining the mean throughput per file per unit size as target metric, we conduct a quantitative analysis of the contribution and relevance of considered variables across explored usage scenarios. Finally, we conduct a qualitative fit analysis of the behaviour of chosen throughput metric as a function of relevant user dependent variables. For - - each scenario and for each variables a best fit model function is selected according to R-square goodness of fit index.
Keywords :
data handling; high energy physics instrumentation computing; ATLAS; GPFS data storage; GridFTP protocol; HEP user communities; INFN-CNAF computing centre; R-square goodness; cp; data movement operations; distributed system simulations; fit index; framework analysis; globus-url-copy; hardware-software configuration; high energy physics; mathematical model; mean throughput; qualitative fit analysis; quantitative analysis; Computational modeling; Design optimization; Distributed computing; Hardware; Mathematical model; Performance analysis; Physics computing; Protocols; Software systems; Throughput;
Conference_Titel :
Nuclear Science Symposium Conference Record (NSS/MIC), 2009 IEEE
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-4244-3961-4
Electronic_ISBN :
1095-7863
DOI :
10.1109/NSSMIC.2009.5402425