DocumentCode
3340192
Title
Simulations of distributed systems in a computing centre
Author
Dongiovanni, D. ; Ronchieri, E. ; Pra, S. Dal ; Agnello, L. Dell ; Ferrari, T. ; Antonelli, S. ; Cavalli, A. ; Gregori, D. ; Martelli, B. ; Prosperini, A. ; Ricci, P.P.
Author_Institution
Nat. Inst. of Nucl. Phys., INFN-CNAF, Bologna, Italy
fYear
2009
fDate
Oct. 24 2009-Nov. 1 2009
Firstpage
1048
Lastpage
1055
Abstract
Modern computing centres addressed to High Energy Physics user communities have to deal with rapidly hardware and software systems evolution. These centres normally face a variety of problems associated with the dimensioning and configuration of several services which must satisfy performance targets under different usage patterns. Therefore, the identification of key variables and the estimation of their impact on services performances is challenging. For example, given an hardware-software configuration for a considered service, how will service performance vary in relation to user dependent settings? Will it be able to support a certain number of requests per minute over the common parameter ranges? In addition, it is difficult to generalize the impact of same settings over different usage scenarios. Therefore, the design of a mathematical model able to relate services performance to key variables in the user computing patterns under common hardware-software settings can help to optimize the exploitation of computing resources. In the present work, starting from the analysis of a typical job of ATLAS as representative HEP user communities, we focus on how local data movement operations use hardware-software resources of INFN-CNAF computing centre and which variables affect data movement performances. As a result of this framework analysis we identify GridFTP protocol and GPFS data storage as core services whose performance to study in depen-dancy of typical user defined variables. We therefore decompose data movement commands in operations of increasing complexity i.e., cp, globus-url-copy with and without network, and after defining the mean throughput per file per unit size as target metric, we conduct a quantitative analysis of the contribution and relevance of considered variables across explored usage scenarios. Finally, we conduct a qualitative fit analysis of the behaviour of chosen throughput metric as a function of relevant user dependent variables. For - - each scenario and for each variables a best fit model function is selected according to R-square goodness of fit index.
Keywords
data handling; high energy physics instrumentation computing; ATLAS; GPFS data storage; GridFTP protocol; HEP user communities; INFN-CNAF computing centre; R-square goodness; cp; data movement operations; distributed system simulations; fit index; framework analysis; globus-url-copy; hardware-software configuration; high energy physics; mathematical model; mean throughput; qualitative fit analysis; quantitative analysis; Computational modeling; Design optimization; Distributed computing; Hardware; Mathematical model; Performance analysis; Physics computing; Protocols; Software systems; Throughput;
fLanguage
English
Publisher
ieee
Conference_Titel
Nuclear Science Symposium Conference Record (NSS/MIC), 2009 IEEE
Conference_Location
Orlando, FL
ISSN
1095-7863
Print_ISBN
978-1-4244-3961-4
Electronic_ISBN
1095-7863
Type
conf
DOI
10.1109/NSSMIC.2009.5402425
Filename
5402425
Link To Document