• DocumentCode
    3340192
  • Title

    Simulations of distributed systems in a computing centre

  • Author

    Dongiovanni, D. ; Ronchieri, E. ; Pra, S. Dal ; Agnello, L. Dell ; Ferrari, T. ; Antonelli, S. ; Cavalli, A. ; Gregori, D. ; Martelli, B. ; Prosperini, A. ; Ricci, P.P.

  • Author_Institution
    Nat. Inst. of Nucl. Phys., INFN-CNAF, Bologna, Italy
  • fYear
    2009
  • fDate
    Oct. 24 2009-Nov. 1 2009
  • Firstpage
    1048
  • Lastpage
    1055
  • Abstract
    Modern computing centres addressed to High Energy Physics user communities have to deal with rapidly hardware and software systems evolution. These centres normally face a variety of problems associated with the dimensioning and configuration of several services which must satisfy performance targets under different usage patterns. Therefore, the identification of key variables and the estimation of their impact on services performances is challenging. For example, given an hardware-software configuration for a considered service, how will service performance vary in relation to user dependent settings? Will it be able to support a certain number of requests per minute over the common parameter ranges? In addition, it is difficult to generalize the impact of same settings over different usage scenarios. Therefore, the design of a mathematical model able to relate services performance to key variables in the user computing patterns under common hardware-software settings can help to optimize the exploitation of computing resources. In the present work, starting from the analysis of a typical job of ATLAS as representative HEP user communities, we focus on how local data movement operations use hardware-software resources of INFN-CNAF computing centre and which variables affect data movement performances. As a result of this framework analysis we identify GridFTP protocol and GPFS data storage as core services whose performance to study in depen-dancy of typical user defined variables. We therefore decompose data movement commands in operations of increasing complexity i.e., cp, globus-url-copy with and without network, and after defining the mean throughput per file per unit size as target metric, we conduct a quantitative analysis of the contribution and relevance of considered variables across explored usage scenarios. Finally, we conduct a qualitative fit analysis of the behaviour of chosen throughput metric as a function of relevant user dependent variables. For - - each scenario and for each variables a best fit model function is selected according to R-square goodness of fit index.
  • Keywords
    data handling; high energy physics instrumentation computing; ATLAS; GPFS data storage; GridFTP protocol; HEP user communities; INFN-CNAF computing centre; R-square goodness; cp; data movement operations; distributed system simulations; fit index; framework analysis; globus-url-copy; hardware-software configuration; high energy physics; mathematical model; mean throughput; qualitative fit analysis; quantitative analysis; Computational modeling; Design optimization; Distributed computing; Hardware; Mathematical model; Performance analysis; Physics computing; Protocols; Software systems; Throughput;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Nuclear Science Symposium Conference Record (NSS/MIC), 2009 IEEE
  • Conference_Location
    Orlando, FL
  • ISSN
    1095-7863
  • Print_ISBN
    978-1-4244-3961-4
  • Electronic_ISBN
    1095-7863
  • Type

    conf

  • DOI
    10.1109/NSSMIC.2009.5402425
  • Filename
    5402425