• DocumentCode
    2109365
  • Title

    Management of Online Processing Farms in the ATLAS Experiment

  • Author

    Dobson, Marc ; Malik, Usman Ahmad ; Elejabarrieta, Hegoi Garitaonandia

  • Author_Institution
    CERN, Geneva
  • fYear
    2007
  • fDate
    April 29 2007-May 4 2007
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    The ATLAS experiment will use of order three thousand nodes for the online processing farms. The administration of such a large cluster is a challenge. The ability to quickly turn on/off machines, especially after a power cut, and the ability to remote monitor the hardware health whether the machine be on or off are some of the major issues. To solve these problems ATLAS has decided wherever possible to use Intelligent Platform Management Interfaces (IPMI) for its nodes. This paper will present the mechanisms which were developed to allow the distribution of management and monitoring commands to many machines. These commands were run simultaneously on the prototype farm, by taking into account the specificities of the different IPMI versions and implementations, and the network topology. Results from timing measurements for the distribution of commands to many nodes, for booting and for shutting down of the nodes will be shown with an extrapolation to the final cluster size.
  • Keywords
    data mining; high energy physics instrumentation computing; on-off control; position sensitive particle detectors; prototypes; system monitoring; ATLAS experiment; Intelligent Platform Management Interfaces; booting; extrapolation; management distribution; monitoring commands distribution; network topology; on-off machines; online processing farms; prototype farm; remote monitor; Condition monitoring; Energy management; Hardware; Machine intelligence; Personal communication networks; Physics; Pipelines; Prototypes; Remote monitoring; Sensor phenomena and characterization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Real-Time Conference, 2007 15th IEEE-NPSS
  • Conference_Location
    Batavia, IL
  • Print_ISBN
    978-1-4244-0866-5
  • Electronic_ISBN
    978-1-4244-0867-2
  • Type

    conf

  • DOI
    10.1109/RTC.2007.4382772
  • Filename
    4382772