DocumentCode :
2109365
Title :
Management of Online Processing Farms in the ATLAS Experiment
Author :
Dobson, Marc ; Malik, Usman Ahmad ; Elejabarrieta, Hegoi Garitaonandia
Author_Institution :
CERN, Geneva
fYear :
2007
fDate :
April 29 2007-May 4 2007
Firstpage :
1
Lastpage :
5
Abstract :
The ATLAS experiment will use of order three thousand nodes for the online processing farms. The administration of such a large cluster is a challenge. The ability to quickly turn on/off machines, especially after a power cut, and the ability to remote monitor the hardware health whether the machine be on or off are some of the major issues. To solve these problems ATLAS has decided wherever possible to use Intelligent Platform Management Interfaces (IPMI) for its nodes. This paper will present the mechanisms which were developed to allow the distribution of management and monitoring commands to many machines. These commands were run simultaneously on the prototype farm, by taking into account the specificities of the different IPMI versions and implementations, and the network topology. Results from timing measurements for the distribution of commands to many nodes, for booting and for shutting down of the nodes will be shown with an extrapolation to the final cluster size.
Keywords :
data mining; high energy physics instrumentation computing; on-off control; position sensitive particle detectors; prototypes; system monitoring; ATLAS experiment; Intelligent Platform Management Interfaces; booting; extrapolation; management distribution; monitoring commands distribution; network topology; on-off machines; online processing farms; prototype farm; remote monitor; Condition monitoring; Energy management; Hardware; Machine intelligence; Personal communication networks; Physics; Pipelines; Prototypes; Remote monitoring; Sensor phenomena and characterization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Real-Time Conference, 2007 15th IEEE-NPSS
Conference_Location :
Batavia, IL
Print_ISBN :
978-1-4244-0866-5
Electronic_ISBN :
978-1-4244-0867-2
Type :
conf
DOI :
10.1109/RTC.2007.4382772
Filename :
4382772
Link To Document :
بازگشت