DocumentCode
1064907
Title
Dynamic Error Recovery in the ATLAS TDAQ System
Author
Sloper, John Erik ; Miotto, Giovanna Lehmann ; Hines, Evor
Author_Institution
CERN, Geneva
Volume
55
Issue
1
fYear
2008
Firstpage
405
Lastpage
410
Abstract
This paper describes the new dynamic recovery mechanisms in the ATLAS Trigger and DataAcQuisition (TDAQ) system. The purpose of the new recovery mechanism is to minimize the impact certain errors and failures have on the system. The new recovery mechanisms are capable of analyzing and recovering from a variety of errors, both software and hardware, without stopping the data-gathering operations. An expert system is incorporated to perform the analysis of the errors and to decide what measures are needed. Due to the wide array of sub-systems there is also a need to optimize the way similar errors are handled for the different sub-systems. The main focus of the paper is to consider the design and implementation of the new recovery mechanisms and how expert knowledge is gathered from the different sub-systems and implemented in the recovery procedures.
Keywords
data acquisition; high energy physics instrumentation computing; nuclear electronics; position sensitive particle detectors; system recovery; trigger circuits; ATLAS trigger system; TDAQ; dynamic error recovery; trigger data acquisition system; Detectors; Erbium; Error analysis; Expert systems; Hardware; Helium; Performance analysis; Performance evaluation; Production systems; ATLAS; TDAQ; error recovery; expert system;
fLanguage
English
Journal_Title
Nuclear Science, IEEE Transactions on
Publisher
ieee
ISSN
0018-9499
Type
jour
DOI
10.1109/TNS.2007.913472
Filename
4448537
Link To Document