DocumentCode :
1033382
Title :
Failure related dataflow dynamics in a highly parallel processor for L1 triggering
Author :
Cancelo, G. ; Gottschalk, E. ; Pavlicek, V. ; Wang, M. ; Wu, J.
Author_Institution :
Fermilab Nat. Accel. Lab., Batavia, IL, USA
Volume :
51
Issue :
3
fYear :
2004
fDate :
6/1/2004 12:00:00 AM
Firstpage :
1158
Lastpage :
1162
Abstract :
This paper studies how processor failures affect the dataflow of the Level 1 Trigger in the BTeV experiment proposed to run at Fermilab´s Tevatron. The failure analysis is crucial for a system with over 2500 processing nodes and a number of storage units and communication links of the same order of magnitude. This paper is based on models of the L1 Trigger architecture and shows the dynamics of the architecture´s dataflow. The dataflow analysis provides insight into how system variables are affected by single component failures and provides key information to the implementation of error recovery strategies. The analysis includes both short-term failures from which the system can recover quickly and long-term failures which imply a more drastic error-recovery strategy. The modeling results are supported by behavioral simulations of the L1 Trigger processing BTeV´s GEANT Monte Carlo data.
Keywords :
Monte Carlo methods; data flow analysis; high energy physics instrumentation computing; parallel processing; semiconductor counters; trigger circuits; BTeV experiment; GEANT Monte Carlo data; L1 trigger architecture; architectures dataflow; communication links; error recovery strategies; failure related dataflow dynamics; parallel processor; processing nodes; single component failures; storage units; Data analysis; Detectors; Failure analysis; Fault tolerant systems; Hardware; Information analysis; Mesons; Queueing analysis; Steady-state; Switches;
fLanguage :
English
Journal_Title :
Nuclear Science, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9499
Type :
jour
DOI :
10.1109/TNS.2004.829653
Filename :
1312034
Link To Document :
بازگشت