DocumentCode :
1565662
Title :
A "flight data recorder" for enabling full-system multiprocessor deterministic replay
Author :
Xu, Min ; Bodik, Rastislav ; Hill, Mark D.
Author_Institution :
Comput. Sci. Dept. & ECE Dept., Wisconsin Univ., Madison, WI, USA
fYear :
2003
Firstpage :
122
Lastpage :
133
Abstract :
Debuggers have been proven indispensable in improving software reliability. Unfortunately, on most real-life software, debuggers fail to deliver their most essential feature - a faithful replay of the execution. The reason is nondeterminism caused by multithreading and nonrepeatable inputs. A common solution to faithful replay has been to record the nondeterministic execution. Existing recorders, however, either work only for data-race-free programs or have prohibitive overhead. As a step towards powerful debugging, we develop a practical low-overhead hardware recorder for cache-coherent multiprocessors, called flight data recorder (FDR). Like an aircraft flight data recorder, FDR continuously records the execution, even on deployed systems, logging the execution for post-mortem analysis. FDR is practical because it piggybacks on the cache coherence hardware and logs nearly the minimal thread-ordering information necessary to faithfully replay the multiprocessor execution. Our studies, based on simulating a four-processor server with commercial workloads, show that when allocated less than 7% of system´s physical memory, our FDR design can capture the last one second of the execution at modest (less than 2%) slowdown.
Keywords :
data recording; multi-threading; multiprocessing systems; performance evaluation; program debugging; software reliability; system monitoring; FDR; cache coherent multiprocessor; data race free program; flight data recorder; full-system multiprocessor deterministic replay; multithreading; post mortem analysis; software debugging; software reliability; Bandwidth; Clocks; Computer architecture; Concrete; Core dumps; Hardware; Image restoration; Protocols; Software performance; Timing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Architecture, 2003. Proceedings. 30th Annual International Symposium on
ISSN :
1063-6897
Print_ISBN :
0-7695-1945-8
Type :
conf
DOI :
10.1109/ISCA.2003.1206994
Filename :
1206994
Link To Document :
بازگشت