DocumentCode
1689763
Title
Trace recovery: a distributed computing application for perturbation tracking
Author
Gannon, J.A. ; Williams, K.J. ; Andersland, M.S. ; Casavant, T.L.
Author_Institution
Dept. of Electr. & Comput. Eng., Iowa Univ., Iowa City, IA, USA
Volume
3
fYear
1994
Firstpage
2621
Abstract
Execution monitoring plays a central role in most software development tools for parallel and distributed computer systems. However, such monitoring may induce delays that corrupt event timing. In this paper we introduce a perturbation analyses-like algorithm that, given a safe timed Petri net model of the monitored software, can recover the uncorrupted event timings, i.e., those that would have been observed had the delays not been present. Monitoring conditions sufficient to ensure correct operation of the algorithm, and examples illustrating the algorithm´s applicability to message-passing systems are also presented. This is part of a larger effort aimed at identifying cost-effective software alternatives to hardware monitoring
Keywords
Petri nets; distributed processing; message passing; program diagnostics; software tools; system recovery; cost-effective software; delays; distributed computing; event timing; execution monitoring; hardware monitoring; message-passing systems; monitored software; parallel systems; perturbation analyses; perturbation tracking; safe timed Petri net model; software development tools; trace recovery; uncorrupted event timings; Algorithm design and analysis; Application software; Computerized monitoring; Concurrent computing; Delay; Distributed computing; Programming; Software algorithms; Software safety; Timing;
fLanguage
English
Publisher
ieee
Conference_Titel
Decision and Control, 1994., Proceedings of the 33rd IEEE Conference on
Conference_Location
Lake Buena Vista, FL
Print_ISBN
0-7803-1968-0
Type
conf
DOI
10.1109/CDC.1994.411541
Filename
411541
Link To Document