Title :
Debugging parallel programs using incomplete information
Author :
Huband, Simon ; McDonald, Chris
Author_Institution :
Dept. of Comput. Sci., Western Australia Univ., Nedlands, WA, Australia
Abstract :
Many parallel programs employ regular topological structures to support their computation. This topological information is exploitable in the debugging process. Communications not normally part of a topology ones that are either missing or unexpected, are immediately recognisable. Furthermore, animations used to assist the debugging may be enhanced by arranging representations of the executing tasks with reference to the program´s topology. However direct topology support is lacking in many environments, including workstation clusters, where popular language extensions such as the Parallel Virtual Machine (PVM) and the Message Passing Interface (MPI) are common. Programmers are required to implement topology support themselves. Moreover debugger support that exploits topological information is lacking; without explicit knowledge, determining a program´s topology is difficult. This paper presents a methodology to identify program topologies using only standard trace facilities. This methodology uses the concept of distance between graphs. To demonstrate the feasibility of the approach, several generic algorithms are implemented, and results on five different types of topologies reported
Keywords :
computer animation; message passing; parallel programming; program debugging; workstation clusters; animations; generic algorithms; incomplete information; parallel programs debugging; program topologies; regular topological structures; trace facilities; workstation clusters; Animation; Computer science; Costs; Debugging; Message passing; Parallel algorithms; Power engineering computing; Programming profession; Supercomputers; Topology;
Conference_Titel :
Cluster Computing, 1999. Proceedings. 1st IEEE Computer Society International Workshop on
Conference_Location :
Melbourne, Vic.
Print_ISBN :
0-7695-0343-8
DOI :
10.1109/IWCC.1999.810897