Title :
Architecture for an Offline Parallel Debugger
Author :
Lindekugel, Karl ; DiGirolamo, Anthony ; Stanzione, Dan
Author_Institution :
Arizona State Univ., AZ, USA
Abstract :
This paper provides and overview of the {it GDBase} framework for offline parallel debuggers. The framework was designed to become the basis of debugging tools which scale successfully on systems with tens to hundreds of thousands of cores. With several systems coming online at more than 50,000 cores in the past year, debuggers which can run at these scales are now required. The proposed framework offers two features not found in current generation debugging tools: the ability to debug "offline\´\´, and a central database to act as a repository of debugging information. These two features enable the GDBase debugger to offer several advantages. The debugger can be used in conjunction with modern batch systems with low overhead, with user interaction taking place after the parallel system resources are freed. The use of a database and a simple API allows for multiple interfaces and data mining tools to be implemented to provide novel ways of viewing and analyzing debugging data. The database also enables cross-run analysis, and the combination of debugging, performance, and system health information. Evidence is provided of the scalability of the framework, as well as output from several simple analysis tools that have been implemented.
Keywords :
application program interfaces; data mining; parallel programming; program debugging; software architecture; API; GDBase framework; batch systems; data mining; offline parallel debugger; parallel system resources; Application software; Computer bugs; Data analysis; Data mining; Debugging; Distributed processing; Large-scale systems; Performance analysis; Production systems; Spatial databases; hpc parallel debugging;
Conference_Titel :
Parallel and Distributed Processing with Applications, 2008. ISPA '08. International Symposium on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-0-7695-3471-8
DOI :
10.1109/ISPA.2008.125