DocumentCode
2789574
Title
Decomposing Partial Order Execution Graphs to Improve Message Race Detection
Author
Schaeli, Basile ; Gerlach, Sebastian ; Hersch, Roger D.
Author_Institution
Sch. of Comput. & Commun. Sci., Ecole Polytech. Fed. de Lausanne
fYear
2007
fDate
26-30 March 2007
Firstpage
1
Lastpage
8
Abstract
In message-passing parallel applications, messages are not delivered in a strict order. In most applications, the computation results and the set of messages produced during the execution should be the same for all distinct orderings of messages delivery. Finding an ordering that produces a different outcome then reveals a message race. Assuming that the partial order execution graph (POEG) capturing the causality between events is known for a reference execution, the present paper describes techniques for identifying independent sets of messages and within each set equivalent message orderings. Orderings of messages belonging to different sets may then be re-executed independently from each other, thereby reducing the number of orderings that must be tested to detect message races. We integrated the presented techniques into the dynamic parallel schedules parallelization framework, and applied our approach on an image processing, a linear algebra, and a neighborhood-dependent parallel computation. In all cases, the number of possible orderings is reduced by several orders of magnitudes. In order to further reduce this number, we describe an algorithm that generates a subset of orderings that are likely to reveal existing message races.
Keywords
graph theory; message passing; parallel programming; scheduling; POEG decomposition; dynamic parallel schedules parallelization framework; image processing; linear algebra; message race detection; message-passing parallel applications; neighborhood-dependent parallel computation; parallel program; partial order execution graphs; reference execution; Application software; Computer applications; Concurrent computing; Debugging; Dynamic scheduling; Finite element methods; Image processing; Linear algebra; Processor scheduling; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International
Conference_Location
Long Beach, CA
Print_ISBN
1-4244-0910-1
Electronic_ISBN
1-4244-0910-1
Type
conf
DOI
10.1109/IPDPS.2007.370409
Filename
4228137
Link To Document