DocumentCode
1914253
Title
Improving the Accuracy and Efficiency of Time-Independent Trace Replay
Author
Desprez, Frederic ; Markomanolis, G.S. ; Suter, Frederic
Author_Institution
LIP, ENS Lyon, Lyon, France
fYear
2012
fDate
10-16 Nov. 2012
Firstpage
446
Lastpage
455
Abstract
Simulation is a popular approach to obtain objective performance indicators on platforms that are not at one´s disposal. It may help the dimensioning of compute clusters in large computing centers. In a previous work, we proposed a framework for the off-line simulation of MPI applications. Its main originality with regard to the literature is to rely on time-independent execution traces. This allows us to completely decouple the acquisition process from the actual replay of the traces in a simulation context. Then we are able to acquire traces for large application instances without being limited to an execution on a single compute cluster. Finally our framework is built on top of a scalable, fast, and validated simulation kernel. In this paper, we detail the performance issues that we encountered with the first implementation of our trace replay framework. We propose several modifications to address these issues and analyze their impact. Results show a clear improvement on the accuracy and efficiency with regard to the initial implementation.
Keywords
application program interfaces; computer centres; message passing; performance evaluation; workstation clusters; MPI applications; acquisition process; compute cluster; computing centers; message passing interface; off-line simulation; performance indicators; performance issues; simulation context; simulation kernel; time-independent execution traces; time-independent trace replay accuracy; time-independent trace replay efficiency; trace replay framework; MPI; Performance prediction; Simulation;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion:
Conference_Location
Salt Lake City, UT
Print_ISBN
978-1-4673-6218-4
Type
conf
DOI
10.1109/SC.Companion.2012.64
Filename
6495846
Link To Document