DocumentCode :
3293022
Title :
Toward a more accurate understanding of the limits of the TLS execution paradigm
Author :
Ioannou, Nikolas ; Singer, Jeremy ; Khan, Salman ; Xekalakis, Polychronis ; Yiapanis, Paraskevas ; Pocock, Adam ; Brown, Gavin ; Luján, Mikel ; Watson, Ian ; Cintra, Marcelo
Author_Institution :
Sch. of Inf., Univ. of Edinburgh, Edinburgh, UK
fYear :
2010
fDate :
2-4 Dec. 2010
Firstpage :
1
Lastpage :
12
Abstract :
Thread-Level Speculation (TLS) facilitates the extraction of parallel threads from sequential applications. Most prior work has focused on developing the compiler and architecture for this execution paradigm. Such studies often narrowly concentrated on a specific design point. On the other hand, other studies have attempted to assess how well TLS performs if some architectural/ compiler constraint is relaxed. Unfortunately, such previous studies have failed to truly assess TLS performance potential, because they have been bound to some specific TLS architecture and have ignored one or another important TLS design choice, such as support for out-of-order task spawn or support for intermediate checkpointing. In this paper we attempt to remedy some of the shortcomings of previous TLS limit studies. To this end a characterization approach is pursued that is, as much as possible, independent of specific architecture configurations. High-level TLS architectural support is explored in one common framework. In this way, a more accurate upper-bound on the performance potential of the TLS execution paradigm is obtained (as opposed to some particular architecture design point) and, moreover, relative performance gains can be related to specific high-level architectural support. Finally, in the spirit of performing a comprehensive study, applications from a variety of domains and programming styles are evaluated. Experimental results suggest that TLS performance varies significantly depending on the features provided by the architecture. Additionally, the performance of these systems is not only hindered by data dependences, but also by load imbalance and limited coverage.
Keywords :
checkpointing; data handling; parallel programming; program compilers; resource allocation; TLS execution paradigm; architectural constraint; compiler development; data dependence; load imbalance; parallel thread; programming style; thread-level speculation; Benchmark testing; Instruction sets; Java; Optimization; Programming; Synchronization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Workload Characterization (IISWC), 2010 IEEE International Symposium on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4244-9297-8
Electronic_ISBN :
978-1-4244-9296-1
Type :
conf
DOI :
10.1109/IISWC.2010.5649169
Filename :
5649169
Link To Document :
بازگشت