Title :
A Dynamically Adaptive Approach for Speculative Loop Execution in SMT Architectures
Author :
Meirong Li ; Yinliang Zhao
Author_Institution :
Dept. of Comput. Sci. & Technol., Xi´an Jiaotong Univ., Xi´an, China
Abstract :
Simultaneous multithreading allows the exploitation of thread-level speculation on the same processor. Due to the contention for shared processor resources, the performance of speculative threads often suffers from the potential of inter-thread interference, which is hard to be statically estimated by the compiler. Thus we propose an approach to dynamically determine and extract speculative threads from parallel regions until runtime. It relies on a cycle counter architecture to collect the performance profiles of each parallelized loop and uncover the potential of loop-level parallelism. These performance profiles are obtained from the relative single-threaded execution time prediction for speculative threads using thread execution cycle breakdown. The performance of different loop levels is dynamically evaluated by the prediction and only the best loop level will be chosen to parallelize. Several performance tuning policies are also examined. The best policy can achieve an average speedup of 1.45 using SPEC CPU2000 benchmarks, and it outperforms the static loop selection by 33%.
Keywords :
multi-threading; program compilers; program control structures; SMT architectures; SPEC CPU2000 benchmarks; average speedup; compilers; cycle counter architecture; dynamic adaptive approach; dynamic performance evaluation; interthread interference; loop-level parallelism; parallel regions; parallelized loop; performance profile collection; performance tuning policies; relative single-threaded execution time prediction; shared processor resource contention; simultaneous multithreading; speculative loop execution; speculative threads; thread execution cycle breakdown; thread-level speculation; Electric breakdown; Instruction sets; Interference; Message systems; Parallel processing; Performance evaluation; Radiation detectors; Loop-level parallelism; Performance prediction; Simultaneous multithreading; Thread-level speculation;
Conference_Titel :
High Performance Computing and Communications, 2014 IEEE 6th Intl Symp on Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf on Embedded Software and Syst (HPCC,CSS,ICESS), 2014 IEEE Intl Conf on
Print_ISBN :
978-1-4799-6122-1
DOI :
10.1109/HPCC.2014.171