Title :
Compiler optimization of memory-resident value communication between speculative threads
Author :
Zhai, Antonia ; Colohan, Christopher B. ; Steffan, J. Gregory ; Mowry, Todd C.
Author_Institution :
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
Efficient inter-thread value communication is essential for improving performance in thread-level speculation (TLS). Although several mechanisms for improving value communication using hardware support have been proposed, there is relatively little work on exploiting the potential of compiler optimization. Building on recent research on compiler optimization of scalar value communication between speculative threads, we propose compiler techniques for the optimization of memory-resident values. In TLS, data dependences through memory-resident values are tracked by the underlying hardware and preserved by re-executing any speculative thread that violates a dependence; however, re-execution incurs a large performance penalty and should be used only to resolve data dependences that are infrequent. In contrast, value communication for frequently-occurring data dependences must be very efficient. We propose using the compiler to first identify frequently-occurring memory-resident data dependences, then insert synchronization for communicating values to preserve these dependences. We find that by synchronizing frequently-occurring data dependences we can significantly improve the efficiency of parallel execution. A comparison between compiler-inserted and hardware-inserted memory synchronization reveals that the two techniques are complementary, with each technique benefitting different benchmarks.
Keywords :
benchmark testing; multi-threading; optimising compilers; storage management; synchronisation; benchmarks; compiler optimization; compiler techniques; compiler-inserted memory synchronization; data dependences; hardware-inserted memory synchronization; memory-resident value communication; parallel execution; thread-level speculation; Computer science; Costs; Data structures; Hardware; Optimizing compilers; Program processors; Runtime; Turning; Yarn;
Conference_Titel :
Code Generation and Optimization, 2004. CGO 2004. International Symposium on
Print_ISBN :
0-7695-2102-9
DOI :
10.1109/CGO.2004.1281662