Title :
Task sampling: computer architecture simulation in the many-core era
Author :
Sujon, Majedul Haque ; Whaley, R. Clint ; Qing Yi
Author_Institution :
Dept. of Comput. Sci., Univ. of TX at San Antonio, San Antonio, TX, USA
Abstract :
Modern architectures increasingly rely on SIMD vectorization to improve performance for floating point intensive scientific applications. However, existing compiler optimization techniques for automatic vectorization are inhibited by the presence of unknown control flow surrounding partially vectorizable computations. In this paper, we present a new approach, speculative vectorization, which speculates past dependent branches to aggressively vectorize computational paths that are expected to be taken frequently at runtime, while simply restarting the calculation using scalar instructions when the speculation fails. We have integrated our technique in an iterative optimizing compiler and have employed empirical tuning to select the profitable paths for speculation. When applied to optimize 9 floating-point benchmarks, our optimizing compiler has achieved up to 6.8X speedup for single precision and 3.4X for double precision kernels using AVX, while vectorizing some operations considered not vectorizable by prior techniques.
Keywords :
optimisation; parallel processing; program compilers; software architecture; vectors; SIMD vectorization; compiler optimization techniques; floating point intensive scientific applications; floating-point benchmarks; modern architectures; Algorithm design and analysis; Benchmark testing; Kernel; Optimization; Optimizing compilers; Safety; Vectors; many-core; sampling; simulation; task-based programs;
Conference_Titel :
Parallel Architectures and Compilation Techniques (PACT), 2013 22nd International Conference on
Conference_Location :
Edinburgh
Print_ISBN :
978-1-4799-1018-2
DOI :
10.1109/PACT.2013.6618831