DocumentCode :
1772600
Title :
Performance modeling for highly-threaded many-core GPUs
Author :
Lin Ma ; Chamberlain, Roger D. ; Agrawal, Kunal
Author_Institution :
Dept. of Comput. Sci. & Eng., Washington Univ. in St. Louis, St. Louis, MO, USA
fYear :
2014
fDate :
18-20 June 2014
Firstpage :
84
Lastpage :
91
Abstract :
Highly-threaded many-core GPUs can provide high throughput for a wide range of algorithms and applications. Such machines hide memory latencies via the use of a large number of threads and large memory bandwidth. The achieved performance, therefore, depends on the parallelism exploited by the algorithm, the effectiveness of latency hiding, and the utilization of multiprocessors (occupancy). In this paper, we extend previously proposed analytical models, jointly addressing parallelism, latency-hiding, and occupancy. In particular, the model not only helps to explore and reduce the configuration space for tuning kernel execution on GPUs, but also reflects performance bottlenecks and predicts how the runtime will trend as the problem and other parameters scale. The model is validated with empirical experiments. In addition, the model points to at least one circumstance in which the occupancy decisions automatically made by the scheduler are clearly sub-optimal in terms of runtime.
Keywords :
graphics processing units; multi-threading; multiprocessing systems; performance evaluation; analytical models; highly threaded manycore GPU; jointly addressing parallelism; large memory bandwidth; latency hiding; memory latencies; multiprocessors utilization; performance bottlenecks; performance modeling; tuning kernel execution; Algorithm design and analysis; Analytical models; Computational modeling; Hidden Markov models; Instruction sets; Mathematical model; Runtime; All-pairs Shortest Paths (APSP); GPGPU; Performance Model; Threaded Many-core Memory (TMM) Model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Application-specific Systems, Architectures and Processors (ASAP), 2014 IEEE 25th International Conference on
Conference_Location :
Zurich
Type :
conf
DOI :
10.1109/ASAP.2014.6868641
Filename :
6868641
Link To Document :
بازگشت