DocumentCode
3716798
Title
Design and Verification of Heterogeneous Streaming Parallel Mechanisms on Kepler CUDA
Author
Kailong Zhang;Shaoli Zhou;Liang Hu;Hang Su;Xiao Wu;Kejian Miao
Author_Institution
Sch. of Comput. Sci. &
fYear
2015
Firstpage
2256
Lastpage
2262
Abstract
In many-core based parallel computing field, how to optimally allocate and schedule computing core resources according to characteristics of parallel applications is one typical and fundamental problem, which touches closely to computing performances. After analyzing features and mechanisms of Kepler CUDA architecture, three heterogeneous streaming parallel computing modes and corresponding constraints, and mechanisms are studied and described in detail. Considering the performance differences between different processing steps of one parallel task, a novel mechanism for balancing resource and performance of one whole task is further studied. Finally, we present typical implementation methods on Kepler CUDA processor, and implement typical matrix-processing algorithms and complicated target-detecting algorithms with these three different computing modes. Experiments show that these modes can adapt different types of applications, and the performance of pipelining parallel computing mode is usually better.
Keywords
"Graphics processing units","Synchronization","Parallel processing","Instruction sets","Computer architecture","Kernel","Algorithm design and analysis"
Publisher
ieee
Conference_Titel
Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing (CIT/IUCC/DASC/PICOM), 2015 IEEE International Conference on
Type
conf
DOI
10.1109/CIT/IUCC/DASC/PICOM.2015.333
Filename
7363379
Link To Document