• DocumentCode
    3716798
  • Title

    Design and Verification of Heterogeneous Streaming Parallel Mechanisms on Kepler CUDA

  • Author

    Kailong Zhang;Shaoli Zhou;Liang Hu;Hang Su;Xiao Wu;Kejian Miao

  • Author_Institution
    Sch. of Comput. Sci. &
  • fYear
    2015
  • Firstpage
    2256
  • Lastpage
    2262
  • Abstract
    In many-core based parallel computing field, how to optimally allocate and schedule computing core resources according to characteristics of parallel applications is one typical and fundamental problem, which touches closely to computing performances. After analyzing features and mechanisms of Kepler CUDA architecture, three heterogeneous streaming parallel computing modes and corresponding constraints, and mechanisms are studied and described in detail. Considering the performance differences between different processing steps of one parallel task, a novel mechanism for balancing resource and performance of one whole task is further studied. Finally, we present typical implementation methods on Kepler CUDA processor, and implement typical matrix-processing algorithms and complicated target-detecting algorithms with these three different computing modes. Experiments show that these modes can adapt different types of applications, and the performance of pipelining parallel computing mode is usually better.
  • Keywords
    "Graphics processing units","Synchronization","Parallel processing","Instruction sets","Computer architecture","Kernel","Algorithm design and analysis"
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing (CIT/IUCC/DASC/PICOM), 2015 IEEE International Conference on
  • Type

    conf

  • DOI
    10.1109/CIT/IUCC/DASC/PICOM.2015.333
  • Filename
    7363379