• DocumentCode
    1240310
  • Title

    Compute Unified Device Architecture Application Suitability

  • Author

    Hwu, Wen-Mei ; Rodrigues, Christopher ; Ryoo, Shane ; Stratton, John

  • Author_Institution
    Univ. of Illinois, Urbana, IL
  • Volume
    11
  • Issue
    3
  • fYear
    2009
  • Firstpage
    16
  • Lastpage
    26
  • Abstract
    Graphics processing units (GPUs) can provide excellent speedups on some, but not all, general-purpose workloads. Using a set of computational GPU kernels as examples, the authors show how to adapt kernels to utilize the architectural features of a GeForce 8800 GPU and what finally limits the achievable performance.
  • Keywords
    microprocessor chips; Nvidia GeForce 8800 GTX GPU; general-purpose workloads; graphics processing units; unified device architecture; Central Processing Unit; Computer architecture; Costs; Graphics; Hardware; Kernel; Multicore processing; Parallel processing; Phased arrays; Yarn; CUDA; GPGPU; benchmarks; compute unified device architecture; computer architecture; general-purpose computing on GPU; software optimization;
  • fLanguage
    English
  • Journal_Title
    Computing in Science & Engineering
  • Publisher
    ieee
  • ISSN
    1521-9615
  • Type

    jour

  • DOI
    10.1109/MCSE.2009.48
  • Filename
    4814979