• DocumentCode
    3774669
  • Title

    NVIDIA GTX200: TeraFLOPS visual computing

  • Author

    John Tynefield

  • fYear
    2008
  • Firstpage
    1
  • Lastpage
    19
  • Abstract
    This article consists of a collection of slides from the author´s conference presentation. Some of the specific conclusions presented/discussed include: Rebalanced architecture to workload trends; Scaled from 128 to 240 processors; Hardware manages thousands of threads; Zero software overhead; Hides huge latencies; High achieved utilization; Natively Scalar; No swizzling or vectorization overhead; Coalescing for high bandwidth memory I/O; Software architecture allows 2X scaling on customer C code with no modification.
  • Keywords
    "Multithreading","Graphics processing units","Software architecture","Processor scheduling","Multiprocessing systems","Computer architecture","Parallel processing"
  • Publisher
    ieee
  • Conference_Titel
    Hot Chips 20 Symposium (HCS), 2008 IEEE
  • Type

    conf

  • DOI
    10.1109/HOTCHIPS.2008.7476559
  • Filename
    7476559