Title :
NVIDIA GTX200: TeraFLOPS visual computing
Abstract :
This article consists of a collection of slides from the author´s conference presentation. Some of the specific conclusions presented/discussed include: Rebalanced architecture to workload trends; Scaled from 128 to 240 processors; Hardware manages thousands of threads; Zero software overhead; Hides huge latencies; High achieved utilization; Natively Scalar; No swizzling or vectorization overhead; Coalescing for high bandwidth memory I/O; Software architecture allows 2X scaling on customer C code with no modification.
Keywords :
"Multithreading","Graphics processing units","Software architecture","Processor scheduling","Multiprocessing systems","Computer architecture","Parallel processing"
Conference_Titel :
Hot Chips 20 Symposium (HCS), 2008 IEEE
DOI :
10.1109/HOTCHIPS.2008.7476559