DocumentCode
3774669
Title
NVIDIA GTX200: TeraFLOPS visual computing
Author
John Tynefield
fYear
2008
Firstpage
1
Lastpage
19
Abstract
This article consists of a collection of slides from the author´s conference presentation. Some of the specific conclusions presented/discussed include: Rebalanced architecture to workload trends; Scaled from 128 to 240 processors; Hardware manages thousands of threads; Zero software overhead; Hides huge latencies; High achieved utilization; Natively Scalar; No swizzling or vectorization overhead; Coalescing for high bandwidth memory I/O; Software architecture allows 2X scaling on customer C code with no modification.
Keywords
"Multithreading","Graphics processing units","Software architecture","Processor scheduling","Multiprocessing systems","Computer architecture","Parallel processing"
Publisher
ieee
Conference_Titel
Hot Chips 20 Symposium (HCS), 2008 IEEE
Type
conf
DOI
10.1109/HOTCHIPS.2008.7476559
Filename
7476559
Link To Document