DocumentCode :
1565906
Title :
Overcoming the limitations of conventional vector processors
Author :
Kozyrakis, Christos ; Patterson, David
Author_Institution :
Dept. of Electr. Eng., Stanford Univ., CA, USA
fYear :
2003
Firstpage :
399
Lastpage :
409
Abstract :
Despite their superior performance for multimedia applications, vector processors have three limitations that hinder their widespread acceptance. First, the complexity and size of the centralized vector register file limits the number of functional units. Second, precise exceptions for vector instructions are difficult to implement. Third, vector processors require an expensive on-chip memory system that supports high bandwidth at low access latency. We introduce CODE, a scalable vector microarchitecture that addresses these three shortcomings. It is designed around a clustered vector register file and uses a separate network for operand transfers across functional units. With extensive use of decoupling, it can hide the latency of communication across functional units and provides 26% performance improvement over a centralized organization. CODE scales efficiently to 8 functional units without requiring wide instruction issue capabilities. A renaming table makes the clustered register file transparent at the instruction set level. Renaming also enables precise exceptions for vector instructions at a performance loss of less than 5%. Finally, decoupling allows CODE to tolerate large increases in memory latency at sublinear performance degradation without using on-chip caches. Thus, CODE can use economical, off-chip, memory systems.
Keywords :
exception handling; file organisation; instruction sets; performance evaluation; random-access storage; storage allocation; system-on-chip; vector processor systems; CODE; clustered organization for decoupled execution; conventional vector processor limitations; instruction set; memory latency; multimedia application; on-chip memory system; performance improvement; scalable vector microarchitecture; vector instructions precise exception; vector register file; Bandwidth; CMOS technology; Computer architecture; Computer science; Delay; Microarchitecture; Performance loss; Registers; System-on-a-chip; Vector processors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Architecture, 2003. Proceedings. 30th Annual International Symposium on
ISSN :
1063-6897
Print_ISBN :
0-7695-1945-8
Type :
conf
DOI :
10.1109/ISCA.2003.1207017
Filename :
1207017
Link To Document :
بازگشت