DocumentCode
2054490
Title
Simultaneous multithreaded vector architecture: merging ILP and DLP for high performance
Author
Espasa, Roger ; Valero, Mateo
Author_Institution
Dept. d´´Arquitectura de Computadors, Univ. Politecnica de Catalunya, Barcelona, Spain
fYear
1997
fDate
18-21 Dec 1997
Firstpage
350
Lastpage
357
Abstract
Shows that instruction-level parallelism (ILP) and data-level parallelism (DLP) can be merged in a single simultaneous vector multithreaded architecture to execute regular vectorizable code at a performance level that cannot be achieved using either paradigm on its own. We show that the combination of the two techniques yields very high performance at a low cost and a low complexity. We show that this architecture achieves a sustained performance on numerical regular codes that is 20 times the performance that can be achieved with today´s superscalar microprocessors. Moreover, we show that the architecture can tolerate very large memory latencies, of up to a 100 cycles, with a relatively small performance degradation. This high performance is independent of working set size or of locality considerations, since the DLP paradigm allows very efficient exploitation of a high-performance flat memory bandwidth
Keywords
parallel architectures; performance evaluation; vector processor systems; complexity; cost; data-level parallelism; flat memory bandwidth; high-performance computing; instruction-level parallelism; locality considerations; memory latencies; numerical regular codes; regular vectorizable code; simultaneous vector multithreaded architecture; working set size; Computer architecture; Costs; Delay effects; High performance computing; Instruction sets; Merging; Microprocessors; Parallel processing; Supercomputers; Wire;
fLanguage
English
Publisher
ieee
Conference_Titel
High-Performance Computing, 1997. Proceedings. Fourth International Conference on
Conference_Location
Bangalore
Print_ISBN
0-8186-8067-9
Type
conf
DOI
10.1109/HIPC.1997.634514
Filename
634514
Link To Document