Title :
Simultaneous multithreaded vector architecture: merging ILP and DLP for high performance
Author :
Espasa, Roger ; Valero, Mateo
Author_Institution :
Dept. d´´Arquitectura de Computadors, Univ. Politecnica de Catalunya, Barcelona, Spain
Abstract :
Shows that instruction-level parallelism (ILP) and data-level parallelism (DLP) can be merged in a single simultaneous vector multithreaded architecture to execute regular vectorizable code at a performance level that cannot be achieved using either paradigm on its own. We show that the combination of the two techniques yields very high performance at a low cost and a low complexity. We show that this architecture achieves a sustained performance on numerical regular codes that is 20 times the performance that can be achieved with today´s superscalar microprocessors. Moreover, we show that the architecture can tolerate very large memory latencies, of up to a 100 cycles, with a relatively small performance degradation. This high performance is independent of working set size or of locality considerations, since the DLP paradigm allows very efficient exploitation of a high-performance flat memory bandwidth
Keywords :
parallel architectures; performance evaluation; vector processor systems; complexity; cost; data-level parallelism; flat memory bandwidth; high-performance computing; instruction-level parallelism; locality considerations; memory latencies; numerical regular codes; regular vectorizable code; simultaneous vector multithreaded architecture; working set size; Computer architecture; Costs; Delay effects; High performance computing; Instruction sets; Merging; Microprocessors; Parallel processing; Supercomputers; Wire;
Conference_Titel :
High-Performance Computing, 1997. Proceedings. Fourth International Conference on
Conference_Location :
Bangalore
Print_ISBN :
0-8186-8067-9
DOI :
10.1109/HIPC.1997.634514