Title :
Effects of MSHR and Prefetch Mechanisms on an On-Chip Cache of the Vector Architecture
Author :
Musa, Akihiro ; Sato, Yoshiei ; Soga, Takashi ; Egawa, Ryusuke ; Takizawa, Hiroyuki ; Okabe, Koki ; Kobayashi, Hiroaki
Author_Institution :
Tohoku Univ., NEC Corp. Sendai, Sendai, Japan
Abstract :
Vector supercomputers have been encountering the memory wall problem and their memory bandwidth per flop/s rate has decreased. To cover the insufficient memory bandwidth per flop/s rate, an on-chip vector cache has been proposed for the vector processors. Although vector caching is effective to increase the sustained performance to a certain degree, it still needs software and hardware supporting mechanisms to extract its potential. To this end, we propose miss status handling registers (MSHR) and a prefetch mechanism. This paper evaluates the performance of the vector cache with the MSHR and the prefetch mechanism on the vector supercomputer across three leading scientific applications. The MSHR is an effective mechanism for handling subsequent vector loads of the same data, which frequently appear in different schemes. The experimental results indicate that the MSHR can improve the computational performance of scientific applications by 1.45Ã. Moreover, we examine the performance of the prefetch mechanism on the vector cache. The prefetch mechanism increases the computational performance by 1.6Ã. Accordingly, the MSHR and the prefetching mechanism are very effective optimization options for vector caching of future vector supercomputers even if the vector supercomputers cannot maintain the current memory bandwidth per flop/s rate.
Keywords :
cache storage; parallel machines; MSHR; memory bandwidth; memory wall problem; miss status handling registers; onchip cache; prefetch mechanisms; vector architecture; vector supercomputers; Bandwidth; Delay; Distributed processing; Hardware; National electric code; Prefetching; Registers; Supercomputers; System-on-a-chip; Vector processors; MSHR; Memory system; Performance characterization; Prefetch; Vector architecture; Vector cache;
Conference_Titel :
Parallel and Distributed Processing with Applications, 2008. ISPA '08. International Symposium on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-0-7695-3471-8
DOI :
10.1109/ISPA.2008.100