DocumentCode :
1829681
Title :
Impact of sharing-based thread placement on multithreaded architectures
Author :
Thekkath, Radhika ; Eggers, Susan J.
Author_Institution :
Dept. of Comput. Sci. & Eng., Washington Univ., Seattle, WA, USA
fYear :
1994
fDate :
18-21 Apr 1994
Firstpage :
176
Lastpage :
186
Abstract :
Multithreaded architectures context switch between instruction streams to hide memory access latency. Although this improves processor utilization, it can increase cache interference and degrade overall performance. One technique to reduce the interconnect traffic is to co-locate threads that share data on the same processor. Multiple threads sharing in the cache should reduce compulsory and invalidation misses, thereby improving execution rime. To test this hypothesis, the authors compared a variety of thread placement algorithms via trace-driven simulation of fourteen coarse- and medium-grain parallel applications on several multithreaded architectures. The results contradict the hypothesis. Rather than decreasing, compulsory and invalidation misses remained neatly constant across all placement algorithms, for all processor configurations, even with an infinite cache. That is, sharing-based-placement had no (positive) effect on execution time. Instead load balancing was the critical factor that affected performance. The results were due to one or both of the following reasons: (1) the sequential and uniform access of shared data by the application´s threads and (2) the insignificant number of data references that require interconnect access, relative to the total number of instructions
Keywords :
buffer storage; memory architecture; parallel architectures; performance evaluation; shared memory systems; cache; coarse-grain parallel applications; context switch; execution time; medium-grain parallel applications; multithreaded architectures; overall performance; processor utilization; sharing-based thread placement; sharing-based-placement; thread placement algorithms; trace-driven simulation; Computer architecture; Computer science; Degradation; Delay; Load management; Processor scheduling; Switches; Telecommunication traffic; Testing; Yarn;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Architecture, 1994., Proceedings the 21st Annual International Symposium on
Conference_Location :
Chicago, IL
Print_ISBN :
0-8186-5510-0
Type :
conf
DOI :
10.1109/ISCA.1994.288151
Filename :
288151
Link To Document :
بازگشت