DocumentCode :
2279938
Title :
The performance of the Cedar multistage switching network
Author :
Torrellas, Josep ; Zhang, Zheng
Author_Institution :
Center for Supercomput. Res. & Dev., Illinois Univ., Champaign, IL, USA
fYear :
1994
fDate :
14-18 Nov 1994
Firstpage :
265
Lastpage :
274
Abstract :
While multistage switching networks for vector multiprocessors have been studied extensively, detailed evaluations of their performance are rare. Indeed, analytical models, simulations with pseudo-synthetic loads, studies focused on average-value parameters, and measurements of networks disconnected from the machine all provide limited information. In this paper, instead, we present an in-depth empirical analysis of a multistage switching network in a realistic setting: we use hardware probes to examine the performance of the omega network of the Cedar shared-memory machine executing real applications. The machine is configured with 16 vector processors. The analysis suggests that the performance of multistage switching networks is limited by traffic non-uniformities. We identify two major non-uniformities that degrade Cedar´s performance and are likely to slow down other networks too. The first one is the contention caused by the return messages in a vector access as they converge from the memories to one processor port. This traffic convergence penalizes vector reads and, more importantly, causes tree saturation. The second non-uniformity is the uneven contention delays induced by even a relatively fair scheme to resolve message collisions. Based on our observations, we argue that intuitive optimizations for multistage switching networks may not be cost-effective. Instead, we suggest changes to increase the network bandwidth at the root of the traffic convergence tree and to delay traffic convergence up until the final stages of the network
Keywords :
convergence; delays; multistage interconnection networks; optimisation; performance evaluation; probes; telecommunication traffic; vector processor systems; Cedar multistage switching network; hardware probes; intuitive optimizations; message collisions; message contention; message convergence; network bandwidth; omega network; performance evaluation; return messages; traffic convergence; traffic nonuniformities; tree saturation; uneven contention delays; vector access; vector multiprocessor; vector reads; Analytical models; Convergence; Degradation; Delay; Hardware; Performance analysis; Probes; Road accidents; Telecommunication traffic; Vector processors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Supercomputing '94., Proceedings
Conference_Location :
Washington, DC
Print_ISBN :
0-8186-6605-6
Type :
conf
DOI :
10.1109/SUPERC.1994.344286
Filename :
344286
Link To Document :
بازگشت