DocumentCode
1381170
Title
Architecture scalability of parallel vector computers with a shared memory
Author
Dekker, Eskil
Author_Institution
Fac. of Inf. Technol. & Syst., Delft Univ. of Technol., Netherlands
Volume
47
Issue
5
fYear
1998
fDate
5/1/1998 12:00:00 AM
Firstpage
614
Lastpage
624
Abstract
Based on a model of a parallel vector computer with a shared memory, its scalability properties are derived. The processor-memory interconnection network is assumed to be composed of crossbar switches of size b×b. This paper analyzes sustainable peak performance under optimal conditions, i.e., no memory bank conflicts, sufficient processor-memory bank pathways, and no interconnection network conflicts. It will be shown that, with fully vectorizable algorithms and no communication overhead, the sustainable peak performance does not scale up linearly with the number of processors p. If the interconnection network is unbuffered, the number of memory banks must increase at least with O(p logb p) to sustain peak performance. If the network is buffered, this bottleneck can be alleviated; however, the half performance vector length still increases with O(logb p). The paper confirms the validity of the model by examining the performance behavior of the LINPACK benchmark
Keywords
parallel architectures; shared memory systems; LINPACK benchmark; crossbar switches; interconnection network; parallel vector computer; performance behavior; processor-memory interconnection network; scalability; shared memory; sustainable peak performance; vectorizable algorithms; Computer aided manufacturing; Computer architecture; Concurrent computing; Kernel; Multiprocessor interconnection networks; Parallel machines; Parallel processing; Scalability; Switches; Throughput;
fLanguage
English
Journal_Title
Computers, IEEE Transactions on
Publisher
ieee
ISSN
0018-9340
Type
jour
DOI
10.1109/12.677257
Filename
677257
Link To Document