Title :
The LINPACK benchmark on the Fujitsu FAP 1000
Author :
Brent, Richard P.
Author_Institution :
Comput. Sci. Lab., Australian Nat. Univ., Canberra, ACT, Australia
Abstract :
The author describes an implementation of the LINPACK benchmark on the Fujitsu AP 1000. Design considerations include communication primitives, data distribution, use of blocking to reduce memory references, and effective use of the cache. The LINPACK benchmark results show that the AP 1000 is a good machine for numerical linear algebra, and that one can consistently achieve close to 80 percent of its theoretical peak performance on moderate to large problems. The main reason for this is the high ratio of communication speed to floating-point speed compared to machines such as the Intel Delta and nCUBE 2. The high-bandwidth hardware row/column broadcast capability of the T-net (xbrd, ybrd) and the low latency of the synchronous communication routines are significant
Keywords :
linear algebra; mathematics computing; performance evaluation; software packages; Fujitsu FAP 1000; Intel Delta; LINPACK benchmark; T-net; cache; communication primitives; communication speed; data distribution; floating-point speed; high-bandwidth hardware row/column broadcast capability; low latency; memory references; nCUBE 2; numerical linear algebra; Australia; DRAM chips; Equations; Financial advantage program; Floating-point arithmetic; Hardware; Indexing; Laboratories; Network topology; Routing;
Conference_Titel :
Frontiers of Massively Parallel Computation, 1992., Fourth Symposium on the
Conference_Location :
McLean, VA
Print_ISBN :
0-8186-2772-7
DOI :
10.1109/FMPC.1992.234897