Title :
Optimizing user-level communication patterns on the Fujitsu AP3000
Author :
Dawson, Jeremy ; Strazdins, Peter
Author_Institution :
Dept. of Comput. Sci., Australian Nat. Univ., Acton, ACT, Australia
Abstract :
We present techniques and algorithms to improve the performance of various communication patterns on message passing platforms where, for reasons of safety, user level communications must be buffered in (special) memory on both the send and the receive. These algorithms can not only minimize message copying but overlap the copying to/from the special memory with the actual transfer enabling full bandwidth to be achieved. These patterns include tree broadcast and reductions, (ring based) multiple broadcasts and reductions, pipelined broadcast and buffered point-to-point sends. In each case, the messages have a simple stride. All of these patterns are used in dense linear algebra applications, although they are also used it many other contexts. These algorithms are implemented and their performance evaluated on the Fujitsu AP3000, a message passing multicomputer having many characteristics of the cluster model. Some aspects, such as the performance characteristics of the special memory are specific to the AP3000; however the algorithms still apply to any platform using a similar mode of user level communications. Worthwhile performance increases are obtained, especially for patterns involving moderate-large number of processors
Keywords :
broadcasting; linear algebra; message passing; multiprocessing systems; Fujitsu AP3000; buffered point-to-point sends; cluster model; dense linear algebra applications; message copying; message passing multicomputer; message passing platforms; performance characteristics; pipelined broadcast; ring based multiple broadcasts; special memory; tree broadcast; user level communications; user-level communication patterns; Bandwidth; Broadcasting; Communication networks; Computer networks; Computer science; Costs; Libraries; Message passing; Operating systems; Reduced instruction set computing;
Conference_Titel :
Cluster Computing, 1999. Proceedings. 1st IEEE Computer Society International Workshop on
Conference_Location :
Melbourne, Vic.
Print_ISBN :
0-7695-0343-8
DOI :
10.1109/IWCC.1999.810814