DocumentCode :
2174977
Title :
On the Efficient Implementation of Reductions on the Cell Broadband Engine
Author :
Strey, Alfred
Author_Institution :
Inst. of Comput. Sci., Univ. of Innsbruck, Innsbruck, Austria
fYear :
2010
fDate :
17-19 Feb. 2010
Firstpage :
223
Lastpage :
228
Abstract :
For a high-performance parallel implementation of many scientific algorithms, efficient realizations of combining communication patterns like reduce or all-reduce are important. Especially on the Cell Broadband Engine a low latency realization of such operations is not obvious. So in this paper several algorithms for implementing reductions are discussed and efficient implementations on the Cell are proposed. Detailed performance results are presented for reductions of vectors of various sizes on a Cell blade consisting of two interconnected Cell processors. It is shown that the new reductions algorithms are in most cases faster than other previously published implementations.
Keywords :
computer architecture; multiprocessing systems; cell blade; cell broadband engine; communication patterns; interconnected cell processors; reductions algorithms; Bandwidth; Blades; Clocks; Clustering algorithms; Computer science; Concurrent computing; Distributed computing; Engines; Synchronization; Yarn; Cell Broadband Engine; Reductions;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel, Distributed and Network-Based Processing (PDP), 2010 18th Euromicro International Conference on
Conference_Location :
Pisa
ISSN :
1066-6192
Print_ISBN :
978-1-4244-5672-7
Electronic_ISBN :
1066-6192
Type :
conf
DOI :
10.1109/PDP.2010.59
Filename :
5452463
Link To Document :
بازگشت