• DocumentCode
    2174977
  • Title

    On the Efficient Implementation of Reductions on the Cell Broadband Engine

  • Author

    Strey, Alfred

  • Author_Institution
    Inst. of Comput. Sci., Univ. of Innsbruck, Innsbruck, Austria
  • fYear
    2010
  • fDate
    17-19 Feb. 2010
  • Firstpage
    223
  • Lastpage
    228
  • Abstract
    For a high-performance parallel implementation of many scientific algorithms, efficient realizations of combining communication patterns like reduce or all-reduce are important. Especially on the Cell Broadband Engine a low latency realization of such operations is not obvious. So in this paper several algorithms for implementing reductions are discussed and efficient implementations on the Cell are proposed. Detailed performance results are presented for reductions of vectors of various sizes on a Cell blade consisting of two interconnected Cell processors. It is shown that the new reductions algorithms are in most cases faster than other previously published implementations.
  • Keywords
    computer architecture; multiprocessing systems; cell blade; cell broadband engine; communication patterns; interconnected cell processors; reductions algorithms; Bandwidth; Blades; Clocks; Clustering algorithms; Computer science; Concurrent computing; Distributed computing; Engines; Synchronization; Yarn; Cell Broadband Engine; Reductions;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel, Distributed and Network-Based Processing (PDP), 2010 18th Euromicro International Conference on
  • Conference_Location
    Pisa
  • ISSN
    1066-6192
  • Print_ISBN
    978-1-4244-5672-7
  • Electronic_ISBN
    1066-6192
  • Type

    conf

  • DOI
    10.1109/PDP.2010.59
  • Filename
    5452463