• DocumentCode
    2484355
  • Title

    Transitive closure on the cell broadband engine: A study on self-scheduling in a multicore processor

  • Author

    Vinjamuri, Sudhir ; Prasanna, Viktor K.

  • Author_Institution
    Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
  • fYear
    2009
  • fDate
    23-29 May 2009
  • Firstpage
    1
  • Lastpage
    11
  • Abstract
    In this paper, we present a mapping methodology and optimizations for solving transitive closure on the Cell multicore processor. Using our approach, it is possible to achieve near peak performance for transitive closure on the Cell processor. We first parallelize the Standard Floyd Warshall algorithm and show through analysis and experimental results that data communication is a bottleneck for performance and scalability. We parallelize a cache optimized version of Floyd Warshall algorithm to remove the memory bottleneck. As is the case with several scientific computing and industrial applications on a multicore processor, synchronization and scheduling of the cores plays a crucial role in determining the performance of this algorithm. We define a self-scheduling mechanism for the cores of a multicore processor and design a self-scheduler for Blocked Floyd Warshall algorithm on the Cell multicore processor to remove the scheduling bottleneck. We also present optimizations in scheduling order to remove synchronization points. Our implementations achieved up to 78GFLOPS.
  • Keywords
    microprocessor chips; processor scheduling; storage management; 78GFLOPS; cell broadband engine; mapping methodology; memory bottleneck; multicore processor; scheduling bottleneck; self-scheduling; standard Floyd Warshall algorithm; synchronization; synchronization points; transitive closure; Algorithm design and analysis; Communication standards; Data communication; Engines; Job shop scheduling; Multicore processing; Optimization methods; Performance analysis; Processor scheduling; Scheduling algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on
  • Conference_Location
    Rome
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4244-3751-1
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2009.5161072
  • Filename
    5161072