• DocumentCode
    2791519
  • Title

    Efficient Scheduling of Nested Parallel Loops on Multi-Core Systems

  • Author

    Kejariwal, Arun ; Nicolau, Alexandru ; Veidenbaum, Alexander V. ; Banerjee, Utpal ; Polychronopoulos, Constantine D.

  • Author_Institution
    Center for Embedded Comput. Syst., Univ. of California, Irvine, CA, USA
  • fYear
    2009
  • fDate
    22-25 Sept. 2009
  • Firstpage
    74
  • Lastpage
    83
  • Abstract
    Parallel loops, such as a parallel DO loop, in Fortran, account for large percentage of the total execution time. Given this, we focus on the problem of how to efficiently schedule nested perfect/non-perfect parallel loops on the emerging multi-core systems. In this regard, one of the key aspects is how to determine the profitability of parallel execution and how to efficiently capture the cache behavior as the cache subsystem is often the main performance bottleneck in multi-core systems. In this paper, we present a novel profile-guided compiler technique for cache-aware scheduling of iteration spaces of such loops. Specifically, we propose a technique for iteration space scheduling which captures the effect of variation in the number of cache misses across the iteration space. Subsequently, we propose a general approach to capture the variation of both the number of cache misses and computation across the iteration space. We demonstrate the efficacy of our approach on a dedicated 4-way Intel®Xeon®based multiprocessor using several kernels from the industry-standard benchmarks.
  • Keywords
    parallel processing; scheduling; cache aware scheduling; cache behavior; cache subsystem; iteration space scheduling; multicore system; nested nonperfect parallel loops scheduling; profile guided compiler technique; Computer aided instruction; Computer science; Concurrent computing; Embedded computing; Job shop scheduling; Laboratories; Parallel processing; Processor scheduling; Profitability; Research and development; Cache misses; Cost modeling; Load balancing; Multithreading;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing, 2009. ICPP '09. International Conference on
  • Conference_Location
    Vienna
  • ISSN
    0190-3918
  • Print_ISBN
    978-1-4244-4961-3
  • Electronic_ISBN
    0190-3918
  • Type

    conf

  • DOI
    10.1109/ICPP.2009.19
  • Filename
    5361821