• DocumentCode
    3144661
  • Title

    Reducing Shared Cache Contention by Scheduling Order Adjustment on Commodity Multi-cores

  • Author

    Wang, Yingxin ; Cui, Yan ; Tao, Pin ; Fan, Haining ; Chen, Yu ; Shi, Yuanchun

  • Author_Institution
    Tsinghua Univ., Beijing, China
  • fYear
    2011
  • fDate
    16-20 May 2011
  • Firstpage
    984
  • Lastpage
    992
  • Abstract
    Due to the limitation of power and processor complexity on traditional single core processors, multi-core processors have become the mainstream. One key feature on commodity multi-cores is that the last level cache (LLC) is usually shared. However, the shared cache contention can affect the performance of applications significantly. Several existing proposals demonstrate that task co-scheduling has the potential to alleviate the contention, but it is challenging to make co-scheduling practical in commodity operating systems. In this paper, we propose two lightweight practical cache-aware co-scheduling methods, namely static SOA and dynamic SOA, to solve the cache contention problem on commodity multi-cores. The central idea of the two methods is that the cache contention can be reduced by adjusting the scheduling order properly. These two methods are different from each other mainly in the way of acquiring the process´s cache requirement. The static SOA (static scheduling order adjustment) method acquires the cache requirement information statically by offline profiling, while the dynamic SOA (dynamic scheduling order adjustment) captures the cache requirement statistics by using performance counters. Experimental results using multi-programmed NAS workloads suggest that the proposed methods can greatly reduce the effect of cache contention on multi-core systems. Specifically, for the static SOA method, the execution time can be reduced by up to 15.7%, the number of cache misses can be reduced by up to 11.8%, and the performance improvement remains obvious across the cache size and the length of time slice. For the dynamic SOA method, the execution time reduction can achieve up to 7.09%.
  • Keywords
    cache storage; processor scheduling; shared memory systems; statistical analysis; cache requirement statistics; cache-aware coscheduling; commodity multicore; dynamic SOA; dynamic scheduling order adjustment; last level cache; multicore processor; multicore system; multiprogrammed NAS workload; offline profiling; shared cache contention; static SOA; static scheduling order adjustment; task coscheduling; Arrays; Degradation; Kernel; Measurement; Radiation detectors; Scheduling; Semiconductor optical amplifiers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW), 2011 IEEE International Symposium on
  • Conference_Location
    Shanghai
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-61284-425-1
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2011.248
  • Filename
    6008947