• DocumentCode
    1647522
  • Title

    Improving the throughput of synchronization by insertion of delays

  • Author

    Rajwar, Ravi ; Kagi, Alain ; Goodman, James R.

  • Author_Institution
    Dept. of Comput. Sci., Wisconsin Univ., Madison, WI, USA
  • fYear
    2000
  • fDate
    6/22/1905 12:00:00 AM
  • Firstpage
    168
  • Lastpage
    179
  • Abstract
    Efficiency of synchronization mechanisms can limit the parallel performance of many shared-memory applications. In addition, the ever increasing performance gap between processor and interprocessor communication may further compromise the scalability of these primitives. Ideally, synchronization primitives should provide high performance under both high and low contention without requiring substantial programmer effort and software support. QOLR has been shown to offer substantial speedups and to outperform other synchronization primitives consistently, but at the cost of software support and protocol complexity. This paper proposes the use of speculation and delays to implement a purely hardware-based queueing mechanism called Implicit QOLB. Making use of the pervasiveness of the Load-Linked/Store-Conditional primitives, we present a series of hardware mechanisms to optimize performance for sharing patterns exhibited by locks and associated data. The mechanisms do not require any change to existing software or instruction sets. IQOLB sits alongside the cache-coherence protocol and guides the decisions the protocol makes with respect to lock (and associated data) transfers. Preliminary evaluations indicate that IQOLB may perform as well as, if not better than, QOLB without the additional software and protocol complexity
  • Keywords
    concurrency control; shared memory systems; IQOLB; Implicit QOLB; delays; parallel performance; shared-memory applications; speculation; synchronization; Application software; Costs; Delay; Hardware; Instruction sets; Programming profession; Protocols; Scalability; Software performance; Throughput;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High-Performance Computer Architecture, 2000. HPCA-6. Proceedings. Sixth International Symposium on
  • Conference_Location
    Touluse
  • Print_ISBN
    0-7695-0550-3
  • Type

    conf

  • DOI
    10.1109/HPCA.2000.824348
  • Filename
    824348