• DocumentCode
    1267412
  • Title

    An advanced compiler framework for non-cache-coherent multiprocessors

  • Author

    Paek, Yunheung ; Navarro, Angeles ; Zapata, Emilio ; Hoeflinger, Jay ; Padua, David

  • Author_Institution
    Dept. of Electr. Eng. & Comput. Sci., Korean Adv. Inst. of Sci. & Technol., Taejon, South Korea
  • Volume
    13
  • Issue
    3
  • fYear
    2002
  • fDate
    3/1/2002 12:00:00 AM
  • Firstpage
    241
  • Lastpage
    259
  • Abstract
    The Cray T3D and T3E are non-cache-coherent (NCC) computers with a NUMA structure. They have been shown to exhibit a very stable and scalable performance for a variety of application programs. Considerable evidence suggests that they are more stable and scalable than many other shared-memory multiprocessors. However, the principal drawback of these machines is a lack of programmability, caused by the absence of the global cache coherence that is necessary to provide a convenient shared view of memory in hardware. This forces the programmer to keep careful track of where each piece of data is stored, a complication that is unnecessary when a pure shared-memory view is presented to the user. We believe that a remedy for this problem is advanced compiler technology. In this paper, we present our experience with a compiler framework for automatic parallelization and communication generation that has the potential to reduce the time-consuming hand-tuning that would otherwise be necessary to achieve good performance with this type of machine. From our experiments, we learned that our compiler performs well for a variety of applications on the T3D and T3E and we found a few sophisticated techniques that could improve performance even more once they are fully implemented in the compiler
  • Keywords
    cache storage; parallel programming; parallelising compilers; shared memory systems; array privatization; compiler; dependence analysis; global cache coherence; multiprocessors; noncache-coherent multiprocessors; noncoherent caches; programmability; shared-memory multiprocessors; shared-memory programming; Application software; Hardware; Programming profession;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/71.993205
  • Filename
    993205