• DocumentCode
    2497360
  • Title

    Memory Scalability Evaluation of the Next-Generation Intel Bensley Platform with InfiniBand

  • Author

    Koop, Matthew J. ; Huang, Wei ; Vishnu, Abhinav ; Panda, Dhabaleswar K.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH
  • fYear
    2006
  • fDate
    23-25 Aug. 2006
  • Firstpage
    52
  • Lastpage
    60
  • Abstract
    As multi-core systems gain popularity for their increased computing power at low-cost, the rest of the architecture must be kept in balance, such as the memory subsystem. Many existing memory subsystems can suffer from scalability issues and show memory performance degradation with more than one process running. To address these scalability issues, fully-buffered DIMMs have recently been introduced. In this paper we present an initial performance evaluation of the next-generation multi-core Intel platform by evaluating the FB-DIMM-based memory subsystem and the associated InfiniBand performance. To the best of our knowledge this is the first such study of Intel multi-core platforms with multi-rail InfiniBand DDR configurations. We provide an evaluation of the current-generation Intel Lindenhurst platform as a reference point. We find that the Intel Bensley platform can provide memory scalability to support memory accesses by multiple processes on the same machine as well as drastically improved inter-node throughput over InfiniBand. On the Bensley platform we observe a 1.85 times increase in aggregate write bandwidth over the Lindenhurst platform. For inter-node MPI-level benchmarks we show bi-directional bandwidth of over 4.55 GB/sec for the Bensley platform using 2 DDR InfiniBand host channel adapters (HCAs), an improvement of 77% over the current generation Lindenhurst platform. The Bensley system is also able to achieve a throughput of 3.12 million MPI messages/sec in the above configuration
  • Keywords
    memory architecture; microprocessor chips; InfiniBand; InfiniBand host channel adapters; bi-directional bandwidth; current-generation Intel Lindenhurst platform; fully-buffered DIMM; inter-node throughput; memory performance degradation; memory scalability evaluation; multirail InfiniBand DDR configurations; next-generation Intel Bensley platform; next-generation multicore Intel platform; Aggregates; Bandwidth; Bidirectional control; Computer networks; Hardware; Next generation networking; Power engineering computing; Scalability; Sun; Throughput;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High-Performance Interconnects, 14th IEEE Symposium on
  • Conference_Location
    Stanford, CA
  • ISSN
    1550-4794
  • Print_ISBN
    0-7695-2654-3
  • Type

    conf

  • DOI
    10.1109/HOTI.2006.19
  • Filename
    1690198