• DocumentCode
    2525280
  • Title

    Efficient Utilization of Memory Mapped NICs onto Clusters using Pipelined Schedules

  • Author

    Sotiropoulos, Aristidis ; Tsoukalas, Georgios ; Koziris, Nectarios

  • fYear
    2002
  • fDate
    21-24 May 2002
  • Firstpage
    238
  • Lastpage
    238
  • Abstract
    This paper describes the performance benefits attained using enhanced network interfaces to achieve low latency communication. We make use of DMA communication mode, to send data to other nodes, while the CPU performs useful calculations. Zero-copy communication is achieved through pinned-down physical memory regions, provided by NIC´s driver modules. Our testbed concerns the parallel execution of tiled nested loops onto a Linux PC cluster with PCI-SCI NICs (Dolphin D330). Tiles are essentially exchanging data and should also have large Computational grain, so that their parallel execution becomes beneficial. We schedule tiles much more efficiently by exploiting the inherent overlapping between communication and computation phases among successive, atomic tile executions. The applied nonblocking schedule resembles a pipelined data-path where computation phases are overlapped with communication ones, instead of being interleaved with them. Experimental evaluation illustrates that when using enhanced communication features such as DMA transfers, memory-mapped interfaces and zero-copy mechanisms, overall performance is considerably improved compared to using conventional, CPU and kernel bounded, communication primitives.
  • Keywords
    Computer interfaces; Computer networks; Concurrent computing; Delay; Linear programming; Network interfaces; Processor scheduling; Shape; Testing; Tiles;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing and the Grid, 2002. 2nd IEEE/ACM International Symposium on
  • Print_ISBN
    0-7695-1582-7
  • Type

    conf

  • DOI
    10.1109/CCGRID.2002.1017132
  • Filename
    1540461