• DocumentCode
    2312514
  • Title

    Reliable Multicast Based on Erasure Resilient Codes over InfiniBand

  • Author

    Wang, Xigui ; Xiao, Zifeng ; Han, Jizhong ; Han, Chengde

  • Author_Institution
    Inst. of Comput. Technol., Chinese Acad. of Sci., Beijing
  • fYear
    2006
  • fDate
    25-27 Oct. 2006
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Many distributed applications and systems, e.g., an efficient implementation of distributed cache coherence protocol in distributed shared-memory systems, usually require efficient, reliable and scalable multicast capabilities from low-level interconnections. However, InfiniBand network, a high performance interconnection with low latency and high bandwidth, lacks the necessary reliable hardware multicast capability. To avoid low-efficiency multicast emulation with one-to-many point-to-point messages and ACKs, this paper proposes an efficient algorithm to provide reliable multicast based on erasure resilient codes over InfiniBand. This algorithm can not only avoid the feedback implosion problem by point-to-point multicast emulation messages, but also achieve lower latency and better scalability comparing with automatic-request retransmission (ARQ). Moreover, this algorithm can be optimized with message pipeline mechanism to achieve the same level of latency as the un-reliable InfiniBand hardware multicast. Performance analysis demonstrates that the failure probability to recover a message is less than 1.4times10 even for a system with 1000 message receivers
  • Keywords
    error correction codes; multi-agent systems; multicast protocols; telecommunication network reliability; ARQ; InfiniBand network; automatic-request retransmission; distributed cache coherence protocol; distributed shared-memory systems; erasure resilient codes; failure probability; message pipeline mechanism; one-to-many point-to-point messages; reliable multicast; Automatic repeat request; Bandwidth; Delay; Emulation; Feedback; Hardware; Multicast algorithms; Multicast protocols; Pipelines; Scalability; Erasure Resilient Codes; InfiniBand; Multicast; Reed-Solomon code;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications and Networking in China, 2006. ChinaCom '06. First International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    1-4244-0463-0
  • Electronic_ISBN
    1-4244-0463-0
  • Type

    conf

  • DOI
    10.1109/CHINACOM.2006.344802
  • Filename
    4149767