DocumentCode
25703
Title
Reliable Multicast in Data Center Networks
Author
Dan Li ; Mingwei Xu ; Ying Liu ; Xia Xie ; Yong Cui ; Jingyi Wang ; Guihai Chen
Author_Institution
Tsinghua Univ., Beijing, China
Volume
63
Issue
8
fYear
2014
fDate
Aug. 2014
Firstpage
2011
Lastpage
2024
Abstract
Multicast benefits data center group communication in both saving network traffic and improving application throughput. Reliable packet delivery is required in data center multicast for data-intensive computations. However, existing reliable multicast solutions for the Internet are not suitable for the data center environment, especially with regard to keeping multicast throughput from degrading upon packet loss, which is norm instead of exception in data centers. We present RDCM, a novel reliable multicast protocol for data center network. The key idea of RDCM is to minimize the impact of packet loss on the multicast throughput, by leveraging the rich link resource in data centers. A multicast-tree-aware backup overlay is explicitly built on group members for peer-to-peer packet repair. The backup overlay is organized in such a way that it causes little individual repair burden, control overhead, as well as overall repair traffic. RDCM also realizes a window-based congestion control to adapt its sending rate to the traffic status in the network. Simulation results in typical data center networks show that RDCM can achieve higher application throughput and less traffic footprint than other representative reliable multicast protocols. We have implemented RDCM as a user-level library on Windows platform. The experiments on our test bed show that RDCM handles packet loss without obvious throughput degradation during high-speed data transmission, gracefully respond to link failure and receiver failure, and causes less than 10% CPU overhead to data center servers.
Keywords
Internet; computer centres; computer network reliability; multicast protocols; overlay networks; peer-to-peer computing; telecommunication traffic; Internet; RDCM; Windows platform; control overhead; data center group communication; data center networks; data-intensive computations; high-speed data transmission; multicast throughput; multicast-tree-aware backup overlay; network traffic saving; packet loss; peer-to-peer packet repair; reliable multicast protocol; reliable packet delivery; repair traffic; user-level library; window-based congestion control; Data center networks; backup overlay; reliable multicast;
fLanguage
English
Journal_Title
Computers, IEEE Transactions on
Publisher
ieee
ISSN
0018-9340
Type
jour
DOI
10.1109/TC.2013.91
Filename
6504457
Link To Document