DocumentCode :
2869249
Title :
Slingshot: Time-CriticalMulticast for Clustered Applications
Author :
Balakrishnan, Mahesh ; Pleisch, Stefan ; Birman, Ken
Author_Institution :
Dept. of Comput. Sci., Cornell Univ., Ithaca, NY
fYear :
2005
fDate :
27-29 July 2005
Firstpage :
205
Lastpage :
214
Abstract :
Datacenters are complex environments consisting of thousands of failure-prone commodity components connected by fast, high capacity interconnects. The software running on such datacenters typically uses multicast communication patterns involving multiple senders. We examine the problem of time-critical multicast in such settings, and propose Slingshot, a protocol that uses receiver-based FEC to recover lost packets quickly. Slingshot offers probabilistic guarantees on timeliness by having receivers exchange FEC packets in an initial phase, and optional complete reliability on packets not recovered in this first phase. We evaluate an implementation of Slingshot against SRM, a well-known multicast protocol, and show that it achieves two orders of magnitude faster recovery in datacenter settings
Keywords :
forward error correction; multicast protocols; telecommunication network reliability; workstation clusters; Slingshot protocol; clustered application; datacenter; forward error correction; multicast communication pattern; multicast protocol; packet recovery; probabilistic guarantee; receiver-based FEC; time-critical multicast; Application software; Computer science; Counting circuits; Degradation; Fault tolerance; Hardware; Multicast communication; Multicast protocols; Time factors; Timing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Network Computing and Applications, Fourth IEEE International Symposium on
Conference_Location :
Cambridge, MA
Print_ISBN :
0-7695-2326-9
Type :
conf
DOI :
10.1109/NCA.2005.49
Filename :
1565954
Link To Document :
بازگشت