• DocumentCode
    826995
  • Title

    Improving the Fault Resilience of Overlay Multicast for Media Streaming

  • Author

    Tan, Guang ; Jarvis, Stephen A.

  • Author_Institution
    Dept. of Comput. Sci., Warwick Univ., Coventry
  • Volume
    18
  • Issue
    6
  • fYear
    2007
  • fDate
    6/1/2007 12:00:00 AM
  • Firstpage
    721
  • Lastpage
    734
  • Abstract
    A key technical challenge for overlay multicast is that the highly dynamic multicast members can make data delivery unreliable. In this paper, we address this issue in the context of live media streaming by exploring 1) how to construct a stable multicast tree that minimizes the negative impact of frequent member departures on an existing overlay and 2) how to efficiently recover from packet errors caused by end-system or network failures. For the first problem, we identify two layout schemes for the tree nodes, namely, the bandwidth-ordered tree and the time-ordered tree, which represent two typical approaches to improving tree reliability, and conduct a stochastic analysis on their properties regarding reliability and tree depth. Based on the findings, we propose a distributed reliability-oriented switching tree (ROST) algorithm that minimizes the failure correlation among tree nodes. Compared with some commonly used distributed algorithms, the ROST algorithm significantly improves tree reliability and reduces average service delay, while incurring only a small protocol overhead; furthermore, it features a mechanism that prevents cheating or malicious behaviors in the exchange of bandwidth/time information. For the second problem, we develop a simple cooperative error recovery (CER) protocol that helps recover from packet errors efficiently. Recognizing that a single recovery source is usually incapable of providing the timely delivery of the lost data, the protocol recovers from data outages using the residual bandwidths from multiple sources, which are identified using a minimum-loss-correlation algorithm. Extensive simulations demonstrate the effectiveness of the proposed schemes
  • Keywords
    computer network reliability; media streaming; multicast communication; multicast protocols; peer-to-peer computing; stochastic processes; trees (mathematics); bandwidth-ordered tree; cooperative error recovery protocol; distributed reliability-oriented switching tree algorithm; fault resilience; media streaming; minimum-loss-correlation algorithm; overlay multicast; packet error; stochastic analysis; time-ordered tree; Bandwidth; Delay effects; Distributed algorithms; Multicast algorithms; Peer to peer computing; Protocols; Resilience; Shape; Stochastic processes; Streaming media; Reliability; fault resilience; media streaming; multicast; overlay.; peer-to-peer;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/TPDS.2007.1054
  • Filename
    4180341