• DocumentCode
    3618180
  • Title

    High-availability algorithms for distributed stream processing

  • Author

    J.-H. Hwang;M. Balazinska;A. Rasin;U. Cetintemel;M. Stonebraker;S. Zdonik

  • Author_Institution
    Brown Univ., Providence, RI, USA
  • fYear
    2005
  • fDate
    6/27/1905 12:00:00 AM
  • Firstpage
    779
  • Lastpage
    790
  • Abstract
    Stream-processing systems are designed to support an emerging class of applications that require sophisticated and timely processing of high-volume data streams, often originating in distributed environments. Unlike traditional data-processing applications that require precise recovery for correctness, many stream-processing applications can tolerate and benefit from weaker recovery guarantees. In this paper, we study various recovery guarantees and pertinent recovery techniques that can meet the correctness and performance requirements of stream-processing applications. We discuss the design and algorithmic challenges associated with the proposed recovery techniques and describe how each can provide different guarantees with proper combinations of redundant processing, checkpointing, and remote logging. Using analysis and simulations, we quantify the cost of our recovery guarantees and examine the performance and applicability of the recovery techniques. We also analyze how the knowledge of query network properties can help decrease the cost of high availability.
  • Keywords
    "Condition monitoring","Costs","Remote monitoring","Digital signal processing","Runtime","Algorithm design and analysis","Checkpointing","Performance analysis","Analytical models","Availability"
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2005. ICDE 2005. Proceedings. 21st International Conference on
  • ISSN
    1084-4627
  • Print_ISBN
    0-7695-2285-8
  • Type

    conf

  • DOI
    10.1109/ICDE.2005.72
  • Filename
    1410192