an empirical study of high availability in stream
play

An Empirical Study of High Availability in Stream Processing Systems - PowerPoint PPT Presentation

IBM Research An Empirical Study of High Availability in Stream Processing Systems Yu Gu, Zhe Zhang , Fan Ye, Hao Yang, Minkyong Kim, Hui Lei, Zhen Liu 12/3/2009 IBM Research Stream Processing Model software operators (PEs)


  1. IBM Research An Empirical Study of High Availability in Stream Processing Systems Yu Gu, Zhe Zhang , Fan Ye, Hao Yang, Minkyong Kim, Hui Lei, Zhen Liu 12/3/2009

  2. IBM Research Stream Processing Model software operators ∆ ∞ (PEs) ∩ ∫ Ω ∑ … … subjob … …  Unexpected machine failures … – Loss of data and internal state deployment machines – Disruption to normal processing  Challenge: how to preserve data / state and minimize disruption? 2 An Empirical Study of High Availability in DSPS 12/3/2009

  3. IBM Research Existing approaches: vs. Passive Standby Active Standby Passive Standby Active Standby ∆ ∆ ∆ ∩ ∩ ∩ ∑ ∑ ∑ ∆ ∆ ∆ ∩ ∩ ∩ ∑ ∑ ∑ 3 An Empirical Study of High Availability in DSPS 12/3/2009

  4. IBM Research Basic Tradeoff between AS and PS  Active Standby – Overhead: double processing load; at least double message load – Recovery delay: almost zero  Passive Standby – Overhead: checkpoint messages – Recovery delay: failure detection + deploy new job + recover state 4 12/3/2009

  5. IBM Research Motivation  Tradeoffs of AS & PS not fully understood – Only systematic comparison: [Hwang ICDE05] • Used a variant of PS with high overhead • Evaluated in simulations rather than real systems  Our contributions – A sweeping checkpointing method • Reducing checkpoint overhead by one order of magnitude • Proof of consistency – A real prototype distributed stream processing system – Comprehensive and empirical evaluation of AS and PS 5 An Empirical Study of High Availability in DSPS 12/3/2009

  6. IBM Research Outline  Background and Motivation  Design and Implementation – Sweeping Checkpointing – System Architecture  Performance Evaluation  Related Work  Conclusions 6 An Empirical Study of High Availability in DSPS 12/3/2009

  7. IBM Research Overview of Sweeping Checkpointing  What to include recoverable from – Input queues upstream output queues – Internal states dominating ckpt size – Output queues with high data rates  When to trim  Checkpointing Multiple PEs  Proof of consistency 7 An Empirical Study of High Availability in DSPS 12/3/2009

  8. IBM Research When to Trim In U’s output queue, only removing those packets that have been processed and checkpointed by D upstream downstream node U node D √ ≡ 5 4 3 2 1 1 1 1 8 An Empirical Study of High Availability in DSPS 12/3/2009

  9. IBM Research When to Trim In U’s output queue, only removing those packets that have been processed and checkpointed by D upstream downstream node U node D √ ≡ 5 4 3 2 1 1 1 9 An Empirical Study of High Availability in DSPS 12/3/2009

  10. IBM Research When to Trim In U’s output queue, only removing those packets that have been processed and checkpointed by D upstream downstream node U node D √ ≡ 5 4 3 2 2 1 2 1 10 An Empirical Study of High Availability in DSPS 12/3/2009

  11. IBM Research When to Trim In U’s output queue, only removing those packets that have been processed and checkpointed by D checkpoint upstream downstream node U node D ≡ √ ≡ 5 4 3 2 2 1 2 2 1 1 11 An Empirical Study of High Availability in DSPS 12/3/2009

  12. IBM Research When to Trim In U’s output queue, only removing those packets that have been processed and checkpointed by D ≡ 2 1 checkpoint upstream downstream node U node D √ ≡ 5 4 3 2 2 1 2 1 1 and 2 have been processed and checkpointed 12 An Empirical Study of High Availability in DSPS 12/3/2009

  13. IBM Research Checkpointing Multiple PEs – Synchronous  Freeze all PEs, then snapshot of the checkpoint all state, whole sub job then resume all PEs checkpoint manager CM CM ∆ ∆ ∩ ∩ ≡ ≡ ≡ √ √ √ ∑ ∑ sub job 1 sub job 2 Site 2 Site 1 13 An Empirical Study of High Availability in DSPS 12/3/2009

  14. IBM Research Checkpointing Multiple PEs – Individual  Freeze / checkpoint / resume each PE individually checkpoint manager CM CM ∆ ∆ ∩ ∩ ≡ ≡ √ √ ∑ ∑ sub job 1 sub job 2 Site 2 Site 1 14 An Empirical Study of High Availability in DSPS 12/3/2009

  15. IBM Research Checkpointing Multiple PEs – Sweeping  Checkpoint a PE immediately after receipt of acknowledgement and output queue trimming checkpoint manager CM CM ∆ ∆ ∩ ∩ ≡ ≡ √ √ ∑ ∑ sub job 1 sub job 2 Site 2 Site 1 15 An Empirical Study of High Availability in DSPS 12/3/2009

  16. IBM Research Sketch of Proof for Consistency  Scenario: single node failure (N i ) only trimmed to reflect – Actions for recovery latest checkpoint of N i • Recovering operator state • Recovering input queue from output queues of upstream • Reprocessing affected elements  Scenario: multiple concurrent node failures – Actions for recovery • Finding and recovering most upstream failed node • Recovering other nodes recursively 16 An Empirical Study of High Availability in DSPS 12/3/2009

  17. IBM Research System Architecture  Remote Execution Coordinator – manage HA protection for distributed jobs  Node Job Management – manage job deployment  Checkpoint Manager FM REC – manage checkpoint tasks according to assigned checkpoint mechanism CM  Failover Manager JMN monitor other nodes and initiate recovery –  Jobs and Processing Nodes – take data from upstream, execute processing tasks, and send results to downstream Job ∩  Features: – A distributed job consists of multiple subjobs, each of which can choose its own specific HA mechanism (AS, PS) ∆ ∑ – The system coordinates the deployment and protection of subjobs among all machines Job 17 An Empirical Study of High Availability in DSPS 12/3/2009

  18. IBM Research Outline  Background and Motivation  Design and Implementation  Performance Evaluation – Experiment Setup – Overhead and Delay Results  Related Work  Conclusions 18 An Empirical Study of High Availability in DSPS 12/3/2009

  19. IBM Research Experiment Setup  Testbed: a cluster environment – Dual Xeon 3.06GHz CPUs, 800MHz, 512KB L2 caches, 4GB memory, 80GB disk – 1Gbps LAN – A distributed job containing 4 subjobs, each having 2 processing nodes running on one machine  Metrics – Recovery delay – Message overhead 19 An Empirical Study of High Availability in DSPS 12/3/2009

  20. IBM Research Avg. Checkpoint Queue Size Comparison 3000 elements/second Sweeping reduces checkpoint size by about 96% 20 20 An Empirical Study of High Availability in DSPS 12/3/2009 12/3/2009

  21. IBM Research Checkpoint Time Comparison checkpoint interval = 500 ms Sweeping reduces checkpoint time by about 75% 21 21 An Empirical Study of High Availability in DSPS 12/3/2009 12/3/2009

  22. IBM Research Message Overhead Comparison AS-AS incurs almost 4 times message overhead vs. PS 22 22 An Empirical Study of High Availability in DSPS 12/3/2009 12/3/2009

  23. IBM Research Recovery Delay Decomposition Detection delay becomes dominant with large heartbeat interval 23 23 An Empirical Study of High Availability in DSPS 12/3/2009 12/3/2009

  24. IBM Research Outline  Background and Motivation  Design and Implementation  Performance Evaluation  Related Work  Conclusions 24 An Empirical Study of High Availability in DSPS 12/3/2009

  25. IBM Research Related Work  Borealis 1. “ Fault tolerance in the Borealis distributed stream processing system ” (SIGMOD ‘05) A variant of AS  Achieving flexible trade-off between availability and consistency by introducing tentative data  concept 2. “ Fast and reliable stream processing over wide area networks ” (ICDE ’07) A variant of AS  Most expensive variant; upstream sending to all downstream replicas  No switch required when failure occurs  3. “ A cooperative, self-configuring high-availability solution for stream processing ” (ICDE ‘07) A variant of PS  Novel checkpoint scheduling and backup assignment  Balances recovery load over multiple servers  4. “ Borealis-R: a replication-transparent stream processing system for wide-area monitoring applications ” (SIGMOD ‘08) A variant of AS  Same technique as in [2]  Novel mechanism to allow replicas execute without coordination but still produce consistent  results 25 25 An Empirical Study of High Availability in DSPS 12/3/2009 12/3/2009

  26. IBM Research Related Work  System S 5. “Towards automatic fault recovery in System-S” (ICAC ‘07) Checkpoint state  Recovery of JMN, not jobs  6. “Failure recovery in cooperative data streaming analysis” (ARES ’07) How to select a backup site on demand, not recovery technique  7. “Online failure forecast for fault-tolerant data stream processing” (ICDE ‘08) Prediction of potential failures, a monitoring technique  Leverages varies system metrics (system productivity, available CPU, etc.) to predict failures  before they occur  Comparison of AS and PS 8. “High-availability algorithms for distributed stream processing” (ICDE ‘05) Valuable summaries of basic tradeoffs  PS variant has large overhead  Evaluation mainly based on simulations  26 26 An Empirical Study of High Availability in DSPS 12/3/2009 12/3/2009

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend