Fault-tolerance demo & reconfiguration - CS 591 K1: Data Stream Processing and Analytics Spring 2020of reconfiguration • Ensure result correctness • reconfiguration mechanism often relies on fault-tolerance mechanism • State re-partitioning and migration • minimize communication • keep duration short0 码力 | 41 页 | 4.09 MB | 2 年前3
Exactly-once fault-tolerance in Apache Flink - CS 591 K1: Data Stream Processing and Analytics SpringExactly-once fault-tolerance in Apache Flink Vasiliki (Vasia) Kalavri vkalavri@bu.edu Go read his PhD thesis: http://kth.diva-portal.org/smash/get/diva2:1240814/FULLTEXT01.pdf ## Fault-tolerance approaches0 码力 | 81 页 | 13.18 MB | 2 年前3
High-availability, recovery semantics, and guarantees - CS 591 K1: Data Stream Processing and Analytics Spring 2020guarantees Vasiliki (Vasia) Kalavri vkalavri@bu.edu ## Today's topics • High-availability and fault-tolerance in distributed stream processing • Recovery semantics and guarantees • Exactly-once processing you think of an operator that will diverge? ## Fault-tolerance trade-offs ## Steady-state overhead • How is performance affected by the fault-tolerance mechanism under normal, failure-free operation0 码力 | 49 页 | 2.08 MB | 2 年前3
Course introduction - CS 591 K1: Data Stream Processing and Analytics Spring 2020Systems Architecture and design Scheduling and load management Scalability and elasticity Fault-tolerance and guarantees State management Operator semantics Window optimizations Filtering, counting updates Debugging Order Processing guarantees Retractions & results amendment Progress Fault-tolerance & high-availability ## Building a stream processor  High Availability Fault-tolerance for masters, workers, and etcd nodes 












