Streaming optimizations - CS 591 K1: Data Stream Processing and Analytics Spring 2020optimizations • plan translation alternatives • Runtime optimizations • load management, scheduling, state management • Optimization semantics, correctness, profitability Topics covered in this There may exist several ways to execute a computation • query plans, e.g. order of operators • scheduling and placement decisions • different algorithms, e.g. hash-based vs. broadcast join • What does data structures • sorting vs hashing • indexing, pre-fetching • minimize disk access • scheduling Objectives • optimize resource utilization or minimize resources • decrease latency, increase0 码力 | 54 页 | 2.83 MB | 1 年前3
Course introduction - CS 591 K1: Data Stream Processing and Analytics Spring 2020representing, summarizing, and analyzing data streams Systems Algorithms Architecture and design Scheduling and load management Scalability and elasticity Fault-tolerance and guarantees State management management • Analysis of real-time vehicle locations to improve traffic conditions • Provide real-time scheduling information for public transport • Optimize transport network flow and recommend alternative routes0 码力 | 34 页 | 2.53 MB | 1 年前3
PyFlink 1.15 DocumentationYARN Apache Hadoop YARN is a cluster resource management framework for managing the resources and scheduling jobs in a Hadoop cluster. It’s supported to submit PyFlink jobs to YARN for execution. Set up0 码力 | 36 页 | 266.77 KB | 1 年前3
PyFlink 1.16 DocumentationYARN Apache Hadoop YARN is a cluster resource management framework for managing the resources and scheduling jobs in a Hadoop cluster. It’s supported to submit PyFlink jobs to YARN for execution. Set up0 码力 | 36 页 | 266.80 KB | 1 年前3
共 4 条
- 1













