Fault-tolerance demo & reconfiguration - CS 591 K1: Data Stream Processing and Analytics Spring 2020University 2020 • Change parallelism • scale out to process increased load • scale in to save resources • Fix bugs or change business logic • Optimize execution plan • Change operator placement predict their effects, and decide which and when to apply • Allocate new resources, spawn new processes or release unused resources, safely terminate processes • Adjust dataflow channels and network connections predict their effects, and decide which and when to apply • Allocate new resources, spawn new processes or release unused resources, safely terminate processes • Adjust dataflow channels and network connections0 码力 | 41 页 | 4.09 MB | 1 年前3
Streaming optimizations - CS 591 K1: Data Stream Processing and Analytics Spring 2020minimize disk access • scheduling Objectives • optimize resource utilization or minimize resources • decrease latency, increase throughput • minimize monetary costs (if running in the cloud) Safety • Ensure resource kinds: all resources required by a fused operator should remain available. • Ensure resource amounts: the total amount of resources required by the fused operator must be | Boston University 2020 35 Safety • Ensure resource availability: the host must have enough resources for all assigned operators • Ensure security constraints: what are the trusted hosts for each0 码力 | 54 页 | 2.83 MB | 1 年前3
监控Apache Flink应用程序(入门).............................................................................. 22 4.14 System Resources................................................................................................ decreasing the number of task slots per TaskManager (in case of a Standalone setup), by providing more resources to the TaskManager (in case of a containerized setup), or by providing more TaskManagers. In general ease-1.7/monitoring/metrics.html#system-resources 10 https://ci.apache.org/projects/flink/flink-docs-release-1.7/monitoring/metrics.html 4.14 System Resources In addition to the JVM metrics above, it0 码力 | 23 页 | 148.62 KB | 1 年前3
Introduction to Apache Flink and Apache Kafka - CS 591 K1: Data Stream Processing and Analytics Spring 2020file:///home/user/wordcount_out 19 Flink commands Vasiliki Kalavri | Boston University 2020 Resources • Documentation • https://flink.apache.org/ • Community • https://flink.apache.org/community failures without losing any records committed to the log. Vasiliki Kalavri | Boston University 2020 Resources • Documentation • https://kafka.apache.org/ • Community • https://kafka.apache.org/contact0 码力 | 26 页 | 3.33 MB | 1 年前3
Flow control and load shedding - CS 591 K1: Data Stream Processing and Analytics Spring 2020Scale resource allocation: • Addresses the case of increased load and additionally ensures no resources are left idle when the input load decreases. ??? Vasiliki Kalavri | Boston University 2020 Load system processing capacity H: headroom factor, i.e. a conservative estimate of the percentage of resources required by the system at steady state Load(N(I)): the load as a fraction of the total capacity0 码力 | 43 页 | 2.42 MB | 1 年前3
Elasticity and state migration: Part I - CS 591 K1: Data Stream Processing and Analytics Spring 2020predict their effects, and decide which and when to apply • Allocate new resources, spawn new processes or release unused resources, safely terminate processes • Adjust dataflow channels and network connections0 码力 | 93 页 | 2.42 MB | 1 年前3
PyFlink 1.15 Documentation1.1.1.4 YARN Apache Hadoop YARN is a cluster resource management framework for managing the resources and scheduling jobs in a Hadoop cluster. It’s supported to submit PyFlink jobs to YARN for execution talking with Kubernetes and allocating and de-allocating TaskManagers depending on the required resources. ./bin/flink run-application \ --target kubernetes-application \ --parallelism 8 \ -Dkubernetes0 码力 | 36 页 | 266.77 KB | 1 年前3
PyFlink 1.16 Documentation1.1.1.4 YARN Apache Hadoop YARN is a cluster resource management framework for managing the resources and scheduling jobs in a Hadoop cluster. It’s supported to submit PyFlink jobs to YARN for execution talking with Kubernetes and allocating and de-allocating TaskManagers depending on the required resources. ./bin/flink run-application \ --target kubernetes-application \ --parallelism 8 \ -Dkubernetes0 码力 | 36 页 | 266.80 KB | 1 年前3
Course introduction - CS 591 K1: Data Stream Processing and Analytics Spring 2020joins • Examples • Real-time statistics, e.g. weather maps • Monitor conditions to adjust resources, e.g. power generation • Monitor energy consumption for billing purposes 22 Vasiliki Kalavri0 码力 | 34 页 | 2.53 MB | 1 年前3
State management - CS 591 K1: Data Stream Processing and Analytics Spring 2020Dive on Apache Flink State - Seth Wiesman: https:// www.youtube.com/watch?v=9GF8Hwqzwnk Further resources 240 码力 | 24 页 | 914.13 KB | 1 年前3
共 12 条
- 1
- 2













