Graph streaming algorithms - CS 591 K1: Data Stream Processing and Analytics Spring 2020
4 5 3 . . . 1 3, 4 2 1, 4 5 3 . . . ??? Vasiliki Kalavri | Boston University 2020 15 A component is a subgraph in which every vertex is reachable from all other vertices in the subgraph. Connected State: the graph and a component ID per vertex • initially equal to vertex ID • Iterative step: For each vertex • choose the min of neighbors’ component IDs and own component ID as the new ID • • if the component ID changed since the last iteration, notify neighbors 16 ??? Vasiliki Kalavri | Boston University 2020 1 4 3 2 5 i=0 Batch Connected Components 17 6 7 8 ??? Vasiliki Kalavri0 码力 | 72 页 | 7.77 MB | 1 年前3PyFlink 1.15 Documentation
Table Creation Table is a core component of the Python Table API. A Table object describes a pipeline of data transformations. It does not Getting Started 19 pyflink-docs, Release release-1.15 DataStream Creation DataStream is a core component of the Python DataStream API. A DataStream object describes a pipeline of data transformations. 0 码力 | 36 页 | 266.77 KB | 1 年前3PyFlink 1.16 Documentation
Table Creation Table is a core component of the Python Table API. A Table object describes a pipeline of data transformations. It does not Getting Started 19 pyflink-docs, Release release-1.16 DataStream Creation DataStream is a core component of the Python DataStream API. A DataStream object describes a pipeline of data transformations. 0 码力 | 36 页 | 266.80 KB | 1 年前3State management - CS 591 K1: Data Stream Processing and Analytics Spring 2020
Operator state Keyed state State types 6 Vasiliki Kalavri | Boston University 2020 A pluggable component that determines how state is stored, accessed, and maintained. State backends are responsible0 码力 | 24 页 | 914.13 KB | 1 年前3Windows and triggers - CS 591 K1: Data Stream Processing and Analytics Spring 2020
. trigger evictor evaluation function result stream Custom windows 20 • Describe each component Vasiliki Kalavri | Boston University 2020 32 4 2 5 7 44 8 18 Window max over 5 last elements0 码力 | 35 页 | 444.84 KB | 1 年前3Flow control and load shedding - CS 591 K1: Data Stream Processing and Analytics Spring 2020
Boston University 2020 Implementation • Load shedding is commonly implemented by a standalone component integrated with the stream processor • The load shedder continuously monitors input rates or0 码力 | 43 页 | 2.42 MB | 1 年前3Streaming optimizations - CS 591 K1: Data Stream Processing and Analytics Spring 2020
Apache Flink • The TaskManagers ship data from sending tasks to receiving tasks. • The network component of a TaskManager collects records in buffers before they are shipped, i.e., records are not shipped0 码力 | 54 页 | 2.83 MB | 1 年前3Filtering and sampling streams - CS 591 K1: Data Stream Processing and Analytics Spring 2020
2 Synopsis: a lossy, compact summary of the input stream input stream synopsis maintenance component user queries approximate results ??? Vasiliki Kalavri | Boston University 2020 A simple and0 码力 | 74 页 | 1.06 MB | 1 年前3
共 8 条
- 1