Cardinality and frequency estimation - CS 591 K1: Data Stream Processing and Analytics Spring 2020f[j] = ci,j return min(f[1], f[2], …, f[p]) ??? Vasiliki Kalavri | Boston University 2020 24 Computing top-k ??? Vasiliki Kalavri | Boston University 2020 24 • Additional to the array of counter, we elements seen so far • a heap X* of up to k potential heavy hitters and their frequency estimations Computing top-k ??? Vasiliki Kalavri | Boston University 2020 24 • Additional to the array of counter, we frequency estimations • We use a frequency threshold f*=N/k to decide whether an element is popular Computing top-k ??? Vasiliki Kalavri | Boston University 2020 24 • Additional to the array of counter, we0 码力 | 69 页 | 630.01 KB | 1 年前3
PyFlink 1.15 Documentationcentral context for creating Table and SQL API programs. Flink is an unified streaming and batch computing engine, which provides unified streaming and batch API to create a TableEnvironment. TableEnvironment and central concept for creating DataStream API programs. Flink is an unified streaming and batch computing engine, which provides unified streaming and batch API to create a StreamExecutionEnvironment.0 码力 | 36 页 | 266.77 KB | 1 年前3
PyFlink 1.16 Documentationcentral context for creating Table and SQL API programs. Flink is an unified streaming and batch computing engine, which provides unified streaming and batch API to create a TableEnvironment. TableEnvironment and central concept for creating DataStream API programs. Flink is an unified streaming and batch computing engine, which provides unified streaming and batch API to create a StreamExecutionEnvironment.0 码力 | 36 页 | 266.80 KB | 1 年前3
Flow control and load shedding - CS 591 K1: Data Stream Processing and Analytics Spring 2020discard by relying on the notion of a window-based concept drift. • The metric is defined by computing a similarity metric across windows. 18 ??? Vasiliki Kalavri | Boston University 2020 How many0 码力 | 43 页 | 2.42 MB | 1 年前3
Streaming optimizations - CS 591 K1: Data Stream Processing and Analytics Spring 2020University 2020 53 • Martin Hirzel et. al. A Catalog of Stream Processing Optimizations. (ACM Computing Surveys 2014). • Ron Avnur and Joseph M. Hellerstein. Eddies: continuously adaptive query processing0 码力 | 54 页 | 2.83 MB | 1 年前3
Exactly-once fault-tolerance in Apache Flink - CS 591 K1: Data Stream Processing and Analytics Spring 2020-algorithm/ • A video lecture on global snapshots: https://www.coursera.org/lecture/ cloud-computing/1-2-global-snapshot-algorithm-hndGi 520 码力 | 81 页 | 13.18 MB | 1 年前3
共 6 条
- 1













