Streaming optimizations - CS 591 K1: Data Stream Processing and Analytics Spring 2020
Non-fused operators can run on different threads • The optimizer can interact with the scheduler and fuse operators according to the number of available cores / threads • Fused operators can share the address address space but use separate threads of control • avoid communication cost without losing pipeline parallelism • use a shared buffer for communication • Fused filters / projections at the source0 码力 | 54 页 | 2.83 MB | 1 年前3Stream processing fundamentals - CS 591 K1: Data Stream Processing and Analytics Spring 2020
cyclic dataflows and iterations on streams • Operators are data-parallel • distributed workers (threads) execute one parallel instance of one of more operators on disjoint data partitions 36 Vasiliki0 码力 | 45 页 | 1.22 MB | 1 年前3Scalable Stream Processing - Spark Streaming and Flink
spark._ import org.apache.spark.streaming._ // Create a local StreamingContext with two working threads and batch interval of 1 second. val conf = new SparkConf().setMaster("local[2]").setAppName("NetworkWordCount")0 码力 | 113 页 | 1.22 MB | 1 年前3
共 3 条
- 1