Sliding Window Join
Window joins and window functions are implemented by Flink, Spark, and other big data tools. I give an use-case involving 2 streams “metrics” and “events” below. In an ideal situation, machine-generated data would automatically alert the right set of people when a problem occurs. To do so, we need to do various “aggregations” and “joins” to connect a metric to the relevant employee. Window computations can make big data pipelines CPU bound or memory bound, based on my experience. I do a quick investigation of why that is the case here. ...