Publication
Efficient Window Aggregation with General Stream Slicing
Jonas Traub; Philipp Grulich; Alejandro Rodríguez Cuéllar; Sebastian Breß; Asterios Katsifodimos; Tilmann Rabl; Volker Markl
In: 22th International Conference on Extending Database Technology (EDBT). International Conference on Extending Database Technology (EDBT-2019), 22th, March 26-29, Lisbon, Portugal, OpenProceedings, 2019.
Abstract
Window aggregation is a core operation in data stream processing. Existing aggregation techniques focus on reducing latency, eliminating redundant computations, and minimizing memory usage. However, each technique operates under different assumptions
with respect to workload characteristics such as properties of aggregation functions (e.g., invertible, associative), window types (e.g., sliding, sessions), windowing measures (e.g., time- or countbased), and stream (dis)order. Violating the assumptions of a technique can deem it unusable or drastically reduce its performance.
In this paper, we present the first general stream slicing technique for window aggregation. General stream slicing automatically adapts to workload characteristics to improve performance without sacrificing its general applicability. As a prerequisite, we identify workload characteristics which affect the performance and applicability of aggregation techniques. Our experiments show that general stream slicing outperforms alternative concepts by up to one order of magnitude.