























Abstract:Temporal online analytical processing (OLAP) analyzes past states of data whose values change over time. Such histories are naturally stored as interval histories, in which each row records the period during which a value remained valid. Because temporal analyses typically arrive in infrequent, intermittent bursts, serverless execution that launches functions only at query time offers a cost advantage over always-on clusters. Splitting a computation that a single process performs as a whole across independent serverless functions, however, breaks correctness in two ways. A function may not receive the rows that determine the state of its time range, and naively summing partial results yields incorrect answers for duration-weighted and cumulative-threshold queries. Existing SQL engines and serverless analytics do not address both problems together.
This paper presents Vivace, a serverless system for exact temporal OLAP over interval histories. Vivace resolves the two problems in separate stages. Before any query arrives, a pre-query layout step partitions the interval history, replicating boundary-crossing intervals so each function computes its range completely from a single file. At query time, a merge step combines partial results under operator-specific rules. Associative aggregates merge intermediate values, and ranking re-orders candidates within each time range. We prove that this partitioned execution matches single-process computation up to canonical form. Evaluated on AWS Lambda with real-world datasets, Vivace reduces latency and monetary cost by up to 82% and 84%, respectively, against an equivalent SQL baseline that queries the history directly, demonstrating robust generality and efficiency.
From: Taeyoon Kim [view email]
[v1]
Fri, 12 Jun 2026 03:31:51 UTC (1,279 KB)
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。