TY - GEN
T1 - Analytics on fast data
T2 - 20th International Conference on Extending Database Technology, EDBT 2017
AU - Kipf, Andreas
AU - Braun, Lucas
AU - Pandey, Varun
AU - Neumann, Thomas
AU - Böttcher, Jan
AU - Kemper, Alfons
N1 - Publisher Copyright:
© 2017, Copyright is with the authors.
PY - 2017
Y1 - 2017
N2 - Today’s streaming applications demand increasingly high event throughput rates and are often subject to strict latency constraints. To allow for more complex workloads, such as window-based aggregations, streaming systems need to support stateful event processing. This introduces new challenges for streaming engines as the state needs to be maintained in a consistent and durable manner and simultaneously accessed by complex queries for real-time analytics. Modern streaming systems, such as Apache Flink, do not allow for efficiently exposing the state to analytical queries. Thus, data engineers are forced to keep the state in external data stores, which significantly increases the latencies until events are visible to analytical queries. Proprietary solutions have been created to meet data freshness constraints. These solutions are expensive, error-prone, and difficult to maintain. Main-memory database systems, such as HyPer, achieve extremely low query response times while maintaining high update rates, which makes them well-suited for analytical streaming workloads. In this paper, we identify potential extensions to database systems to match the performance and usability of streaming systems.
AB - Today’s streaming applications demand increasingly high event throughput rates and are often subject to strict latency constraints. To allow for more complex workloads, such as window-based aggregations, streaming systems need to support stateful event processing. This introduces new challenges for streaming engines as the state needs to be maintained in a consistent and durable manner and simultaneously accessed by complex queries for real-time analytics. Modern streaming systems, such as Apache Flink, do not allow for efficiently exposing the state to analytical queries. Thus, data engineers are forced to keep the state in external data stores, which significantly increases the latencies until events are visible to analytical queries. Proprietary solutions have been created to meet data freshness constraints. These solutions are expensive, error-prone, and difficult to maintain. Main-memory database systems, such as HyPer, achieve extremely low query response times while maintaining high update rates, which makes them well-suited for analytical streaming workloads. In this paper, we identify potential extensions to database systems to match the performance and usability of streaming systems.
KW - Main-memory database systems
KW - Stream processing
UR - http://www.scopus.com/inward/record.url?scp=85045700806&partnerID=8YFLogxK
U2 - 10.5441/002/edbt.2017.06
DO - 10.5441/002/edbt.2017.06
M3 - Conference contribution
AN - SCOPUS:85045700806
T3 - Advances in Database Technology - EDBT
SP - 49
EP - 60
BT - Advances in Database Technology - EDBT 2017
A2 - Mitschang, Bernhard
A2 - Markl, Volker
A2 - Bress, Sebastian
A2 - Andritsos, Periklis
A2 - Sattler, Kai-Uwe
A2 - Orlando, Salvatore
PB - OpenProceedings.org
Y2 - 21 March 2017 through 24 March 2017
ER -