Evaluating Task-Level CPU Efficiency for Distributed Stream Processing Systems

Johannes Rank, Jonas Herget, Andreas Hein, Helmut Krcmar

Research output: Contribution to journalArticlepeer-review

Abstract

Big Data and primarily distributed stream processing systems (DSPSs) are growing in complexity and scale. As a result, effective performance management to ensure that these systems meet the required service level objectives (SLOs) is becoming increasingly difficult. A key factor to consider when evaluating the performance of a DSPS is CPU efficiency, which is the ratio of the workload processed by the system to the CPU resources invested. In this paper, we argue that developing new performance tools for creating DSPSs that can fulfill SLOs while using minimal resources is crucial. This is especially significant in edge computing situations where resources are limited and in large cloud deployments where conserving power and reducing computing expenses are essential. To address this challenge, we present a novel task-level approach for measuring CPU efficiency in DSPSs. Our approach supports various streaming frameworks, is adaptable, and comes with minimal overheads. This enables developers to understand the efficiency of different DSPSs at a granular level and provides insights that were not previously possible.

Original languageEnglish
Article number49
JournalBig Data and Cognitive Computing
Volume7
Issue number1
DOIs
StatePublished - Mar 2023

Keywords

  • CPU efficiency
  • big data
  • distributed stream processing
  • flink
  • performance
  • profiling
  • spark
  • task-level measurement

Fingerprint

Dive into the research topics of 'Evaluating Task-Level CPU Efficiency for Distributed Stream Processing Systems'. Together they form a unique fingerprint.

Cite this