Logo des Repositoriums
 

Predicting Scaling Efficiency of Distributed Stream Processing Systems via Task Level Performance Simulation

dc.contributor.authorRank, Johannes
dc.contributor.authorBarnert, Maximilian
dc.contributor.authorHein, Andreas
dc.contributor.authorKrcmar, Helmut
dc.contributor.editorHerrmann, Andrea
dc.date.accessioned2024-02-22T10:37:53Z
dc.date.available2024-02-22T10:37:53Z
dc.date.issued2023
dc.description.abstractStream processing systems (SPS) are a special class of Big Data systems that firms employ in (near) real time business scenarios. They ensure low-latency processing through a high degree of parallelization and elasticity. However, firms often do not know which scaling direction: horizontally, vertically, or mixed, is the best strategy in terms of CPU performance to scale those systems. Especially in cloud deployments with a pay-per-use model and cluster sizes that can span dozens of cores and machines, firms would profit from more accurate measurement-based approaches. In this paper, we show how to predict the CPU consumption of Apache Flink for different scaling scenarios using the Palladio Component Model. Our approach models the individual streaming tasks that make up the application and parametrizes it with fine grained CPU metrics obtained by combining BPF pro filing and querying the CPU’s performance measurement unit. Through this “task-level model approach”, we can achieve highly accurate predictions, despite using a simple model and only requiring a few mea surements for parametrization. Our experiment also shows that we achieve more accurate results than an alternative approach based on regression analysis.en
dc.identifier.issn0720-8928
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/43654
dc.language.isoen
dc.pubPlaceBonn
dc.publisherGesellschaft für Informatik e.V.
dc.relation.ispartofSoftwaretechnik-Trends Band 43, Heft 1
dc.relation.ispartofseriesSoftwaretechnik-Trends
dc.subjectStream processing
dc.subjectBig Data
dc.subjectparallelization
dc.subjectscaling
dc.subjectprediction
dc.subjectperformance
dc.subjectlatency
dc.titlePredicting Scaling Efficiency of Distributed Stream Processing Systems via Task Level Performance Simulationen
dc.typeText/Conference Paper
mci.conference.date7.-9.11.2022
mci.conference.locationStuttgart
mci.conference.sessiontitle13th Symposium on Software Performance (SSP)
mci.reference.pages14-16

Dateien

Originalbündel
1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
SSP_22_paper_209.pdf
Größe:
726.56 KB
Format:
Adobe Portable Document Format