Logo des Repositoriums
 

Computation Offloading in JVM-based Dataflow Engines

dc.contributor.authorGavriilidis, Haralampos
dc.contributor.editorMeyer, Holger
dc.contributor.editorRitter, Norbert
dc.contributor.editorThor, Andreas
dc.contributor.editorNicklas, Daniela
dc.contributor.editorHeuer, Andreas
dc.contributor.editorKlettke, Meike
dc.date.accessioned2019-04-15T11:40:33Z
dc.date.available2019-04-15T11:40:33Z
dc.date.issued2019
dc.description.abstractState-of-the-art dataflow engines, such as Apache Spark and Apache Flink scale out on large clusters for a variety of data-processing tasks, including machine learning and data mining algorithms. However, being based on the JVM, they are unable to apply optimizations supported by modern CPUs. On the contrary, specialized data processing frameworks scale up by exploiting modern CPU characteristics. The goal of this thesis is to find the sweet spot between scale-out and scale-up systems by offloading computation from dataflow engines to specialized systems. We propose two computation offloading methods, reason about their applicability, and implement a prototype based on Apache Spark. Our evaluation shows that for compute-intensive tasks, computation offloading leads to performance improvements of up to a factor of 2.5x. For certain UDF scenarios, computation offloading performs worse by up to a factor of 3x: our microbenchmarks show that 80% of the time is spent on serialization operations. By employing data exchange without serialization, computation offloading achieves performance improvements by up to 10x.en
dc.identifier.doi10.18420/btw2019-ws-20
dc.identifier.isbn978-3-88579-684-8
dc.identifier.pissn1617-5468
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/21807
dc.language.isoen
dc.publisherGesellschaft für Informatik, Bonn
dc.relation.ispartofBTW 2019 – Workshopband
dc.relation.ispartofseriesLecture Notes in Informatics (LNI) – Proceedings, Volume P-290
dc.subjectdataflow engines
dc.subjectcomputation offloading
dc.subjectdata exchange
dc.subjectnative execution
dc.titleComputation Offloading in JVM-based Dataflow Enginesen
gi.citation.endPage204
gi.citation.startPage195
gi.conference.date4.-8. März 2019
gi.conference.locationRostock
gi.conference.sessiontitleStudierendenprogramm

Dateien

Originalbündel
1 - 1 von 1
Lade...
Vorschaubild
Name:
D1-4.pdf
Größe:
282.21 KB
Format:
Adobe Portable Document Format