Logo des Repositoriums
 

Workload-Driven Data Placement for Tierless In-Memory Database Systems

dc.contributor.authorHurdelhey, Ben
dc.contributor.authorWeisgut, Marcel
dc.contributor.authorBoissier, Martin
dc.contributor.editorKönig-Ries, Birgitta
dc.contributor.editorScherzinger, Stefanie
dc.contributor.editorLehner, Wolfgang
dc.contributor.editorVossen, Gottfried
dc.date.accessioned2023-02-23T13:59:49Z
dc.date.available2023-02-23T13:59:49Z
dc.date.issued2023
dc.description.abstractHigh main memory consumption is a significant cost factor for in-memory database systems. Tiering, i.e., placing parts of the data on memory or storage devices other than DRAM, reduces the main memory footprint. A controlled data placement can assign rarely accessed data to slow devices while frequently used data remains on fast devices, such as main memory, to maintain acceptable query latencies. We present an automatic data placement decision system for the in-memory database Hyrise. The system organizes the memory and storage devices in a tierless pool, with no fixed device class categorization or performance order. The system supports data placement use cases, such as minimizing end-to-end query latencies and making cost-optimal purchase recommendations in cloud environments. In this paper, we introduce an efficient calibration process to derive cost models for various storage devices. To determine data placements, we introduce a linear programming-based approach, which yields optimal configurations, and an efficient heuristic. With a set of main memory and SSD devices, we can reduce the main memory consumption for base table data of the TPC-DS benchmark by 74 percent when accepting a workload latency increase of 52 percent. In a comparison of data placement algorithms and cost models, we find that simplistic algorithms (e.g., greedy algorithms) can present viable alternatives to optimal linear programming algorithms, especially under cost prediction inaccuracies.en
dc.identifier.doi10.18420/BTW2023-02
dc.identifier.isbn978-3-88579-725-8
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/40324
dc.language.isoen
dc.publisherGesellschaft für Informatik e.V.
dc.relation.ispartofBTW 2023
dc.relation.ispartofseriesLecture Notes in Informatics (LNI) - Proceedings, Volume P-331
dc.subjectTiering
dc.subjectData Placement
dc.subjectIn-Memory Database Systems
dc.subjectLinear Programming
dc.subjectCost Models
dc.titleWorkload-Driven Data Placement for Tierless In-Memory Database Systemsen
dc.typeText/Conference Paper
gi.citation.endPage70
gi.citation.publisherPlaceBonn
gi.citation.startPage47
gi.conference.date06.-10. März 2023
gi.conference.locationDresden, Germany

Dateien

Originalbündel
1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
B1-2.pdf
Größe:
804.4 KB
Format:
Adobe Portable Document Format