Learn What Really Matters: A Learning-to-Rank Approach for ML-based Query Optimization

Behr, Henriette; Markl, Volker; Kaoudi, Zoi

Learn What Really Matters: A Learning-to-Rank Approach for ML-based Query Optimization

dc.contributor.author	Behr, Henriette
dc.contributor.author	Markl, Volker
dc.contributor.author	Kaoudi, Zoi
dc.contributor.editor	König-Ries, Birgitta
dc.contributor.editor	Scherzinger, Stefanie
dc.contributor.editor	Lehner, Wolfgang
dc.contributor.editor	Vossen, Gottfried
dc.date.accessioned	2023-02-23T13:59:51Z
dc.date.available	2023-02-23T13:59:51Z
dc.date.issued	2023
dc.description.abstract	Query optimization is crucial for any data management system to achieve good performance. Recent advancements in Machine Learning (ML) have led to several efforts in the database research community that aim at improving query optimization with the help of ML. In particular, many works propose replacing the cost model used during plan enumeration with an ML model. The goal of these works is to learn a regression model from previously executed query plans that estimates the runtime of a given plan. Interestingly, it is well-known that what really matters in query optimization is the order of the query plans and not their actual cost or runtime. We thus take a learning-to-rank approach and propose a novel neural network model architecture that considers a plan in comparison with other equivalent plans that belong to the same query. We use our model architecture together with a loss function that incorporates ranking metrics into the learning process to highlight the learning-to-rank objective.To enable training, we first extract features from query plans by adapting a state-of-the-art deep learning approach so that all features are independent of the input dataset schema. Second, we devise two score functions that map the runtime of plans to scores which are then used as labels. We integrate the trained model into an adapted bottom-up plan enumeration algorithm that finds the best possible execution plan for a given query. We evaluate our approach against two state-of-the-art ML models and the highly tuned cost model of a commercial database and measure the runtime of the plans chosen in each case when executed in the database. We show that our approach achieves up to an order of magnitude better query performance than the comparison models and is able to either match (for short and medium-running queries) or outperform the commercial database (up to 5x for long-running queries).	en
dc.identifier.doi	10.18420/BTW2023-25
dc.identifier.isbn	978-3-88579-725-8
dc.identifier.uri	https://dl.gi.de/handle/20.500.12116/40330
dc.language.iso	en
dc.publisher	Gesellschaft für Informatik e.V.
dc.relation.ispartof	BTW 2023
dc.relation.ispartofseries	Lecture Notes in Informatics (LNI) - Proceedings, Volume P-331
dc.subject	query optimization
dc.subject	learning-to-rank
dc.subject	cost model
dc.title	Learn What Really Matters: A Learning-to-Rank Approach for ML-based Query Optimization	en
dc.type	Text/Conference Paper
gi.citation.endPage	554
gi.citation.publisherPlace	Bonn
gi.citation.startPage	535
gi.conference.date	06.-10. März 2023
gi.conference.location	Dresden, Germany

Dateien

Originalbündel

1 - 1 von 1

Name:: B5-2.pdf
Größe:: 712.64 KB
Format:: Adobe Portable Document Format

Herunterladen

Sammlungen

P331 - BTW2023- Datenbanksysteme für Business, Technologie und Web