Logo des Repositoriums
 

Recursive SQL and GPU-Support for In-Database Machine Learning

dc.contributor.authorSchüle, Maximilian Emanuel
dc.contributor.editorKönig-Ries, Birgitta
dc.contributor.editorScherzinger, Stefanie
dc.contributor.editorLehner, Wolfgang
dc.contributor.editorVossen, Gottfried
dc.date.accessioned2023-02-23T14:00:12Z
dc.date.available2023-02-23T14:00:12Z
dc.date.issued2023
dc.description.abstractBut to retrieve the latest data from a database, time-consuming extraction is necessary as database systems have rarely been used for operations such as matrix algebra and gradient descent.In this work, we demonstrate that SQL with recursive tables makes it possible to express a complete machine learning pipeline out of data preprocessing, model training and its validation.To facilitate the specification of loss functions, we extend the code-generating database system Umbra by an operator for automatic differentiation for use within recursive tables:With the loss function expressed in SQL as a lambda function, Umbra generates machine code for each partial derivative.We further use automatic differentiation for a dedicated gradient descent operator, which generates LLVM code to train a user-specified model on GPUs.We fine-tune GPU kernels at hardware level to allow a higher throughput and propose non-blocking synchronisation of multiple units.In our evaluation, automatic differentiation accelerated the runtime by the number of cached subexpressions compared to compiling each derivative separately.Our GPU kernels with independent models allowed maximal throughput even for small batch sizes, making machine learning pipelines within SQL more competitive.en
dc.identifier.doi10.18420/BTW2023-62
dc.identifier.isbn978-3-88579-725-8
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/40371
dc.language.isoen
dc.publisherGesellschaft für Informatik e.V.
dc.relation.ispartofBTW 2023
dc.relation.ispartofseriesLecture Notes in Informatics (LNI) - Proceedings, Volume P-331
dc.titleRecursive SQL and GPU-Support for In-Database Machine Learningen
dc.typeText/Conference Paper
gi.citation.endPage931
gi.citation.publisherPlaceBonn
gi.citation.startPage931
gi.conference.date06.-10. März 2023
gi.conference.locationDresden, Germany

Dateien

Originalbündel
1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
C3-02.pdf
Größe:
133.8 KB
Format:
Adobe Portable Document Format