Flexible data partitioning schemes for parallel merge joins in semantic web queries
Abstract
In the context of the Semantic Web, large amounts of data must be preprocessed and stored so that they can be queried efficiently later. The key technology in this topic are triple stores, in which all information is stored in the form of (subject, predicate and object) triple patterns. Depending on the triple patterns used within the queries, very different value distributions can be observed within these datasets. Currently, these properties are only exploited implicitly during join optimization in the form of histograms or similar technologies. This paper proposes a new way to take advantage of these different distributions using different partitioning schemes at runtime. This means that an optimal partitioning scheme can be used depending on the data access in order to improve query performance. In the experiments we achieve speedups up to a factor of 5.92 in comparison to no partitioning, and a performance improvement of up to 81% compared to a not optimal number of partitions.
- Citation
- BibTeX
Warnke, B., Rehan, M. W., Fischer, S. & Groppe, S.,
(2021).
Flexible data partitioning schemes for parallel merge joins in semantic web queries.
In:
, ., , . & , .
(Hrsg.),
BTW 2021.
Gesellschaft für Informatik, Bonn.
(S. 237-256).
DOI: 10.18420/btw2021-12
@inproceedings{mci/Warnke2021,
author = {Warnke, Benjamin AND Rehan, Muhammad Waqas AND Fischer, Stefan AND Groppe, Sven},
title = {Flexible data partitioning schemes for parallel merge joins in semantic web queries},
booktitle = {BTW 2021},
year = {2021},
editor = {Kai-Uwe Sattler AND Melanie Herschel AND Wolfgang Lehner} ,
pages = { 237-256 } ,
doi = { 10.18420/btw2021-12 },
publisher = {Gesellschaft für Informatik, Bonn},
address = {}
}
author = {Warnke, Benjamin AND Rehan, Muhammad Waqas AND Fischer, Stefan AND Groppe, Sven},
title = {Flexible data partitioning schemes for parallel merge joins in semantic web queries},
booktitle = {BTW 2021},
year = {2021},
editor = {Kai-Uwe Sattler AND Melanie Herschel AND Wolfgang Lehner} ,
pages = { 237-256 } ,
doi = { 10.18420/btw2021-12 },
publisher = {Gesellschaft für Informatik, Bonn},
address = {}
}
Sollte hier kein Volltext (PDF) verlinkt sein, dann kann es sein, dass dieser aus verschiedenen Gruenden (z.B. Lizenzen oder Copyright) nur in einer anderen Digital Library verfuegbar ist. Versuchen Sie in diesem Fall einen Zugriff ueber die verlinkte DOI: 10.18420/btw2021-12
Haben Sie fehlerhafte Angaben entdeckt? Sagen Sie uns Bescheid: Send Feedback
More Info
DOI: 10.18420/btw2021-12
ISBN: 978-3-88579-705-0
ISSN: 1617-5468
xmlui.MetaDataDisplay.field.date: 2021
Language:
(en)
