Fast Approximated Nearest Neighbor Joins For Relational Database Systems
Zusammenfassung
K nearest neighbor search (kNN-Search) is a universal data processing technique and a fundamental operation for word embeddings trained by word2vec or related approaches. The benefits of operations on dense vectors like word embeddings for analytical functionalities of RDBMSs motivate an integration of kNN-Joins. However, kNN-Search, as well as kNN-Joins, have barely been integrated into relational database systems so far. In this paper, we develop an index structure for approximated kNN-Joins working well on high-dimensional data and provide an integration into PostgreSQL. The novel index structure is efficient for different cardinalities of the involved join partners. An evaluation of the system based on applications on word embeddings shows the benefits of such an integrated kNN-Join operation and the performance of the proposed approach.
- Vollständige Referenz
- BibTeX
Günther, M., Thiele, M. & Lehner, W.,
(2019).
Fast Approximated Nearest Neighbor Joins For Relational Database Systems.
In:
Grust, T., Naumann, F., Böhm, A., Lehner, W., Härder, T., Rahm, E., Heuer, A., Klettke, M. & Meyer, H.
(Hrsg.),
BTW 2019.
Gesellschaft für Informatik, Bonn.
(S. 225-244).
DOI: 10.18420/btw2019-15
@inproceedings{mci/Günther2019,
author = {Günther, Michael AND Thiele, Maik AND Lehner, Wolfgang},
title = {Fast Approximated Nearest Neighbor Joins For Relational Database Systems},
booktitle = {BTW 2019},
year = {2019},
editor = {Grust, Torsten AND Naumann, Felix AND Böhm, Alexander AND Lehner, Wolfgang AND Härder, Theo AND Rahm, Erhard AND Heuer, Andreas AND Klettke, Meike AND Meyer, Holger} ,
pages = { 225-244 } ,
doi = { 10.18420/btw2019-15 },
publisher = {Gesellschaft für Informatik, Bonn},
address = {}
}
author = {Günther, Michael AND Thiele, Maik AND Lehner, Wolfgang},
title = {Fast Approximated Nearest Neighbor Joins For Relational Database Systems},
booktitle = {BTW 2019},
year = {2019},
editor = {Grust, Torsten AND Naumann, Felix AND Böhm, Alexander AND Lehner, Wolfgang AND Härder, Theo AND Rahm, Erhard AND Heuer, Andreas AND Klettke, Meike AND Meyer, Holger} ,
pages = { 225-244 } ,
doi = { 10.18420/btw2019-15 },
publisher = {Gesellschaft für Informatik, Bonn},
address = {}
}
Sollte hier kein Volltext (PDF) verlinkt sein, dann kann es sein, dass dieser aus verschiedenen Gruenden (z.B. Lizenzen oder Copyright) nur in einer anderen Digital Library verfuegbar ist. Versuchen Sie in diesem Fall einen Zugriff ueber die verlinkte DOI: 10.18420/btw2019-15
Haben Sie fehlerhafte Angaben entdeckt? Sagen Sie uns Bescheid: Feedback abschicken