Combining Programming-by-Example with Transformation Discovery from large Databases
Abstract
Data transformation discovery is one of the most tedious tasks in data preparation. In particular, the generation of transformation programs for semantic transformations is tricky because additional sources for look-up operations are necessary. Current systems for semantic transformation discovery face two major problems: either they follow a program synthesis approach that only scales to a small set of input tables, or they rely on extraction of transformation functions from large corpora, which requires the identification of exact transformations in those resources and is prone to noisy data. In this paper, we try to combine approaches to benefit from large corpora and the sophistication of program synthesis. To do so, we devise a retrieval and pruning strategy ensemble that extracts the most relevant tables for a given transformation task. The extracted resources can then be processed by a program synthesis engine to generate more accurate transformation results than state-of-the-art.
- Citation
- BibTeX
özmen, A., Esmailoghli, M. & Abedjan, Z.,
(2021).
Combining Programming-by-Example with Transformation Discovery from large Databases.
In:
, ., , . & , .
(Hrsg.),
BTW 2021.
Gesellschaft für Informatik, Bonn.
(S. 313-324).
DOI: 10.18420/btw2021-16
@inproceedings{mci/özmen2021,
author = {özmen, Aslihan AND Esmailoghli, Mahdi AND Abedjan, Ziawasch},
title = {Combining Programming-by-Example with Transformation Discovery from large Databases},
booktitle = {BTW 2021},
year = {2021},
editor = {Kai-Uwe Sattler AND Melanie Herschel AND Wolfgang Lehner} ,
pages = { 313-324 } ,
doi = { 10.18420/btw2021-16 },
publisher = {Gesellschaft für Informatik, Bonn},
address = {}
}
author = {özmen, Aslihan AND Esmailoghli, Mahdi AND Abedjan, Ziawasch},
title = {Combining Programming-by-Example with Transformation Discovery from large Databases},
booktitle = {BTW 2021},
year = {2021},
editor = {Kai-Uwe Sattler AND Melanie Herschel AND Wolfgang Lehner} ,
pages = { 313-324 } ,
doi = { 10.18420/btw2021-16 },
publisher = {Gesellschaft für Informatik, Bonn},
address = {}
}
Sollte hier kein Volltext (PDF) verlinkt sein, dann kann es sein, dass dieser aus verschiedenen Gruenden (z.B. Lizenzen oder Copyright) nur in einer anderen Digital Library verfuegbar ist. Versuchen Sie in diesem Fall einen Zugriff ueber die verlinkte DOI: 10.18420/btw2021-16
Haben Sie fehlerhafte Angaben entdeckt? Sagen Sie uns Bescheid: Send Feedback
More Info
DOI: 10.18420/btw2021-16
ISBN: 978-3-88579-705-0
ISSN: 1617-5468
xmlui.MetaDataDisplay.field.date: 2021
Language:
(en)
