Logo des Repositoriums
 

Kurz erklärt: Measuring Data Changes in Data Engineering and their Impact on Explainability and Algorithm Fairness

dc.contributor.authorKlettke, Meike
dc.contributor.authorLutsch, Adrian
dc.contributor.authorStörl, Uta
dc.date.accessioned2022-01-27T13:27:55Z
dc.date.available2022-01-27T13:27:55Z
dc.date.issued2021
dc.description.abstractData engineering is an integral part of any data science and ML process. It consists of several subtasks that are performed to improve data quality and to transform data into a target format suitable for analysis. The quality and correctness of the data engineering steps is therefore important to ensure the quality of the overall process. In machine learning processes requirements such as fairness and explainability are essential. The answers to these must also be provided by the data engineering subtasks. In this article, we will show how these can be achieved by logging, monitoring and controlling the data changes in order to evaluate their correctness. However, since data preprocessing algorithms are part of any machine learning pipeline, they must obviously also guarantee that they do not produce data biases. In this article we will briefly introduce three classes of methods for measuring data changes in data engineering and present which research questions still remain unanswered in this area.de
dc.identifier.doi10.1007/s13222-021-00392-w
dc.identifier.pissn1610-1995
dc.identifier.urihttp://dx.doi.org/10.1007/s13222-021-00392-w
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/38047
dc.publisherSpringer
dc.relation.ispartofDatenbank-Spektrum: Vol. 21, No. 3
dc.relation.ispartofseriesDatenbank-Spektrum
dc.subjectData bias
dc.subjectData engineering pipelines
dc.subjectDegree of data changes
dc.subjectExplainability
dc.subjectReliability
dc.titleKurz erklärt: Measuring Data Changes in Data Engineering and their Impact on Explainability and Algorithm Fairnessde
dc.typeText/Journal Article
gi.citation.endPage249
gi.citation.startPage245

Dateien