Logo des Repositoriums
 

The Data Lake Architecture Framework

dc.contributor.authorGiebler, Corinna
dc.contributor.authorGröger, Christoph
dc.contributor.authorHoos, Eva
dc.contributor.authorEichler, Rebecca
dc.contributor.authorSchwarz, Holger
dc.contributor.authorMitschang, Bernhard
dc.contributor.editorKai-Uwe Sattler
dc.contributor.editorMelanie Herschel
dc.contributor.editorWolfgang Lehner
dc.date.accessioned2021-03-16T07:57:10Z
dc.date.available2021-03-16T07:57:10Z
dc.date.issued2021
dc.description.abstractDuring recent years, data lakes emerged as a way to manage large amounts of heterogeneous data for modern data analytics. Although various work on individual aspects of data lakes exists, there is no comprehensive data lake architecture yet. Concepts that describe themselves as a “data lake architecture” are only partial. In this work, we introduce the data lake architecture framework. It supports the definition of data lake architectures by defining nine architectural aspects, i.e., perspectives on a data lake, such as data storage or data modeling, and by exploring the interdependencies between these aspects. The included methodology helps to choose appropriate concepts to instantiate each aspect. To evaluate the framework, we use it to configure an exemplary data lake architecture for a real-world data lake implementation. This final assessment shows that our framework provides comprehensive guidance in the configuration of a data lake architecture.en
dc.identifier.doi10.18420/btw2021-19
dc.identifier.isbn978-3-88579-705-0
dc.identifier.pissn1617-5468
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/35802
dc.language.isoen
dc.publisherGesellschaft für Informatik, Bonn
dc.relation.ispartofBTW 2021
dc.relation.ispartofseriesLecture Notes in Informatics (LNI) - Proceedings, Volume P-311
dc.subjectData Lake
dc.subjectData Lake Architecture
dc.subjectFramework
dc.titleThe Data Lake Architecture Frameworken
gi.citation.endPage370
gi.citation.startPage351
gi.conference.date13.-17. September 2021
gi.conference.locationDresden
gi.conference.sessiontitle(Industrial) Use Cases & Applications

Dateien

Originalbündel
1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
A4-1.pdf
Größe:
449.22 KB
Format:
Adobe Portable Document Format