Logo des Repositoriums
 

Exploring Existing Tools for Managing Different Types of Research Data

dc.contributor.authorFreund, Adrian
dc.contributor.authorHajiabadi, Hamideh
dc.contributor.authorKoziolek, Anne
dc.contributor.editorKlein, Maike
dc.contributor.editorKrupka, Daniel
dc.contributor.editorWinter, Cornelia
dc.contributor.editorGergeleit, Martin
dc.contributor.editorMartin, Ludger
dc.date.accessioned2024-10-21T18:24:19Z
dc.date.available2024-10-21T18:24:19Z
dc.date.issued2024
dc.description.abstractData management is important for the reproducibility of scientific research. One important aspect of data management is version control. In software development, version control tools like Git are commonly used to track source code changes and releases, reproduce earlier versions, find defects, and simplify their repair. In scientific research, scientists often have to manage large amounts of data, while also trying to achieve reproducibility of results and wanting to identify and repair defects in the data. Version control software like Git is specialized for managing source code and other textual files, making it often unsuitable for managing other types of data. This creates a need for version control tools specialized for dealing with research data. This paper establishes requirements for version control tools for research data and evaluates Git Large File Storage, Neptune, Pachyderm, DVC, and Snowflake according to those requirements. We found that none of the evaluated tools fulfill all of our requirements, but we still recommend DVC, Git LFS, and Pachyderm for the use cases they do support.en
dc.identifier.doi10.18420/inf2024_189
dc.identifier.isbn978-3-88579-746-3
dc.identifier.pissn1617-5468
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/45168
dc.language.isoen
dc.publisherGesellschaft für Informatik e.V.
dc.relation.ispartofINFORMATIK 2024
dc.relation.ispartofseriesLecture Notes in Informatics (LNI) - Proceedings, Volume P-352
dc.subjectData management
dc.subjectVersion control
dc.subjectReproducibility
dc.subjectFRBR model
dc.titleExploring Existing Tools for Managing Different Types of Research Dataen
dc.typeText/Conference Paper
gi.citation.endPage2193
gi.citation.publisherPlaceBonn
gi.citation.startPage2181
gi.conference.date24.-26. September 2024
gi.conference.locationWiesbaden
gi.conference.sessiontitleHochschule 2034 / HS2034

Dateien

Originalbündel
1 - 1 von 1
Lade...
Vorschaubild
Name:
Freund_et_al_Exploring_Existing_Tools.pdf
Größe:
368.53 KB
Format:
Adobe Portable Document Format