Exploring Existing Tools for Managing Different Types of Research Data
dc.contributor.author | Freund, Adrian | |
dc.contributor.author | Hajiabadi, Hamideh | |
dc.contributor.author | Koziolek, Anne | |
dc.contributor.editor | Klein, Maike | |
dc.contributor.editor | Krupka, Daniel | |
dc.contributor.editor | Winter, Cornelia | |
dc.contributor.editor | Gergeleit, Martin | |
dc.contributor.editor | Martin, Ludger | |
dc.date.accessioned | 2024-10-21T18:24:19Z | |
dc.date.available | 2024-10-21T18:24:19Z | |
dc.date.issued | 2024 | |
dc.description.abstract | Data management is important for the reproducibility of scientific research. One important aspect of data management is version control. In software development, version control tools like Git are commonly used to track source code changes and releases, reproduce earlier versions, find defects, and simplify their repair. In scientific research, scientists often have to manage large amounts of data, while also trying to achieve reproducibility of results and wanting to identify and repair defects in the data. Version control software like Git is specialized for managing source code and other textual files, making it often unsuitable for managing other types of data. This creates a need for version control tools specialized for dealing with research data. This paper establishes requirements for version control tools for research data and evaluates Git Large File Storage, Neptune, Pachyderm, DVC, and Snowflake according to those requirements. We found that none of the evaluated tools fulfill all of our requirements, but we still recommend DVC, Git LFS, and Pachyderm for the use cases they do support. | en |
dc.identifier.doi | 10.18420/inf2024_189 | |
dc.identifier.isbn | 978-3-88579-746-3 | |
dc.identifier.pissn | 1617-5468 | |
dc.identifier.uri | https://dl.gi.de/handle/20.500.12116/45168 | |
dc.language.iso | en | |
dc.publisher | Gesellschaft für Informatik e.V. | |
dc.relation.ispartof | INFORMATIK 2024 | |
dc.relation.ispartofseries | Lecture Notes in Informatics (LNI) - Proceedings, Volume P-352 | |
dc.subject | Data management | |
dc.subject | Version control | |
dc.subject | Reproducibility | |
dc.subject | FRBR model | |
dc.title | Exploring Existing Tools for Managing Different Types of Research Data | en |
dc.type | Text/Conference Paper | |
gi.citation.endPage | 2193 | |
gi.citation.publisherPlace | Bonn | |
gi.citation.startPage | 2181 | |
gi.conference.date | 24.-26. September 2024 | |
gi.conference.location | Wiesbaden | |
gi.conference.sessiontitle | Hochschule 2034 / HS2034 |
Dateien
Originalbündel
1 - 1 von 1
Lade...
- Name:
- Freund_et_al_Exploring_Existing_Tools.pdf
- Größe:
- 368.53 KB
- Format:
- Adobe Portable Document Format