ATDLLMD: Acceptance test-driven LLM development

Faragó, David

ATDLLMD: Acceptance test-driven LLM development

dc.contributor.author	Faragó, David
dc.contributor.editor	Herrmann, Andrea
dc.date.accessioned	2024-07-26T10:37:42Z
dc.date.available	2024-07-26T10:37:42Z
dc.date.issued	2024
dc.description.abstract	Since the capabilities of Large Language Models (LLMs) have massively increased in the last years, many new applications based on LLMs are possible. However, these new applications also pose new challenges in LLM development. This article proposes an acceptance test-driven development (ATDD) style, baptized ATDLLMD, where the LLM’s training and test sets are extended in each iteration by data coming from validation of the previous iteration’s LLM and system around the LLM. So the validation phase supplies the additional or updated data for training and verification of the LLM. ATDLLMD is made possible by two major innovative solutions: applying the innovative CPMAI process, and applying our own verification tool, LM-Eval, leading to a red-train green cycle for LLM development, which resembles ATDD, but integrates data science best practices.	en
dc.identifier.uri	https://dl.gi.de/handle/20.500.12116/44202
dc.language.iso	en
dc.pubPlace	Bonn
dc.publisher	Gesellschaft für Informatik e.V.
dc.relation.ispartof	Softwaretechnik-Trends Band 44, Heft 2
dc.relation.ispartofseries	Softwaretechnik-Trends
dc.subject	Large Language Model
dc.subject	LLM
dc.subject	development process
dc.subject	test-first
dc.subject	LLM evaluation
dc.subject	LLM testing
dc.subject	data-centric AI
dc.subject	business-centric AI
dc.title	ATDLLMD: Acceptance test-driven LLM development	en
dc.type	Text/Conference Paper
mci.conference.date	15.-16. Februar 2024
mci.conference.location	Gummersbach
mci.conference.sessiontitle	Treffen der GI-Fachgruppe Test, Analyse und Verifikation von Software (TAV 49)
mci.reference.pages	13-17

Dateien

Originalbündel

1 - 1 von 1

Name:: 2_TAV49_Farago.pdf
Größe:: 1 MB
Format:: Adobe Portable Document Format

Herunterladen

Sammlungen

Softwaretechnik-Trends 44(2) - 2024