Computer-Assisted Short Answer Grading Using Large Language Models and Rubrics

Grading student answers and providing feedback are essential yet time-consuming tasks for educators. Recent advancements in Large Language Models (LLMs), including ChatGPT, Llama, and Mistral, have paved the way for automated support in this domain. This paper investigates the efficacy of instruction-following LLMs in adhering to predefined rubrics for evaluating student answers and delivering meaningful feedback. Leveraging the Mohler dataset and a custom German dataset, we evaluate various models, from commercial ones like ChatGPT to smaller open-source options like Llama, Mistral, and Command R. Additionally, we explore the impact of temperature parameters and techniques such as few-shot prompting. Surprisingly, while few-shot prompting enhances grading accuracy closer to ground truth, it introduces model inconsistency. Furthermore, some models exhibit non-deterministic behavior even at near-zero temperature settings. Our findings highlight the importance of rubrics in enhancing the interpretability of model outputs and fostering consistency in grading practices.

Metzler, Tim; Plöger, Paul G.; Hees, Jörn (2024): Computer-Assisted Short Answer Grading Using Large Language Models and Rubrics. INFORMATIK 2024. DOI: 10.18420/inf2024_121. Bonn: Gesellschaft für Informatik e.V.. PISSN: 1617-5468. ISBN: 978-3-88579-746-3. pp. 1383-1393. AI@WORK. Wiesbaden. 24.-26. September 2024

Schlagwörter

Natural Language Processing , Automatic Short Answer Grading , Large Language Models , Rubrics , Mistral , ChatGPT , Llama

DOI

10.18420/inf2024_121

Sammlungen

P352 - INFORMATIK 2024 - Lock in or log out? Wie digitale Souveränität gelingt

Komplettanzeige

Computer-Assisted Short Answer Grading Using Large Language Models and Rubrics

Volltext URI

Dokumententyp

Dateien

Zusatzinformation

Datum

Autor:innen

Zeitschriftentitel

ISSN der Zeitschrift

Bandtitel

Quelle

Verlag

Zusammenfassung

Beschreibung

Schlagwörter

Zitierform

DOI

Tags

Sammlungen