Learning Normative Behaviour Through Automated Theorem Proving

Neufeld, Emery A.

Zeitschriftenartikel

Learning Normative Behaviour Through Automated Theorem Proving

Dokumententyp

Text/Journal Article

Datum

2024

Autor:innen

Neufeld, Emery A.

Quelle

KI - Künstliche Intelligenz: Vol. 38, No. 0

Verlag

Springer

Zusammenfassung

Reinforcement learning (RL) is a powerful tool for teaching agents goal-directed behaviour in stochastic environments, and many proposed applications involve adopting societal roles which have ethical, legal, or social norms attached to them. Though multiple approaches exist for teaching RL agents norm-compliant behaviour, there are limitations on what normative systems they can accommodate. In this paper we analyse and improve the techniques proposed for use with the Normative Supervisor (Neufeld, et al., 2021)—a module which uses conclusions gleaned from a defeasible deontic logic theorem prover to restrict the behaviour of RL agents. First, we propose a supplementary technique we call violation counting to broaden the range of normative systems we can learn from, thus covering normative conflicts and contrary-to-duty norms. Additionally, we propose an algorithm for constructing a “normative filter”, a function that can be used to implement the addressed techniques without requiring the theorem prover to be run at each step during training or operation, significantly decreasing the overall computational overhead of using the normative supervisor. In order to demonstrate these contributions, we use a computer game-based case study, and thereafter discuss remaining problems to be solved in the conclusion.

Neufeld, Emery A. (2024): Learning Normative Behaviour Through Automated Theorem Proving. KI - Künstliche Intelligenz: Vol. 38, No. 0. DOI: 10.1007/s13218-024-00844-x. Springer. ISSN: 1610-1987

Schlagwörter

Defeasible deontic logic , Ethical reinforcement learning , Theorem proving

DOI

10.1007/s13218-024-00844-x

Sammlungen

Künstliche Intelligenz 38(1-2) - August 2024

Komplettanzeige

Learning Normative Behaviour Through Automated Theorem Proving

Volltext URI

Dokumententyp

Zusatzinformation

Datum

Autor:innen

Zeitschriftentitel

ISSN der Zeitschrift

Bandtitel

Quelle

Verlag

Zusammenfassung

Beschreibung

Schlagwörter

Zitierform

DOI

Tags

Sammlungen