Averaging rewards as a first approach towards Interpolated Experience Replay

Pilar von Pilchau, Wenzel

Konferenzbeitrag

Averaging rewards as a first approach towards Interpolated Experience Replay

Dokumententyp

Text/Conference Paper

Dateien

paper11_03.pdf (275.51 KB)

Datum

2019

Autor:innen

Pilar von Pilchau, Wenzel

Quelle

INFORMATIK 2019: 50 Jahre Gesellschaft für Informatik – Informatik für Gesellschaft (Workshop-Beiträge)

Organic Computing Doctoral Dissertation Colloquium

Verlag

Gesellschaft für Informatik e.V.

Zusammenfassung

Reinforcement learning and especially deep reinforcement learning are research areas which are getting more and more attention. The mathematical method of interpolation is used to get information of data points in an area where only neighboring samples are known and thus seems like a good expansion for the experience replay which is a major component of a variety of deep reinforcement learning methods. Interpolated experiences stored in the experience replay could speed up learning in the early phase and reduce the overall amount of exploration needed. A first approach of averaging rewards in a setting with unstable transition function and very low exploration is implemented and shows promising results that encourage further investigation.

Pilar von Pilchau, Wenzel (2019): Averaging rewards as a first approach towards Interpolated Experience Replay. INFORMATIK 2019: 50 Jahre Gesellschaft für Informatik – Informatik für Gesellschaft (Workshop-Beiträge). DOI: 10.18420/inf2019_ws53. Bonn: Gesellschaft für Informatik e.V.. PISSN: 1617-5468. ISBN: 978-3-88579-689-3. pp. 493-506. Organic Computing Doctoral Dissertation Colloquium. Kassel. 23.-26. September 2019

Schlagwörter

Experience Replay , Deep Q-Network , Deep Reinforcement Learning , Interpolation , Machine Learning , Organic Computing

DOI

10.18420/inf2019_ws53

Sammlungen

P295 - INFORMATIK 2019 - 50 Jahre Gesellschaft für Informatik – Informatik für Gesellschaft (Workshop-Beiträge)

Komplettanzeige

Averaging rewards as a first approach towards Interpolated Experience Replay

Volltext URI

Dokumententyp

Dateien

Zusatzinformation

Datum

Autor:innen

Zeitschriftentitel

ISSN der Zeitschrift

Bandtitel

Quelle

Verlag

Zusammenfassung

Beschreibung

Schlagwörter

Zitierform

DOI

Tags

Sammlungen