Logo des Repositoriums
 
Konferenzbeitrag

Averaging rewards as a first approach towards Interpolated Experience Replay

Lade...
Vorschaubild

Volltext URI

Dokumententyp

Text/Conference Paper

Zusatzinformation

Datum

2019

Zeitschriftentitel

ISSN der Zeitschrift

Bandtitel

Verlag

Gesellschaft für Informatik e.V.

Zusammenfassung

Reinforcement learning and especially deep reinforcement learning are research areas which are getting more and more attention. The mathematical method of interpolation is used to get information of data points in an area where only neighboring samples are known and thus seems like a good expansion for the experience replay which is a major component of a variety of deep reinforcement learning methods. Interpolated experiences stored in the experience replay could speed up learning in the early phase and reduce the overall amount of exploration needed. A first approach of averaging rewards in a setting with unstable transition function and very low exploration is implemented and shows promising results that encourage further investigation.

Beschreibung

Pilar von Pilchau, Wenzel (2019): Averaging rewards as a first approach towards Interpolated Experience Replay. INFORMATIK 2019: 50 Jahre Gesellschaft für Informatik – Informatik für Gesellschaft (Workshop-Beiträge). DOI: 10.18420/inf2019_ws53. Bonn: Gesellschaft für Informatik e.V.. PISSN: 1617-5468. ISBN: 978-3-88579-689-3. pp. 493-506. Organic Computing Doctoral Dissertation Colloquium. Kassel. 23.-26. September 2019

Zitierform

Tags