Auflistung nach Autor:in "Pilar von Pilchau, Wenzel"
1 - 1 von 1
Treffer pro Seite
Sortieroptionen
- KonferenzbeitragAveraging rewards as a first approach towards Interpolated Experience Replay(INFORMATIK 2019: 50 Jahre Gesellschaft für Informatik – Informatik für Gesellschaft (Workshop-Beiträge), 2019) Pilar von Pilchau, WenzelReinforcement learning and especially deep reinforcement learning are research areas which are getting more and more attention. The mathematical method of interpolation is used to get information of data points in an area where only neighboring samples are known and thus seems like a good expansion for the experience replay which is a major component of a variety of deep reinforcement learning methods. Interpolated experiences stored in the experience replay could speed up learning in the early phase and reduce the overall amount of exploration needed. A first approach of averaging rewards in a setting with unstable transition function and very low exploration is implemented and shows promising results that encourage further investigation.