Keras implementation of Upside Down Reinforcement Learning

Jan 19, 2022 1 min read

keras_udrl

This is meant to be as small as possible for educational purposes, so it was not meant to be as flexible as the authors’ implementation here (in pytorch). At least not yet…
Behavior function model assumes a gated (multiplicative) function between state and commands in the first few layers. But this should be super easy to change in Keras (have fun playing around).
Command scaling is hard coded for this environment.
Run with: python main.py
Tested with:

conda 4.10.3
tensorflow 2.6.0
gym 0.21.0

GitHub

View Github

John was the first writer to have joined pythonawesome.com. He has since then inculcated very effective writing and reviewing culture at pythonawesome which rivals have found impossible to imitate.