Average-Reward Reinforcement Learning with Trust Region Methods

Jul 10, 2022 1 min read

Average-Reward PPO

TBD

References

@inproceedings{ma2021average-reward,
    title={Average-Reward Reinforcement Learning with Trust Region Methods},
    author={Ma, Xiaoteng and Tang, Xiaohang and Xia, Li and Yang, Jun and Zhao, Qianchuan},
    journal={International Joint Conferences on Artificial Intelligence},
    pages={2797--2803},
    year={2021}

Also, original implementation from the authors.

GitHub

View Github

reinforcement learning

John was the first writer to have joined pythonawesome.com. He has since then inculcated very effective writing and reviewing culture at pythonawesome which rivals have found impossible to imitate.

Average-Reward Reinforcement Learning with Trust Region Methods

Average-Reward PPO

References

GitHub

John

Revealing Single Frame Bias for Video-and-Language Learning

Stock News Alert Project

Average-Reward PPO

References

GitHub

Revealing Single Frame Bias for Video-and-Language Learning

Stock News Alert Project

You might also like...