PPO Pytorch C++
This is an implementation of the proximal policy optimization algorithm for the C++ API of Pytorch. It uses a simple TestEnvironment to test the algorithm. Below is a small visualization of the environment, the algorithm is tested in.
mkdir build cd build cmake -DCMAKE_PREFIX_PATH=/absolut/path/to/libtorch .. make
Run the executable with
cd build ./train_ppo
It should produce something like shown below.
The algorithm can also be used in test mode, once trained. Therefore, run
cd build ./test_ppo
The results are saved to
data/data.csv and can be visualized by running