Deep_RL_with_pytorch

Deep Reinforcement Learning with PyTorch.

  • [x] 1. Dynamic Programming (Update : 13. 2. 2019)

    1. Conditional GAN

cde_with_gan

 2. Policy Iteration & Value Iteration

pong_result

  • [x] 3. Policy Based Methods (Update : 23. 2. 2019)

    1. A2C

    2. PPO

ppo_pong_result

  • [x] 4. Off-policy Policy Based Methods (Update : 10. 3. 2019)

    1. SAC

    2. SIL ( not with A2C, PPO but SAC)

ssac_breakout_result

breakout_only_intrinsic

iqn_breakout_result

GitHub