This is a reinforcement learning project aimed at testing VPG, DDPG, TRPO and PPO in Cartpole and the Bipedal Walker Environments.
This project uses specific algorithms or methods, e.g VPG, DDPG, TRPO and PPO to achieve specific tasks in the Open AI gym
Python 3
OpenAI Gym