Simulated target reaching tasks using the MuJoCo physics engine. The setup is adapted from [1] end-to-end learning setup for solving pixel-driven control of Jaco arm where learning is accomplished using Asynchronous Advantage Actor-Critic (A3C)[2] method with sparse rewards.
Run with python main.py <options>
.
Obtain a 30-day free trial on the MuJoCo website or free license if you are a student.
- @kaixhin for ACER
- @ikostrikov for pytorch-a3c