A case study on the applicability of solving RL tasks when posed as inference problems. As defined in [Levine 2018] we define our tasks as structured variational inference problems and use Pyro (a probabilistic programming language) to solve them.
For a more in depth description of the project please refer to the following documents:
pip3 install -r requirements.txt