This is the official research code for the paper Faccio et al. 2021: "Parameter-Based Value Functions".
Install the following dependencies (in a virtualenv preferably)
pip3 install wandb gym[all] mujoco_py>=2 torch==1.6.0 numpy
This code uses wandb for data logging and visualization
Change the default configuration in each file and run:
PSSVF:
python3 pssvf.py
PSVF:
python3 psvf.py
PAVF:
python3 pavf.py
Please, cite our paper if you use our code or reimplement our method:
@inproceedings{faccio2020parameter,
title={Parameter-Based Value Functions},
author={Faccio, Francesco and Kirsch, Louis and Schmidhuber, J{\"u}rgen},
booktitle={International Conference on Learning Representations},
year={2020}
}