Improving Exploration in SAC with Normalizing Flows Policies

This codebase was used to generate the results documented in the paper "Improving Exploration in Soft-Actor-Critic with Normalizing Flows Policies". Patrick Nadeem Ward^*12, Ariella Smofsky^*12, Avishek Joey Bose¹². INNF Workshop ICML 2019.

^* Equal contribution, ¹ McGill University, ² Mila
Correspondence to:
- Patrick Nadeem Ward <Github: NadeemWard, [email protected]>
- Ariella Smofsky <Github: asmoog, [email protected]>

Requirements

Run Experiments

Gaussian policy on Dense Gridworld environment with REINFORCE:

TODO

Gaussian policy on Sparse Gridworld environment with REINFORCE:

TODO

Gaussian policy on Dense Gridworld environment with reparametrization:

python main.py --namestr=G-S-DG-CG --make_cont_grid --batch_size=128 --replay_size=100000 --hidden_size=64 --num_steps=100000 --policy=Gaussian --smol --comet --dense_goals --silent

Gaussian policy on Sparse Gridworld environment with reparametrization:

python main.py --namestr=G-S-CG --make_cont_grid --batch_size=128 --replay_size=100000 --hidden_size=64 --num_steps=100000 --policy=Gaussian --smol --comet --silent

Normalizing Flow policy on Dense Gridworld environment:

TODO

Normalizing Flow policy on Sparse Gridworld environment:

TODO

To run an experiment with a different policy distribution, modify the --policy flag.

References

Implementation of SAC based on PyTorch SAC.

Name		Name	Last commit message	Last commit date
Latest commit History 103 Commits
pytorch-soft-actor-critic		pytorch-soft-actor-critic
pytorch-vanilla-reinforce		pytorch-vanilla-reinforce
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Improving Exploration in SAC with Normalizing Flows Policies

Requirements

Run Experiments

References

About

Releases

Packages

Contributors 4

Languages

License

joeybose/FloRL

Folders and files

Latest commit

History

Repository files navigation

Improving Exploration in SAC with Normalizing Flows Policies

Requirements

Run Experiments

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages