Skip to content

Commit

Permalink
[ci skip] Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
araffin authored Feb 6, 2024
1 parent 61f6d63 commit 0f9163d
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -107,7 +107,7 @@ if __name__ == "__main__":

## Note about DroQ

[(DroQ)](https://openreview.net/forum?id=xCVJMsPv3RT) is a special configuration of SAC.
[DroQ](https://openreview.net/forum?id=xCVJMsPv3RT) is a special configuration of SAC.

To have the algorithm with the hyperparameters from the paper, you should use (using RL Zoo config format):
```yaml
Expand All @@ -120,7 +120,7 @@ HalfCheetah-v4:
policy_kwargs: "dict(dropout_rate=0.01, layer_norm=True)"
```
and then `python train.py --algo sac --env HalfCheetah-v4 -P`
and then using the RL Zoo script defined above: `python train.py --algo sac --env HalfCheetah-v4 -c droq.yml -P`.

We recommend playing with the `policy_delay` and `gradient_steps` parameters for better speed/efficiency.
Having a higher learning rate for the q-value function is also helpful: `qf_learning_rate: !!float 1e-3`.
Expand Down

0 comments on commit 0f9163d

Please sign in to comment.