Skip to content

Commit

Permalink
Merge branch 'master' into prioritized-experience-replay
Browse files Browse the repository at this point in the history
  • Loading branch information
araffin authored Jul 17, 2024
2 parents 150b09a + 1a69fc8 commit 5c0c79d
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions docs/guide/examples.rst
Original file line number Diff line number Diff line change
Expand Up @@ -179,9 +179,9 @@ Multiprocessing with off-policy algorithms
vec_env = make_vec_env("Pendulum-v0", n_envs=4, seed=0)
# We collect 4 transitions per call to `ènv.step()`
# and performs 2 gradient steps per call to `ènv.step()`
# if gradient_steps=-1, then we would do 4 gradients steps per call to `ènv.step()`
# We collect 4 transitions per call to `env.step()`
# and performs 2 gradient steps per call to `env.step()`
# if gradient_steps=-1, then we would do 4 gradients steps per call to `env.step()`
model = SAC("MlpPolicy", vec_env, train_freq=1, gradient_steps=2, verbose=1)
model.learn(total_timesteps=10_000)
Expand Down

0 comments on commit 5c0c79d

Please sign in to comment.