From 1a69fc831414626cbbcf13343c6e78d9accb9104 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Quentin=20Gallou=C3=A9dec?= <45557362+qgallouedec@users.noreply.github.com> Date: Mon, 15 Jul 2024 23:57:24 +0200 Subject: [PATCH] Update examples.rst (#1969) --- docs/guide/examples.rst | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/docs/guide/examples.rst b/docs/guide/examples.rst index 9f5423162..32158172b 100644 --- a/docs/guide/examples.rst +++ b/docs/guide/examples.rst @@ -179,9 +179,9 @@ Multiprocessing with off-policy algorithms vec_env = make_vec_env("Pendulum-v0", n_envs=4, seed=0) - # We collect 4 transitions per call to `ènv.step()` - # and performs 2 gradient steps per call to `ènv.step()` - # if gradient_steps=-1, then we would do 4 gradients steps per call to `ènv.step()` + # We collect 4 transitions per call to `env.step()` + # and performs 2 gradient steps per call to `env.step()` + # if gradient_steps=-1, then we would do 4 gradients steps per call to `env.step()` model = SAC("MlpPolicy", vec_env, train_freq=1, gradient_steps=2, verbose=1) model.learn(total_timesteps=10_000)