Stable-Baselines3 v2.1.0: Float64 actions, Gymnasium 0.29 support and bug fixes

araffin released this 20 Aug 12:13

· 79 commits to master since this release

SB3 Contrib (more algorithms): https://github.com/Stable-Baselines-Team/stable-baselines3-contrib
RL Zoo3 (training framework): https://github.com/DLR-RM/rl-baselines3-zoo
Stable-Baselines Jax (SBX): https://github.com/araffin/sbx

To upgrade:

pip install stable_baselines3 sb3_contrib --upgrade

or simply (rl zoo depends on SB3 and SB3 contrib):

pip install rl_zoo3 --upgrade

Breaking Changes:

Removed Python 3.7 support
SB3 now requires PyTorch >= 1.13

New Features:

Added Python 3.11 support
Added Gymnasium 0.29 support (@pseudo-rnd-thoughts)

SB3-Contrib

Fixed MaskablePPO ignoring stats_window_size argument
Added Python 3.11 support

RL Zoo

Upgraded to Huggingface-SB3 >= 2.3
Added Python 3.11 support

Bug Fixes:

Relaxed check in logger, that was causing issue on Windows with colorama
Fixed off-policy algorithms with continuous float64 actions (see #1145) (@tobirohrer)
Fixed env_checker.py warning messages for out of bounds in complex observation spaces (@Gabo-Tor)

Others:

Updated GitHub issue templates
Fix typo in gym patch error message (@lukashass)
Refactor test_spaces.py tests

Documentation:

Fixed callback example (@BertrandDecoster)
Fixed policy network example (@kyle-he)
Added mobile-env as new community project (@stefanbschneider)
Added DeepNetSlice to community projects (@AlexPasqua)

Full Changelog: v2.0.0...v2.1.0

Contributors

pseudo-rnd-thoughts, kyle-he, and 6 other contributors

Assets 2