Skip to content

SB3 v1.5.0: Bug fixes, early stopping callback

Compare
Choose a tag to compare
@araffin araffin released this 25 Mar 13:57
· 254 commits to master since this release
30772aa

SB3 Contrib: https://github.com/Stable-Baselines-Team/stable-baselines3-contrib

Breaking Changes:

  • Switched minimum Gym version to 0.21.0.

New Features:

  • Added StopTrainingOnNoModelImprovement to callback collection (@caburu)
  • Makes the length of keys and values in HumanOutputFormat configurable,
    depending on desired maximum width of output.
  • Allow PPO to turn of advantage normalization (see PR #763) @vwxyzjn

SB3-Contrib

Bug Fixes:

  • Fixed a bug in VecMonitor. The monitor did not consider the info_keywords during stepping (@ScheiklP)
  • Fixed a bug in HumanOutputFormat. Distinct keys truncated to the same prefix would overwrite each others value,
    resulting in only one being output. This now raises an error (this should only affect a small fraction of use cases
    with very long keys.)
  • Routing all the nn.Module calls through implicit rather than explict forward as per pytorch guidelines (@manuel-delverme)
  • Fixed a bug in VecNormalize where error occurs when norm_obs is set to False for environment with dictionary observation (@buoyancy99)
  • Set default env argument to None in HerReplayBuffer.sample (@qgallouedec)
  • Fix batch_size typing in DQN (@qgallouedec)
  • Fixed sample normalization in DictReplayBuffer (@qgallouedec)

Others:

  • Fixed pytest warnings
  • Removed parameter remove_time_limit_termination in off policy algorithms since it was dead code (@Gregwar)

Documentation:

  • Added doc on Hugging Face integration (@simoninithomas)
  • Added furuta pendulum project to project list (@Armandpl)
  • Fix indentation 2 spaces to 4 spaces in custom env documentation example (@Gautam-J)
  • Update MlpExtractor docstring (@gianlucadecola)
  • Added explanation of the logger output
  • Update Directly Accessing The Summary Writer in tensorboard integration (@xy9485)

Full Changelog: v1.4.0...v1.5.0