(v3.6.3) - Normalization calibration improvement and load_agent.py fix #451
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
This patch introduces two minor changes in Sinergym:
Fixed an issue in one of the scripts included in the repository (this does not affect the tool's functionality as it is just an example/functional script). The script was loading the model but, instead of using it to evaluate the agent, it used random behavior. The issue originated from a bad commit I made in version 3.5.5 of the tool.
Improved the calibration-saving system when using the normalization wrapper for observations. For better security, calibration states are now saved episode by episode in their respective folders. Additionally, these calibrations are saved and overwritten in the root of the output folder to track the latest valid calibrations.
For DRL evaluations using
LoggerEvalCallback
, these calibrations are saved each time the best model is updated, ensuring improved evaluation quality when using normalization.The documentation has been updated with this information in the wrapper section.
Types of changes
Checklist:
autopep8
second level aggressive.isort
.cd docs && make spelling && make html
pass (required if documentation has been updated.)pytest tests/ -vv
pass. (required).pytype -d import-error sinergym/
pass. (required)Changelog: