(v3.6.3) - Normalization calibration improvement and load_agent.py fix #451

AlejandroCN7 · 2024-10-04T09:16:20Z

Description

This patch introduces two minor changes in Sinergym:

Fixed an issue in one of the scripts included in the repository (this does not affect the tool's functionality as it is just an example/functional script). The script was loading the model but, instead of using it to evaluate the agent, it used random behavior. The issue originated from a bad commit I made in version 3.5.5 of the tool.
Improved the calibration-saving system when using the normalization wrapper for observations. For better security, calibration states are now saved episode by episode in their respective folders. Additionally, these calibrations are saved and overwritten in the root of the output folder to track the latest valid calibrations.
For DRL evaluations using LoggerEvalCallback, these calibrations are saved each time the best model is updated, ensuring improved evaluation quality when using normalization.
The documentation has been updated with this information in the wrapper section.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)
Improvement (of an existing feature)
Others

I've read the CONTRIBUTION guide (required)
My change requires a change to the documentation.
I have updated the tests.
I have updated the documentation accordingly.
I have reformatted the code using autopep8 second level aggressive.
I have reformatted the code using isort.
I have ensured cd docs && make spelling && make html pass (required if documentation has been updated.)
I have ensured pytest tests/ -vv pass. (required).
I have ensured pytype -d import-error sinergym/ pass. (required)

scripts/eval/load_agent.py: fix bug in example script (random agent loop instead of using loaded model).
Normalization calibration save: latest calibration saved in rootoutput folder and intermediate states in episode folders.
LoggerEvalCallback: Normalization calibration saved in evaluation folder (if normalization wrapper is applied).
Documentation: Indicate this changes in normalization wrapper

…oop instead of using loaded model)

…t folder and intermediate states in episode folders

…der (if normalization wrapper is applied)

AlejandroCN7 added 5 commits October 4, 2024 09:01

scripts/eval/load_agent.py: fix bug in example script (random agent l…

f9003a5

…oop instead of using loaded model)

Normalization calibration save: latest calibration saved in rootoutpu…

594b4be

…t folder and intermediate states in episode folders

LoggerEvalCallback: Normalization calibration saved in evaluation fol…

b76fb01

…der (if normalization wrapper is applied)

Documentation: Indicate this changes in normalization wrapper

cf43106

Updated Sinergym version from 3.6.2 to 3.6.3

1a94493

AlejandroCN7 merged commit 6397357 into main Oct 4, 2024
6 checks passed

AlejandroCN7 deleted the feat/evaluate-model branch October 4, 2024 09:27