[Question] Resume of training from saved model does not give similar result #326

kmsgnnew · 2021-02-22T08:18:37Z

Question

Does Resume training from saved model give similar results in stable baseline 3??

Additional context

A model is trained for 10 epochs, model is saved at 5 epochs then loaded using DQN.load() and then training is continued with model.learn for another 5 epoch. This continuation does not match for me. does it match for anyone or is it still not supported

Note: 1 epoch is over entire dataset

I have already seen similar issue comments in stable baseline at
hill-a/stable-baselines#692
hill-a/stable-baselines#301

araffin · 2021-02-22T08:35:02Z

Hello,
have you saved the replay buffer? (cf doc)
have you make sure that the exploration factor (epsilon) starts at the same value it ended?

It is hard to compare training policy, better to compare deterministic ones, usually used for evaluation (cf doc).

Please provide a minimal working example (cf issue template)

kmsgnnew · 2021-02-22T09:24:40Z

thanks araffin for quick reply.

as I understand adding following two changes should improve:

{have you saved the replay buffer?}

save_replay_buffer while training and then first load model using DQN.load() also load_replay_buffer from pickle file

{ have you make sure that the exploration factor (epsilon) starts at the same value it ended? }

for ensuring exploration factor (epsilon) starts at the same value it ended.

mean_params = model.get_parameters()
save this dictonary.
DQN.load() also set_parameters(load_path_to_dictonary)

araffin · 2021-03-05T09:10:20Z

hill-a/stable-baselines#950 (comment)

kmsgnnew added the question Further information is requested label Feb 22, 2021

kmsgnnew changed the title ~~[Question] Resume training from saved model~~ [Question] Resume of training from saved model does not give similar result Feb 22, 2021

araffin added more information needed Please fill the issue template completely RTFM Answer is the documentation labels Feb 22, 2021

araffin added the custom gym env Issue related to Custom Gym Env label Feb 22, 2021

araffin closed this as completed Mar 5, 2021

Miffyli mentioned this issue Mar 23, 2021

How to make training sessions reproducible? #292

Closed

2 tasks

Necropsy mentioned this issue Oct 6, 2021

[Question] Continued Training in Stable Baseline 3 #597

Closed

2 tasks

anand-bala mentioned this issue Aug 17, 2022

[Feature Request] CheckpointCallback should also save replay buffer #1016

Closed

1 task

Florence-C mentioned this issue Aug 9, 2023

[Question] Does the environment restart from scratch when reloading model ? #1643

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Resume of training from saved model does not give similar result #326

[Question] Resume of training from saved model does not give similar result #326

kmsgnnew commented Feb 22, 2021 •

edited

Loading

araffin commented Feb 22, 2021

kmsgnnew commented Feb 22, 2021 •

edited

Loading

araffin commented Mar 5, 2021

[Question] Resume of training from saved model does not give similar result #326

[Question] Resume of training from saved model does not give similar result #326

Comments

kmsgnnew commented Feb 22, 2021 • edited Loading

Question

Additional context

araffin commented Feb 22, 2021

kmsgnnew commented Feb 22, 2021 • edited Loading

araffin commented Mar 5, 2021

kmsgnnew commented Feb 22, 2021 •

edited

Loading

kmsgnnew commented Feb 22, 2021 •

edited

Loading