In PPO.ipynb, the position of action loss epoch and value loss epoch need to be swapped. #10

wadx2019 · 2021-07-16T05:03:42Z

In PPO.ipynb, the position of action loss epoch and value loss epoch need to be swapped and I suggest that you'd better use RMSprop as the optimizer and reduce the learning rate to make these RL model easier to converge.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

In PPO.ipynb, the position of action loss epoch and value loss epoch need to be swapped. #10

In PPO.ipynb, the position of action loss epoch and value loss epoch need to be swapped. #10

wadx2019 commented Jul 16, 2021

In PPO.ipynb, the position of action loss epoch and value loss epoch need to be swapped. #10

In PPO.ipynb, the position of action loss epoch and value loss epoch need to be swapped. #10

Comments

wadx2019 commented Jul 16, 2021