Skip to content

Commit

Permalink
add reference
Browse files Browse the repository at this point in the history
  • Loading branch information
WWWonderer committed Feb 18, 2024
1 parent 6fc88b2 commit 533d4c0
Showing 1 changed file with 3 additions and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -182,10 +182,12 @@ After 1000 episodes of training, we do have a reasonable taxi driver who is not
Your browser does not support the video tag.
</video>
</div>
<br>


Reference: [Reinforcement Learning - An Introduction][sutton_book] by Richard S. Sutton and Andrew G. Barto (chapters 3 and 6)


[sutton_book]: https://www.andrew.cmu.edu/course/10-703/textbook/BartoSutton.pdf
[convergence_SARSA]: https://link.springer.com/content/pdf/10.1023/A:1007678930559.pdf
[convergence_Qlearning]: https://arxiv.org/pdf/2108.02827.pdf
[gymnasium]: https://gymnasium.farama.org/index.html
Expand Down

0 comments on commit 533d4c0

Please sign in to comment.