This is a proof of concept on how MCTS can be implemented on top of TorchRL
- A Simple Alpha(Go) Zero Tutorial - Stanford
- Stanford CS234: Reinforcement Learning | Winter 2019 | Lecture 16 - Monte Carlo Tree Search
- Monte-Carlo Tree Search (MCTS) - Tim Miller
- Alpha Zero General - Github
- David Silver: Simulation-Based Search - Youtube
- Silver, David, Richard S. Sutton, and Martin Müller. "Temporal-difference search in computer Go."
- AlphaGo Papers:
- AlphaZero Papers:
- MuZero Papers: