MuZero PyTorch

Implementation of Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model by DeepMind
for CartPole-v0 environment.

MuZero + naive tree search is working.
MuZero + monte carlo tree search (MCTS) is now working.

Improvements: more tricks/hacks for better MCTS training.

MCTS results

Naive tree search results

Search in the fully expanded tree at depth n the maximum discounted value (+ discounted rewards).
Take the action which is the first action from the root to the maximum node.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

MuZero PyTorch

MCTS results

Naive tree search results

Files

README.md

Latest commit

History

README.md

File metadata and controls

MuZero PyTorch

MCTS results

Naive tree search results