Skip to content

CynapticsAI/ML14_MarioAI_SOC23

Repository files navigation

Proximal Policy Optimization (PPO) for Gym Super Mario Bros

By using Proximal Policy Optimization (PPO) algorithm introduced in the paper Proximal Policy Optimization Algorithms paper, we've trained a Mario playing agent in gym environment. With the script, you can Test the model through python test.py. e.g. python test.py --world 1 --stage 2.

Note: If the render is too fast, append env.render() in test.py with time.sleep(0.5). It is recommended to use gym environment that supports old step-API as well.

P.s.: We shared our work of algorithms that we tried earlier in the report.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages