Proximal Policy Optimization (PPO) for Gym Super Mario Bros

By using Proximal Policy Optimization (PPO) algorithm introduced in the paper Proximal Policy Optimization Algorithms paper, we've trained a Mario playing agent in gym environment. With the script, you can Test the model through python test.py. e.g. python test.py --world 1 --stage 2.

Note: If the render is too fast, append env.render() in test.py with time.sleep(0.5). It is recommended to use gym environment that supports old step-API as well.

P.s.: We shared our work of algorithms that we tried earlier in the report.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
demo_video		demo_video
gym		gym
results		results
scene		scene
src		src
trained_models		trained_models
Copy of presentation.pdf		Copy of presentation.pdf
README.md		README.md
index.html		index.html
plot.png		plot.png
script.js		script.js
style.css		style.css
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Proximal Policy Optimization (PPO) for Gym Super Mario Bros

About

Releases

Packages

Contributors 4

Languages

CynapticsAI/ML14_MarioAI_SOC23

Folders and files

Latest commit

History

Repository files navigation

Proximal Policy Optimization (PPO) for Gym Super Mario Bros

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages