Skip to content

Minimal implementation of the PPO algorithm in Julia

Notifications You must be signed in to change notification settings

gladisor/PPO.jl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PPO.jl

A minimal implementation of the PPO algorithm written in the Julia programming language. Both actor and critic networks are simple MLPs. The value function is learned with the temporal difference bootstrap update.

PPO paper: https://arxiv.org/abs/1707.06347

About

Minimal implementation of the PPO algorithm in Julia

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages