Skip to content

LeonardoCordoba/reinforcement-learning-ic

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

60 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

reinforcement-learning-ic

Gastón Bujía - Leonardo Córdoba

In this repo we uploaded some code related to a Reinforcement Learning postgraduate course at University of Buenos Aires, taught by Matthieu Jonckheere.

The code is organized as follows:

  • recycling_robots: exercise in which Bellman's equation is used to solve Recyling Robot problem (Sutton).

  • bandits: traditional multi-armed bandits problem solved using different algorithms (Sutton).

  • model_free: Montecarlo, SARSA and Q-learning algorithms are used to solve a variation of Perudo game.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published