reinforcement-learning-ic

Gastón Bujía - Leonardo Córdoba

In this repo we uploaded some code related to a Reinforcement Learning postgraduate course at University of Buenos Aires, taught by Matthieu Jonckheere.

The code is organized as follows:

recycling_robots: exercise in which Bellman's equation is used to solve Recyling Robot problem (Sutton).
bandits: traditional multi-armed bandits problem solved using different algorithms (Sutton).
model_free: Montecarlo, SARSA and Q-learning algorithms are used to solve a variation of Perudo game.

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
atari		atari
bandits		bandits
model_free		model_free
recycling_robots		recycling_robots
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

reinforcement-learning-ic

Gastón Bujía - Leonardo Córdoba

About

Releases

Packages

Contributors 2

Languages

LeonardoCordoba/reinforcement-learning-ic

Folders and files

Latest commit

History

Repository files navigation

reinforcement-learning-ic

Gastón Bujía - Leonardo Córdoba

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages