Reinforcement Learning - approximate method in Pacman

Introduction

In this project experimented with various Reinforcement Learning Approximate method techniques namely value Approximate Q-learning, Episode semi-gradient SARSA and True online SARSA. This is part of Pacman projects developed at UC Berkeley.

Directory Structure

-- ML_approximate_method - qlearningAgents.py : It contains Q-learning, approximate Q-learning, Epsode semi-gradient SARSA and True online SARSA classes - learningAgents.py : This file contains training and test control.

-- Analysis - Analysis.ipynb : This file analizes different algorithm with Pacman scores and execution time.

Executing

Executing each algorithm :

python pacman.py -p ApproximateQAgent -a extractor=SimpleExtractor -x 50 -n 60 -l mediumGrid
- Algorithm
  - Above command line execute ApproximateQAgent, 50 times training and 10 times testing. Using environment is mediumGrid.
  - Approximate Q-leanring algorithm => ApproximateQAgent
  - Episode semi-gradient SARSA algorithm => SemiGradientSarsaAgent
  - True online SARSA algorithm => TrueOnlineSarsaAgent
- Environment
  - smallGrid
  - mediumGrid
  - mediumClassic
Running Analysis.ipynb : Put the Q-learning, Episode semi-gradient SARSA and True online SARSA algorithm result files in the same directory with the program.

Outout

Output directory has output file.

Modified by

Yeonjung LEE

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Analysis		Analysis
ML_approximate_method		ML_approximate_method
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning - approximate method in Pacman

Introduction

Directory Structure

Executing

python pacman.py -p ApproximateQAgent -a extractor=SimpleExtractor -x 50 -n 60 -l mediumGrid

Outout

Modified by

About

Releases

Packages

Contributors 2

Languages

ylee197/ML

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning - approximate method in Pacman

Introduction

Directory Structure

Executing

python pacman.py -p ApproximateQAgent -a extractor=SimpleExtractor -x 50 -n 60 -l mediumGrid

Outout

Modified by

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages