LinearRL - Python Implementation

This repository is a Python implementation of "Linear reinforcement learning in planning, grid fields, and cognitive control" by Piray et al.

The detour task is the only 1-1 task as presented in the paper, for the other two tasks I used inspiration from the great paper "Predictive representations can link model- based reinforcement learning to model-free mechanisms" by Russek et al. Even though the latent learning and change in goal tasks are not presented like this in the original LinearRL paper, the equations I used are from the original paper and still hold up.

Author Note: Although I believe this to be a faithful reimplementation of the original paper. It is not an official implementation, for the official implementation please visit this repository.

Introduction

The code converts the LinearRL framework introduced by Piray et al. into Python. Additionally, it is converted to be used with environments that have a similar structure to OpenAI's Gym reinforcement learning environments.

Although the code structure is made to handle gym-like function calls. This code will not be compatible with any environment. It has been specifically designed for the tabular environments that I have created, see gym-env/ for more details.

Conda & Gym Environments

Conda Environment

I recommend creating a conda environment for usage with this repo. Especially because we will be installing some custom built environments. You can install the conda environment from the yml file I have provided.

conda env create -f env.yml

Gym Environments

Because we are using custom gym environments, you need to install them locally in order for gymansium to recognize them when we are constructing our environment variables. To install the environemnts, just run:

pip install -e gym-env

Usage

Notebooks

Examples of using the model can be found in the notebooks. There are two notebooks, one containing a version of the Tolman detour task, the other containing a reward revaluation task.

The notebook with reward revaluation planning can be found in src/linear-rl-reward.ipynb.
The notebook with the Tolman detour planning task can be found in src/linear-rl-det.ipynb.
The notebook with updating the reward of a terminal state can be found in src/linear-rl-change-goal.ipynb.

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
gym-env		gym-env
src		src
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LinearRL - Python Implementation

Introduction

Conda & Gym Environments

Conda Environment

Gym Environments

Usage

Notebooks

About

Releases

Packages

Languages

ArminBaz/LinearRL

Folders and files

Latest commit

History

Repository files navigation

LinearRL - Python Implementation

Introduction

Conda & Gym Environments

Conda Environment

Gym Environments

Usage

Notebooks

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages