dynamics-learn

Working directory for my work on model-based reinforcement learning for novel robots. Best for robots with high test cost and difficult to model dynamics. Contact: [email protected] Project website: https://sites.google.com/berkeley.edu/mbrl-quadrotor/

This directory is working towards an implementation of many simulated model-based approaches on real robots. For current state of the art in simulation, see this work from Prof Sergey Levine's group: Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models.

Future implementations work towards controlled flight of the ionocraft, with a recent publication in Robotics and Automation Letters and in the future for transfer learning of dynamics on the Crazyflie 2.0 Platform.

Some potentially noteable implementations include:

probablistic nueral network in pytorch
gaussian loss function for said pytorch probablistic neural network
random shooting MPC implementation with customizable cost / reward function (See cousin repo: https://github.com/natolambert/ros-crazyflie-mbrl)

File Breakdown:

utils/ holds utility files for reading data (reach out for dataset to collaborate on), plotting tools, and some nn functions that were written before being added to pytorch.
execute_gen_PID and execute_train_nn are used to generate either a PID or a dynamics model from data. Expect another to be added for model free policies.
gymenv's take a dynamics model and provide a wrapper for it to be used as a standard gym environement.
model_'s are our various implementations of different dynamics models and policies.
pid contains the PID class, not currently used.
plot_'s contain plotting code used for papers, could be a good reference for plotting dynamics model function.
policy_'s contain implementations of different state of the art model free algorithms in coordination with rlkit (https://github.com/vitchyr/rlkit), but there is a strong chance with branch from this.

Requirements:

TBD, currently code runs on pytorch 0.4.1, but we wish to update to 1.0.

Future Implementations:

Feel free to request feature additions if you are interested in the line of work, but things in our pipeline:

Implementation of regularization term for training policies on a learned neural network dynamics model, as introduced here: http://proceedings.mlr.press/v80/parmas18a.html
Additional tools for training dynamics models on two step predictions, improved ensemble implementation, and simpler models
PID tuning with gaussian processes

Literature Breakdown:

For current state of the art, as I said see K. Chua et al.. This paper covers the design choices between deterministic and probablistic neural networks for learning, along with a discussion of ensemble learning. It then covers a new MPC technique needed for higher state systems coined Trajectory Sampling. Especially for our goal of implementing this on real robots, some other recent papers that cover their own implementations can prove more useful, such as Bansal et al. learning trajectories on the CrazyFlie or Nagabundi et al. with millirobots. A more theoretical framework for model-based learning includes the PILCO Algorithm and .... will update with what I feel is relevant.

For some general reinforcement learning references, see a lecture series by Deepmind's David Silver, the Deep RL Bootcamp 2017 Lectures from various researchers, the Deep Learning Book from Goodfellow & MIT, or Berkeley's own Deep RL Course

Name		Name	Last commit message	Last commit date
Latest commit History 189 Commits
rlkit @ 86db9c2		rlkit @ 86db9c2
utils		utils
.gitmodules		.gitmodules
README.md		README.md
execute_gen_PID.py		execute_gen_PID.py
execute_gpytorch_ex.py		execute_gpytorch_ex.py
execute_train_nn.py		execute_train_nn.py
gymenv_quad.py		gymenv_quad.py
gymenv_quad_discrete.py		gymenv_quad_discrete.py
model_ensemble_nn.py		model_ensemble_nn.py
model_free_policy.py		model_free_policy.py
model_general_nn.py		model_general_nn.py
model_split_nn.py		model_split_nn.py
pid.py		pid.py
pipps_experiments.py		pipps_experiments.py
pipps_psuedocode.py		pipps_psuedocode.py
policy_ddpg.py		policy_ddpg.py
policy_tsac.py		policy_tsac.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

dynamics-learn

File Breakdown:

Requirements:

Future Implementations:

Literature Breakdown:

About

Releases

Packages

Languages

PisterLab/dynamics-learn

Folders and files

Latest commit

History

Repository files navigation

dynamics-learn

File Breakdown:

Requirements:

Future Implementations:

Literature Breakdown:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages