XuanJing

XuanJing is a benchmark library of decision algorithms for reinforcement learning, imitation learning, multi-agent learning and planning algorithms.

In both supervised learning and reinforcement learning, the algorithm consists of two main components. : the data and the update formula. XuanJing abstracts these two parts, so that it is possible to train reinforcement learning algorithms in the same way as supervised learning.

Status

WIP. Not released yet.

FileFramework

Env is in responsible for parallelizing and wrapping the environment. The task of interacting with the environment falls to the actor. The data produced during the interaction between the actor and the environment is stored in the buffer(if needed.). When an actor interacts with an environment, learner is in charge of managing the data and algorithms. enhancement is used to enhance the data in the buffer. Model parameters are updated by the learner using data and algorithms. utils are a class of useful functions.

Install

TODO

Usage

TODO

Support

Supported algorithms are as following:

model free reinforcement learning

model based reinforcement learning

Imitation Learning

planning algorithms

Regret Minimization in Games with Incomplete Information (CFR)

evolution strategies

Example Readmes

To see how the specification has been applied, see the example-readmes.

Contributors

This project exists thanks to all the people who contribute.

Made with contributors-img.

License

MIT © tinyzqh

Citation

If you find XuanJing useful, please cite it in your publications.

@software{XuanJing,
  author = {Zhiqiang He},
  title = {XuanJing},
  year = {2022},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/tinyzqh/XuanJing}},
}

Name		Name	Last commit message	Last commit date
Latest commit History 78 Commits
XuanJing		XuanJing
docs		docs
example-readmes		example-readmes
examples		examples
tutriol		tutriol
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

XuanJing

Status

Table of Contents

FileFramework

Install

Usage

Support

model free reinforcement learning

model based reinforcement learning

Imitation Learning

planning algorithms

evolution strategies

Example Readmes

Contributors

License

Citation

About

Releases

Packages

Languages

ChuangZhang1999/XuanJing

Folders and files

Latest commit

History

Repository files navigation

XuanJing

Status

Table of Contents

FileFramework

Install

Usage

Support

model free reinforcement learning

model based reinforcement learning

Imitation Learning

planning algorithms

evolution strategies

Example Readmes

Contributors

License

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages