GitHub - ali-sharifi-23/CustomMuJoCoEnviromentForRL: This is a very simple example of creating and training your own MuJoCo environment using RL algorithms through the Gymnasium.

How to create Gymnasium enviroment from your MuJoCo model?

This example shows how to create a simple custom MuJoCo model and train a reinforcement learning agent using the Gymnasium shell and algorithms from StableBaselines.

To reproduce the result you will need python packages MuJoCo, Gymnasium and StableBaselines3 with the appropriate versions:

pip install mujoco==3.1.4
pip install gymnasium==0.29.1
pip install stable-baselines3==2.3.0

Results of custom model

A simple ball balancing environment was implemented as a starting point. The observation consisted of 10 values (position and speed for rotating the platform along two axes and moving the ball along three axes). Each step the agent received a reward of 1 until he fell off the platform or the episode ended (500 steps by default). The Soft Actor Critic algorithm learned an acceptable policy in 150,000 steps.

What to change for your own environment?

The model in MJCF format is located in the assets folder. Place the xml file of your own model there. Configure it with the necessary joints and actuators.
By running the model_viewer.py file, you can test all actuators, as well as the physics of the model in general, in a convenient interactive mode. To do this, insert the name of your model file in this line:

model = mujoco.MjModel.from_xml_path("<PATH OF YOUR XML MODEL FILE>")

Create your own environment class similar to BallBalanceEnv.

In the __init__ method, replace the model path with your own, and insert your observation shape into observation_space (size of observation).
In the step method, define the reward for each step, as well as the condition for the end of the episode.
In the reset_model method, define what should happen at the beginning of each episode (when the model is reset). For example, initial states, adding noise, etc.
In the _get_obs method, return your observations, such as the velocities and coordinates of certain joints.

In the file learn.py, create an instance of your class (instead of BallBalanceEnv). Then you can choose a different algorithm or use your own, now your environment has all the qualities of the Gym environment.
In the file test.py you can test your agent by specifying the path to the model saved after training. You can also create a GIF from frames (commented code) :)

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
__pycache__		__pycache__
assets		assets
media		media
ppo_ball_balance_tensorboard		ppo_ball_balance_tensorboard
sac_ball_balance_tensorboard		sac_ball_balance_tensorboard
README.md		README.md
ball_balance1.gif		ball_balance1.gif
ball_balance_env.py		ball_balance_env.py
ball_balance_env1.py		ball_balance_env1.py
learn.py		learn.py
learn1.py		learn1.py
model_viewer.py		model_viewer.py
ppo_ball_balance.zip		ppo_ball_balance.zip
ppo_ball_balance1.gif		ppo_ball_balance1.gif
sac_ball_balance.zip		sac_ball_balance.zip
test.py		test.py
test1.py		test1.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How to create Gymnasium enviroment from your MuJoCo model?

Results of custom model

What to change for your own environment?

About

Releases

Packages

Languages

ali-sharifi-23/CustomMuJoCoEnviromentForRL

Folders and files

Latest commit

History

Repository files navigation

How to create Gymnasium enviroment from your MuJoCo model?

Results of custom model

What to change for your own environment?

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages