ECE276C

This project, which is a final project of the ECE276C course in UCSD, aims to train a robot to juggle a table tennis ball.

Dependencies' installation guide

We assume the system is Ubuntu 18.04 and the Python version is 3.7.9 in this part.

MuJoCo

Get a trial license or a students license from https://www.roboti.us/license.html. An email with a mjkey.txt attachment will be sent to your mailbox.
Download MuJoCo 200 and unzip it to the ~/.mujoco/ folder. After unzipping, the folder hierarchy should be similar to the following one.
```
~/.mujoco
└── mujoco200
    ├── bin
    ├── doc
    ├── include
    ├── model
    └── sample
```

Copy the mjkey.txt to two folders: ~/.mujoco/ and ~/.mujoco/mujoco200/bin. After copying, the folder hierarchy should be as follows.

~/.mujoco
├── mjkey.txt
└── mujoco200
    ├── bin
    │   ├── mjkey.txt
    │   └── ...
    ├── doc
    ├── include
    ├── model
    └── sample

Append the following line to the end of ~/.bashrc.

export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$HOME/.mujoco/mujoco200/bin

mujoco-py

Install some packages with apt-get.

sudo apt-get install libosmesa6-dev libgl1-mesa-glx libglfw3 patchelf libglew-dev

Append the following line to the end of ~/.bashrc.

export LD_PRELOAD=/usr/lib/x86_64-linux-gnu/libGLEW.so

Install mujoco-py with pip. You can also install it with the other packages.

Other Python packages

I prepare a requirements.txt that specifies all the necessary Python packages with their versions fixed. You can install with the following command.

pip install -r requirements.txt

Verify installation

If everything is installed properly, you can test the JuggleEnv with the following demo code.

from juggle_env import JuggleEnv
import numpy as np 
import time

env = JuggleEnv()
ob = env.reset()
for _ in range(500):
    env.step(np.zeros((7, )))
    env.render()
    time.sleep(0.02)
env.close()

Environment design

The JuggleEnv is composed of two parts: an 7-dof IIWA robot and a ping-pong ball. We adopt the gym's API, and the details of the methods are explained below.

reset

During reset, two operation are taken.

Set the manipulator's joints back to the initial values with some noise.

The initial values are [0.0, 0.7, 0.0, -1.4, 0.0, -0.56, 0.0]. The noises on the 7 joints are Gaussian and independent, with mu=0 and sigma=0.02.
Place the ball randomly in a initial region.

The position of the ball is uniformly distributed in the AABB [[0.76, -0.06, 1.96], [0.84, 0.04, 2.04]].

The observation is an ordered dict containing the following numpy arrays.

"robot0_joint_pos"
"robot0_joint_vel"
"robot0_eef_pos"
"robot0_eef_quat"
"robot0_gripper_qpos"
"robot0_gripper_qvel"
"robot0_robot_state"
"pingpong_pos"

step

We use a joint velocity controller within the environment, so the input to the step function should be the velocity of each joints. The control frequency is 50Hz, and the maximum time span of an episode is 20s. The input should be in the range [-1, 1].

The reward consists of two parts: the state part and the control part. The state part is the distance on the x-y plane between the end-effector and ping-pong, added with a score that is achieved every time the ping-pong pass through the z=0.8 plane from downside to upside. The action part is to prevent the robot from taking rapid movement.

render & close

We provide two kinds of rendering: "human" and "rgb_array". You can use opencv to generate a video by concatanating all the images returned by env.render("rgb_array"). One defect of our rendering functions is that it pops a window regardless the type you choose.

Training result

The results are put in the data folder.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data		data
juggle_env		juggle_env
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
convert_format.sh		convert_format.sh
ddpg_juggle.py		ddpg_juggle.py
draw_curve.py		draw_curve.py
requirements.txt		requirements.txt
setup.cfg		setup.cfg
td3_juggle.py		td3_juggle.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ECE276C

Dependencies' installation guide

MuJoCo

mujoco-py

Other Python packages

Verify installation

Environment design

reset

step

render & close

Training result

About

Releases

Packages

Languages

License

JiangengDong/ECE276C

Folders and files

Latest commit

History

Repository files navigation

ECE276C

Dependencies' installation guide

MuJoCo

mujoco-py

Other Python packages

Verify installation

Environment design

reset

step

render & close

Training result

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages