Skip to content

Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient

License

Notifications You must be signed in to change notification settings

supersglzc/ddiffpg

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Deep Diffusion Policy Gradient (DDiffPG)

This repository provides a PyTorch implementation of DDiffPG.


⚙️ Installation

Install DDiffPG

  1. Clone the package:

    git clone [email protected]:ddiffpg.git
    cd ddiffpg
  2. Create Conda environment and install dependencies:

    conda env create -f environment.yml
    conda activate ddiffpg
  3. Install MuJoCo 210.

  4. Install D4RL.

  5. Install panda-gym

  6. Install DDiffPG.

    pip install -e .

📜 Usage

✏️ Logging

We use Weights & Biases (W&B) for logging.

  1. Get a W&B account from https://wandb.ai/site

  2. Get your API key from https://wandb.ai/authorize

  3. set up your account in terminal

    export WANDB_API_KEY=$API Key$

💡 Train with DDiffPG

Run DDiffPG on AntMaze tasks.

python scripts/ddiffpg_main.py algo=ddiffpg_algo env.name=antmaze-v1

🔖 Baselines

Run DIPO baseline

python scripts/baselines_main.py algo=dipo_algo env.name=antmaze-v1

Run SAC baseline

python scripts/baselines_main.py algo=sac_algo env.name=antmaze-v1

💾 Saving and Loading

Checkpoints are automatically saved as W&B Artifacts.

About

Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages