Robust Multi-Agent Reinforcement Learning via Minimax Deep Deterministic Policy Gradient

This is the code for implementing the M3DDPG (mmmaddpg) algorithm.

For Multi-Agent Particle Environments (MPE) installation, please refer to https://github.com/openai/multiagent-particle-envs

python train.py --scenario simple

--scenario: defines which environment in the MPE is to be used (default: "simple")
--max-episode-len maximum length of each episode for the environment (default: 25)
--num-episodes total number of training episodes (default: 60000)
--num-adversaries: number of adversaries in the environment (default: 0)
--good-policy: algorithm used for the 'good' (non adversary) policies in the environment (default: "maddpg"; options: {"mmmaddpg", "maddpg", "ddpg"})
--adv-policy: algorithm used for the adversary policies in the environment (default: "maddpg"; options: {"mmmaddpg", "maddpg", "ddpg"})

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
experiments		experiments
maddpg		maddpg
LICENSE.txt		LICENSE.txt
README.md		README.md
setup.py		setup.py

Provide feedback