Skip to content
/ VBC Public
forked from saizhang0218/VBC

pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"

License

Notifications You must be signed in to change notification settings

jyericlin/VBC

 
 

Repository files navigation

Pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"

  1. This code is developed based on the source code of QMIX paper, which is available at: https://github.com/oxwhirl/pymarl

  2. To run the code, please install the SMAC (StarCraft Multi-Agent Challenge) first, which is available at: https://github.com/oxwhirl/smac, please then follow the instructions to install the StarCraft II client. Download StarCraft II into the 3rdparty folder and copy the maps necessary to run over.

  3. The requirements.txt file can be used to install the necessary packages into a virtual environment.

  4. To run the code, use the following command: python3 src/main.py --config=xxx_xxx --env-config=sc2 with env_args.map_name=xxx

--config can be one of the following four options: vdn_6h_vs_8z,vdn_corridor,qmix_6h_vs_8z,qmix_corridor (corridor is 6z_vs_24zerg scenario). For example 'vdn_6h_vs_8z' means 6h_vs_8z map with VDN as the mixing network.

--env_args.map_name can be one of the following two options:6h_vs_8z,corridor (corridor is the 6z_vs_24zerg scenario)

  1. All the hyperparameters can be found at: src/config/default.yaml, src/config/algs/.yaml and src/config/envs/.yaml

  2. The execution of the agent network can be found at: src/controllers/basic_controller_xxx_vbc.py, where xxx is the name of the map. For example, the execution of the agent network for 6h_vs_8z can be found at: src/controllers/basic_controller_6h_vs_8z_vbc.py

  3. The training of the agent network can be found at: src/learners/q_learner_xxx_vbc.py, where xxx is the name of the map. For example, the training of the agent network for 6h_vs_8z can be found at: src/learners/q_learner_6h_vs_8z_vbc.py

  4. To load model, specifying the path of the saved model by filling in the checkpoint_path parameter in the default.yaml.

  5. The test accuracy will be saved in the 'xxx_accuracy_list.txt', where xxx is the local_results_path parameter in default.yaml.

  6. Communication overhead \beta will be saved in the 'xxx_comm_overhead.txt', where xxx is the local_results_path parameter in default.yaml.

About

pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 96.3%
  • Shell 1.9%
  • Dockerfile 1.8%