Skip to content

Latest commit

 

History

History
25 lines (13 loc) · 2.09 KB

README.md

File metadata and controls

25 lines (13 loc) · 2.09 KB

Pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"

  1. This code is developed based on the source code of QMIX paper, which is available at: https://github.com/oxwhirl/pymarl

  2. To run the code, please install the SMAC (StarCraft Multi-Agent Challenge) first, which is available at: https://github.com/oxwhirl/smac, please then follow the instructions to install the StarCraft II client. Download StarCraft II into the 3rdparty folder and copy the maps necessary to run over.

  3. The requirements.txt file can be used to install the necessary packages into a virtual environment.

  4. To run the code, use the following command: python3 src/main.py --config=xxx_xxx --env-config=sc2 with env_args.map_name=xxx

--config can be one of the following four options: vdn_6h_vs_8z,vdn_corridor,qmix_6h_vs_8z,qmix_corridor (corridor is 6z_vs_24zerg scenario). For example 'vdn_6h_vs_8z' means 6h_vs_8z map with VDN as the mixing network.

--env_args.map_name can be one of the following two options:6h_vs_8z,corridor (corridor is the 6z_vs_24zerg scenario)

  1. All the hyperparameters can be found at: src/config/default.yaml, src/config/algs/.yaml and src/config/envs/.yaml

  2. The execution of the agent network can be found at: src/controllers/basic_controller_xxx_vbc.py, where xxx is the name of the map. For example, the execution of the agent network for 6h_vs_8z can be found at: src/controllers/basic_controller_6h_vs_8z_vbc.py

  3. The training of the agent network can be found at: src/learners/q_learner_xxx_vbc.py, where xxx is the name of the map. For example, the training of the agent network for 6h_vs_8z can be found at: src/learners/q_learner_6h_vs_8z_vbc.py

  4. To load model, specifying the path of the saved model by filling in the checkpoint_path parameter in the default.yaml.

  5. The test accuracy will be saved in the 'xxx_accuracy_list.txt', where xxx is the local_results_path parameter in default.yaml.

  6. Communication overhead \beta will be saved in the 'xxx_comm_overhead.txt', where xxx is the local_results_path parameter in default.yaml.