Added the single agent PPO checkpoint

GAIGResearch · Dec 3, 2020 · 4539976 · 4539976
1 parent f6c4c79
commit 4539976
Show file tree

Hide file tree

Showing 10 changed files with 427 additions and 273 deletions.
diff --git a/examples/README.md b/examples/README.md
@@ -24,7 +24,7 @@ We also provided non-notebook versions of these guides, which contain less expla
 If you have any issues with running Malmo check the [FAQ](FAQ.md) as it might cover the issues.
 
 ## Baseline results
-Single-agent PPO
+**Single-agent PPO**
 
 We trained PPO in single and multi-agent setups on the Mob chases tasks. The tensorboard learning curves are shown below from a run of 1 million agent-env interactions. The checkpoint is available in the ```examples/checkpoints/``` package.
 ![Single Agent PPO learning curves](imgs/PPO_single_agent_mobchase.png)

diff --git a/examples/checkpoints/PPO_malmo_single_agent/checkpoint_209/.is_checkpoint b/examples/checkpoints/PPO_malmo_single_agent/checkpoint_209/.is_checkpoint
diff --git a/examples/checkpoints/PPO_malmo_single_agent/checkpoint_209/checkpoint-209 b/examples/checkpoints/PPO_malmo_single_agent/checkpoint_209/checkpoint-209
diff --git a/examples/checkpoints/PPO_malmo_single_agent/checkpoint_209/checkpoint-209.tune_metadata b/examples/checkpoints/PPO_malmo_single_agent/checkpoint_209/checkpoint-209.tune_metadata
diff --git a/examples/checkpoints/PPO_malmo_single_agent/events.out.tfevents.1606943053.nxg1 b/examples/checkpoints/PPO_malmo_single_agent/events.out.tfevents.1606943053.nxg1
diff --git a/examples/checkpoints/PPO_malmo_single_agent/params.json b/examples/checkpoints/PPO_malmo_single_agent/params.json
@@ -0,0 +1,7 @@
+{
+  "env": "malmo",
+  "ignore_worker_failures": true,
+  "lr": 5e-05,
+  "num_gpus": 1,
+  "num_workers": 6
+}
diff --git a/examples/checkpoints/PPO_malmo_single_agent/params.pkl b/examples/checkpoints/PPO_malmo_single_agent/params.pkl
diff --git a/examples/checkpoints/PPO_malmo_single_agent/progress.csv b/examples/checkpoints/PPO_malmo_single_agent/progress.csv
diff --git a/examples/checkpoints/PPO_malmo_single_agent/result.json b/examples/checkpoints/PPO_malmo_single_agent/result.json
diff --git a/examples/notebooks/rllib_evaluate_checkpoint.ipynb b/examples/notebooks/rllib_evaluate_checkpoint.ipynb