From 7fcbbd79077b5772b8a8c228ecd942bda96935c1 Mon Sep 17 00:00:00 2001
From: Jama Hussein Mohamud <engmubarak48@gmail.com>
Date: Thu, 6 Jun 2024 12:28:55 -0400
Subject: [PATCH] minor type fix

---
 README.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/README.md b/README.md
index f4337af85..7cf1553f6 100644
--- a/README.md
+++ b/README.md
@@ -134,7 +134,7 @@ So far, we've discussed how to manually set actions or use random actions in the
 
 As the agent interacts with the environment, it collects data about the outcomes of its actions. This data is used to train a policy network, which models the probability distribution of possible actions given the current state. Over time, the policy network learns to favor actions that lead to more successful outcomes with higher reward, optimizing the agent's performance.
 
-8. Sample a batch of trajectories from a trained agent 
+9. Sample a batch of trajectories from a trained agent 
 
 ```python 
 batch, _ = gflownet.sample_batch(n_forward=3,  train=False)