Chapter 3 - Not learning with larger grid (size = 12) #38

daniel-xion · 2022-06-23T09:37:34Z

When I change the size to 12 (and in the mode = "player"), the agent no longer learning. It always move towards the borders, i.e. keep taking the action moving towards the borders even when it is already at the border.
Is it because there is no penalty for such action?

daniel-xion · 2022-06-23T09:38:37Z

daniel-xion · 2022-06-23T10:01:45Z

I added the following penalty and it is still not working:

if (self.board.components['Player'].pos[0] < 0 or self.board.components['Player'].pos[1] < 0 or self.board.components['Player'].pos[0] > self.board.size-1 or self.board.components['Player'].pos[1] > self.board.size-1) :
            
return -10

outlace · 2022-11-29T00:31:57Z

This is likely due to the fact that with a larger grid size it becomes increasingly rare for the agent to randomly hit the goal, which we need it to do a few times for the algorithm to learn. When the grid is too large, a random walk will almost never hit the goal. To solve large grids with very sparse rewards you will have to implement some of the more advanced techniques like curiosity that are covered later in the book.

daniel-xion mentioned this issue Jun 23, 2022

Change size to 30 and agent is unable to reach the goal qihongl/demo-q-learning#2

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chapter 3 - Not learning with larger grid (size = 12) #38

Chapter 3 - Not learning with larger grid (size = 12) #38

daniel-xion commented Jun 23, 2022 •

edited

Loading

daniel-xion commented Jun 23, 2022

daniel-xion commented Jun 23, 2022

outlace commented Nov 29, 2022

Chapter 3 - Not learning with larger grid (size = 12) #38

Chapter 3 - Not learning with larger grid (size = 12) #38

Comments

daniel-xion commented Jun 23, 2022 • edited Loading

daniel-xion commented Jun 23, 2022

daniel-xion commented Jun 23, 2022

outlace commented Nov 29, 2022

daniel-xion commented Jun 23, 2022 •

edited

Loading