Change size to 30 and agent is unable to reach the goal #2

daniel-xion · 2022-06-16T06:14:33Z

Hi, thanks for the code. I changed the grid size to 30 and it seems the agent is unable to learn to reach the goal:

It seems Q-learning is unable to handle large grids, and DQN is needed to solve it?

qihongl · 2022-06-16T12:40:38Z

thanks for the datapoint daniel!

I think the epoch number needs to be much much larger since the state space increased 36 times (30 x 30) / (5 x 5). q learning requires an agent with a random policy to bump into the target many times. This "target visitation probability" exponentially decreases as the state space gets larger.

daniel-xion · 2022-06-17T07:17:16Z

Thanks for the prompt reply! Do you suggest, for large grids, and to be scalable (including reasonable training episodes), do you think DQN or other reinforcement learning will work?

daniel-xion · 2022-06-23T10:02:57Z

FYI, I tried DQN from the following notebook and it is still not working. The agent keep bumping into the walls (even with the size of only 12):
DeepReinforcementLearning/DeepReinforcementLearningInAction#38

qihongl · 2022-06-24T19:56:36Z

Thanks for the datapoint -- that's really interesting. I'm not sure what's the next simplest thing to try...

daniel-xion · 2022-07-08T04:01:32Z

No problem. By the way, what do you think of using transformer in the RL to solve the maze game? for example, by using the transformer to predict a sequence of actions that leads to a sequence of high rewards?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change size to 30 and agent is unable to reach the goal #2

Change size to 30 and agent is unable to reach the goal #2

daniel-xion commented Jun 16, 2022 •

edited

Loading

qihongl commented Jun 16, 2022

daniel-xion commented Jun 17, 2022

daniel-xion commented Jun 23, 2022 •

edited

Loading

qihongl commented Jun 24, 2022

daniel-xion commented Jul 8, 2022

Change size to 30 and agent is unable to reach the goal #2

Change size to 30 and agent is unable to reach the goal #2

Comments

daniel-xion commented Jun 16, 2022 • edited Loading

qihongl commented Jun 16, 2022

daniel-xion commented Jun 17, 2022

daniel-xion commented Jun 23, 2022 • edited Loading

qihongl commented Jun 24, 2022

daniel-xion commented Jul 8, 2022

daniel-xion commented Jun 16, 2022 •

edited

Loading

daniel-xion commented Jun 23, 2022 •

edited

Loading