Replies: 3 comments 4 replies
-
Well, if I understand the rules of the game correctly, this is the best tactic to win - the agent will never lose. It might be that the game has no solution and the agent has worked out it can't win, and therefore simply tries not to lose in the simplest way. |
Beta Was this translation helpful? Give feedback.
-
Hi @callix2, are you still experiencing this issue? You may try to put breakpoint to MCTS search method. Add an int variable depth, which will be increased in every nested call of the search. Try to stop the program, when depth reaches more than 10 for example. |
Beta Was this translation helpful? Give feedback.
-
Hi @callix2 Maybe you should check how did the game end. Just print the result, and you may see more into this. Not only if 1 or -1 won, but also the reason. Maybe they win because opponent got to 100th move. Add this print-out here, so it will print you the reason after every game ended during Arena (or add it to Coach too if you want) If you experience 20/20 wins - is it only White winning? Maybe if first 20 games are won by White (1st model), and second 20 games also won by White (2nd model) - this may tell you that learning process is only good for White, and you have to change something. Keep in mind - as I just stated, models rotate after first half of the games. For ex. first model plays as white, and then plays 20 games as black. Be aware that sometimes when we ask about result for black, and sometimes for wait (player parameter for getGameEnded). |
Beta Was this translation helpful? Give feedback.
-
Hi @ALL,
i run into a infinite loop with the MCTS. In my game you lose if you have no more pieces in your reserve. You can get your pieces back from the board by placing them in a row of four. (This is also the way you remove pieces from your opponent, if he has pieces that form a direct extension of these 4). But if your own reserve is nearing its end, you can just stay alive with this tactic.
So the program puts four pieces one after the other into a row and gets them back again. Now the board looks exactly as it did four rounds before and the corresponding node in the search tree is used again ..again ..and again...
Do you guys have any ideas on how to avoid this behavior?
Beta Was this translation helpful? Give feedback.
All reactions