Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DeepCFR on Leduc Question #2

Open
rpSebastian opened this issue Nov 16, 2022 · 0 comments
Open

DeepCFR on Leduc Question #2

rpSebastian opened this issue Nov 16, 2022 · 0 comments

Comments

@rpSebastian
Copy link

Hi,

I am running DeepCFR_Leduc experiment. I use the same hyperparameters recommended by the configuration file deep_cfr_leduc.cfg. When the number of iterations reaches 100, the exploitability reaches 0.31, which is higher than the corresponding value 0.1 in the paper.

I find that the hyperparameters of kuhn poker and leduc poker are the same. The performance on kuhn poker roughly matches, but I am not able to replicate the performance on leduc poker. Are you using different hyperparameters on the leduc poker?

Do you have any idea why this would happen? It would be nice if you could provide the source code including hyperparameters and random seeds to reproduce the same result.

Thank you very much.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant