Index

DMC
Deep-Q Learning
NFSP
CFR (chance sampling)

Deep Monte-Carlo

Deep Monte-Carlo (DMC) is a very effective algorithm for card games. This is the only algorithm that shows human-level performance on complex games such as Dou Dizhu.

Deep-Q Learning

Deep-Q Learning (DQN) [paper] is a basic reinforcement learning (RL) algorithm. We wrap DQN as an example to show how RL algorithms can be connected to the environments. In the DQN agent, the following classes are implemented:

DQNAgent: The agent class that interacts with the environment.
Memory: A memory buffer that manages the storing and sampling of transitions.
Estimator: The neural network that is used to make predictions.

NFSP

Neural Fictitious Self-Play (NFSP) [paper] end-to-end approach to solve card games with deep reinforcement learning. NFSP has an inner RL agent and a supervised agent that is trained based on the data generated by the RL agent. In the toolkit, we use DQN as RL agent.

CFR (chance sampling)

Counterfactual Regret Minimization (CFR) [paper] is a regret minimizaiton method for solving imperfect information games.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

algorithms.md

algorithms.md

Index

Deep Monte-Carlo

Deep-Q Learning

NFSP

CFR (chance sampling)

Files

algorithms.md

Latest commit

History

algorithms.md

File metadata and controls

Index

Deep Monte-Carlo

Deep-Q Learning

NFSP

CFR (chance sampling)