fedden · vtjeng · Nov 8, 2020
diff --git a/README.md b/README.md
@@ -1,12 +1,10 @@
-| code-thing      | status        |
-| --------------- | ------------- |
-| master          | [![Build Status](https://travis-ci.org/fedden/poker_ai.svg?branch=master)](https://travis-ci.org/fedden/poker_ai)  |
-| develop         | [![Build Status](https://travis-ci.org/fedden/poker_ai.svg?branch=develop)](https://travis-ci.org/fedden/poker_ai) |
+| code-thing      | status                                                                                                                                                           |
+| --------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------- |
+| master          | [![Build Status](https://travis-ci.org/fedden/poker_ai.svg?branch=master)](https://travis-ci.org/fedden/poker_ai)                                                |
+| develop         | [![Build Status](https://travis-ci.org/fedden/poker_ai.svg?branch=develop)](https://travis-ci.org/fedden/poker_ai)                                               |
 | maintainability | [![Maintainability](https://api.codeclimate.com/v1/badges/c5a556dae097b809b4d9/maintainability)](https://codeclimate.com/github/fedden/poker_ai/maintainability) |
-| coverage        | [![Test Coverage](https://api.codeclimate.com/v1/badges/c5a556dae097b809b4d9/test_coverage)](https://codeclimate.com/github/fedden/poker_ai/test_coverage) |
-| license         | [![License: GPL v3](https://img.shields.io/badge/License-GPLv3-blue.svg)](https://www.gnu.org/licenses/gpl-3.0) |
-
-[Read the documentation]()
+| coverage        | [![Test Coverage](https://api.codeclimate.com/v1/badges/c5a556dae097b809b4d9/test_coverage)](https://codeclimate.com/github/fedden/poker_ai/test_coverage)       |
+| license         | [![License: GPL v3](https://img.shields.io/badge/License-GPLv3-blue.svg)](https://www.gnu.org/licenses/gpl-3.0)                                                  |
 
 # 🤖 Poker AI
 
@@ -21,6 +19,7 @@ _Made with love from the developers [Leon](https://www.leonfedden.co.uk) and [Co
 _A special thank you to [worldveil](https://github.com/worldveil) for originally writing [this awesome hand evaluator python2 module](https://github.com/worldveil/deuces), which was ported to python3 and [maintained here](https://github.com/fedden/poker_ai/tree/master/poker_ai/poker/evaluation)._
 
 ## Join the Community
+
 [https://thepoker.ai](https://thepoker.ai)
 
 ## Prerequisites
@@ -29,12 +28,14 @@ This repository assumes Python 3.7 or newer is used.
 
 ## Installing
 
-Either install from pypi:
+Either install from `pypi`:
+
 ```bash
-pip install poker_ai 
+python3 -m pip install poker_ai
 ```
 
 Or if you want to dev on our code, install the Python package from source by cloning this repo and `pip -e` installing it:
+
 ```bash
 git clone https://github.com/fedden/poker_ai.git # Though really we should use ssh here!
 cd /path/to/poker_ai
@@ -46,26 +47,31 @@ pip install .
 We have a CLI that will be installed when you pip install the package. To get help on any option, just add the `--help` flag when invoking the CLI.
 
 How to get a list of commands that can be run:
+
 ```bash
 poker_ai --help
-``` 
+```
 
 You will need to produce some lookup tables that cluster the various information sets. Here is more information on that:
+
 ```bash
 poker_ai cluster --help
 ```
 
 How to get information on training an agent:
+
 ```bash
 poker_ai train start --help
 ```
 
 How to get information on resuming training:
+
 ```bash
 poker_ai train resume --help
 ```
 
 Once you have an agent, and want to play against it, you can do the following:
+
 ```bash
 poker_ai play --help
 ```
@@ -84,7 +90,7 @@ You'll save the combinations of public information in a file called card_info_lu
 
 ### Train your bot
 
-We use MCCFR to learn strategies. The MCCFR algorithm uses iterative self-play to adjust strategy based on regret. 
+We use MCCFR to learn strategies. The MCCFR algorithm uses iterative self-play to adjust strategy based on regret.
 
 ```bash
 poker_ai train start
@@ -100,11 +106,12 @@ Finally, you can play your bot with the following command:
 poker_ai play
 ```
 
-You'll create a results.yaml file in ~/.poker/. So be sure to see how you stack up against your bot.
- 
+You'll create a `results.yaml` file in `~/.poker/`. So be sure to see how you stack up against your bot.
+
 ## Running tests
 
 We are working hard on testing our components, but contributions here are always welcome. You can run the tests by cloning the code, changing directory to this repositories root directory (i.e `poker_ai/`) and call the python test library `pytest`:
+
 ```bash
 cd /path/to/poker_ai
 pip install pytest
@@ -115,24 +122,28 @@ See below on how to run the tests from the docker image.
 
 ## Building the docker image
 
-We use a custom docker image for our testing suite. 
+We use a custom docker image for our testing suite.
+
+You'll need to have computed the pickled card information lookup tables first (the cluster command for poker_ai). We build the images like below, in this case the luts are in './research/blueprint_algo'. First we build the parent image, with all of the dependencies.
 
-You'll need to have computed the pickled card information lookup tables first (the cluster command for poker_ai). We build the images like below, in this case the luts are in './research/blueprint_algo'. First we build the parent image, with all of the dependancies.
 ```bash
 docker build --build-arg LUT_DIR=research/blueprint_algo -f ParentDockerfile -t pokerai .
 ```
 
 Then we build the test image.
+
 ```bash
 docker build -t pokeraitest .
 ```
 
 We then can run the tests with:
+
 ```bash
-docker run -it pokeraitest pytest 
+docker run -it pokeraitest pytest
 ```
 
-This is just a note for the developers, but we can push the parent image to the registry with the following (please ensure the version tag that comes after the colon is correct). We want to do this because we need various dependancies for the remote tests, and travis builds the `pokeraitest` image with the current git commit that has just been pushed.
+This is just a note for the developers, but we can push the parent image to the registry with the following (please ensure the version tag that comes after the colon is correct). We want to do this because we need various dependencies for the remote tests, and travis builds the `pokeraitest` image with the current git commit that has just been pushed.
+
 ```bash
 docker tag pokerai pokerai/pokerai:1.0.0rc1
 docker push pokerai/pokerai:1.0.0rc1
@@ -141,33 +152,38 @@ docker push pokerai/pokerai:1.0.0rc1
 ## Building documentation
 
 Documentation is hosted, but you can build it yourself if you wish:
+
 ```bash
 # Build the documentation.
 cd /path/to/poker_ai/docs
 make html
-cd ./_build/html 
+cd ./_build/html
 # Run a webserver and navigate to localhost and the port (usually 8000) in your browser.
-python -m http.server 
+python -m http.server
 ```
 
 ## Repository Structure
 
-Below is a rough structure of the codebase. 
+Below is a rough structure of the codebase.
 
-```
+```sh
 ├── applications   # Larger applications like the state visualiser sever.
-├── paper          # Main source of info and documentation :)
+├── assets         # Images used in the README.
+├── bin
+├── docs           # Documentation
 ├── poker_ai       # Main Python library.
 │   ├── ai         # Stub functions for ai algorithms.
+│   ├── cli        # Command line interface
+│   ├── clustering # Code for clustering similar situations
 │   ├── games      # Implementations of poker games as node based objects that
 │   │              # can be traversed in a depth-first recursive manner.
 │   ├── poker      # WIP general code for managing a hand of poker.
 │   ├── terminal   # Code to play against the AI from your console.
 │   └── utils      # Utility code like seed setting.
-├── research       # A directory for research/development scripts 
+├── research       # A directory for research/development scripts
 │                  # to help formulate understanding and ideas.
 └── test           # Python tests.
-    ├── functional # Functional tests that test multiple components 
+    ├── functional # Functional tests that test multiple components
     │              # together.
     └── unit       # Individual tests for functions and objects.
 ```
@@ -207,23 +223,25 @@ To invoke the code, either call the `run_terminal_app` method directly from the
 ```bash
 cd /path/to/poker_ai/dir
 python -m poker_ai.terminal.runner       \
-  --agent offline                        \ 
+  --agent offline                        \
   --pickle_dir ./research/blueprint_algo \
-  --strategy_path ./research/blueprint_algo/offline_strategy_285800.gz 
+  --strategy_path ./research/blueprint_algo/offline_strategy_285800.gz
 ```
 
 ### Web visualisation code
 
 We are also working on code to visualise a given instance of the `ShortDeckPokerState`, which looks like this:
+
 <p align="center">
   <img src="https://github.com/fedden/poker_ai-poker-AI/blob/develop/assets/visualisation.png">
 </p>
 
-It is so we can visualise the AI as it plays, and also debug particular situations visually. The idea as it stands, is a live web-visualisation server like TensorBoard, so you'll just push your current poker game state, and this will be reflected in the visualisations, so you can see what the agents are doing. 
+It is so we can visualise the AI as it plays, and also debug particular situations visually. The idea as it stands, is a live web-visualisation server like TensorBoard, so you'll just push your current poker game state, and this will be reflected in the visualisations, so you can see what the agents are doing.
 
 [_The frontend code is based on this codepen._](https://codepen.io/Rovak/pen/ExYeQar)
 
 Here is an example of how you could plot the poker game state:
+
 ```python
 from plot import PokerPlot
 from poker_ai.games.short_deck.player import ShortDeckPokerPlayer
@@ -240,7 +258,7 @@ def get_state() -> ShortDeckPokerState:
         for player_i in range(n_players)
     ]
     return ShortDeckPokerState(
-        players=players, 
+        players=players,
         pickle_dir="../../research/blueprint_algo/"
     )
 
@@ -260,7 +278,7 @@ pp.update_state(state)
 
 There are two parts to this repository, the code to manage a game of poker, and the code to train an AI algorithm to play the game of poker. A low level thing to first to is to implement a poker engine class that can manage a game of poker.
 
-The reason the poker engine is implemented is because it is useful to have a well-integrated poker environment available during the development of the AI algorithm, incase there are tweaks that must be made to accomadate things like the history of state or the replay of a scenario during Monte Carlo Counterfactual Regret Minimisation. 
+The reason the poker engine is implemented is because it is useful to have a well-integrated poker environment available during the development of the AI algorithm, in case there are tweaks that must be made to accomodate things like the history of state or the replay of a scenario during MCCFR.
 
 The following code is how one might program a round of poker that is deterministic using the engine. This engine is now the first pass that will be used support self play.
 
@@ -281,7 +299,7 @@ big_blind_amount = 100
 
 # Create the pot.
 pot = Pot()
-# Instanciate six players that will make random moves, make sure 
+# Instantiate six players that will make random moves, make sure
 # they can reference the pot so they can add chips to it.
 players = [
     RandomPlayer(
@@ -303,29 +321,35 @@ engine.play_one_round()
 
 ## Roadmap
 
-The following todo will change dynamically as my understanding of the algorithms and the poker_ai project evolves. 
+The following todo will change dynamically as my understanding of the algorithms and the poker_ai project evolves.
 
-At first, the goal is to prototype in Python as iteration will be much easier and quicker. Once there is a working prototype, write in a systems level language like C++ and optimise for performance. 
+At first, the goal is to prototype in Python as iteration will be much easier and quicker. Once there is a working prototype, write in a systems level language like C++ and optimise for performance.
+
+### 1. Game engine iteration
+
+_Implement a multi-player working heads up no limit poker game engine to support the self-play._
 
-### 1. Game engine iteration.
-_Implement a multiplayer working heads up no limit poker game engine to support the self-play._
 - [x] Lay down the foundation of game objects (player, card etc).
 - [x] Add poker hand evaluation code to the engine.
 - [x] Support a player going all in during betting.
 - [x] Support a player going all in during payouts.
 - [x] Lots of testing for various scenarios to ensure logic is working as expected.
 
-### 2. AI iteration.
+### 2. AI iteration
+
 _Iterate on the AI algorithms and the integration into the poker engine._
-- [x] Integrate the AI strategy to support self-play in the multiplayer poker game engine.
-- [x] In the game-engine, allow the replay of any round the current hand to support MCCFR. 
-- [x] Implement the creation of the blueprint strategy using Monte Carlo CFR miminisation.
+
+- [x] Integrate the AI strategy to support self-play in the multi-player poker game engine.
+- [x] In the game-engine, allow the replay of any round the current hand to support MCCFR.
+- [x] Implement the creation of the blueprint strategy using MCCFR.
 - [x] Add the real-time search for better strategies during the game.
 
-### 3. Game engine iteration.
+### 3. Game engine iteration
+
 _Strengthen the game engine with more tests and allow users to see live visualisation of game state._
-- [x] Start work on a visualisation server to allow a game state to be displayed. 
-- [ ] Triple check that the rules are implemented in the poker engine as described in the supplimentary material.
+
+- [x] Start work on a visualisation server to allow a game state to be displayed.
+- [ ] Triple check that the rules are implemented in the poker engine as described in the supplementary material.
 - [ ] Work through the coverage, adding more tests, can never have enough.
 
 <p align="center">
@@ -334,18 +358,18 @@ _Strengthen the game engine with more tests and allow users to see live visualis
 
 ## Contributing
 
-This is an open effort and help, criticisms and ideas are all welcome. 
+This is an open effort and help, criticisms and ideas are all welcome.
 
 First of all, please check out the [CONTRIBUTING](/CONTRIBUTING.md) guide.
 
-Feel free to start a discussion on the github issues or to reach out to me at leonfedden at gmail dot com. 
+Feel free to start a discussion on the github issues or to reach out to me at leonfedden at gmail dot com.
 
 ## License
 
 The code is provided under the copy-left GPL licence. If you need it under a more permissive license then please contact me at leonfedden at gmail dot com.
 
 ## Stargazers over time
 
-We appreciate you getting this far in the README file! If you like what we are doing, please give us a star and share with your friends! 
+We appreciate you getting this far in the README file! If you like what we are doing, please give us a star and share with your friends!
 
 [![Stargazers over time](https://starchart.cc/fedden/poker_ai.svg)](https://starchart.cc/fedden/poker_ai)