Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Factored action spaces? #79

Open
rdgain opened this issue Jun 3, 2020 · 0 comments
Open

Factored action spaces? #79

rdgain opened this issue Jun 3, 2020 · 0 comments
Labels
discussion Further information and input is requested enhancement New feature or request

Comments

@rdgain
Copy link
Collaborator

rdgain commented Jun 3, 2020

Reading a paper on this, and considering this as a future development.

We could additionally provide the option to get from the game state a list of action "shells", i.e. all the action types available with no variables assigned (if there are more than 1 option for variables in an action). Then, each action shell would provide:

  • a list of valid options for each variable
  • a way to correctly construct the action from one value per variable (generic, without need to know the types of the variables, but normal constructors still public as well)

Example: a "Move token" action would provide:

  • options for variables:
    • token id: {57, 65, 80}
    • direction: {up, down, left, right}
  • constructor with 2 integers representing indeces in the variable options arrays, and assigns the correct values
  • can check if the resulting action is valid if it exists in the full list of actions provided normally by the game state

This could allow more granular/hierarchical planning.

@rdgain rdgain added enhancement New feature or request discussion Further information and input is requested labels Jun 3, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
discussion Further information and input is requested enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant