Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Exercice 3.26] The expression of q* should not have max over the action a #11

Open
anonymous-pusher opened this issue Apr 4, 2022 · 0 comments

Comments

@anonymous-pusher
Copy link

Hello,

I might be wrong, but I think that since q* is a function of the state s and action a, it should not be equal a term that is maximizing over the action a. The suggested solution looks more like the expression of v* in 3.19.

Thanks and keep up the good work !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant