You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I might be wrong, but I think that since q* is a function of the state s and action a, it should not be equal a term that is maximizing over the action a. The suggested solution looks more like the expression of v* in 3.19.
Thanks and keep up the good work !
The text was updated successfully, but these errors were encountered:
Hello,
I might be wrong, but I think that since q* is a function of the state s and action a, it should not be equal a term that is maximizing over the action a. The suggested solution looks more like the expression of v* in 3.19.
Thanks and keep up the good work !
The text was updated successfully, but these errors were encountered: