Evaluating the reward function for arbitrary states and actions #326
Unanswered
jamesheald
asked this question in
Q&A
Replies: 1 comment 7 replies
-
If you're using the Env API, the reward would be stored in the State: Line 36 in b373f5a and if you want Transitions, you could use the actor_step function: Lines 31 to 51 in b373f5a |
Beta Was this translation helpful? Give feedback.
7 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm implementing model predictive control and I want to be able to evaluate the reward associated with a rollout performed under a learned model. Is there a generic way (that works for any environment) to evaluate the reward associated with a state-action(-next state, if applicable) tuple?
Beta Was this translation helpful? Give feedback.
All reactions