You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have not figured out why this is happening yet. The .copy() is the orange curve that actually trains. This is just on picknplace with the fetch robotics task.
I'm using a previous version of SAC (with separate Q and V value function) but noticed something strange.
there is this line in twin_sac.py:
self.target_vf = vf.copy()
Changing it to a deepcopy results in very different training curves (see curves attached):
vf.copy() is defined as such:
It looks like it should do essentially the same thing as deepcopy, so what's causing the difference..
The text was updated successfully, but these errors were encountered: