Copy vs Deepcopy in SAC #74

richardrl · 2019-08-09T02:13:48Z

I'm using a previous version of SAC (with separate Q and V value function) but noticed something strange.

there is this line in twin_sac.py:
self.target_vf = vf.copy()

Changing it to a deepcopy results in very different training curves (see curves attached):

        from copy import deepcopy
        self.target_vf = deepcopy(vf)

vf.copy() is defined as such:

  def copy(self):
       copy = Serializable.clone(self)
       ptu.copy_model_params_from_to(self, copy)
       return copy

It looks like it should do essentially the same thing as deepcopy, so what's causing the difference..

The text was updated successfully, but these errors were encountered:

vitchyr · 2019-08-10T00:20:51Z

Yeah, that's really odd. Can you check if deepcopy copies the weight values as well?

vitchyr · 2019-09-06T06:57:30Z

@richardrl Did you ever get around to seeing what's going on? Also, which curve is which?

richardrl · 2019-10-22T21:44:55Z

I have not figured out why this is happening yet. The .copy() is the orange curve that actually trains. This is just on picknplace with the fetch robotics task.

vitchyr · 2019-10-26T12:55:41Z

Did you check if deepcopy copies the weights over (as reference or as value)?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Copy vs Deepcopy in SAC #74

Copy vs Deepcopy in SAC #74

richardrl commented Aug 9, 2019

vitchyr commented Aug 10, 2019

vitchyr commented Sep 6, 2019

richardrl commented Oct 22, 2019

vitchyr commented Oct 26, 2019

Copy vs Deepcopy in SAC #74

Copy vs Deepcopy in SAC #74

Comments

richardrl commented Aug 9, 2019

vitchyr commented Aug 10, 2019

vitchyr commented Sep 6, 2019

richardrl commented Oct 22, 2019

vitchyr commented Oct 26, 2019