Not normalizing the expert states when computing gradient penalty. #27

EGalahad · 2024-09-27T13:52:56Z

AMP_for_hardware/rsl_rl/rsl_rl/algorithms/amp_ppo.py

Line 235 in bfb0dbd

grad_pen_loss = self.discriminator.compute_grad_pen(

When using state normalization, the sample_amp_expert tuple is not normalized.

The text was updated successfully, but these errors were encountered:

EGalahad · 2024-09-28T02:08:57Z

AMP_for_hardware/rsl_rl/rsl_rl/algorithms/amp_ppo.py

Line 254 in bfb0dbd

if self.amp_normalizer is not None:

and I think maybe we should update the normalizer stats before normalizing them?

apirrone · 2024-10-04T11:02:10Z

@EGalahad What effect do you think this has ?

EGalahad · 2024-10-31T14:14:45Z

I just think it is weird to pass in the normalized values to update the normalizer. I think you should pass un-normalized version so that the normalizer can update running mean?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not normalizing the expert states when computing gradient penalty. #27

Not normalizing the expert states when computing gradient penalty. #27

EGalahad commented Sep 27, 2024

EGalahad commented Sep 28, 2024

apirrone commented Oct 4, 2024

EGalahad commented Oct 31, 2024

Not normalizing the expert states when computing gradient penalty. #27

Not normalizing the expert states when computing gradient penalty. #27

Comments

EGalahad commented Sep 27, 2024

EGalahad commented Sep 28, 2024

apirrone commented Oct 4, 2024

EGalahad commented Oct 31, 2024