Models: fix preprocessing transforms #1166

adamjstewart · 2023-03-09T19:20:44Z

Our pre-trained models come with preprocessing transforms that should be used to normalize and reshape images used with them. However, these transforms don't actually run. This PR fixes them and adds tests to ensure that all transforms actually work.

adamjstewart · 2023-03-09T19:21:54Z

torchgeo/models/resnet.py

-                818.86747235,
-            ]
-        ),
+        mean=torch.tensor([590.23569706, 614.21682446, 429.9430203]),


We only have weights for RGB images

adamjstewart · 2023-03-09T19:26:10Z

torchgeo/models/resnet.py

-            ]
-        ),
+        mean=torch.tensor([590.23569706, 614.21682446, 429.9430203]),
+        std=2 * torch.tensor([675.88746967, 582.87945694, 572.41639287]),


SeCo and SSL4EO use 2x the std dev for normalization:

https://github.com/ServiceNow/seasonal-contrast/blob/8285173ec205b64bc3e53b880344dd6c3f79fa7a/datasets/bigearthnet_dataset.py#L112,L113

https://github.com/zhu-xlab/SSL4EO-S12/blob/d2868adfada65e40910bfcedfc49bc3b20df2248/src/benchmark/pretrain_ssl/datasets/SSL4EO/ssl4eo_dataset.py#L37,L38

https://github.com/ServiceNow/seasonal-contrast/blob/8285173ec205b64bc3e53b880344dd6c3f79fa7a/datasets/bigearthnet_dataset.py#L111 Is this also different from the Kornia Normalize method, as they use some min max operation and also clip to range [0, 255]? Or is this something like percentiles used to normalize? Never seen this tbh. I guess my question is does the difference matter?

True, this is actually quite different. They substract (mean - 2 * std), not mean, and they divide by (4 * std), not std. I have no idea why they scale by 255, seems odd unless you're trying to plot something. And we obviously aren't clamping (again, only useful for plotting).

EDIT: oh, is it because they need to load it as a PIL image? Maybe that removes the 255 when converting to a Tensor.

Not sure how they're even using PIL here since it only supports RGB, not MSI.

(I guess that's why SeCo only had RGB weights?)

Ah yes, they divide by 10K * 255 for S1 but not for S2?

The more I look at these, the less sure I am that they are correct. May want to reach out to the authors.

Once we figure this out, we should also fix the SeCO datamodule I recently added. According to @wangyi111, it actually doesn't matter that much what the exact normalization is.

Opened an issue to get to the bottom of this. I actually think we're using the wrong normalization (this was only used for BigEarthNet, not for any other dataset). My guess is the SeCo normalization, but we'll see if we get a response. If I don't hear anything back, let's just assume SeCo normalization.

ServiceNow/seasonal-contrast#19

torchgeo/models/resnet.py

torchgeo/datamodules/seco.py

* Models: fix preprocessing transforms * Fix normalization of SeCo std dev * black * Fix SeCo transforms * Add comment explaining source of transforms

adamjstewart added this to the 0.4.1 milestone Mar 9, 2023

github-actions bot added models Models and pretrained weights testing Continuous integration testing labels Mar 9, 2023

adamjstewart commented Mar 9, 2023

View reviewed changes

torchgeo/models/resnet.py Show resolved Hide resolved

adamjstewart requested a review from nilsleh March 9, 2023 19:26

isaaccorley previously approved these changes Mar 10, 2023

View reviewed changes

adamjstewart marked this pull request as draft March 18, 2023 16:49

adamjstewart added 4 commits March 18, 2023 17:36

Models: fix preprocessing transforms

ab0635e

Fix normalization of SeCo std dev

4baa81a

black

9b5f617

Fix SeCo transforms

397265a

adamjstewart dismissed isaaccorley’s stale review via 397265a March 18, 2023 23:01

adamjstewart force-pushed the fixes/model-transforms branch from 858cc65 to 397265a Compare March 18, 2023 23:01

github-actions bot added the datamodules PyTorch Lightning datamodules label Mar 18, 2023

adamjstewart marked this pull request as ready for review March 18, 2023 23:02

nilsleh previously approved these changes Mar 19, 2023

View reviewed changes

calebrob6 reviewed Mar 22, 2023

View reviewed changes

torchgeo/datamodules/seco.py Show resolved Hide resolved

calebrob6 reviewed Mar 22, 2023

View reviewed changes

torchgeo/datamodules/seco.py Show resolved Hide resolved

calebrob6 previously approved these changes Mar 22, 2023

View reviewed changes

Add comment explaining source of transforms

5471224

adamjstewart dismissed stale reviews from calebrob6 and nilsleh via 5471224 March 22, 2023 16:59

calebrob6 approved these changes Mar 23, 2023

View reviewed changes

calebrob6 merged commit 88df515 into main Mar 23, 2023

calebrob6 deleted the fixes/model-transforms branch March 23, 2023 04:59

calebrob6 pushed a commit that referenced this pull request Apr 10, 2023

Models: fix preprocessing transforms (#1166)

cfe4541

* Models: fix preprocessing transforms * Fix normalization of SeCo std dev * black * Fix SeCo transforms * Add comment explaining source of transforms

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Models: fix preprocessing transforms #1166

Models: fix preprocessing transforms #1166

adamjstewart commented Mar 9, 2023

adamjstewart Mar 9, 2023

adamjstewart Mar 9, 2023

nilsleh Mar 16, 2023 •

edited

Loading

adamjstewart Mar 16, 2023

adamjstewart Mar 16, 2023

adamjstewart Mar 16, 2023

adamjstewart Mar 16, 2023

adamjstewart Mar 16, 2023

adamjstewart Mar 17, 2023

adamjstewart Mar 18, 2023

Models: fix preprocessing transforms #1166

Models: fix preprocessing transforms #1166

Conversation

adamjstewart commented Mar 9, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nilsleh Mar 16, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nilsleh Mar 16, 2023 •

edited

Loading