Feature/mask NaNs in training loss function #56

sahahner · 2024-10-02T14:28:36Z

Variables with missing values that are imputed by the imputer should not be considered in the loss.

The NaN masks are prepared in the imputer. The remapper contains a new function to remap the NaN masks from the imputer.

This goes together with PR #72 from anemoi-training.

codecov-commenter · 2024-10-02T14:40:39Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 99.85%. Comparing base (0e03d33) to head (15cf7b9).
Report is 1 commits behind head on develop.

Additional details and impacted files

@@           Coverage Diff            @@
##           develop      #56   +/-   ##
========================================
  Coverage    99.85%   99.85%           
========================================
  Files           23       23           
  Lines         1350     1374   +24     
========================================
+ Hits          1348     1372   +24     
  Misses           2        2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

floriankrb · 2024-10-15T07:39:38Z

This functionality seems to be related to ecmwf/anemoi-training#79
Perhaps the masks.py created by @JPXKQX should move in anemoi-models and a [refactored version of] OutputMask be used here?

JPXKQX · 2024-10-15T10:02:15Z

I see some similarities between the output masking and the post-processors, but the part that doesn't fit is that the post-processors are only applied at the end of the rollout. Instead, the masking is called not only at the end, but also in between all the rollout steps (to roll out the boundary forcing). So I don't know if it's better to include it as a special post-processor or leave it in the anemoi-training.

I would say that we can do the loss masking here similar to the imputer, but I think the masking should remain in anemoi-training.

src/anemoi/models/preprocessing/remapper.py

src/anemoi/models/preprocessing/imputer.py

Co-authored-by: Harrison Cook <[email protected]>

…com:ecmwf/anemoi-models into feature/mask-NaNs-in-training-loss-function

jakob-schloer

After testing and discussing with @sahahner, I approve the changes.

sahahner added 7 commits September 23, 2024 14:10

make preprocessors iterable

38d78c6

feat: calculate nan mask for loss function in imputer forward pass

e0c6067

remove iterators from baseprocessors

cf22b5e

transform loss nan mask in remapper

fa16cb2

Merge branch 'develop' into feature/mask-NaNs-in-training-loss-function

27d682d

use internal model indices for bounding

3551992

Merge branch 'develop' into feature/mask-NaNs-in-training-loss-function

3c4d93c

sahahner mentioned this pull request Oct 2, 2024

Feature/mask NaNs in training loss function ecmwf/anemoi-training#72

Merged

changelog

87647b7

sahahner added 3 commits November 11, 2024 14:05

Merge branch 'develop' into feature/mask-NaNs-in-training-loss-function

709e7ac

tests

29669a6

Merge branch 'develop' into feature/mask-NaNs-in-training-loss-function

a083d31

sahahner marked this pull request as ready for review November 13, 2024 09:09

floriankrb reviewed Nov 13, 2024

View reviewed changes

src/anemoi/models/preprocessing/remapper.py Outdated Show resolved Hide resolved

remove obsolete enumerate

94f0d52

sahahner self-assigned this Nov 13, 2024

sahahner added the enhancement New feature or request label Nov 13, 2024

HCookie reviewed Nov 15, 2024

View reviewed changes

src/anemoi/models/preprocessing/imputer.py Outdated Show resolved Hide resolved

Update src/anemoi/models/preprocessing/imputer.py

fcc4bcd

Co-authored-by: Harrison Cook <[email protected]>

sahahner requested a review from jakob-schloer November 15, 2024 11:30

sahahner added 2 commits November 18, 2024 14:11

nan mask and loss mask calculate together

7aefe56

Merge branch 'feature/mask-NaNs-in-training-loss-function' of github.…

15cf7b9

…com:ecmwf/anemoi-models into feature/mask-NaNs-in-training-loss-function

jakob-schloer approved these changes Nov 19, 2024

View reviewed changes

sahahner merged commit fd2bcf1 into develop Nov 22, 2024
121 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/mask NaNs in training loss function #56

Feature/mask NaNs in training loss function #56

sahahner commented Oct 2, 2024 •

edited

Loading

codecov-commenter commented Oct 2, 2024 •

edited

Loading

floriankrb commented Oct 15, 2024

JPXKQX commented Oct 15, 2024

jakob-schloer left a comment

Feature/mask NaNs in training loss function #56

Feature/mask NaNs in training loss function #56

Conversation

sahahner commented Oct 2, 2024 • edited Loading

codecov-commenter commented Oct 2, 2024 • edited Loading

Codecov Report

floriankrb commented Oct 15, 2024

JPXKQX commented Oct 15, 2024

jakob-schloer left a comment

Choose a reason for hiding this comment

sahahner commented Oct 2, 2024 •

edited

Loading

codecov-commenter commented Oct 2, 2024 •

edited

Loading