Lazy rectilinear interpolator #6084

fnattino · 2024-07-25T09:08:28Z

🚀 Pull Request

Description

Different take to enable the rectilinear interpolator to run lazily #6002 .

Trying to address the same issue as #6006, but there I have made the underlying _RegularGridInterpolator from _scipy_interpolate to run on lazy data, which required switching from scipy.sparse to sparse (not ideal since it would add numba as a dependency).

Here I have tried instead to implement a similar approach as used for regridding, which works on lazy data as well. The downside is that the chunks in the dimensions we are interpolating over need to be merged, but at least we could run interpolation in parallel over the chunks in the other dimensions (and we do not need to add extra dependencies to iris).

lib/iris/analysis/_interpolation.py

bouweandela · 2024-08-26T12:09:38Z

Nice to see this progressing @fnattino! Did you notice that CI is failing on this pull request?

codecov · 2024-08-27T20:39:27Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 89.83%. Comparing base (d76a4d8) to head (3481e46).
Report is 4 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #6084   +/-   ##
=======================================
  Coverage   89.82%   89.83%           
=======================================
  Files          88       88           
  Lines       23150    23180   +30     
  Branches     5043     5043           
=======================================
+ Hits        20794    20823   +29     
+ Misses       1624     1622    -2     
- Partials      732      735    +3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

lib/iris/tests/unit/analysis/interpolation/test_RectilinearInterpolator.py

trexfeathers · 2024-09-12T07:57:40Z

@fnattino just want to reassure you that I have been looking at this, but since I have never worked with our interpolators before it is slow progress. Having another go this afternoon with help from some coffee ☕

fnattino · 2024-09-12T08:07:25Z

No worries @trexfeathers, but thanks for the heads-up! :)

trexfeathers

Hi @fnattino, thank you for your hard work on this.

Here is a partial review. I have left myself a couple of TODO comments. But the suggestions I have already might take some time, and may change the code significantly - mainly #6084 (comment) - so it seemed important to get these suggestions to you as soon as possible.

Also thank you to @HarriKeenan for helping me with this review last week 🤜🤛

lib/iris/analysis/_interpolation.py

trexfeathers · 2024-09-05T14:04:50Z

lib/iris/analysis/_interpolation.py

+def _interpolated_dtype(dtype, method):
+    """Determine the minimum base dtype required by the underlying interpolator."""
+    if method == "nearest":
+        result = dtype
+    else:
+        result = np.result_type(_DEFAULT_DTYPE, dtype)
+    return result


If this needs to stay (see my other comment about args=[self] - #6084 (comment)), then I'd be interested in us unifying this function with RectilinearInterpolator._interpolated_dtype().

Do you mean to put this back as a (static)method of the RectilinearInterpolator? Or to merge the body of this function with RectilinearInterpolator._interpolate?

I mean find any appropriate way for there to be only 1 _interpolated_dtype() in this file, which can be used by both:

RectilinearInterpolator._interpolate()

RectilinearInterpolator._points()

Up to you how this is achieved.

lib/iris/analysis/_interpolation.py

lib/iris/tests/unit/analysis/interpolation/test_RectilinearInterpolator.py

Co-authored-by: Martin Yeo <[email protected]>

…iris into lazy-rectilinearinterpolator-2

fnattino

@trexfeathers thanks a lot for the review and apologies for the scattered response.

As I have tried to explain in reply to your comments, I am a bit hesitant to implement the solution that would copy the current instance of the RectilinearInterpolation - but I am very curious to hear your thoughts on this!

lib/iris/analysis/_interpolation.py

fnattino · 2024-09-20T12:45:42Z

lib/iris/analysis/_interpolation.py

+def _interpolated_dtype(dtype, method):
+    """Determine the minimum base dtype required by the underlying interpolator."""
+    if method == "nearest":
+        result = dtype
+    else:
+        result = np.result_type(_DEFAULT_DTYPE, dtype)
+    return result


Do you mean to put this back as a (static)method of the RectilinearInterpolator? Or to merge the body of this function with RectilinearInterpolator._interpolate?

trexfeathers

OK I got my head back in the space and I understand your logic better now. Nearly there.

If you're busy with other things please let me know and I can try actioning the remaining stuff myself.

lib/iris/analysis/_interpolation.py

trexfeathers · 2024-11-14T16:52:45Z

lib/iris/analysis/_interpolation.py

+def _interpolated_dtype(dtype, method):
+    """Determine the minimum base dtype required by the underlying interpolator."""
+    if method == "nearest":
+        result = dtype
+    else:
+        result = np.result_type(_DEFAULT_DTYPE, dtype)
+    return result


I mean find any appropriate way for there to be only 1 _interpolated_dtype() in this file, which can be used by both:

RectilinearInterpolator._interpolate()

RectilinearInterpolator._points()

Up to you how this is achieved.

lib/iris/analysis/_interpolation.py

lib/iris/tests/unit/analysis/interpolation/test_RectilinearInterpolator.py

lib/iris/analysis/_interpolation.py

trexfeathers · 2024-11-15T16:18:58Z

lib/iris/analysis/_interpolation.py

I tried writing a benchmark to demonstrate the benefits of this but everything either stayed the same or got slower.

Could you share with me the results you have seen? Especially if it's something I/we could turn into a benchmark. Thanks

trexfeathers

Clicked the wrong button with my previous review!

fnattino · 2024-11-15T16:26:34Z

Thanks a lot for the effort of getting back to this @trexfeathers ! I should have time to get back to this next week, but I will keep you posted!

lazy interpolation using map_complete_blocks

b77af1d

fnattino commented Jul 25, 2024

View reviewed changes

lib/iris/analysis/_interpolation.py Show resolved Hide resolved

fnattino commented Jul 25, 2024

View reviewed changes

lib/iris/analysis/_interpolation.py Show resolved Hide resolved

fnattino mentioned this pull request Jul 25, 2024

Lazy rectilinear interpolator, with sparse #6006

Open

pre-commit fixes

0021cb0

fnattino marked this pull request as ready for review July 25, 2024 09:44

replace test on interpolation with lazy data

18908ae

fnattino commented Aug 27, 2024

View reviewed changes

lib/iris/tests/unit/analysis/interpolation/test_RectilinearInterpolator.py Show resolved Hide resolved

trexfeathers self-assigned this Sep 4, 2024

Merge branch 'main' into lazy-rectilinearinterpolator-2

47a8599

trexfeathers requested changes Sep 12, 2024

View reviewed changes

fnattino and others added 6 commits September 13, 2024 11:47

Update lib/iris/analysis/_interpolation.py

555f3c7

Co-authored-by: Martin Yeo <[email protected]>

Update lib/iris/analysis/_interpolation.py

7a08108

Co-authored-by: Martin Yeo <[email protected]>

Merge branch 'lazy-rectilinearinterpolator-2' of github.com:fnattino/…

c453e01

…iris into lazy-rectilinearinterpolator-2

resume local import

3814383

add entry to latest.rst

0c5dc9a

add author name to list

3481e46

fnattino commented Sep 20, 2024

View reviewed changes

fnattino requested a review from trexfeathers September 20, 2024 13:05

trexfeathers approved these changes Nov 15, 2024

View reviewed changes

trexfeathers requested changes Nov 15, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lazy rectilinear interpolator #6084

Lazy rectilinear interpolator #6084

fnattino commented Jul 25, 2024

bouweandela commented Aug 26, 2024

codecov bot commented Aug 27, 2024 •

edited

Loading

trexfeathers commented Sep 12, 2024

fnattino commented Sep 12, 2024

trexfeathers left a comment •

edited

Loading

trexfeathers Sep 5, 2024 •

edited

Loading

fnattino Sep 20, 2024

trexfeathers Nov 14, 2024

fnattino left a comment

fnattino Sep 20, 2024

trexfeathers left a comment •

edited

Loading

trexfeathers Nov 14, 2024

trexfeathers Nov 15, 2024

trexfeathers left a comment

fnattino commented Nov 15, 2024

Lazy rectilinear interpolator #6084

Are you sure you want to change the base?

Lazy rectilinear interpolator #6084

Conversation

fnattino commented Jul 25, 2024

🚀 Pull Request

Description

bouweandela commented Aug 26, 2024

codecov bot commented Aug 27, 2024 • edited Loading

Codecov Report

trexfeathers commented Sep 12, 2024

fnattino commented Sep 12, 2024

trexfeathers left a comment • edited Loading

Choose a reason for hiding this comment

trexfeathers Sep 5, 2024 • edited Loading

Choose a reason for hiding this comment

fnattino Sep 20, 2024

Choose a reason for hiding this comment

trexfeathers Nov 14, 2024

Choose a reason for hiding this comment

fnattino left a comment

Choose a reason for hiding this comment

fnattino Sep 20, 2024

Choose a reason for hiding this comment

trexfeathers left a comment • edited Loading

Choose a reason for hiding this comment

trexfeathers Nov 14, 2024

Choose a reason for hiding this comment

trexfeathers Nov 15, 2024

Choose a reason for hiding this comment

trexfeathers left a comment

Choose a reason for hiding this comment

fnattino commented Nov 15, 2024

codecov bot commented Aug 27, 2024 •

edited

Loading

trexfeathers left a comment •

edited

Loading

trexfeathers Sep 5, 2024 •

edited

Loading

trexfeathers left a comment •

edited

Loading