-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Speed evaluation of the smmregrid tool #2
Comments
Commit b08b045 establish a good starting point:
Few points:
IMPORTANT: this numbers does not take into account the loading of the data |
Numbers are less incredible if we take into account the loading of the data into memory i.e.
We are still faster than CDO for single DataArray, but the speedup is small. |
Conversely, we get to very bad scaling when we use
|
Last commit in #2 suggest significant improvements. Considering that we are not using dask yet, this can be considered as a success.
|
This issue is to keep track of the speed tests that I have been to see what is the optimal configuration for the regridder based on #1
The tests are based on files on different grids (curvilinear, gaussian, gaussian reduced, lonlat and unstructured) to cover all the possibilities, with 2D files, files with mask (i.e. ocean files) and files with pressure levels. We also tested the access of the entire xarray.Dataset versus working on the single xarray.DataArray. The writing of the NetCDF output is also assessed. All tests are run with conservative remapping.
The tests can be found in the playground notebook, and are based on multiple repetition (usually 20 fo each operation). https://github.com/jhardenberg/smmregrid/blob/devel/extend/playground.ipynb
The text was updated successfully, but these errors were encountered: