-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
floating point exception in LDAS_DEBUGCONUS/model test when using ESMA_env v4.26.0 #726
Comments
Hmm. There definitely was no change in netcdf-fortran in the newer Baselibs. And while HDF5 did update, if that caused it, every netcdf open would fall apart. |
Such an odd traceback:
It's almost like the file is odd. I think we need to know what file was trying to be opened and take a look at it. I wonder if it's something that HDF5 1.10 let through but HDF5 1.14 is a bit more sensitive or exacting with? |
@mathomp4, I think the perhaps more useful part of the backtrace is ~line 2567 in GEOSldas_err_txt, see below for an excerpt. The run is trying to read a MERRA-2 file. Here's the corresponding log entry from the successful CONUS run with standard optimization:
Excerpt from GEOSldas_err_txt of failed run:
|
@gmao-rreichle This might be a moot issue. We've discovered some other issues with HDF5 1.14 in some of our testing. So I might be moving back our HDF5 to 1.10 for now. Weirdly, the issues we see in Baselibs with HDF5 1.14 don't seem to be happening with Spack + 1.14, so I'm...perplexed. |
I guess this issue is solved by this |
@weiyuan-jiang No. That was an attempt to work around it. I'm currently trying to build Baselibs 7.20.0 everywhere and then I'll make a new ESMA_env which reverts to HDF5 1.10 |
The
LDAS_DEBUGCONUS/model
test crashes with a floating point exception when using ESMA_env v4.26.0. The test runs ok with ESMA_env v4.23.0. All other tests (incl. GNUDEBUGCONUS) are ok.Note that ESMA_env v4.26.0 uses a new version of HDF5.
The GEOSldas "err" and "log" files from the run that crashed are:
GEOSldas_err_txt.txt
GEOSldas_log_txt.txt
The log file suggests that the floating point exception occurs when opening an GEOS nc file (Line 5376 of LDAS_Forcing.F90) using
nf90_open()
.I overlooked this problem when testing for #713, where I probably only ran the standard tests and not the debug tests.
I suspect the problem is not within ESMA_env v4.26.0 but rather poor coding in LDAS that is exposed with the DEBUG build .
cc: @mathomp4 @weiyuan-jiang @biljanaorescanin
The text was updated successfully, but these errors were encountered: