Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add WW3 recipe #57

Closed
wants to merge 18 commits into from
Closed

Add WW3 recipe #57

wants to merge 18 commits into from

Conversation

jbusecke
Copy link
Contributor

@jbusecke jbusecke commented Oct 17, 2023

@jbusecke
Copy link
Contributor Author

Recipe is running:
image

Ill report back after this finishes.

@jbusecke
Copy link
Contributor Author

pre-commit.ci autofix

@jbusecke
Copy link
Contributor Author

Got some strange errors during the first two runs:

File "/srv/conda/envs/notebook/lib/python3.9/site-packages/gcsfs/retry.py", line 104, in validate_response
    raise HttpError({"code": status, "message": msg})  # text-like
gcsfs.retry.HttpError: Request range not satisfiable, 416

@leap-stc/data-management-devs look familiar to anyone?

If seems to fail in the rechunk stage
image

@jbusecke
Copy link
Contributor Author

hmmm repeat runs also fail, but all with slightly different errors.

Wondering if we are running out of memory?
image

dataflow job

@jbusecke
Copy link
Contributor Author

Perhaps we should use dataflow prime for the ingestion as well?

@jbusecke
Copy link
Contributor Author

Added dataflow prime to the config for this job. But its late and ill check on this tomorrow.

@jbusecke
Copy link
Contributor Author

That did not seemed to have helped. Any clues that I might have missed @cisaacstern ?

@jbusecke
Copy link
Contributor Author

Still fails...I have extended the year range to exclude that there is something wrong with these two particular files we downloaded previously.

@jbusecke
Copy link
Contributor Author

Debugging this with @cisaacstern and he noticed that the files are heavily compressed (3Gb vs 17GB!), which might overwhelm our our workers memory.

Trying to deactivate prime and use a big machine for testing now.

@jbusecke
Copy link
Contributor Author

pre-commit.ci autofix

@jbusecke
Copy link
Contributor Author

So I have done some more investigation on this case. See this gist.

Just deployed another version that incorporates lessons learned here: 5672faa

Lets see how that fares...

@jbusecke
Copy link
Contributor Author

@cisaacstern it seems like we are running into some issues with the new schema validation for this recipe.

If you have a minute could you look into this? Does the current way we set up this repo not conform to the schema? Or is there some other problem?

@jbusecke jbusecke mentioned this pull request Dec 11, 2023
@jbusecke
Copy link
Contributor Author

I am very sorry for the long delay here. I have started working on this again as an example for our new (to be announced) structure of data management here: https://github.com/leap-stc/ww3_feedstock.

@jbusecke
Copy link
Contributor Author

jbusecke commented May 2, 2024

@jbusecke jbusecke closed this May 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

New Dataset [Ifremer WW3 global wave hindcasts]
1 participant