You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on May 1, 2023. It is now read-only.
🤖 This is the current blocker for a full hyperparameter search.
Currently have to write the manuscript, so unable to debug in detail. We need a permanent solution shared across projects, therefore the issue here.
Is this related to the latest internet shutdown? Maybe so, but we already open the URLs I could find from wandb
https://api.wandb.ai
.train_models_in_parallel
script--multirun
Appears we might be hitting rate limiting, since training a single model worked fine the first time, and then didn't work?
Proposed next steps for debugging:
Check if we can train and upload even a single model (note that syncing continuous after
If we can, but cannot train in parallel, appears it's a wandb problem. Potential next steps:
If anyone is up for debugging, they're more than welcome to go ahead and collect thoughts here! @HLasse, @sarakolding, @signekb, @bokajgd, @erikperfalk.
The text was updated successfully, but these errors were encountered: