You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The structure of make_jobs.py means that it won't send fits of targets until all models have been run for every target or one of them times out. This is a bit of an issue because occasionally one model gets stuck for an object (or one object gets stuck on all the models). Not sure exactly how to revise this code, but it would be good if there was some way to identify jobs that are holding the overall script back and possibly submit the other fits to be shown on slack ahead of time with some message about another job or jobs taking too long
The text was updated successfully, but these errors were encountered:
Partially addressed in 56774c1 in the sense that the job time limit for the different models has been reduced from 12 to 8 hours and made it so the timeout flag in make_jobs.py is more likely to actually work, but rework would still be beneficial. Perhaps have the jobs structured in such a way that either all instances of one model or all instances of one candidate are pushed to the slack channel rather than waiting for all instances of all models for all candidates to finish to be pushed to slack channel, though this would require a rework in the way make_jobs.py submits and checks on fit jobs
The structure of make_jobs.py means that it won't send fits of targets until all models have been run for every target or one of them times out. This is a bit of an issue because occasionally one model gets stuck for an object (or one object gets stuck on all the models). Not sure exactly how to revise this code, but it would be good if there was some way to identify jobs that are holding the overall script back and possibly submit the other fits to be shown on slack ahead of time with some message about another job or jobs taking too long
The text was updated successfully, but these errors were encountered: