how does resuming a batch output job work? #172

ctb · 2025-01-12T13:26:45Z

something died with OOM and then resuming didn't work - it just restarted from scratch. Any tips or tricks?

bluegenes · 2025-01-13T17:35:16Z

Were you using --batch-size? Can you give me the output from the resumed run?

In short, if you are using batches you get {filename}.n.zip zipfiles, where n is the batch and {filename}.zip is the specified output. If we find any {filename}.n.zip files on a subsequent run, we read all that we can, ignoring incomplete batches, and continue forward with batch n+1.

If you are not using batches, we do not resume, b/c afaik, rust zip utils can't append to zips and incomplete zipfiles are not readable. With current strategy, if we were to read {filename}.zip, we would count those sketches as 'done', but then we would overwrite {filename}.zip with the new sketches (meaning we lose the old ones). An alternative is that we could read that file and copy all the old sketches into memory before writing them all again into the same output.

Happy to modify if I'm missing something about rust zip writing or you have other strategy suggestions.

ctb · 2025-01-13T17:49:26Z

I was using batch, but it didn't pick it up. Maybe I got something wrong. I'll give it a try again!

For the bigger databases, I'm also thinking of doing a manual split of the input CSV to get to a small chunk size and then using snakemake on that. Animal genomes are all really big!

bluegenes · 2025-01-13T18:18:26Z

I also think using the NCBI REST API links instead might help, especially since we could up the # of simultaneous downloads if providing an API key. I'll make an issue for that

it is much faster with simultaneous downloads, especially since genome sizes vary and the biggest ones take a lot of time.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how does resuming a batch output job work? #172

how does resuming a batch output job work? #172

ctb commented Jan 12, 2025

bluegenes commented Jan 13, 2025

ctb commented Jan 13, 2025 •

edited

Loading

bluegenes commented Jan 13, 2025 •

edited

Loading

how does resuming a batch output job work? #172

how does resuming a batch output job work? #172

Comments

ctb commented Jan 12, 2025

bluegenes commented Jan 13, 2025

ctb commented Jan 13, 2025 • edited Loading

bluegenes commented Jan 13, 2025 • edited Loading

ctb commented Jan 13, 2025 •

edited

Loading

bluegenes commented Jan 13, 2025 •

edited

Loading