Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

🗂️ Allow to submit analyses by batches #29

Open
juanesarango opened this issue Jan 20, 2023 · 0 comments
Open

🗂️ Allow to submit analyses by batches #29

juanesarango opened this issue Jan 20, 2023 · 0 comments
Labels
enhancement New feature or request

Comments

@juanesarango
Copy link
Contributor

juanesarango commented Jan 20, 2023

New feature

Something submitting hundreds of analyses is very time-intensive. And querying takes a lot of time as the api queries get very large. It would be good to allow to submit jobs by batches:

How it works now:

Currently it works like this:

$ isabl apps-grch37 bwa-mem-0.7.17.r1188 -fi projects 100
Retrieving 2000 from experiments API endpoint...

RUNNING 2000 TUPLES FOR CICERO 1.9.2 GRCH37
----------------------------------------

Checking for existing analyses...
Retrieving 0 from analyses API endpoint...
Creating analyses for 2000 tuples...		  [####################################]  100%

IDENTIFIER          PROJECTS            TARGETS                                 REFERENCES          MESSAGE
493756              100 (TD)            IID_H100004_T01_01_TD01                 0                   READY FOR SUBMISSION /work/isabl/data/analyses/37/56/493756
[...]

RAN 2000 | SKIPPED 0 | INVALID 0

Add --commit to proceed.

Which can take up tens of minutes to run ⌛️😴💤, as the queried object is very large.

Suggestion:

$ isabl apps-grch37 bwa-mem-0.7.17.r1188 -fi projects 100 --batch 400
BATCH MODE: Running 1-400 of 2000
Retrieving 400 from experiments API endpoint...

RUNNING 400 TUPLES FOR CICERO 1.9.2 GRCH37
----------------------------------------

Checking for existing analyses...
Retrieving 0 from analyses API endpoint...
Creating analyses for 400 tuples...		  [####################################]  100%

IDENTIFIER          PROJECTS            TARGETS                                 REFERENCES          MESSAGE
493756              100 (TD)            IID_H100004_T01_01_TD01                 0                   READY FOR SUBMISSION 
/work/isabl/data/analyses/37/56/493756
[...]

RAN 4000 | SKIPPED 0 | INVALID 0


BATCH MODE: Running 401-800 of 2000
Retrieving 400 from experiments API endpoint...

RUNNING 400 TUPLES FOR CICERO 1.9.2 GRCH37
----------------------------------------

Checking for existing analyses...
Retrieving 0 from analyses API endpoint...
Creating analyses for 400 tuples...		  [####################################]  100%

IDENTIFIER          PROJECTS            TARGETS                                 REFERENCES          MESSAGE
493756              100 (TD)            IID_H100004_T01_01_TD01                 0                   READY FOR SUBMISSION 
/work/isabl/data/analyses/37/56/493756
[...]

RAN 4000 | SKIPPED 0 | INVALID 0


[...]


BATCH MODE: Running 1601-2000 of 2000
Retrieving 400 from experiments API endpoint...

RUNNING 400 TUPLES FOR CICERO 1.9.2 GRCH37
----------------------------------------

Checking for existing analyses...
Retrieving 0 from analyses API endpoint...
Creating analyses for 400 tuples...		  [####################################]  100%

IDENTIFIER          PROJECTS            TARGETS                                 REFERENCES          MESSAGE
493756              100 (TD)            IID_H100004_T01_01_TD01                 0                   READY FOR SUBMISSION 
/work/isabl/data/analyses/37/56/493756
[...]

RAN 4000 | SKIPPED 0 | INVALID 0

Add --commit to proceed.

This way it might be ⚡️ faster as api responses are smaller, and the first batches of analyses can start running while the others are still being created

@juanesarango juanesarango added the enhancement New feature or request label Jan 20, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant