Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Determine configurations to improve process runtimes #21

Open
cmroughan opened this issue Oct 25, 2024 · 0 comments
Open

Determine configurations to improve process runtimes #21

cmroughan opened this issue Oct 25, 2024 · 0 comments

Comments

@cmroughan
Copy link
Collaborator

cmroughan commented Oct 25, 2024

Testing has been in progress with Selenium, focusing on file uploads/deletions and segmentation recognition (as per issue #12 ).

As of 2024-10-25, these rounds of testing have been run on the following VM configurations:

  • 2 CPUs, 16 GB memory
  • 4 CPUs, 16 GB memory
  • 4 CPUs, 32 GB memory

However, these CPU and memory increases have not led to any real improvement in runtimes.

CPUs Memory Import test average runtime Segmentation (lines) runtime Segmentation (lines + regions) runtime
2 16 GB 0:27:32 1:05:00 -
4 16 GB 0:29:24 - 0:58:00
4 32 GB 0:28:32 1:01:00 1:01:00

Also ran a segmentation test (4 CPUs, 32 GB, lines) a second time on a fresh set of images, in case there was a difference in runtime between documents that had already been segmented and those that had not. Runtime ended up being exactly the same.

Should look at celery worker configurations to start, see whether changes there make a difference.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant