Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Low-memory support #13

Open
2 tasks
bittremieux opened this issue Mar 15, 2021 · 0 comments
Open
2 tasks

Low-memory support #13

bittremieux opened this issue Mar 15, 2021 · 0 comments

Comments

@bittremieux
Copy link
Member

Running ANN-SoLo can lead to excessive memory requirements:

  • The candidate mask takes up O(num_candidates x num_library_spectra) memory. For a default batch size of 16,384 and a spectral library of 4 million spectra, this requires more than 8 GB (best-case scenario: 1-bit booleans). This memory requirement is duplicated for the ANN mask. A potential solution would be to iterate over batches of library candidates as well.

  • The ANN index needs to fit into the GPU memory, which will be problematic for large spectral libraries or low-memory GPUs. Potential solution: shard the index. This has some additional benefit that the shards can be processed using multiple GPUs.

@bittremieux bittremieux self-assigned this Mar 15, 2021
@bittremieux bittremieux added this to the 0.4 milestone Mar 15, 2021
@bittremieux bittremieux removed their assignment Oct 7, 2022
@bittremieux bittremieux removed this from the 0.4 milestone Oct 7, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant