Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP: use HashIntoType and custom hashing instead of u64 #87

Open
wants to merge 17 commits into
base: parallel_consume
Choose a base branch
from

Conversation

ctb
Copy link
Contributor

@ctb ctb commented Nov 1, 2024

Brief description:

  • implement threaded consume that chunks sequences into some num chunks, each of which is processed individually, and then combined at the end;
  • add criterion benchmarking;
  • build new k-mer hash type HashIntoType and provide custom (no-op) hashing for HashIntoType;
  • pre-allocate tables with default capacity (currently 100_000);
  • replace String with &str where possible, to avoid copying/cloning;

notes for later:

fix lib when doing cargo bench:

ln -s ~/miniforge3/envs/py311/lib/libpython3.11.dylib target/release/deps/

relevant pyo3 issue:

PyO3/pyo3#4155

TODO notes:

  • consumed calculation/validation?
  • custom hashing fn for kmers?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant