-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question: Migrating from Khmer #88
Comments
Hi David, Atm there are three ways to manually set the counts in a
Oxli supports serialisation of
from oxli import KmerCountTable
# Create new count table
kct = KmerCountTable(ksize=4) # Note: Use "store_kmers=True" only if you need to retrieve a list of all kmers in table. This option slows counting.
# Manually add new kmer and set count
kct['GGGG'] = 1000
# Only canonical kmer is stored
kct.get('GGGG')
>>> 1000
kct.get('CCCC')
>> 1000
This might be useful if you only have hashes for canonical kmers stored.
I could add support for bulk kmer + count upload from a tab delimited file if that would be useful. See #77. Do you need to store kmers and their reverse complement separately? Oxli currently stores counts under the canonical kmer. |
@ctb might be worth adding a khmer migration tutorial to the wiki. lmk what you think is the most efficient way to do this. |
@Adamtaranto thank you for the info- I believe the Khmer uses an old version of oxli right? Will the hash function be the same? |
I believe the hashing should be the same, mumurhash64 in both cases. @ctb? |
Hi,
My team is currently using Khmer and we want to upgrade to Oxli. We currently use khmer's Nodetable to store binary present/not-present kmer sets. We have a few large precomputed nodetables.
My questions:
Thank you for your help!
The text was updated successfully, but these errors were encountered: