Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HNSW for biology #23

Open
jianshu93 opened this issue Aug 13, 2021 · 1 comment
Open

HNSW for biology #23

jianshu93 opened this issue Aug 13, 2021 · 1 comment

Comments

@jianshu93
Copy link

Dear HNSW author,

This is Jianshu, a bioinformatics phd student at Georgia Tech. I am writing to you to ask your interest to form a new collaboration. Specifically, applying HNSW into genome classification problems, that is to find the closest genome in a big genome database to see the close related ones in the database so that enivronmental microbiologist can tell what taxonomy the query genome is. This will have a very big impact on the field and will definitely have a lot of citations. I want to completely rely on rust for this project without using any other language considering the advantages compared to C++ and python. I am not an expert in rust but have been using it for 2 years. The biology needed and taxonomy related information will be my strong part. I also know all the classification software in this field and have benign using/modifying them for my master and half of my Ph.D I am confident that HNSW in rust will greatly improved the speed of genome classifiers. Hopefully we can come up with a paper in the end. My email is [email protected]

Let me know if you are interest and if something I mentioned above is not clear.

Many thanks,

Jianshu

@vadixidav
Copy link
Member

Hey @jianshu93. Typically we communicate over the Rust CV Discord (https://discord.gg/d32jaam). If you still want to use email, I can send one to you, but I would prefer to collaborate on there. You can also find my email on my GitHub profile as well.

I also recommend taking a look at the hgg crate crate, which is a data structure I created more recently roughly based on the HNSW design which is giving me better performance for binary features using hamming distance in computer vision.

Let me know how you want to get in contact and I would love to see how I can help.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants