-
Notifications
You must be signed in to change notification settings - Fork 654
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Why does the recall rate drop significantly after normalization? #570
Comments
Hi @shaozhixue, Whether to normalize your vectors or not, depends on your application/vector representations. I don't have all the details, but the optimal parameters for HNSW should depend on the data. If you are using the same data for brute force and the hnsw and the recall drops, it is probably worth to increase |
Hi yurymalkov, import os class RandomSelfTestCase(unittest.TestCase):
if name == 'main': Problem:By removing the normalization part of the code, the recall rate can reach around 96% |
hello:
When testing recall rate using the bindings_test_recall.py script, I found that the recall rate drops significantly after normalizing the vectors. The metrics used are inner product. In this case, do the vectors need to be normalized? Why does the recall rate drop significantly after normalization?
code snippet:
The text was updated successfully, but these errors were encountered: