-
Notifications
You must be signed in to change notification settings - Fork 123
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Zilliz streaming solution #268
base: main
Are you sure you want to change the base?
Conversation
30ed3df
to
82244bb
Compare
82244bb
to
fb97191
Compare
Code installs without issues. results on Azure
Note that the first three runs had issues and did not get good recall. The log file is too large to put here, here is a sample from the issues that happened (this is from the Ls=450 run):
|
@ingberam Ok. I'll try to fix it. |
@ingberam Code updated, can you run it again? |
ok, I had to run the 4 parameter sets separately. But the results look good (even a tiny bit better than your resuts):
please ack, and fix conflicts so that we can merge |
expected results:
Our solution is based on vamana graph and SQ8 quantization. The main search process is under high-performance 8-bit integer computation. To restore the recall rate, floating point refine is used. To satisfy the 8G memory limitation, we use fp16 instead of fp32 to refine the results.