Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about usage of embed instead of query_embed in query method in QdrantFastembedMixin #796

Open
martinkozle opened this issue Sep 30, 2024 · 0 comments

Comments

@martinkozle
Copy link

sparse_vector = list(sparse_embedding_model_inst.embed(documents=query_text))[0]

Hello, I have a question.

For the Fastembed mixin, why is embed used for sparse embeddings instead of query_embed? From what I read, for bm25 specifically at least, is that it doesn't use term frequency and length, but instead sets a uniform 1.0 for each token. What is the effect of that difference here? Is 'query_embed' for sparse embeddings not meant to be used with Qdrant?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant