Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[similarity] use a vector db for embeddings #294

Open
marcorosa opened this issue Feb 15, 2024 · 1 comment
Open

[similarity] use a vector db for embeddings #294

marcorosa opened this issue Feb 15, 2024 · 1 comment
Labels
proposal Ask/propose a new feature

Comments

@marcorosa
Copy link
Member

Postgres now offers an extension to store vectors (pgvector).
We could leverage it to store embeddings for our similarity feature, because that's what vector dbs can do best.

Why pgvector? what about another vector db?
We already have Postgres in place, so it would be reasonable not to add another component (our stack is already complicated). Nevertheless, pgvector would require the installation of postgres for all users, also for those fostering sqlite.
So, we have 2 options here: (i) either we integrate vectordb capabilities only for PgClient users (leaving SqliteClient users storing the embeddings as text in sqlite), or (ii) we add a local vector db (like chromadb or FAISS)

Note to myself: option (i) is the more conservative choice, and it could be the starting point

@marcorosa marcorosa added the proposal Ask/propose a new feature label Feb 15, 2024
@marcorosa
Copy link
Member Author

Does this issue require #246 to be resolved first?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
proposal Ask/propose a new feature
Projects
None yet
Development

No branches or pull requests

1 participant