A package for visualising vector embedding collections as part of the Chroma vector database.
Uses Flask, Vite, and react-three-fiber to host a live 3D view of the data in a web browser, should perform well up to 10k+ documents. Dimensional reduction is performed using PCA for colors down to 50 dimensions, followed by tSNE down to 3.
pip install chromaviz
or pip install git+https://github.com/mtybadger/chromaviz/
.
After installing from pip, simply call visualize_collection
with a valid ChromaDB collection, and chromaviz will do the rest.
from chromaviz import visualize_collection
visualize_collection(chromadb.Collection)
It also works with Langchain+Chroma, as in:
from langchain.vectorstores import Chroma
vectordb = Chroma.from_documents(data, embeddings, ids)
from chromaviz import visualize_collection
visualize_collection(vectordb._collection)
- More dimensional reduction options and flexibility
- Refactor extremely shoddy React code
- Improve UX