diff --git a/README.md b/README.md index ec240e4e..acf06192 100644 --- a/README.md +++ b/README.md @@ -4,7 +4,7 @@ Dive into building GenAI applications! This repository contains examples, applications, starter code, & tutorials to help you kickstart your GenAI projects. - These are built using LanceDB, a free, open-source, serverless vectorDB that **requires no setup**. -- It **integrates into python data ecosystem** so you can simply start using these in your existing data pipelines in pandas, arrow, pydantic etc. +- It **integrates into Python data ecosystem** so you can simply start using these in your existing data pipelines in pandas, arrow, pydantic etc. - LanceDB has **native Typescript SDK** using which you can **run vector search** in serverless functions! @@ -15,10 +15,10 @@ Join our community for support - Discord --- -This repository is divided into 3 sections: +This repository is divided into 2 sections: - [Examples](#examples) - Get right into the code with minimal introduction, aimed at getting you from an idea to PoC within minutes! - [Applications](#projects--applications) - Ready to use Python and web apps using applied LLMs, VectorDB and GenAI tools -- [Tutorials](#tutorials) - A curated list of tutorials, blogs, Colabs and courses to get you started with GenAI in greater depth. + ## Examples Applied examples that get right into the code with minimal introduction, aimed at getting you from an idea to PoC within minutes! @@ -27,46 +27,137 @@ Examples are available as: * **Python scripts** - for cases where you'd like directly to use the file or snippets to integrate in your application * **JS/TS scripts** - Some examples are written using lancedb's native js library! These script/snippets can also be directly integrated in your web applications. -If you're looking for in-depth tutorial-like examples, checkout the [tutorials](#tutorials) section! +The following examples are organized into different tables to make similar types of examples easily accessible. -| Example   | Notebook & Scripts   | Read The Blog!       | -|-------- | ------------- | ------------- | -| | | | -| [Youtube transcript search bot](/examples/Youtube-Search-QA-Bot/) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/Youtube-Search-QA-Bot/main.py) [![JS](https://img.shields.io/badge/javascript-%23323330.svg?style=for-the-badge&logo=javascript&logoColor=%23F7DF1E)](./examples/Youtube-Search-QA-Bot/index.js) [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)|| -| [Langchain: Code Docs QA bot](/examples/Code-Documentation-QA-Bot/) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/Code-Documentation-QA-Bot/main.py) [![JS](https://img.shields.io/badge/javascript-%23323330.svg?style=for-the-badge&logo=javascript&logoColor=%23F7DF1E)](./examples/Code-Documentation-QA-Bot/index.js) [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)|| -| [Databricks DBRX Website Bot](./examples/databricks_DBRX_website_bot/) | [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/databricks_DBRX_website_bot/main.py) [![Databricks LLM](https://img.shields.io/badge/databricks-api-red)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| -| [CLI-based SDK Manual Chatbot with Phidata](/examples/CLI-SDK-Manual-Chatbot-Locally/) | [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/CLI-SDK-Manual-Chatbot-Locally/assistant.py) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| -| [TransformersJS Embedding example](./examples/js-transformers/) |[![JS](https://img.shields.io/badge/javascript-%23323330.svg?style=for-the-badge&logo=javascript&logoColor=%23F7DF1E)](./examples/js-transformers/index.js) [![LLM](https://img.shields.io/badge/local-llm-green)](#) [![advanced](https://img.shields.io/badge/advanced-FF3333)](#)| | -| [Inbuilt Hybrid Search](/examples/Inbuilt-Hybrid-Search) |Open In Colab [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)|| -| [Audio Search](./examples/audio_search/) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/audio_search/main.py) [![LLM](https://img.shields.io/badge/local-llm-green)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| | -| [Multi-lingual search](/examples/multi-lingual-wiki-qa) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/multi-lingual-wiki-qa/main.py) [![LLM](https://img.shields.io/badge/cohere-api-pink)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| | -| [Hybrid search BM25 & lancedb ](./examples/Hybrid_search_bm25_lancedb/) | Open In Colab [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)|[![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/hybrid-search-combining-bm25-and-semantic-search-for-better-results-with-lan-1358038fe7e6)| -| [Search Within Images](/examples/search-within-images-with-sam-and-clip/) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/examples/search-within-images-with-sam-and-clip/main.ipynb) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/search-within-an-image-331b54e4285e)| -| [Accelerate Vector Search Applications Using OpenVINO](/examples/Accelerate-Vector-Search-Applications-Using-OpenVINO/) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/examples/Accelerate-Vector-Search-Applications-Using-OpenVINO/clip_text_image_search.ipynb) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![advanced](https://img.shields.io/badge/advanced-FF3333)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/accelerate-vector-search-applications-using-openvino-lancedb/)| +### Build from Scratch + +Build applications/examples using LanceDB for efficient vector-based document retrieval. + +| Build from Scratch    | Interactive Notebook & Scripts   | +|-------- | -------------: | +||| +| [Build RAG from Scratch](./tutorials/RAG-from-Scratch) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/tutorials/RAG-from-Scratch/RAG_from_Scratch.ipynb) [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| | +| [Local RAG from Scratch with Llama3](./tutorials/Local-RAG-from-Scratch) | [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./tutorials/Local-RAG-from-Scratch/rag.py) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| | +|||| + +### MultiModal + +Create a multimodal search application using LanceDB for efficient vector-based retrieval of text and image data. Input text or image queries to find the most relevant documents and images from your corpus. + +| Multimodal    | Interactive Notebook & Scripts   | Blog | +| --------- | -------------------------- | ----------- | +|||| | [Multimodal CLIP: DiffusionDB](/examples/multimodal_clip_diffusiondb/) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/multimodal_clip_diffusiondb/main.py) [![LLM](https://img.shields.io/badge/local-llm-green)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/multi-modal-ai-made-easy-with-lancedb-clip-5aaf8801c939/)| | [Multimodal CLIP: Youtube videos](/examples/multimodal_video_search/) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/multimodal_video_search/main.py) [![LLM](https://img.shields.io/badge/local-llm-green)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)|[![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/multi-modal-ai-made-easy-with-lancedb-clip-5aaf8801c939/)| | [Multimodal Image + Text Search](/examples/multimodal_search/) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/multimodal_search/main.py) [![LLM](https://img.shields.io/badge/local-llm-green)](#) [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/multi-modal-ai-made-easy-with-lancedb-clip-5aaf8801c939/)| -| [Movie Recommender](/examples/movie-recommender/) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/movie-recommender/main.py) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| | -| [Product Recommender](./examples/product-recommender/) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/product-recommender/main.py)[![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)| | -| [Arxiv paper recommender](/examples/arxiv-recommender) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/arxiv-recommender/main.py) [![LLM](https://img.shields.io/badge/local-llm-green)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| | +|||| + +### RAG + +Develop a Retrieval-Augmented Generation (RAG) application using LanceDB for efficient vector-based information retrieval. Input text queries to retrieve relevant documents and generate comprehensive answers by combining retrieved information. + +| RAG    | Interactive Notebook & Scripts | Blog | +| --------- | -------------------------- | ----------- | +|||| | [Improve RAG with Re-ranking](/examples/RAG_Reranking/) | Open In Colab [![LLM](https://img.shields.io/badge/local-llm-green)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)|[![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/simplest-method-to-improve-rag-pipeline-re-ranking-cf6eaec6d544)| -| [Improve RAG with FLARE](/examples/Advanced-RAG-with-FLARE) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/Advanced-RAG-with-FLARE/app.py) [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/better-rag-with-active-retrieval-augmented-generation-flare-3b66646e2a9f)| +| [Instruct-Multitask](./examples/instruct-multitask) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/instruct-multitask/main.py) [![LLM](https://img.shields.io/badge/local-llm-green)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)|[![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/multitask-embedding-with-lancedb-be18ec397543)| | [Improve RAG with HyDE](/examples/Advance-RAG-with-HyDE/) | Open In Colab [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)|[![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/advanced-rag-precise-zero-shot-dense-retrieval-with-hyde-0946c54dfdcb)| | [Improve RAG with LOTR ](/examples/Advance_RAG_LOTR/) | Open In Colab [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)|[![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/better-rag-with-lotr-lord-of-retriever-23c8336b9a35)| | [Advanced RAG: Parent Document Retriever](/examples/parent_document_retriever/) | Open In Colab [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)|[![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/modified-rag-parent-document-bigger-chunk-retriever-62b3d1e79bc6)| +| [Corrective RAG with Langgraph](./tutorials/Corrective-RAG-with_Langgraph/) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/tutorials/Corrective-RAG-with_Langgraph/CRAG_with_Langgraph.ipynb) [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/implementing-corrective-rag-in-the-easiest-way-2/)| +| [Contextual-Compression-with-RAG](/examples/Contextual-Compression-with-RAG/) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/examples/Contextual-Compression-with-RAG/main.ipynb) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)|[![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/enhance-rag-integrate-contextual-compression-and-filtering-for-precision-a29d4a810301/) | +| [Improve RAG with FLARE](./examples/better-rag-FLAIR) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/examples/better-rag-FLAIR/main.ipynb) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![advanced](https://img.shields.io/badge/advanced-FF3333)](#)|[![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/better-rag-with-active-retrieval-augmented-generation-flare-3b66646e2a9f/) | | [Query Expansion and Reranker ](/examples/QueryExpansion&Reranker/) | Open In Colab [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![advanced](https://img.shields.io/badge/advanced-FF3333)](#)|[![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/improving-rag-with-query-expansion-reranking-models/)| | [RAG Fusion](/examples/RAG_Fusion/) | Open In Colab [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![advanced](https://img.shields.io/badge/advanced-FF3333)](#)| -| [Contextual-Compression-with-RAG](/examples/Contextual-Compression-with-RAG/) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/examples/Contextual-Compression-with-RAG/main.ipynb) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)|[![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/enhance-rag-integrate-contextual-compression-and-filtering-for-precision-a29d4a810301/) | -| [Instruct-Multitask](./examples/instruct-multitask) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/instruct-multitask/main.py) [![LLM](https://img.shields.io/badge/local-llm-green)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)|[![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/multitask-embedding-with-lancedb-be18ec397543)| +| [Agentic RAG ](/tutorials/Agentic_RAG/) | Open In Colab [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![advanced](https://img.shields.io/badge/advanced-FF3333)](#)| +|||| + +### Vector Search + +Build a vector search application using LanceDB for efficient vector-based document retrieval. Input text queries to find the most relevant documents from your corpus. + +| Vector Search    | Interactive Notebook & Scripts   | Blog | +| --------- | -------------------------- | ----------- | +|||| +| [Inbuilt Hybrid Search](/examples/Inbuilt-Hybrid-Search) |Open In Colab [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)|| +| [Hybrid search BM25 & lancedb ](./examples/Hybrid_search_bm25_lancedb/) | Open In Colab [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#) |[![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/hybrid-search-combining-bm25-and-semantic-search-for-better-results-with-lan-1358038fe7e6)| +| [NER powered Semantic Search](./tutorials/NER-powered-Semantic-Search) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/tutorials/NER-powered-Semantic-Search/NER_powered_Semantic_Search_with_LanceDB.ipynb) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/ner-powered-semantic-search-using-lancedb-51051dc3e493) | +| [Audio Search](./examples/audio_search/) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/audio_search/main.py) [![LLM](https://img.shields.io/badge/local-llm-green)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| | +| [Multi-lingual search](/examples/multi-lingual-wiki-qa) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/multi-lingual-wiki-qa/main.py) [![LLM](https://img.shields.io/badge/cohere-api-pink)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| | +| [Facial Recognition](./examples/facial_recognition) | Open In Colab [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| +[Sentiment Analysis : Analysing Hotel Reviews](/examples/Sentiment-Analysis-Analyse-Hotel-Reviews/) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/examples/Sentiment-Analysis-Analyse-Hotel-Reviews/Sentiment_Analysis_using_LanceDB.ipynb) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/sentiment-analysis-using-lancedb-2da3cb1e3fa6)| +| [Imagebind demo app](./examples/imagebind_demo/) | hf spaces [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)| +| [Search Within Images](/examples/search-within-images-with-sam-and-clip/) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/examples/search-within-images-with-sam-and-clip/main.ipynb) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/search-within-an-image-331b54e4285e)| +| [Vector Search with TransformersJS](./examples/js-transformers/) |[![JS](https://img.shields.io/badge/javascript-%23323330.svg?style=for-the-badge&logo=javascript&logoColor=%23F7DF1E)](./examples/js-transformers/index.js) [![LLM](https://img.shields.io/badge/local-llm-green)](#) [![advanced](https://img.shields.io/badge/advanced-FF3333)](#)| | +| [Accelerate Vector Search Applications Using OpenVINO](/examples/Accelerate-Vector-Search-Applications-Using-OpenVINO/) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/examples/Accelerate-Vector-Search-Applications-Using-OpenVINO/clip_text_image_search.ipynb) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![advanced](https://img.shields.io/badge/advanced-FF3333)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/accelerate-vector-search-applications-using-openvino-lancedb/)| +|||| + +### Chatbot + +Create a chatbot application using LanceDB for efficient vector-based response generation. Input user queries to retrieve relevant context and generate coherent, context-aware replies. + +| Chatbot    | Interactive Notebook & Scripts   | Blog  | +| --------- | -------------------------- | ----------- | +|||| +| [Databricks DBRX Website Bot](./examples/databricks_DBRX_website_bot/) | [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/databricks_DBRX_website_bot/main.py) [![Databricks LLM](https://img.shields.io/badge/databricks-api-red)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| +| [CLI-based SDK Manual Chatbot with Phidata](/examples/CLI-SDK-Manual-Chatbot-Locally/) | [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/CLI-SDK-Manual-Chatbot-Locally/assistant.py) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| +| [Youtube transcript search bot](/examples/Youtube-Search-QA-Bot/) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/Youtube-Search-QA-Bot/main.py) [![JS](https://img.shields.io/badge/javascript-%23323330.svg?style=for-the-badge&logo=javascript&logoColor=%23F7DF1E)](./examples/Youtube-Search-QA-Bot/index.js) [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)|| +| [Langchain: Code Docs QA bot](/examples/Code-Documentation-QA-Bot/) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/Code-Documentation-QA-Bot/main.py) [![JS](https://img.shields.io/badge/javascript-%23323330.svg?style=for-the-badge&logo=javascript&logoColor=%23F7DF1E)](./examples/Code-Documentation-QA-Bot/index.js) [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)|| +| [Context-Aware Chatbot using Llama 2 & LanceDB](./tutorials/chatbot_using_Llama2_&_lanceDB) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/tutorials/chatbot_using_Llama2_&_lanceDB/main.ipynb) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![advanced](https://img.shields.io/badge/advanced-FF3333)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/context-aware-chatbot-using-llama-2-lancedb-as-vector-database-4d771d95c755) | +|||| + + +### Evaluation + +Develop an evaluation application. Input reference and candidate texts to measure their performance on various metrics. + +| Evaluation    | Interactive Notebook & Scripts   | Blog | +| --------- | -------------------------- | ----------- | +|||| | [Evaluating Prompts with Prompttools](/examples/prompttools-eval-prompts/) | Open In Colab [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![advanced](https://img.shields.io/badge/advanced-FF3333)](#)| | +| [Evaluating RAG with RAGAs](./examples/Evaluating_RAG_with_RAGAs/) | Open In Colab [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)| | +|||| + +### AI Agents + +Design an AI agents coordination application with LanceDB for efficient vector-based communication and collaboration. Input queries to enable AI agents to exchange information, coordinate tasks, and achieve shared goals effectively. + +| AI Agents    | Interactive Notebook & Scripts   | Blog | +| --------- | -------------------------- | ----------- | +|||| | [AI Agents: Reducing Hallucination](/examples/reducing_hallucinations_ai_agents/) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/reducing_hallucinations_ai_agents/main.py) [![JS](https://img.shields.io/badge/javascript-%23323330.svg?style=for-the-badge&logo=javascript&logoColor=%23F7DF1E)](./examples/reducing_hallucinations_ai_agents/index.js) [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![advanced](https://img.shields.io/badge/advanced-FF3333)](#) |[![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/how-to-reduce-hallucinations-from-llm-powered-agents-using-long-term-memory-72f262c3cc1f/)| | [AI Trends Searcher with CrewAI](./examples/AI-Trends-with-CrewAI/) |Open In Colab [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)|[![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/track-ai-trends-crewai-agents-rag/)| | [SuperAgent Autogen](/examples/SuperAgent_Autogen) |Open In Colab [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)|| -[Sentiment Analysis : Analysing Hotel Reviews](/examples/Sentiment-Analysis-Analyse-Hotel-Reviews/) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/examples/Sentiment-Analysis-Analyse-Hotel-Reviews/Sentiment_Analysis_using_LanceDB.ipynb) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/sentiment-analysis-using-lancedb-2da3cb1e3fa6)| -| [Facial Recognition](./examples/facial_recognition) | Open In Colab [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| -| [Imagebind demo app](/examples/imagebind_demo/) | hf spaces [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)| +|||| + +### Recommender Systems + +Create a recommender system application with LanceDB for efficient vector-based item recommendation. Input user preferences or item features to generate personalized recommendations and enhance user experience. + +| Recommender Systems | Interactive Notebook & Scripts   | Blog | +| --------- | -------------------------- | ----------- | +|||| +| [Movie Recommender](/examples/movie-recommender/) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/movie-recommender/main.py) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| | +| [Movie Recommender with Genre](./examples/movie-recommendation-with-genres/) | Open In Colab [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/movie-recommendation-system-using-lancedb-and-doc2vec/)| +| [Product Recommender](./examples/product-recommender/) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/product-recommender/main.py)[![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)| | +| [Arxiv paper recommender](/examples/arxiv-recommender) | Open In Colab [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./examples/arxiv-recommender/main.py) [![LLM](https://img.shields.io/badge/local-llm-green)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| | +|||| +### Concepts +Checkout concepts of LLM applications pipeline to ensures accurate information retrieval. + +| Concepts | Interactive Notebook | Blog | +| --------- | -------------------------- | ----------- | +| | | | +| [A Primer on Text Chunking and its Types](./tutorials/different-types-text-chunking-in-RAG) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/tutorials/different-types-text-chunking-in-RAG/Text_Chunking_on_RAG_application_with_LanceDB.ipynb) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/a-primer-on-text-chunking-and-its-types-a420efc96a13) | +| [Langchain LlamaIndex Chunking](./tutorials/Langchain-LlamaIndex-Chunking) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/tutorials/Langchain-LlamaIndex-Chunking/Langchain_Llamaindex_chunking.ipynb) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/chunking-techniques-with-langchain-and-llamaindex/) | +| [Comparing Cohere Rerankers with LanceDB](./tutorials/cohere-reranker) | [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/benchmarking-cohere-reranker-with-lancedb/) | +| [Product Quantization: Compress High Dimensional Vectors](https://blog.lancedb.com/benchmarking-lancedb-92b01032874a-2/) |[![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#) | [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/benchmarking-lancedb-92b01032874a-2/) | +| [LLMs, RAG, & the missing storage layer for AI](https://blog.lancedb.com/llms-rag-the-missing-storage-layer-for-ai-28ded35fa984) | [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/llms-rag-the-missing-storage-layer-for-ai-28ded35fa984/) | +| [Fine-Tuning LLM using PEFT & QLoRA](./tutorials/fine-tuning_LLM_with_PEFT_QLoRA) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/tutorials/fine-tuning_LLM_with_PEFT_QLoRA/main.ipynb) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![advanced](https://img.shields.io/badge/advanced-FF3333)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/optimizing-llms-a-step-by-step-guide-to-fine-tuning-with-peft-and-qlora-22eddd13d25b) | +| [Extracting Complex tables-text from PDFs using LlamaParse ](./tutorials/Advace_RAG_LlamaParser) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/tutorials/Advace_RAG_LlamaParser/main.ipynb) [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![LlamaCloud](https://img.shields.io/badge/Llama-api-pink)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| | +|||| ## Projects & Applications These are ready to use applications built using LanceDB serverless vector database. You can explore these open source projects, use parts of them in your projects or build your applications on top of these. @@ -89,27 +180,7 @@ These are ready to use applications built using LanceDB serverless vector databa | [ Fastapi RAG template ](https://github.com/lancedb/vectordb-recipes/tree/main/applications/Chatbot_RAG_with_FASTAPI) | FastAPI based RAG template with Websocket support | ![image](./assets/chatbot_fastapi.png)| | [ GTE MLX RAG ](https://github.com/lancedb/vectordb-recipes/tree/main/applications/GTE_mlx_RAG) | mlx based RAG model using lancedb api support | ![image](./assets/rag-mlx.png)| | [ Healthcare Chatbot ](https://github.com/lancedb/vectordb-recipes/tree/main/applications/Healthcare_chatbot/) | Healthcare chatbot using domain specific LLM & Embedding model | ![image](./assets/chatbot_medical.png)| - - - -## Tutorials -Looking to get started with LLMs, vectorDBs, and the world of Generative AI? These in-depth tutorials and courses cover these concepts with practical follow along colabs where possible. -| Tutorial | Interactive Environment | Blog Link | -| --------- | -------------------------- | ----------- | -| | | | -| [Build RAG from Scratch](./tutorials/RAG-from-Scratch) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/tutorials/RAG-from-Scratch/RAG_from_Scratch.ipynb) [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| | -| [Local RAG from Scratch with Llama3](./tutorials/Local-RAG-from-Scratch) | [![Python](https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54)](./tutorials/Local-RAG-from-Scratch/rag.py) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| | -| [A Primer on Text Chunking and its Types](./tutorials/different-types-text-chunking-in-RAG) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/tutorials/different-types-text-chunking-in-RAG/Text_Chunking_on_RAG_application_with_LanceDB.ipynb) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/a-primer-on-text-chunking-and-its-types-a420efc96a13) | -| [Langchain LlamaIndex Chunking](./tutorials/Langchain-LlamaIndex-Chunking) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/tutorials/Langchain-LlamaIndex-Chunking/Langchain_Llamaindex_chunking.ipynb) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/chunking-techniques-with-langchain-and-llamaindex/) | -| [Comparing Cohere Rerankers with LanceDB](./tutorials/cohere-reranker) | [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/benchmarking-cohere-reranker-with-lancedb/) | -| [NER powered Semantic Search](./tutorials/NER-powered-Semantic-Search) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/tutorials/NER-powered-Semantic-Search/NER_powered_Semantic_Search_with_LanceDB.ipynb) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![beginner](https://img.shields.io/badge/beginner-B5FF33)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/ner-powered-semantic-search-using-lancedb-51051dc3e493) | -| [Product Quantization: Compress High Dimensional Vectors](https://blog.lancedb.com/benchmarking-lancedb-92b01032874a-2/) |[![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#) | [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/benchmarking-lancedb-92b01032874a-2/) | -| [Corrective RAG with Langgraph](./tutorials/Corrective-RAG-with_Langgraph/) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/tutorials/Corrective-RAG-with_Langgraph/CRAG_with_Langgraph.ipynb) [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/implementing-corrective-rag-in-the-easiest-way-2/)| -| [LLMs, RAG, & the missing storage layer for AI](https://blog.lancedb.com/llms-rag-the-missing-storage-layer-for-ai-28ded35fa984) | [![intermediate](https://img.shields.io/badge/intermediate-FFDA33)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/llms-rag-the-missing-storage-layer-for-ai-28ded35fa984/) | -| [Fine-Tuning LLM using PEFT & QLoRA](./tutorials/fine-tuning_LLM_with_PEFT_QLoRA) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/tutorials/fine-tuning_LLM_with_PEFT_QLoRA/main.ipynb) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![advanced](https://img.shields.io/badge/advanced-FF3333)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/optimizing-llms-a-step-by-step-guide-to-fine-tuning-with-peft-and-qlora-22eddd13d25b) | -| [Context-Aware Chatbot using Llama 2 & LanceDB](./tutorials/chatbot_using_Llama2_&_lanceDB) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/tutorials/chatbot_using_Llama2_&_lanceDB/main.ipynb) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![advanced](https://img.shields.io/badge/advanced-FF3333)](#)| [![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/context-aware-chatbot-using-llama-2-lancedb-as-vector-database-4d771d95c755) | -| [Better RAG with FLARE](./tutorials/better-rag-FLAIR) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/tutorials/better-rag-FLAIR/main.ipynb) [![local LLM](https://img.shields.io/badge/local-llm-green)](#) [![LLM](https://img.shields.io/badge/openai-api-white)](#) [![advanced](https://img.shields.io/badge/advanced-FF3333)](#)|[![Ghost](https://img.shields.io/badge/ghost-000?style=for-the-badge&logo=ghost&logoColor=%23F7DF1E)](https://blog.lancedb.com/better-rag-with-active-retrieval-augmented-generation-flare-3b66646e2a9f/) | - +|||| **🌟 New! 🌟 Applied GenAI and VectorDB course on Udacity** diff --git a/assets/imagebind-demo.png b/assets/imagebind-demo.png new file mode 100644 index 00000000..c1e28ae1 Binary files /dev/null and b/assets/imagebind-demo.png differ diff --git a/assets/movie-recommendation-with-genre.png b/assets/movie-recommendation-with-genre.png new file mode 100644 index 00000000..6b26c968 Binary files /dev/null and b/assets/movie-recommendation-with-genre.png differ diff --git a/assets/rag_evaluation_flow.png b/assets/rag_evaluation_flow.png new file mode 100644 index 00000000..4b74b6cb Binary files /dev/null and b/assets/rag_evaluation_flow.png differ diff --git a/assets/superagent-autogen.png b/assets/superagent-autogen.png index d41d2484..9908c624 100644 Binary files a/assets/superagent-autogen.png and b/assets/superagent-autogen.png differ diff --git a/examples/Code-Documentation-QA-Bot/lancedb_cloud/README.md b/examples/Code-Documentation-QA-Bot/lancedb_cloud/README.md new file mode 100644 index 00000000..88b6eca7 --- /dev/null +++ b/examples/Code-Documentation-QA-Bot/lancedb_cloud/README.md @@ -0,0 +1,33 @@ +# Code documentation Q&A bot example with LangChain + +![imgonline-com-ua-twotoone-RaRlTe66ft3RUvK](https://github.com/lancedb/vectordb-recipes/assets/15766192/4682b39d-62f4-4722-bc64-f45d45ec8a22) + + +This Q&A bot will allow you to query your own documentation easily using questions. We'll also demonstrate the use of LangChain and LanceDB Cloud using the OpenAI API. In this example we'll use **Numpy 1.26** documentation, but, this could be replaced for your own docs as well. +Colab walkthrough - Open In Colab + + +### Set credentials +if you would like to set api key through an environment variable: +``` +export LANCEDB_API_KEY="sk_..." +``` +or +``` +import os +import getpass + +os.environ["LANCEDB_API_KEY"] = getpass.getpass("Enter Your LANCEDB API Key:") +``` + +replace the following lines in main.py with your project slug and api key" +``` +db_url="db://your-project-slug-name" +api_key="sk_..." +region="us-east-1" +``` + +### Run the script +```python +OPENAI_API_KEY=... python main.py --query "what is a vectordb?" +``` \ No newline at end of file diff --git a/examples/Code-Documentation-QA-Bot/lancedb_cloud/main.ipynb b/examples/Code-Documentation-QA-Bot/lancedb_cloud/main.ipynb new file mode 100644 index 00000000..5b40c699 --- /dev/null +++ b/examples/Code-Documentation-QA-Bot/lancedb_cloud/main.ipynb @@ -0,0 +1,484 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "id": "13cb272e", + "metadata": {}, + "source": [ + "# Code documentation Q&A bot example with LangChain\n", + "![picture](https://lancedb.github.io/lancedb/assets/ecosystem-illustration.png)\n", + "\n", + "This Q&A bot will allow you to query your own documentation easily using questions. We'll also demonstrate the use of LangChain and LanceDB using the OpenAI API.\n", + "\n", + "In this example we'll **Numpy 1.26** documentation, but, this could be replaced for your own docs as well" + ] + }, + { + "cell_type": "markdown", + "id": "9a0e829a", + "metadata": { + "id": "wgPbKbpumkhH" + }, + "source": [ + "### Credentials\n", + "\n", + "Copy and paste the project name and the api key from your project page.\n", + "These will be used later to [connect to LanceDB Cloud](#scroll-to=5q8m6GMD7sGu)" + ] + }, + { + "cell_type": "code", + "execution_count": 1, + "id": "6553603f", + "metadata": { + "id": "rqEXT5-fmofw" + }, + "outputs": [], + "source": [ + "project_slug = \"your-project-slug\" # @param {type:\"string\"}" + ] + }, + { + "cell_type": "code", + "execution_count": 2, + "id": "36ef9c45", + "metadata": { + "id": "5LYmBomPmswi" + }, + "outputs": [], + "source": [ + "api_key = \"sk_...\" # @param {type:\"string\"}" + ] + }, + { + "cell_type": "markdown", + "id": "33ba6af1", + "metadata": { + "id": "Xs6tr6CMnBrr" + }, + "source": [ + "You can also set the LANCEDB_API_KEY as an environment variable. More details can be found **here**." + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "Le27BWs2vDbB" + }, + "source": [ + "Since we will be using OPENAI API, let us set the OPENAI API KEY as well." + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "-2-fyVPKu9fl" + }, + "outputs": [], + "source": [ + "openai_api_key = \"sk-...\" # @param {type:\"string\"}" + ] + }, + { + "cell_type": "markdown", + "id": "1991331f-4316-417a-b693-e2f27cbe9ea7", + "metadata": {}, + "source": [ + "### Installing dependencies" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "id": "e8a49c31", + "metadata": {}, + "outputs": [], + "source": [ + "! pip install -U langchain langchain-openai langchain-community" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "id": "66638d6c", + "metadata": { + "id": "QR9W53zStdlz" + }, + "outputs": [], + "source": [ + "! pip install -qq tiktoken unstructured pandas lancedb" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "0QQL4lm8lTzg" + }, + "source": [ + "### Importing libraries" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "vP6d6JUShgqo" + }, + "outputs": [], + "source": [ + "import openai\n", + "import os\n", + "import re\n", + "import pickle\n", + "import requests\n", + "import zipfile\n", + "from pathlib import Path\n", + "\n", + "from langchain.document_loaders import UnstructuredHTMLLoader\n", + "from langchain.text_splitter import RecursiveCharacterTextSplitter\n", + "from langchain.vectorstores import LanceDB\n", + "from langchain_openai import OpenAI, OpenAIEmbeddings\n", + "from langchain.chains import RetrievalQA\n", + "\n", + "os.environ[\"OPENAI_API_KEY\"] = openai_api_key\n", + "assert openai.models.list() is not None" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "8eKRYd2F7v5n" + }, + "source": [ + "### Get the data\n", + "To make this easier, we've downloaded Numpy documentation and stored the raw HTML files for you to download. Once the docs are downloaded, we then use LangChain's HTML document readers to parse them and store them in LanceDB as a vector store, along with relevant metadata.\n", + "By default we use numpy docs, but you can replace this with your own docs as well." + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "l0ezDr7suAf_" + }, + "outputs": [], + "source": [ + "numpy_docs = requests.get(\"https://numpy.org/doc/1.26/numpy-html.zip\")\n", + "with open(\"numpy-html.zip\", \"wb\") as f:\n", + " f.write(numpy_docs.content)\n", + "\n", + "file = zipfile.ZipFile(\"numpy-html.zip\")\n", + "file = file.extractall(path=\"numpy_docs\")" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "HJf8xZmX8VJC" + }, + "source": [ + "We'll create a simple **helper function** that can help to extract metadata, so it can used later when querying with filters. In this case, we want to keep the lineage of the uri or path for each document that has been processed:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "5aljyqpUiViE" + }, + "outputs": [], + "source": [ + "# Pre-processing and loading the documentation\n", + "\n", + "# Next, let's pre-process and load the documentation. To make sure we don't need to do this repeatedly if we were updating code,\n", + "# we're caching it using pickle so we can retrieve it again (this could take a few minutes to run the first time you do it).\n", + "# We'll also add some more metadata to the docs here such as the title and version of the code:\n", + "\n", + "\n", + "def get_document_title(document_list):\n", + " titles = []\n", + " for doc in document_list:\n", + " if \"metadata\" in doc and \"source\" in doc[\"metadata\"]:\n", + " m = str(doc[\"metadata\"][\"source\"])\n", + " title = re.findall(\"numpy_docs(.*).html\", m)\n", + " print(title)\n", + " if title:\n", + " titles.append(title[0])\n", + " else:\n", + " titles.append(\"\")\n", + " else:\n", + " titles.append(\"\")\n", + " return titles" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "PCufm9Xr8eWp" + }, + "source": [ + "### Pre-processing and loading the documents\n", + "\n", + "Next, let's pre-process and load the documents. To make sure we don't need to do this repeatedly while updating code, we're caching it using pickle so it can be retrieved again (this could take a few minutes to run the first time you do it). We'll also add extra metadata to the docs here such as the title and version of the code:\n", + "\n", + "*Note*: This step might take up to 10 minutes to run!\n", + "*Note*: If there is some issue with nltk package, kindly try using\n", + "```\n", + "import nltk\n", + "nltk.download('punkt')\n", + "```\n", + "or try to manually install the [nltk_data](https://github.com/nltk/nltk_data/tree/gh-pages) package and unzip the **punkt tokenizer** zip and the **averaged_perceptron_tagger** zip file in the packages folder." + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/", + "height": 443 + }, + "id": "964Z2sZA247g", + "outputId": "236df468-a630-4691-85a4-886835cfc02d" + }, + "outputs": [], + "source": [ + "from tqdm import tqdm\n", + "\n", + "docs = []\n", + "docs_path = Path(\"docs.pkl\")\n", + "for p in tqdm(Path(\"numpy_docs\").rglob(\"*.html\")):\n", + " if p.is_dir():\n", + " continue\n", + " loader = UnstructuredHTMLLoader(p)\n", + " raw_document = loader.load()\n", + " # docs.append(raw_document)\n", + " title = get_document_title(raw_document)\n", + " m = {\"title\": title}\n", + " if raw_document:\n", + " raw_document[0].metadata.update(m)\n", + " raw_document[0].metadata[\"source\"] = str(raw_document[0].metadata[\"source\"])\n", + " docs.extend(raw_document)\n", + "\n", + "\n", + "if docs:\n", + " with open(docs_path, \"wb\") as fh:\n", + " pickle.dump(docs, fh)\n", + "else:\n", + " with open(docs_path, \"rb\") as fh:\n", + " docs = pickle.load(fh)\n", + "\n", + "len(docs)" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "cntAuaUU_TER" + }, + "source": [ + "### Generating emebeddings from our docs\n", + "\n", + "Now that we have our raw documents loaded, we need to pre-process them to generate embeddings:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "dHw2DSAj3u9B" + }, + "outputs": [], + "source": [ + "text_splitter = RecursiveCharacterTextSplitter(\n", + " chunk_size=1000,\n", + " chunk_overlap=200,\n", + ")\n", + "documents = text_splitter.split_documents(docs)\n", + "embeddings = OpenAIEmbeddings()" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "IiM4DJvC_2dV" + }, + "source": [ + "### Store data in LanceDB Cloud\n", + "\n", + "Let's connect to LanceDB so we can store our documents, It requires 0 setup !" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "GV77SSi-AK0v" + }, + "outputs": [], + "source": [ + "uri = \"db://\" + project_slug\n", + "table_name = \"langchain_vectorstore\"\n", + "\n", + "vectorstore = LanceDB(\n", + " embedding=embeddings,\n", + " uri=uri, # your remote database URI\n", + " api_key=api_key,\n", + " region=\"us-east-1\",\n", + " table_name=table_name, # Optional, defaults to \"vectors\"\n", + " mode=\"overwrite\", # Optional, defaults to \"overwrite\"\n", + ")\n", + "\n", + "doc_ids = vectorstore.add_documents(documents=documents)" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "sZOUxfqzXr1m" + }, + "source": [ + "Now let's create our RetrievalQA chain using the LanceDB vector store:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "4nDltKClAhhU" + }, + "outputs": [], + "source": [ + "qa = RetrievalQA.from_chain_type(\n", + " llm=OpenAI(), chain_type=\"stuff\", retriever=vectorstore.as_retriever()\n", + ")" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "xoS-WKXMXvvR" + }, + "source": [ + "And thats it! We're all setup. The next step is to run some queries, let's try a few:" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "7SKSlyq2iwpK" + }, + "source": [ + "### Query" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/" + }, + "id": "6aSZr8fCXx9s", + "outputId": "ac5b5663-d45f-48c0-9f0a-f272e1a3ec2d" + }, + "outputs": [ + { + "data": { + "text/plain": [ + "{'query': 'tell me about the numpy library?',\n", + " 'result': ' The NumPy library is an open source Python library that is used for working with numerical data in Python. It contains multidimensional array and matrix data structures, and provides methods for efficient operations on these arrays. It is widely used in various fields of science and engineering and is a core component of the scientific Python and PyData ecosystems. It also offers a large library of high-level mathematical functions for working with arrays and matrices. '}" + ] + }, + "execution_count": 14, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "query = \"tell me about the numpy library?\"\n", + "qa.invoke(query)" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/" + }, + "id": "EtBw5EH7lv9_", + "outputId": "1745f881-fa15-44b5-e692-b702babce734" + }, + "outputs": [ + { + "data": { + "text/plain": [ + "{'query': \"What's the current version of numpy?\",\n", + " 'result': '\\nThe current version of numpy is 1.16.4.'}" + ] + }, + "execution_count": 15, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "query = \"What's the current version of numpy?\"\n", + "qa.invoke(query)" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/" + }, + "id": "fR4CmF9ylvzw", + "outputId": "1b33bb78-4b3f-4dea-addd-75f56eb4e5e6" + }, + "outputs": [ + { + "data": { + "text/plain": [ + "{'query': 'What kind of linear algebra related operations can be done in numpy?',\n", + " 'result': ' The numpy package provides various operations related to linear algebra, such as decompositions, matrix eigenvalues, norms, solving equations and inverting matrices, and performing linear algebra on several matrices at once. It also has support for logic functions, masked array operations, mathematical functions, matrix library, miscellaneous routines, padding arrays, polynomials, random sampling, set routines, sorting, searching, counting, statistics, and window functions.'}" + ] + }, + "execution_count": 16, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "query = \"What kind of linear algebra related operations can be done in numpy?\"\n", + "qa.invoke(query)" + ] + } + ], + "metadata": { + "colab": { + "provenance": [] + }, + "kernelspec": { + "display_name": "Python 3", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.12.1" + } + }, + "nbformat": 4, + "nbformat_minor": 0 +} diff --git a/examples/Code-Documentation-QA-Bot/lancedb_cloud/main.py b/examples/Code-Documentation-QA-Bot/lancedb_cloud/main.py new file mode 100644 index 00000000..e9eb9605 --- /dev/null +++ b/examples/Code-Documentation-QA-Bot/lancedb_cloud/main.py @@ -0,0 +1,136 @@ +# %% [markdown] +# # Code documentation Q&A bot example with LangChain +# +# This Q&A bot will allow you to query your own documentation easily using questions. We'll also demonstrate the use of LangChain and LanceDB using the OpenAI API. +# +# In this example we'll use Pandas 2.0 documentation, but, this could be replaced for your own docs as well + +import argparse +import os +import pickle +import re +import zipfile +from pathlib import Path + +import openai +import requests +from langchain.chains import RetrievalQA +from langchain.text_splitter import RecursiveCharacterTextSplitter +from langchain_community.document_loaders import BSHTMLLoader +from langchain_community.vectorstores import LanceDB +from langchain_openai import OpenAI, OpenAIEmbeddings + + +def get_document_title(document_list): + titles = [] + for doc in document_list: + if "metadata" in doc and "source" in doc["metadata"]: + m = str(doc["metadata"]["source"]) + title = re.findall("numpy_docs(.*).html", m) + if title: + titles.append(title[0]) + else: + titles.append("") + else: + titles.append("") + return titles + + +def arg_parse(): + default_query = "tell me about the numpy library?" + # default_query = "What's the current version of numpy?" + + parser = argparse.ArgumentParser(description="Code Documentation QA Bot") + parser.add_argument( + "--query", type=str, default=default_query, help="query to search" + ) + parser.add_argument("--openai-key", type=str, help="OpenAI API Key") + args = parser.parse_args() + + if not args.openai_key: + if "OPENAI_API_KEY" not in os.environ: + raise ValueError( + "OPENAI_API_KEY environment variable not set. Please set it or pass --openai_key" + ) + else: + openai.api_key = args.openai_key + + return args + + +def pre_process(): + from tqdm import tqdm + + docs = [] + docs_path = Path("docs.pkl") + for p in tqdm(Path("numpy_docs").rglob("*.html")): + if p.is_dir(): + continue + # loader = UnstructuredHTMLLoader(p) + loader = BSHTMLLoader(p, open_encoding="utf8") + raw_document = loader.load() + # docs.append(raw_document) + title = get_document_title(raw_document) + m = {"title": title} + if raw_document: + raw_document[0].metadata.update(m) + raw_document[0].metadata["source"] = str(raw_document[0].metadata["source"]) + docs.extend(raw_document) + + if docs: + with open(docs_path, "wb") as fh: + pickle.dump(docs, fh) + else: + with open(docs_path, "rb") as fh: + docs = pickle.load(fh) + + return docs + + +if __name__ == "__main__": + args = arg_parse() + + numpy_docs = requests.get("https://numpy.org/doc/1.26/numpy-html.zip") + with open("numpy-html.zip", "wb") as f: + f.write(numpy_docs.content) + + file = zipfile.ZipFile("numpy-html.zip") + file = file.extractall(path="numpy_docs") + + docs = pre_process() + + print("Loaded {} documents".format(len(docs))) + text_splitter = RecursiveCharacterTextSplitter( + chunk_size=1000, + chunk_overlap=200, + ) + documents = text_splitter.split_documents(docs) + embeddings = OpenAIEmbeddings() + + db_url = "db://your-project-slug" + api_key = "sk_..." + region = "us-east-1" + table_name = "langchain_vectorstore" + + vectorstore = LanceDB( + embedding=embeddings, + uri=db_url, # your remote database URI + api_key=api_key, + region="us-east-1", + table_name=table_name, # Optional, defaults to "vectors" + mode="overwrite", # Optional, defaults to "overwrite" + ) + + # insert documents in batches + batch_size = 10000 + for i in range(0, len(documents), batch_size): + print(f"ingesting batch of {i} : {i+batch_size}") + batch = documents[i : i + batch_size] + vectorstore.add_documents(batch) + + qa = RetrievalQA.from_chain_type( + llm=OpenAI(), chain_type="stuff", retriever=vectorstore.as_retriever() + ) + + result = qa.run(args.query) + print(result) diff --git a/examples/Code-Documentation-QA-Bot/lancedb_cloud/requirements.txt b/examples/Code-Documentation-QA-Bot/lancedb_cloud/requirements.txt new file mode 100644 index 00000000..79d2c50b --- /dev/null +++ b/examples/Code-Documentation-QA-Bot/lancedb_cloud/requirements.txt @@ -0,0 +1,8 @@ +argparse +openai +langchain-community +langchain-openai +lancedb +unstructured +tiktoken +polars diff --git a/examples/Evaluating_RAG_with_RAGAs/Evaluating_RAG_with_RAGAs.ipynb b/examples/Evaluating_RAG_with_RAGAs/Evaluating_RAG_with_RAGAs.ipynb new file mode 100644 index 00000000..f69e55d3 --- /dev/null +++ b/examples/Evaluating_RAG_with_RAGAs/Evaluating_RAG_with_RAGAs.ipynb @@ -0,0 +1,1158 @@ +{ + "nbformat": 4, + "nbformat_minor": 0, + "metadata": { + "colab": { + "provenance": [] + }, + "kernelspec": { + "name": "python3", + "display_name": "Python 3" + }, + "language_info": { + "name": "python" + }, + "widgets": { + "application/vnd.jupyter.widget-state+json": { + "91f4187ef74b4c0791fa9058899f7454": { + "model_module": "@jupyter-widgets/controls", + "model_name": "HBoxModel", + "model_module_version": "1.5.0", + "state": { + "_dom_classes": [], + "_model_module": "@jupyter-widgets/controls", + "_model_module_version": "1.5.0", + "_model_name": "HBoxModel", + "_view_count": null, + "_view_module": "@jupyter-widgets/controls", + "_view_module_version": "1.5.0", + "_view_name": "HBoxView", + "box_style": "", + "children": [ + "IPY_MODEL_d9fb5f1092e24ba59cb768842ff8f828", + "IPY_MODEL_6441a4ce0f644c51aa6c140de43ed31d", + "IPY_MODEL_a6b459a38e5c4386b85ee7ebc0e302a4" + ], + "layout": "IPY_MODEL_cf96076499974020b541a541648028f4" + } + }, + "d9fb5f1092e24ba59cb768842ff8f828": { + "model_module": "@jupyter-widgets/controls", + "model_name": "HTMLModel", + "model_module_version": "1.5.0", + "state": { + "_dom_classes": [], + "_model_module": "@jupyter-widgets/controls", + "_model_module_version": "1.5.0", + "_model_name": "HTMLModel", + "_view_count": null, + "_view_module": "@jupyter-widgets/controls", + "_view_module_version": "1.5.0", + "_view_name": "HTMLView", + "description": "", + "description_tooltip": null, + "layout": "IPY_MODEL_82cd48cdf6f144e19cb3ef0a0553b689", + "placeholder": "​", + "style": "IPY_MODEL_8e537aa094004828b08a55a94cbd7dff", + "value": "Evaluating: 100%" + } + }, + "6441a4ce0f644c51aa6c140de43ed31d": { + "model_module": "@jupyter-widgets/controls", + "model_name": "FloatProgressModel", + "model_module_version": "1.5.0", + "state": { + "_dom_classes": [], + "_model_module": "@jupyter-widgets/controls", + "_model_module_version": "1.5.0", + "_model_name": "FloatProgressModel", + "_view_count": null, + "_view_module": "@jupyter-widgets/controls", + "_view_module_version": "1.5.0", + "_view_name": "ProgressView", + "bar_style": "success", + "description": "", + "description_tooltip": null, + "layout": "IPY_MODEL_55a45baafaad4a4ca2cad720da0a80aa", + "max": 27, + "min": 0, + "orientation": "horizontal", + "style": "IPY_MODEL_7f2a940f7f114e439f63e5e900748511", + "value": 27 + } + }, + "a6b459a38e5c4386b85ee7ebc0e302a4": { + "model_module": "@jupyter-widgets/controls", + "model_name": "HTMLModel", + "model_module_version": "1.5.0", + "state": { + "_dom_classes": [], + "_model_module": "@jupyter-widgets/controls", + "_model_module_version": "1.5.0", + "_model_name": "HTMLModel", + "_view_count": null, + "_view_module": "@jupyter-widgets/controls", + "_view_module_version": "1.5.0", + "_view_name": "HTMLView", + "description": "", + "description_tooltip": null, + "layout": "IPY_MODEL_50282721d29447c89bc05559a99183dd", + "placeholder": "​", + "style": "IPY_MODEL_23c5a724d0fa40efac45f1fe8fc0b9c5", + "value": " 27/27 [00:23<00:00,  4.90s/it]" + } + }, + "cf96076499974020b541a541648028f4": { + "model_module": "@jupyter-widgets/base", + "model_name": "LayoutModel", + "model_module_version": "1.2.0", + "state": { + "_model_module": "@jupyter-widgets/base", + "_model_module_version": "1.2.0", + "_model_name": "LayoutModel", + "_view_count": null, + "_view_module": "@jupyter-widgets/base", + "_view_module_version": "1.2.0", + "_view_name": "LayoutView", + "align_content": null, + "align_items": null, + "align_self": null, + "border": null, + "bottom": null, + "display": null, + "flex": null, + "flex_flow": null, + "grid_area": null, + "grid_auto_columns": null, + "grid_auto_flow": null, + "grid_auto_rows": null, + "grid_column": null, + "grid_gap": null, + "grid_row": null, + "grid_template_areas": null, + "grid_template_columns": null, + "grid_template_rows": null, + "height": null, + "justify_content": null, + "justify_items": null, + "left": null, + "margin": null, + "max_height": null, + "max_width": null, + "min_height": null, + "min_width": null, + "object_fit": null, + "object_position": null, + "order": null, + "overflow": null, + "overflow_x": null, + "overflow_y": null, + "padding": null, + "right": null, + "top": null, + "visibility": null, + "width": null + } + }, + "82cd48cdf6f144e19cb3ef0a0553b689": { + "model_module": "@jupyter-widgets/base", + "model_name": "LayoutModel", + "model_module_version": "1.2.0", + "state": { + "_model_module": "@jupyter-widgets/base", + "_model_module_version": "1.2.0", + "_model_name": "LayoutModel", + "_view_count": null, + "_view_module": "@jupyter-widgets/base", + "_view_module_version": "1.2.0", + "_view_name": "LayoutView", + "align_content": null, + "align_items": null, + "align_self": null, + "border": null, + "bottom": null, + "display": null, + "flex": null, + "flex_flow": null, + "grid_area": null, + "grid_auto_columns": null, + "grid_auto_flow": null, + "grid_auto_rows": null, + "grid_column": null, + "grid_gap": null, + "grid_row": null, + "grid_template_areas": null, + "grid_template_columns": null, + "grid_template_rows": null, + "height": null, + "justify_content": null, + "justify_items": null, + "left": null, + "margin": null, + "max_height": null, + "max_width": null, + "min_height": null, + "min_width": null, + "object_fit": null, + "object_position": null, + "order": null, + "overflow": null, + "overflow_x": null, + "overflow_y": null, + "padding": null, + "right": null, + "top": null, + "visibility": null, + "width": null + } + }, + "8e537aa094004828b08a55a94cbd7dff": { + "model_module": "@jupyter-widgets/controls", + "model_name": "DescriptionStyleModel", + "model_module_version": "1.5.0", + "state": { + "_model_module": "@jupyter-widgets/controls", + "_model_module_version": "1.5.0", + "_model_name": "DescriptionStyleModel", + "_view_count": null, + "_view_module": "@jupyter-widgets/base", + "_view_module_version": "1.2.0", + "_view_name": "StyleView", + "description_width": "" + } + }, + "55a45baafaad4a4ca2cad720da0a80aa": { + "model_module": "@jupyter-widgets/base", + "model_name": "LayoutModel", + "model_module_version": "1.2.0", + "state": { + "_model_module": "@jupyter-widgets/base", + "_model_module_version": "1.2.0", + "_model_name": "LayoutModel", + "_view_count": null, + "_view_module": "@jupyter-widgets/base", + "_view_module_version": "1.2.0", + "_view_name": "LayoutView", + "align_content": null, + "align_items": null, + "align_self": null, + "border": null, + "bottom": null, + "display": null, + "flex": null, + "flex_flow": null, + "grid_area": null, + "grid_auto_columns": null, + "grid_auto_flow": null, + "grid_auto_rows": null, + "grid_column": null, + "grid_gap": null, + "grid_row": null, + "grid_template_areas": null, + "grid_template_columns": null, + "grid_template_rows": null, + "height": null, + "justify_content": null, + "justify_items": null, + "left": null, + "margin": null, + "max_height": null, + "max_width": null, + "min_height": null, + "min_width": null, + "object_fit": null, + "object_position": null, + "order": null, + "overflow": null, + "overflow_x": null, + "overflow_y": null, + "padding": null, + "right": null, + "top": null, + "visibility": null, + "width": null + } + }, + "7f2a940f7f114e439f63e5e900748511": { + "model_module": "@jupyter-widgets/controls", + "model_name": "ProgressStyleModel", + "model_module_version": "1.5.0", + "state": { + "_model_module": "@jupyter-widgets/controls", + "_model_module_version": "1.5.0", + "_model_name": "ProgressStyleModel", + "_view_count": null, + "_view_module": "@jupyter-widgets/base", + "_view_module_version": "1.2.0", + "_view_name": "StyleView", + "bar_color": null, + "description_width": "" + } + }, + "50282721d29447c89bc05559a99183dd": { + "model_module": "@jupyter-widgets/base", + "model_name": "LayoutModel", + "model_module_version": "1.2.0", + "state": { + "_model_module": "@jupyter-widgets/base", + "_model_module_version": "1.2.0", + "_model_name": "LayoutModel", + "_view_count": null, + "_view_module": "@jupyter-widgets/base", + "_view_module_version": "1.2.0", + "_view_name": "LayoutView", + "align_content": null, + "align_items": null, + "align_self": null, + "border": null, + "bottom": null, + "display": null, + "flex": null, + "flex_flow": null, + "grid_area": null, + "grid_auto_columns": null, + "grid_auto_flow": null, + "grid_auto_rows": null, + "grid_column": null, + "grid_gap": null, + "grid_row": null, + "grid_template_areas": null, + "grid_template_columns": null, + "grid_template_rows": null, + "height": null, + "justify_content": null, + "justify_items": null, + "left": null, + "margin": null, + "max_height": null, + "max_width": null, + "min_height": null, + "min_width": null, + "object_fit": null, + "object_position": null, + "order": null, + "overflow": null, + "overflow_x": null, + "overflow_y": null, + "padding": null, + "right": null, + "top": null, + "visibility": null, + "width": null + } + }, + "23c5a724d0fa40efac45f1fe8fc0b9c5": { + "model_module": "@jupyter-widgets/controls", + "model_name": "DescriptionStyleModel", + "model_module_version": "1.5.0", + "state": { + "_model_module": "@jupyter-widgets/controls", + "_model_module_version": "1.5.0", + "_model_name": "DescriptionStyleModel", + "_view_count": null, + "_view_module": "@jupyter-widgets/base", + "_view_module_version": "1.2.0", + "_view_name": "StyleView", + "description_width": "" + } + } + } + } + }, + "cells": [ + { + "cell_type": "markdown", + "source": [ + "## Evaluating RAG with RAGAs using GPT-4o\n", + "\n", + "Ragas is a **framework for evaluating Retrieval Augmented Generation (RAG) pipelines**.\n", + "\n", + "Ragas provides you with the tools/metrics based on the latest research for evaluating LLM-generated text to give you insights about your RAG pipeline. Ragas can be integrated with your CI/CD to provide continuous checks to ensure performance.\n", + "\n", + "GPT4-o is used as an LLM to generate responses out of semantically close context chunks.\n", + "\n", + "![flow.png]()" + ], + "metadata": { + "id": "AwhxwHTf4VZp" + } + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/" + }, + "id": "1mr4fXemsYci", + "outputId": "dd51d890-7da2-45ec-b14f-d96169bb8bdf" + }, + "outputs": [ + { + "output_type": "stream", + "name": "stdout", + "text": [ + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m973.5/973.5 kB\u001b[0m \u001b[31m10.7 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m320.6/320.6 kB\u001b[0m \u001b[31m38.4 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m18.9/18.9 MB\u001b[0m \u001b[31m71.7 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m84.1/84.1 kB\u001b[0m \u001b[31m11.7 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m308.5/308.5 kB\u001b[0m \u001b[31m38.4 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m122.8/122.8 kB\u001b[0m \u001b[31m15.9 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m75.6/75.6 kB\u001b[0m \u001b[31m11.5 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m22.8/22.8 MB\u001b[0m \u001b[31m58.6 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m542.0/542.0 kB\u001b[0m \u001b[31m46.5 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m1.1/1.1 MB\u001b[0m \u001b[31m51.2 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m2.1/2.1 MB\u001b[0m \u001b[31m48.3 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m71.1/71.1 kB\u001b[0m \u001b[31m8.8 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m77.9/77.9 kB\u001b[0m \u001b[31m10.9 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m58.3/58.3 kB\u001b[0m \u001b[31m6.1 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m53.0/53.0 kB\u001b[0m \u001b[31m8.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m142.5/142.5 kB\u001b[0m \u001b[31m20.1 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m98.7/98.7 kB\u001b[0m \u001b[31m13.8 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m116.3/116.3 kB\u001b[0m \u001b[31m18.6 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m194.1/194.1 kB\u001b[0m \u001b[31m22.4 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m134.8/134.8 kB\u001b[0m \u001b[31m21.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m49.3/49.3 kB\u001b[0m \u001b[31m8.1 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25h" + ] + } + ], + "source": [ + "!pip install langchain openai lancedb ragas -q" + ] + }, + { + "cell_type": "markdown", + "source": [ + "### Setup `OPENAI_API_KEY` as an environment variable" + ], + "metadata": { + "id": "z8hT0Jn74ZmT" + } + }, + { + "cell_type": "code", + "source": [ + "import os\n", + "\n", + "os.environ[\"OPENAI_API_KEY\"] = \"sk-proj-...\"" + ], + "metadata": { + "id": "YHgQd_1rI04R" + }, + "execution_count": null, + "outputs": [] + }, + { + "cell_type": "markdown", + "source": [ + "### Load .txt file and convert them into chunks" + ], + "metadata": { + "id": "mM0tf_vo6GbI" + } + }, + { + "cell_type": "code", + "source": [ + "import requests\n", + "from langchain.document_loaders import TextLoader\n", + "from langchain.text_splitter import CharacterTextSplitter\n", + "\n", + "url = \"https://raw.githubusercontent.com/hwchase17/chroma-langchain/master/state_of_the_union.txt\"\n", + "res = requests.get(url)\n", + "with open(\"state_of_the_union.txt\", \"w\") as f:\n", + " f.write(res.text)\n", + "\n", + "# Load the data\n", + "loader = TextLoader(\"./state_of_the_union.txt\")\n", + "documents = loader.load()\n", + "\n", + "# Chunk the data\n", + "text_splitter = CharacterTextSplitter(chunk_size=200, chunk_overlap=10)\n", + "chunks = text_splitter.split_documents(documents)" + ], + "metadata": { + "id": "IkLbg-_1I3Rt", + "colab": { + "base_uri": "https://localhost:8080/" + }, + "outputId": "4248c952-c719-4a30-ee7e-06d2f1b17449" + }, + "execution_count": null, + "outputs": [ + { + "output_type": "stream", + "name": "stderr", + "text": [ + "WARNING:langchain_text_splitters.base:Created a chunk of size 215, which is longer than the specified 200\n", + "WARNING:langchain_text_splitters.base:Created a chunk of size 232, which is longer than the specified 200\n", + "WARNING:langchain_text_splitters.base:Created a chunk of size 242, which is longer than the specified 200\n", + "WARNING:langchain_text_splitters.base:Created a chunk of size 219, which is longer than the specified 200\n", + "WARNING:langchain_text_splitters.base:Created a chunk of size 304, which is longer than the specified 200\n", + "WARNING:langchain_text_splitters.base:Created a chunk of size 205, which is longer than the specified 200\n", + "WARNING:langchain_text_splitters.base:Created a chunk of size 332, which is longer than the specified 200\n", + "WARNING:langchain_text_splitters.base:Created a chunk of size 215, which is longer than the specified 200\n", + "WARNING:langchain_text_splitters.base:Created a chunk of size 203, which is longer than the specified 200\n", + "WARNING:langchain_text_splitters.base:Created a chunk of size 281, which is longer than the specified 200\n", + "WARNING:langchain_text_splitters.base:Created a chunk of size 201, which is longer than the specified 200\n", + "WARNING:langchain_text_splitters.base:Created a chunk of size 250, which is longer than the specified 200\n", + "WARNING:langchain_text_splitters.base:Created a chunk of size 325, which is longer than the specified 200\n", + "WARNING:langchain_text_splitters.base:Created a chunk of size 242, which is longer than the specified 200\n" + ] + } + ] + }, + { + "cell_type": "markdown", + "source": [ + "### Setup Retriever\n", + "\n", + "Retriever utilizes **LanceDB** for scalable vector search and advanced retrieval in RAG, delivering blazing fast performance for searching large sets of embeddings." + ], + "metadata": { + "id": "pgetSLZXEJ2Q" + } + }, + { + "cell_type": "code", + "source": [ + "from langchain.embeddings import OpenAIEmbeddings\n", + "from langchain.vectorstores import LanceDB\n", + "import lancedb\n", + "\n", + "openai_embed = OpenAIEmbeddings()\n", + "\n", + "# Setup lancedb\n", + "db = lancedb.connect(\"/tmp/lancedb\")\n", + "table = db.create_table(\n", + " \"raga_eval\",\n", + " data=[{\"vector\": openai_embed.embed_query(\"Hello World\"), \"text\": \"Hello World\"}],\n", + " mode=\"overwrite\",\n", + ")\n", + "\n", + "# Populate vector database\n", + "vectorstore = LanceDB.from_documents(\n", + " client=table, documents=chunks, embedding=openai_embed, by_text=False\n", + ")\n", + "\n", + "# Define vectorstore as retriever to enable semantic search\n", + "retriever = vectorstore.as_retriever()" + ], + "metadata": { + "id": "2PYhU_vvJC0P" + }, + "execution_count": null, + "outputs": [] + }, + { + "cell_type": "markdown", + "source": [ + "### Setup RAG Pipeline with Prompt template" + ], + "metadata": { + "id": "9CFnNEfuExj7" + } + }, + { + "cell_type": "code", + "source": [ + "from langchain.chat_models import ChatOpenAI\n", + "from langchain.prompts import ChatPromptTemplate\n", + "from langchain.schema.runnable import RunnablePassthrough\n", + "from langchain.schema.output_parser import StrOutputParser\n", + "\n", + "# Define LLM\n", + "llm = ChatOpenAI(model_name=\"gpt-4o\", temperature=0)\n", + "\n", + "# Define Prompt template\n", + "template = \"\"\"You are an assistant for question-answering tasks.\n", + "Use the following pieces of retrieved context to answer the question.\n", + "If you don't know the answer, just say that you don't know.\n", + "Use two sentences maximum and keep the answer concise.\n", + "Question: {question}\n", + "Context: {context}\n", + "Answer:\n", + "\"\"\"\n", + "\n", + "prompt = ChatPromptTemplate.from_template(template)\n", + "\n", + "# Setup RAG pipeline\n", + "rag_chain = (\n", + " {\"context\": retriever, \"question\": RunnablePassthrough()}\n", + " | prompt\n", + " | llm\n", + " | StrOutputParser()\n", + ")" + ], + "metadata": { + "id": "-TiQhbNyLSKv" + }, + "execution_count": null, + "outputs": [] + }, + { + "cell_type": "markdown", + "source": [ + "### Sample Questions with their Expected Answers\n", + "\n", + "Define a set of questions with their answers for creating dataset including ground truth, generated answers with their context using which they are generated." + ], + "metadata": { + "id": "Ge8JtkNXFXhI" + } + }, + { + "cell_type": "code", + "source": [ + "from datasets import Dataset\n", + "\n", + "questions = [\n", + " \"What did the president say about Justice Breyer?\",\n", + " \"What did the president say about Intel's CEO?\",\n", + " \"What did the president say about gun violence?\",\n", + "]\n", + "ground_truth = [\n", + " \"The president said that Justice Breyer has dedicated his life to serve the country and thanked him for his service.\",\n", + " \"The president said that Pat Gelsinger is ready to increase Intel's investment to $100 billion.\",\n", + " \"The president asked Congress to pass proven measures to reduce gun violence.\",\n", + "]\n", + "answers = []\n", + "contexts = []\n", + "\n", + "# Inference\n", + "for query in questions:\n", + " answers.append(rag_chain.invoke(query))\n", + " contexts.append(\n", + " [docs.page_content for docs in retriever.get_relevant_documents(query)]\n", + " )\n", + "\n", + "# To dict\n", + "data = {\n", + " \"question\": questions,\n", + " \"answer\": answers,\n", + " \"contexts\": contexts,\n", + " \"ground_truth\": ground_truth,\n", + "}\n", + "\n", + "# Convert dict to dataset\n", + "dataset = Dataset.from_dict(data)" + ], + "metadata": { + "id": "PGiU57QJMP0J" + }, + "execution_count": null, + "outputs": [] + }, + { + "cell_type": "markdown", + "source": [ + "### RAGA Evaluation Pipeline\n", + "\n", + "Simple pipeline of RAGA for evaluation with the listed metrics to understand and evaluate the RAG system.\n", + "\n", + "**Metrics** on which we will evaulate are answer_correctness,\n", + "faithfulness,\n", + "answer_similarity,\n", + "context_precision,\n", + "context_utilization,\n", + "context_recall,\n", + "context_relevancy,\n", + "answer_relevancy, and\n", + "context_entity_recall" + ], + "metadata": { + "id": "szBZ1nwkFruF" + } + }, + { + "cell_type": "code", + "source": [ + "from ragas import evaluate\n", + "from ragas.metrics import (\n", + " answer_correctness,\n", + " faithfulness,\n", + " answer_similarity,\n", + " context_precision,\n", + " context_utilization,\n", + " context_recall,\n", + " context_relevancy,\n", + " answer_relevancy,\n", + " context_entity_recall,\n", + ")\n", + "\n", + "\n", + "# evaluating dataest on listed metrics\n", + "result = evaluate(\n", + " dataset=dataset,\n", + " metrics=[\n", + " answer_correctness,\n", + " faithfulness,\n", + " answer_similarity,\n", + " context_precision,\n", + " context_utilization,\n", + " context_recall,\n", + " context_relevancy,\n", + " answer_relevancy,\n", + " context_entity_recall,\n", + " ],\n", + ")\n", + "\n", + "\n", + "df = result.to_pandas()" + ], + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/", + "height": 49, + "referenced_widgets": [ + "91f4187ef74b4c0791fa9058899f7454", + "d9fb5f1092e24ba59cb768842ff8f828", + "6441a4ce0f644c51aa6c140de43ed31d", + "a6b459a38e5c4386b85ee7ebc0e302a4", + "cf96076499974020b541a541648028f4", + "82cd48cdf6f144e19cb3ef0a0553b689", + "8e537aa094004828b08a55a94cbd7dff", + "55a45baafaad4a4ca2cad720da0a80aa", + "7f2a940f7f114e439f63e5e900748511", + "50282721d29447c89bc05559a99183dd", + "23c5a724d0fa40efac45f1fe8fc0b9c5" + ] + }, + "id": "Samkm2TnMUQA", + "outputId": "f50e72d4-55fd-4f74-f0f6-334af8353ae2" + }, + "execution_count": null, + "outputs": [ + { + "output_type": "display_data", + "data": { + "text/plain": [ + "Evaluating: 0%| | 0/27 [00:00\n", + "
\n", + "\n", + "\n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + "
questionanswercontextsground_truthanswer_correctnessfaithfulnessanswer_similaritycontext_precisioncontext_utilizationcontext_recallcontext_relevancyanswer_relevancycontext_entity_recall
0What did the president say about Justice Breyer?The president honored Justice Stephen Breyer a...[And I did that 4 days ago, when I nominated C...The president said that Justice Breyer has ded...0.4154871.00.9119481.01.01.00.2000000.8415890.500000
1What did the president say about Intel's CEO?The president said that Intel’s CEO, Pat Gelsi...[Intel’s CEO, Pat Gelsinger, who is here tonig...The president said that Pat Gelsinger is ready...0.6199980.00.9801031.01.01.00.0909090.8970840.750000
2What did the president say about gun violence?The president called on Congress to pass prove...[And I ask Congress to pass proven measures to...The president asked Congress to pass proven me...0.6062301.00.9248951.01.01.00.2500000.9148880.666667
\n", + "
\n", + "
\n", + "\n", + "
\n", + " \n", + "\n", + " \n", + "\n", + " \n", + "
\n", + "\n", + "\n", + "
\n", + " \n", + "\n", + "\n", + "\n", + " \n", + "
\n", + "\n", + "
\n", + " \n", + " \n", + " \n", + "
\n", + "\n", + "
\n", + " \n" + ], + "application/vnd.google.colaboratory.intrinsic+json": { + "type": "dataframe", + "variable_name": "df", + "summary": "{\n \"name\": \"df\",\n \"rows\": 3,\n \"fields\": [\n {\n \"column\": \"question\",\n \"properties\": {\n \"dtype\": \"string\",\n \"num_unique_values\": 3,\n \"samples\": [\n \"What did the president say about Justice Breyer?\",\n \"What did the president say about Intel's CEO?\",\n \"What did the president say about gun violence?\"\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"answer\",\n \"properties\": {\n \"dtype\": \"string\",\n \"num_unique_values\": 3,\n \"samples\": [\n \"The president honored Justice Stephen Breyer as an Army veteran, Constitutional scholar, and retiring Justice of the United States Supreme Court, and thanked him for his service. He also mentioned that Circuit Court of Appeals Judge Ketanji Brown Jackson, who he nominated, will continue Justice Breyer\\u2019s legacy of excellence.\",\n \"The president said that Intel\\u2019s CEO, Pat Gelsinger, told him they are ready to increase their investment from $20 billion to $100 billion.\",\n \"The president called on Congress to pass proven measures to reduce gun violence, including universal background checks and banning assault weapons and high-capacity magazines. He also questioned why individuals on a terrorist list should be able to purchase a weapon and advocated for repealing the liability shield that protects gun manufacturers from being sued.\"\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"contexts\",\n \"properties\": {\n \"dtype\": \"object\",\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"ground_truth\",\n \"properties\": {\n \"dtype\": \"string\",\n \"num_unique_values\": 3,\n \"samples\": [\n \"The president said that Justice Breyer has dedicated his life to serve the country and thanked him for his service.\",\n \"The president said that Pat Gelsinger is ready to increase Intel's investment to $100 billion.\",\n \"The president asked Congress to pass proven measures to reduce gun violence.\"\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"answer_correctness\",\n \"properties\": {\n \"dtype\": \"number\",\n \"std\": 0.11430741128276067,\n \"min\": 0.4154869951100285,\n \"max\": 0.6199979663207625,\n \"num_unique_values\": 3,\n \"samples\": [\n 0.4154869951100285,\n 0.6199979663207625,\n 0.6062297668831289\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"faithfulness\",\n \"properties\": {\n \"dtype\": \"number\",\n \"std\": 0.5773502691896258,\n \"min\": 0.0,\n \"max\": 1.0,\n \"num_unique_values\": 2,\n \"samples\": [\n 0.0,\n 1.0\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"answer_similarity\",\n \"properties\": {\n \"dtype\": \"number\",\n \"std\": 0.03619563595630723,\n \"min\": 0.911947980440114,\n \"max\": 0.9801032234987175,\n \"num_unique_values\": 3,\n \"samples\": [\n 0.911947980440114,\n 0.9801032234987175\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"context_precision\",\n \"properties\": {\n \"dtype\": \"number\",\n \"std\": 2.8867515847990063e-11,\n \"min\": 0.9999999999,\n \"max\": 0.99999999995,\n \"num_unique_values\": 2,\n \"samples\": [\n 0.9999999999,\n 0.99999999995\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"context_utilization\",\n \"properties\": {\n \"dtype\": \"number\",\n \"std\": 2.8867515847990063e-11,\n \"min\": 0.9999999999,\n \"max\": 0.99999999995,\n \"num_unique_values\": 2,\n \"samples\": [\n 0.9999999999,\n 0.99999999995\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"context_recall\",\n \"properties\": {\n \"dtype\": \"number\",\n \"std\": 0.0,\n \"min\": 1.0,\n \"max\": 1.0,\n \"num_unique_values\": 1,\n \"samples\": [\n 1.0\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"context_relevancy\",\n \"properties\": {\n \"dtype\": \"number\",\n \"std\": 0.08135390156762909,\n \"min\": 0.09090909090909091,\n \"max\": 0.25,\n \"num_unique_values\": 3,\n \"samples\": [\n 0.2\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"answer_relevancy\",\n \"properties\": {\n \"dtype\": \"number\",\n \"std\": 0.038230490911004,\n \"min\": 0.8415890798965432,\n \"max\": 0.9148879952296768,\n \"num_unique_values\": 3,\n \"samples\": [\n 0.8415890798965432\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n },\n {\n \"column\": \"context_entity_recall\",\n \"properties\": {\n \"dtype\": \"number\",\n \"std\": 0.12729376960740932,\n \"min\": 0.4999999975,\n \"max\": 0.7499999981250001,\n \"num_unique_values\": 3,\n \"samples\": [\n 0.4999999975\n ],\n \"semantic_type\": \"\",\n \"description\": \"\"\n }\n }\n ]\n}" + } + }, + "metadata": {}, + "execution_count": 22 + } + ] + } + ] +} \ No newline at end of file diff --git a/examples/Evaluating_RAG_with_RAGAs/README.md b/examples/Evaluating_RAG_with_RAGAs/README.md new file mode 100644 index 00000000..98c70693 --- /dev/null +++ b/examples/Evaluating_RAG_with_RAGAs/README.md @@ -0,0 +1,13 @@ +# Evaluating RAG with RAGAs and GPT-4o +Open In Colab + + +Ragas is a **framework for evaluating Retrieval Augmented Generation (RAG) pipelines**. + +Ragas provides you with the tools/metrics based on the latest research for evaluating LLM-generated text to give you insights about your RAG pipeline. Ragas can be integrated with your CI/CD to provide continuous checks to ensure performance. + +GPT4-o is used as an LLM to generate responses out of semantically close context chunks. + +![flow](../../assets/rag_evaluation_flow.png) + +Try it out on Colab - Open In Colab \ No newline at end of file diff --git a/examples/LlamaIndex-demo/lancedb_cloud/README.md b/examples/LlamaIndex-demo/lancedb_cloud/README.md new file mode 100644 index 00000000..636ab133 --- /dev/null +++ b/examples/LlamaIndex-demo/lancedb_cloud/README.md @@ -0,0 +1,29 @@ +# LlamaIndex and LanceDB Cloud Demo + +In this demo, we are going to show how to use LanceDB Cloud to perform vector searches in LlamaIndex + + +### Set credentials +if you would like to set api key through an environment variable: +``` +export LANCEDB_API_KEY="sk_..." +``` +or +``` +import os +import getpass + +os.environ["LANCEDB_API_KEY"] = getpass.getpass("Enter Your LANCEDB API Key:") +``` + +replace the following lines in main.py with your project slug and api key" +``` +db_url="db://your-project-slug-name" +api_key="sk_..." +region="us-east-1" +``` + +### Run the script +```python +OPENAI_API_KEY=... python main.py +``` \ No newline at end of file diff --git a/examples/LlamaIndex-demo/lancedb_cloud/main.ipynb b/examples/LlamaIndex-demo/lancedb_cloud/main.ipynb new file mode 100644 index 00000000..6a36bd18 --- /dev/null +++ b/examples/LlamaIndex-demo/lancedb_cloud/main.ipynb @@ -0,0 +1,391 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "id": "13cb272e", + "metadata": {}, + "source": [ + "# Vector search with LanceDB Cloud and LlamaIndex \n" + ] + }, + { + "cell_type": "markdown", + "id": "9a0e829a", + "metadata": { + "id": "wgPbKbpumkhH" + }, + "source": [ + "### Credentials\n", + "\n", + "Copy and paste the project name and the api key from your project page.\n", + "These will be used later to [connect to LanceDB Cloud](#scroll-to=5q8m6GMD7sGu)" + ] + }, + { + "cell_type": "code", + "execution_count": 1, + "id": "6553603f", + "metadata": { + "id": "rqEXT5-fmofw" + }, + "outputs": [], + "source": [ + "project_slug = \"your-project-slug\" # @param {type:\"string\"}" + ] + }, + { + "cell_type": "code", + "execution_count": 2, + "id": "36ef9c45", + "metadata": { + "id": "5LYmBomPmswi" + }, + "outputs": [], + "source": [ + "api_key = \"sk_...\" # @param {type:\"string\"}" + ] + }, + { + "cell_type": "markdown", + "id": "33ba6af1", + "metadata": { + "id": "Xs6tr6CMnBrr" + }, + "source": [ + "You can also set the LANCEDB_API_KEY as an environment variable. More details can be found **here**." + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "Le27BWs2vDbB" + }, + "source": [ + "Since we will be using OPENAI API, let us set the OPENAI API KEY as well." + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "-2-fyVPKu9fl" + }, + "outputs": [], + "source": [ + "openai_api_key = \"sk-...\" # @param {type:\"string\"}" + ] + }, + { + "cell_type": "markdown", + "id": "1991331f-4316-417a-b693-e2f27cbe9ea7", + "metadata": {}, + "source": [ + "### Installing dependencies" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "id": "e8a49c31", + "metadata": {}, + "outputs": [], + "source": [ + "! pip install llama-index-vector-stores-lancedb llama-index-readers-file llama-index-embeddings-openai llama-index-llms-openai" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "0QQL4lm8lTzg" + }, + "source": [ + "### Importing libraries" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "vP6d6JUShgqo" + }, + "outputs": [], + "source": [ + "import openai\n", + "import logging\n", + "import sys\n", + "\n", + "# Uncomment to see debug logs\n", + "# logging.basicConfig(stream=sys.stdout, level=logging.DEBUG)\n", + "# logging.getLogger().addHandler(logging.StreamHandler(stream=sys.stdout))\n", + "\n", + "from llama_index.core import SimpleDirectoryReader, Document, StorageContext\n", + "from llama_index.core import VectorStoreIndex\n", + "from llama_index.vector_stores.lancedb import LanceDBVectorStore\n", + "import textwrap\n", + "\n", + "openai.api_key = openai_api_key\n", + "assert openai.models.list() is not None" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "8eKRYd2F7v5n" + }, + "source": [ + "### Download the data\n" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "l0ezDr7suAf_" + }, + "outputs": [], + "source": [ + "! mkdir -p 'data/paul_graham/'\n", + "! wget 'https://raw.githubusercontent.com/run-llama/llama_index/main/docs/docs/examples/data/paul_graham/paul_graham_essay.txt' -O 'data/paul_graham/paul_graham_essay.txt'\n", + "! ls 'data/paul_graham/'" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "HJf8xZmX8VJC" + }, + "source": [ + "Load the documents stored in the data/paul_graham/ using the SimpleDirectoryReader:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "5aljyqpUiViE" + }, + "outputs": [], + "source": [ + "documents = SimpleDirectoryReader(\"data/paul_graham/\").load_data()\n", + "print(\"Document ID:\", documents[0].doc_id, \"Document Hash:\", documents[0].hash)" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "IiM4DJvC_2dV" + }, + "source": [ + "### Store data in LanceDB Cloud\n", + "\n", + "Let's connect to LanceDB so we can store our documents, It requires 0 setup !" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "GV77SSi-AK0v" + }, + "outputs": [], + "source": [ + "uri = \"db://\" + project_slug\n", + "table_name = \"llamaindex_vectorstore\" #optional, default table name is \"vectors\" \n", + "\n", + "vector_store = LanceDBVectorStore( \n", + " uri=uri, # your remote DB URI\n", + " api_key=\"sk_..\", # lancedb cloud api key\n", + " region=\"your-region\" # the region you configured\n", + " ...\n", + ")" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "sZOUxfqzXr1m" + }, + "source": [ + "### Create an index" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "4nDltKClAhhU" + }, + "outputs": [], + "source": [ + "storage_context = StorageContext.from_defaults(vector_store=vector_store)\n", + "\n", + "index = VectorStoreIndex.from_documents(documents, storage_context=storage_context)" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "xoS-WKXMXvvR" + }, + "source": [ + "And thats it! We're all setup. The next step is to run some queries, let's try a few:" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "7SKSlyq2iwpK" + }, + "source": [ + "### Query the index\n", + "We can now ask questions using the created index. Filtering can be enabled via `MetadataFilters` or use native lance `where` clause." + ] + }, + { + "cell_type": "code", + "execution_count": null, + "id": "5eb6419b", + "metadata": {}, + "outputs": [], + "source": [ + "from datetime import datetime\n", + "from llama_index.core.vector_stores import (\n", + " MetadataFilters,\n", + " FilterOperator,\n", + " FilterCondition,\n", + " MetadataFilter,\n", + ")\n", + "\n", + "date = datetime.today().strftime(\"%Y-%m-%d\")\n", + "query_filters = MetadataFilters(\n", + " filters=[\n", + " MetadataFilter(\n", + " key=\"creation_date\",\n", + " operator=FilterOperator.EQ,\n", + " value=date, # using current date as the latest data is scraped\n", + " ),\n", + " MetadataFilter(key=\"file_size\", value=75040, operator=FilterOperator.GT),\n", + " ],\n", + " condition=FilterCondition.AND,\n", + ")" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [ + { + "data": { + "text/plain": [ + "Viaweb charged $100 a month for a small store and $300 a month for a big one.\n", + "metadata - ..." + ] + }, + "execution_count": 15, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "query_engine = index.as_query_engine(\n", + " filters=query_filters,\n", + ")\n", + "\n", + "response = query_engine.query(\"How much did Viaweb charge per month?\")\n", + "print(response)\n", + "print(\"metadata -\", response.metadata)" + ] + }, + { + "cell_type": "markdown", + "id": "0c1c6c73", + "metadata": {}, + "source": [ + "Let's use LanceDB filters(SQL like) directly via the `where` clause :" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "id": "0a2bcc07", + "metadata": {}, + "outputs": [], + "source": [ + "lance_filter = \"metadata.file_name = 'paul_graham_essay.txt' \"\n", + "retriever = index.as_retriever(vector_store_kwargs={\"where\": lance_filter})\n", + "response = retriever.retrieve(\"What did the author do growing up?\")\n", + "print(response[0].get_content())\n", + "print(\"metadata -\", response[0].metadata)" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "sZOUxfqzXr1m" + }, + "source": [ + "### Append data to the index \n", + "You can also add data to an existing index" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "id": "069fc099", + "metadata": {}, + "outputs": [], + "source": [ + "del index\n", + "\n", + "index = VectorStoreIndex.from_documents(\n", + " [Document(text=\"The sky is purple in Portland, Maine\")],\n", + " uri=\"/tmp/new_dataset\",\n", + ")" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "id": "b5cffcfe", + "metadata": {}, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "Portland, Maine\n" + ] + } + ], + "source": [ + "query_engine = index.as_query_engine()\n", + "response = query_engine.query(\"Where is the sky purple?\")\n", + "print(textwrap.fill(str(response), 100))" + ] + } + ], + "metadata": { + "colab": { + "provenance": [] + }, + "kernelspec": { + "display_name": "Python 3", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.12.1" + } + }, + "nbformat": 4, + "nbformat_minor": 0 +} diff --git a/examples/LlamaIndex-demo/lancedb_cloud/main.py b/examples/LlamaIndex-demo/lancedb_cloud/main.py new file mode 100644 index 00000000..ef4e168d --- /dev/null +++ b/examples/LlamaIndex-demo/lancedb_cloud/main.py @@ -0,0 +1,89 @@ +import os +import textwrap +from datetime import datetime + +import openai +import requests +from llama_index.core import ( + Document, + SimpleDirectoryReader, + StorageContext, + VectorStoreIndex, +) +from llama_index.vector_stores.lancedb import LanceDBVectorStore + +if __name__ == "__main__": + if "OPENAI_API_KEY" not in os.environ: + raise ValueError("OPENAI_API_KEY environment variable not set. Please set it") + else: + openai.api_key = os.environ["OPENAI_API_KEY"] + + # Download the document + data_path = r"data/paul_graham/" + if not os.path.exists(data_path): + os.makedirs(data_path) + url = "https://raw.githubusercontent.com/run-llama/llama_index/main/docs/docs/examples/data/paul_graham/paul_graham_essay.txt" + r = requests.get(url) + with open(data_path + "/paul_graham_essay.txt", "wb") as f: + f.write(r.content) + + # Load the document + documents = SimpleDirectoryReader(data_path).load_data() + print("Document ID:", documents[0].doc_id, "Document Hash:", documents[0].hash) + + # Create a LanceDBVectorStore and create an index + vector_store = LanceDBVectorStore( + uri="db://your-project-slug", # your remote DB URI + api_key="sk_...", # lancedb cloud api key + region="us-east-1", # the region you configured + ) + + storage_context = StorageContext.from_defaults(vector_store=vector_store) + + index = VectorStoreIndex.from_documents(documents, storage_context=storage_context) + + # Query via MetadataFilters + from llama_index.core.vector_stores import ( + FilterCondition, + FilterOperator, + MetadataFilter, + MetadataFilters, + ) + + date = datetime.today().strftime("%Y-%m-%d") + query_filters = MetadataFilters( + filters=[ + MetadataFilter(key="creation_date", operator=FilterOperator.EQ, value=date), + MetadataFilter(key="file_size", value=75040, operator=FilterOperator.GT), + ], + condition=FilterCondition.AND, + ) + + query_engine = index.as_query_engine( + filters=query_filters, + ) + + response = query_engine.query("How much did Viaweb charge per month?") + print("==== query via MetadataFilters") + print(response) + print("metadata -", response.metadata) + + # Query via LanceDB where clause + lance_filter = "metadata.file_name = 'paul_graham_essay.txt' " + retriever = index.as_retriever(vector_store_kwargs={"where": lance_filter}) + response = retriever.retrieve("What did the author do growing up?") + print("==== query via LanceDB where clause") + print(response[0].get_content()) + print("metadata -", response[0].metadata) + + # add data to an existing index and query with the new data + del index + + index = VectorStoreIndex.from_documents( + [Document(text="The sky is purple in Portland, Maine")], + uri="/tmp/new_dataset", + ) + query_engine = index.as_query_engine() + response = query_engine.query("Where is the sky purple?") + print("==== query with new data") + print(textwrap.fill(str(response), 100)) diff --git a/examples/LlamaIndex-demo/lancedb_cloud/requirements.txt b/examples/LlamaIndex-demo/lancedb_cloud/requirements.txt new file mode 100644 index 00000000..8272f875 --- /dev/null +++ b/examples/LlamaIndex-demo/lancedb_cloud/requirements.txt @@ -0,0 +1,5 @@ +llama-index-vector-stores-lancedb +llama-index-readers-file +llama-index-embeddings-openai +llama-index-llms-openai +lancedb \ No newline at end of file diff --git a/examples/QueryExpansion&Reranker/README.md b/examples/QueryExpansion&Reranker/README.md index b89b35d8..3daf63c6 100644 --- a/examples/QueryExpansion&Reranker/README.md +++ b/examples/QueryExpansion&Reranker/README.md @@ -19,4 +19,4 @@ Our focus is on improving the precision and recall of document retrieval process For a detailed exploration of the concepts and methodologies discussed in this project, visit our blog -[Read the Blog Post](https://blog.lancedb.com/improving-rag-with-query-expansion-reranking-models/) +[Read the Blog Post](https://aksdesai1998.medium.com/improving-rag-with-query-expansion-reranking-models-31d252856580) diff --git a/examples/SuperAgent_Autogen/main.ipynb b/examples/SuperAgent_Autogen/main.ipynb index 9278654b..13047b03 100644 --- a/examples/SuperAgent_Autogen/main.ipynb +++ b/examples/SuperAgent_Autogen/main.ipynb @@ -11,55 +11,53 @@ }, { "cell_type": "code", - "execution_count": null, + "execution_count": 1, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "3poVgyh-bZJ-", - "outputId": "ad799a6e-7eec-4e14-dae3-f7e86c9e67cc" + "outputId": "7244e7cf-8eca-481d-fd05-d21a82460d47" }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ - "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m88.8/88.8 kB\u001b[0m \u001b[31m2.7 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", - "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m811.8/811.8 kB\u001b[0m \u001b[31m9.3 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", - "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m226.7/226.7 kB\u001b[0m \u001b[31m9.5 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", - "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m1.8/1.8 MB\u001b[0m \u001b[31m14.4 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", - "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m111.9/111.9 kB\u001b[0m \u001b[31m12.1 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", - "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m284.0/284.0 kB\u001b[0m \u001b[31m13.4 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", - "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m295.2/295.2 kB\u001b[0m \u001b[31m12.1 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", - "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m77.0/77.0 kB\u001b[0m \u001b[31m3.5 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", - "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m1.6/1.6 MB\u001b[0m \u001b[31m21.6 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", - "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m239.4/239.4 kB\u001b[0m \u001b[31m23.2 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", - "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m55.7/55.7 kB\u001b[0m \u001b[31m5.8 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", - "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m21.6/21.6 MB\u001b[0m \u001b[31m34.7 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", - "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m38.3/38.3 MB\u001b[0m \u001b[31m12.6 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", - "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m49.4/49.4 kB\u001b[0m \u001b[31m3.7 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", - "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m55.4/55.4 kB\u001b[0m \u001b[31m4.9 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", - "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m98.7/98.7 kB\u001b[0m \u001b[31m8.5 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", - "\u001b[?25h\u001b[31mERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.\n", - "llmx 0.0.15a0 requires cohere, which is not installed.\n", - "ibis-framework 7.1.0 requires pyarrow<15,>=2, but you have pyarrow 15.0.0 which is incompatible.\u001b[0m\u001b[31m\n", - "\u001b[0m" + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m88.8/88.8 kB\u001b[0m \u001b[31m1.1 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m973.5/973.5 kB\u001b[0m \u001b[31m6.7 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m2.1/2.1 MB\u001b[0m \u001b[31m12.5 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m320.6/320.6 kB\u001b[0m \u001b[31m14.7 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m1.1/1.1 MB\u001b[0m \u001b[31m19.9 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m18.9/18.9 MB\u001b[0m \u001b[31m36.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m290.4/290.4 kB\u001b[0m \u001b[31m22.9 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m45.5/45.5 kB\u001b[0m \u001b[31m3.5 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m296.7/296.7 kB\u001b[0m \u001b[31m20.8 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m77.0/77.0 kB\u001b[0m \u001b[31m2.9 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m308.5/308.5 kB\u001b[0m \u001b[31m24.5 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m122.8/122.8 kB\u001b[0m \u001b[31m10.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m22.8/22.8 MB\u001b[0m \u001b[31m30.7 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m49.3/49.3 kB\u001b[0m \u001b[31m3.5 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m53.0/53.0 kB\u001b[0m \u001b[31m5.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m142.5/142.5 kB\u001b[0m \u001b[31m1.5 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m98.7/98.7 kB\u001b[0m \u001b[31m9.6 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25h" ] } ], "source": [ - "%pip install pyautogen~=0.1.0 langchain openai tiktoken lancedb pypdf -q -U" + "%pip install pyautogen~=0.1.0 langchain langchain_community openai tiktoken lancedb pypdf -q -U" ] }, { "cell_type": "code", - "execution_count": null, + "execution_count": 2, "metadata": { "id": "0tLTTT9ucFEb" }, "outputs": [], "source": [ - "from langchain.embeddings import OpenAIEmbeddings\n", + "from langchain_community.embeddings import OpenAIEmbeddings\n", "from langchain.text_splitter import RecursiveCharacterTextSplitter\n", "from langchain.document_loaders import PyPDFLoader\n", "from langchain.memory import ConversationBufferMemory\n", @@ -80,60 +78,88 @@ }, { "cell_type": "code", - "execution_count": null, + "execution_count": 3, "metadata": { - "id": "6RuVu12whCG0" + "colab": { + "base_uri": "https://localhost:8080/" + }, + "id": "6RuVu12whCG0", + "outputId": "1300d37d-65ae-4e24-cbfb-d1cc701db28d" }, - "outputs": [], + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "Requirement already satisfied: pyautogen in /usr/local/lib/python3.10/dist-packages (0.1.14)\n", + "Requirement already satisfied: diskcache in /usr/local/lib/python3.10/dist-packages (from pyautogen) (5.6.3)\n", + "Requirement already satisfied: flaml in /usr/local/lib/python3.10/dist-packages (from pyautogen) (2.1.2)\n", + "Requirement already satisfied: openai<1 in /usr/local/lib/python3.10/dist-packages (from pyautogen) (0.28.1)\n", + "Requirement already satisfied: python-dotenv in /usr/local/lib/python3.10/dist-packages (from pyautogen) (1.0.1)\n", + "Requirement already satisfied: termcolor in /usr/local/lib/python3.10/dist-packages (from pyautogen) (2.4.0)\n", + "Requirement already satisfied: requests>=2.20 in /usr/local/lib/python3.10/dist-packages (from openai<1->pyautogen) (2.31.0)\n", + "Requirement already satisfied: tqdm in /usr/local/lib/python3.10/dist-packages (from openai<1->pyautogen) (4.66.4)\n", + "Requirement already satisfied: aiohttp in /usr/local/lib/python3.10/dist-packages (from openai<1->pyautogen) (3.9.5)\n", + "Requirement already satisfied: NumPy>=1.17 in /usr/local/lib/python3.10/dist-packages (from flaml->pyautogen) (1.25.2)\n", + "Requirement already satisfied: charset-normalizer<4,>=2 in /usr/local/lib/python3.10/dist-packages (from requests>=2.20->openai<1->pyautogen) (3.3.2)\n", + "Requirement already satisfied: idna<4,>=2.5 in /usr/local/lib/python3.10/dist-packages (from requests>=2.20->openai<1->pyautogen) (3.7)\n", + "Requirement already satisfied: urllib3<3,>=1.21.1 in /usr/local/lib/python3.10/dist-packages (from requests>=2.20->openai<1->pyautogen) (2.0.7)\n", + "Requirement already satisfied: certifi>=2017.4.17 in /usr/local/lib/python3.10/dist-packages (from requests>=2.20->openai<1->pyautogen) (2024.2.2)\n", + "Requirement already satisfied: aiosignal>=1.1.2 in /usr/local/lib/python3.10/dist-packages (from aiohttp->openai<1->pyautogen) (1.3.1)\n", + "Requirement already satisfied: attrs>=17.3.0 in /usr/local/lib/python3.10/dist-packages (from aiohttp->openai<1->pyautogen) (23.2.0)\n", + "Requirement already satisfied: frozenlist>=1.1.1 in /usr/local/lib/python3.10/dist-packages (from aiohttp->openai<1->pyautogen) (1.4.1)\n", + "Requirement already satisfied: multidict<7.0,>=4.5 in /usr/local/lib/python3.10/dist-packages (from aiohttp->openai<1->pyautogen) (6.0.5)\n", + "Requirement already satisfied: yarl<2.0,>=1.0 in /usr/local/lib/python3.10/dist-packages (from aiohttp->openai<1->pyautogen) (1.9.4)\n", + "Requirement already satisfied: async-timeout<5.0,>=4.0 in /usr/local/lib/python3.10/dist-packages (from aiohttp->openai<1->pyautogen) (4.0.3)\n" + ] + } + ], "source": [ "!pip install pyautogen" ] }, { "cell_type": "code", - "execution_count": null, + "execution_count": 38, "metadata": { "id": "sUFdvTyVh8xF" }, "outputs": [], "source": [ "import lancedb\n", + "import os\n", + "\n", + "# setup OPENAI API KEY\n", + "os.environ[\"OPENAI_API_KEY\"] = \"sk-....\"\n", "\n", - "embeddings = OpenAIEmbeddings(openai_api_key=\"sk-yourapikey\")" + "embeddings = OpenAIEmbeddings()" ] }, - { - "cell_type": "markdown", - "metadata": { - "id": "kztlyFIXU8m-" - }, - "source": [] - }, { "cell_type": "code", - "execution_count": null, + "execution_count": 5, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "ODKg12trdhX-", - "outputId": "a7041322-f633-496c-a8e8-126a81cbb5d9" + "outputId": "4736d14d-454c-4458-98c4-d2d0d2edceb9" }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ - "--2024-02-11 04:40:16-- https://pdf.usaid.gov/pdf_docs/PA00TBCT.pdf\n", - "Resolving pdf.usaid.gov (pdf.usaid.gov)... 23.7.61.67, 2600:1408:ec00:380::1923, 2600:1408:ec00:38f::1923\n", - "Connecting to pdf.usaid.gov (pdf.usaid.gov)|23.7.61.67|:443... connected.\n", + "--2024-05-28 05:17:50-- https://pdf.usaid.gov/pdf_docs/PA00TBCT.pdf\n", + "Resolving pdf.usaid.gov (pdf.usaid.gov)... 23.4.180.157, 2600:1408:5400:197::1923, 2600:1408:5400:183::1923\n", + "Connecting to pdf.usaid.gov (pdf.usaid.gov)|23.4.180.157|:443... connected.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 6419525 (6.1M) [application/pdf]\n", "Saving to: ‘food.pdf’\n", "\n", - "food.pdf 100%[===================>] 6.12M --.-KB/s in 0.1s \n", + "food.pdf 100%[===================>] 6.12M 27.0MB/s in 0.2s \n", "\n", - "2024-02-11 04:40:16 (42.7 MB/s) - ‘food.pdf’ saved [6419525/6419525]\n", + "2024-05-28 05:17:51 (27.0 MB/s) - ‘food.pdf’ saved [6419525/6419525]\n", "\n" ] } @@ -143,14 +169,40 @@ "!wget -O food.pdf https://pdf.usaid.gov/pdf_docs/PA00TBCT.pdf" ] }, + { + "cell_type": "markdown", + "metadata": { + "id": "yV0pNPiRPy8h" + }, + "source": [ + "# create file name with OAI_CONFIG_LIT.json" + ] + }, + { + "cell_type": "code", + "execution_count": 53, + "metadata": { + "id": "yWDhjTDMcFBi" + }, + "outputs": [], + "source": [ + "# create file name with OAI_CONFIG_LIT.\n", + "import json\n", + "\n", + "config = [{\"model\": \"gpt-4\", \"api_key\": os.environ[\"OPENAI_API_KEY\"]}]\n", + "\n", + "with open(\"OAI_CONFIG_LIT.json\", \"w\") as fp:\n", + " json.dump(config, fp)" + ] + }, { "cell_type": "markdown", "metadata": { "id": "1oC3NAFyd4Kb" }, "source": [ - "create OAI_CONFIG_LIST.json file in pwd & upload\n", - "in it\n", + "**create OAI_CONFIG_LIST.json file in pwd & upload\n", + "in it**\n", "\n", "\n", "[\n", @@ -163,7 +215,7 @@ }, { "cell_type": "code", - "execution_count": null, + "execution_count": 41, "metadata": { "id": "H1bRXWu-cE_C" }, @@ -179,30 +231,9 @@ ")" ] }, - { - "cell_type": "markdown", - "metadata": { - "id": "yV0pNPiRPy8h" - }, - "source": [ - "# create file name with OAI_CONFIG_LIT.json & put below authentications code" - ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": { - "id": "yWDhjTDMcFBi" - }, - "outputs": [], - "source": [ - "# create file name with OAI_CONFIG_LIT.\n", - "[{\"model\": \"gpt-4\", \"api_key\": \"sk-yourapikey\"}]" - ] - }, { "cell_type": "code", - "execution_count": null, + "execution_count": 42, "metadata": { "id": "5gapqmsscFG-" }, @@ -218,70 +249,30 @@ }, { "cell_type": "code", - "execution_count": null, + "execution_count": 43, "metadata": { - "colab": { - "base_uri": "https://localhost:8080/" - }, - "id": "5dLkCqa0dLXV", - "outputId": "28ab5984-c875-4281-95e5-d48bfdd12e99" + "id": "5dLkCqa0dLXV" }, - "outputs": [ - { - "name": "stderr", - "output_type": "stream", - "text": [ - "/usr/local/lib/python3.10/dist-packages/langchain_core/_api/deprecation.py:117: LangChainDeprecationWarning: The class `langchain_community.embeddings.openai.OpenAIEmbeddings` was deprecated in langchain-community 0.1.0 and will be removed in 0.2.0. An updated version of the class exists in the langchain-openai package and should be used instead. To use it run `pip install -U langchain-openai` and import as `from langchain_openai import OpenAIEmbeddings`.\n", - " warn_deprecated(\n" - ] - } - ], + "outputs": [], "source": [ "import lancedb\n", "\n", - "embeddings = OpenAIEmbeddings(openai_api_key=\"sk-yourapikey\")\n", - "\n", - "db = lancedb.connect(\"/tmp/lancedb\")\n", - "table = db.create_table(\n", - " \"my_table\",\n", - " data=[\n", - " {\n", - " \"vector\": embeddings.embed_query(\"Hello food\"),\n", - " \"text\": \"Hello food\",\n", - " \"id\": \"1\",\n", - " }\n", - " ],\n", - " mode=\"overwrite\",\n", - ")\n", + "embeddings = OpenAIEmbeddings()\n", "\n", - "vectorstore = LanceDB.from_documents(docs, embeddings, connection=table)" + "vectorstore = LanceDB.from_documents(documents=docs, embedding=embeddings)" ] }, { "cell_type": "code", - "execution_count": null, + "execution_count": 45, "metadata": { - "colab": { - "base_uri": "https://localhost:8080/" - }, - "id": "YMBoF5kucFMJ", - "outputId": "13d7edab-5f3d-4698-fe6f-40f33dcd865a" + "id": "YMBoF5kucFMJ" }, - "outputs": [ - { - "name": "stderr", - "output_type": "stream", - "text": [ - "/usr/local/lib/python3.10/dist-packages/langchain_core/_api/deprecation.py:117: LangChainDeprecationWarning: The class `langchain_community.llms.openai.OpenAI` was deprecated in langchain-community 0.0.10 and will be removed in 0.2.0. An updated version of the class exists in the langchain-openai package and should be used instead. To use it run `pip install -U langchain-openai` and import as `from langchain_openai import OpenAI`.\n", - " warn_deprecated(\n" - ] - } - ], + "outputs": [], "source": [ "qa = ConversationalRetrievalChain.from_llm(\n", " OpenAI(\n", " temperature=0,\n", - " openai_api_key=\"sk-yourapikey\",\n", " ),\n", " vectorstore.as_retriever(),\n", " memory=ConversationBufferMemory(memory_key=\"chat_history\", return_messages=True),\n", @@ -290,7 +281,7 @@ }, { "cell_type": "code", - "execution_count": null, + "execution_count": 46, "metadata": { "id": "HjSVygLIcSEX" }, @@ -303,34 +294,26 @@ }, { "cell_type": "code", - "execution_count": null, + "execution_count": 47, "metadata": { "colab": { "base_uri": "https://localhost:8080/", - "height": 160 + "height": 90 }, "id": "XCqxSaQSepsW", - "outputId": "c1fc1bdc-9f2e-467b-cb51-fdde3fc964ae" + "outputId": "309a74fe-05c6-4d78-ad3e-e720b77d6e3c" }, "outputs": [ - { - "name": "stderr", - "output_type": "stream", - "text": [ - "/usr/local/lib/python3.10/dist-packages/langchain_core/_api/deprecation.py:117: LangChainDeprecationWarning: The function `__call__` was deprecated in LangChain 0.1.0 and will be removed in 0.2.0. Use invoke instead.\n", - " warn_deprecated(\n" - ] - }, { "data": { "application/vnd.google.colaboratory.intrinsic+json": { "type": "string" }, "text/plain": [ - "' Good food is food that provides the recommended amounts of nutrients for the body to perform all its physiological activities. It is important to eat the right food, at the right time, in the right amounts, and prepared correctly in order to maintain a balanced diet and promote good nutrition. Good food is essential for physical and cognitive development and can improve overall health and quality of life.'" + "' Good food is any type of food that provides the recommended amounts of nutrients for the body to perform its physiological activities. It should be eaten at the right time, in the right amounts, and prepared correctly. Good food is important for physical and cognitive development, and can help prevent health problems. Foods can also be classified according to their functions in the body, such as energy-giving foods, body-building foods, and protective foods.'" ] }, - "execution_count": 18, + "execution_count": 47, "metadata": {}, "output_type": "execute_result" } @@ -342,13 +325,13 @@ }, { "cell_type": "code", - "execution_count": null, + "execution_count": 18, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "BY4Fz-l7cUCA", - "outputId": "3a10b926-3659-46d3-d76b-7e083daf8fca" + "outputId": "e90c9b52-ab8e-4176-e249-fba0f20cae68" }, "outputs": [ { @@ -357,16 +340,16 @@ "text": [ "Requirement already satisfied: pyautogen in /usr/local/lib/python3.10/dist-packages (0.1.14)\n", "Requirement already satisfied: diskcache in /usr/local/lib/python3.10/dist-packages (from pyautogen) (5.6.3)\n", - "Requirement already satisfied: flaml in /usr/local/lib/python3.10/dist-packages (from pyautogen) (2.1.1)\n", + "Requirement already satisfied: flaml in /usr/local/lib/python3.10/dist-packages (from pyautogen) (2.1.2)\n", "Requirement already satisfied: openai<1 in /usr/local/lib/python3.10/dist-packages (from pyautogen) (0.28.1)\n", "Requirement already satisfied: python-dotenv in /usr/local/lib/python3.10/dist-packages (from pyautogen) (1.0.1)\n", "Requirement already satisfied: termcolor in /usr/local/lib/python3.10/dist-packages (from pyautogen) (2.4.0)\n", "Requirement already satisfied: requests>=2.20 in /usr/local/lib/python3.10/dist-packages (from openai<1->pyautogen) (2.31.0)\n", - "Requirement already satisfied: tqdm in /usr/local/lib/python3.10/dist-packages (from openai<1->pyautogen) (4.66.1)\n", - "Requirement already satisfied: aiohttp in /usr/local/lib/python3.10/dist-packages (from openai<1->pyautogen) (3.9.3)\n", - "Requirement already satisfied: NumPy>=1.17.0rc1 in /usr/local/lib/python3.10/dist-packages (from flaml->pyautogen) (1.23.5)\n", + "Requirement already satisfied: tqdm in /usr/local/lib/python3.10/dist-packages (from openai<1->pyautogen) (4.66.4)\n", + "Requirement already satisfied: aiohttp in /usr/local/lib/python3.10/dist-packages (from openai<1->pyautogen) (3.9.5)\n", + "Requirement already satisfied: NumPy>=1.17 in /usr/local/lib/python3.10/dist-packages (from flaml->pyautogen) (1.25.2)\n", "Requirement already satisfied: charset-normalizer<4,>=2 in /usr/local/lib/python3.10/dist-packages (from requests>=2.20->openai<1->pyautogen) (3.3.2)\n", - "Requirement already satisfied: idna<4,>=2.5 in /usr/local/lib/python3.10/dist-packages (from requests>=2.20->openai<1->pyautogen) (3.6)\n", + "Requirement already satisfied: idna<4,>=2.5 in /usr/local/lib/python3.10/dist-packages (from requests>=2.20->openai<1->pyautogen) (3.7)\n", "Requirement already satisfied: urllib3<3,>=1.21.1 in /usr/local/lib/python3.10/dist-packages (from requests>=2.20->openai<1->pyautogen) (2.0.7)\n", "Requirement already satisfied: certifi>=2017.4.17 in /usr/local/lib/python3.10/dist-packages (from requests>=2.20->openai<1->pyautogen) (2024.2.2)\n", "Requirement already satisfied: aiosignal>=1.1.2 in /usr/local/lib/python3.10/dist-packages (from aiohttp->openai<1->pyautogen) (1.3.1)\n", @@ -393,7 +376,7 @@ }, { "cell_type": "code", - "execution_count": null, + "execution_count": 48, "metadata": { "id": "Vca8Y_khcUID" }, @@ -425,7 +408,7 @@ }, { "cell_type": "code", - "execution_count": null, + "execution_count": 49, "metadata": { "id": "1XHjzIYAcfE7" }, @@ -451,13 +434,13 @@ }, { "cell_type": "code", - "execution_count": null, + "execution_count": 50, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "hOZKxakHchZ4", - "outputId": "7ca36fe0-a211-409a-abb0-57e3cd05e429" + "outputId": "20693c38-773a-47d2-bfc4-8311c7ef4d1d" }, "outputs": [ { @@ -475,7 +458,6 @@ "\n", "***** Suggested function Call: answer_food_question *****\n", "Arguments: \n", - "\n", "{\n", " \"question\": \"what is good food?\"\n", "}\n", @@ -487,13 +469,13 @@ "user_proxy (to assistant):\n", "\n", "***** Response from calling function \"answer_food_question\" *****\n", - " Good food is food that is able to provide the recommended amounts of nutrients for the body to perform all its physiological activities. It is important for our health and well-being because it helps us maintain a balanced diet, promotes physical and cognitive development, and protects us from foodborne illnesses. Good food also ensures that we have enough energy for physical activity and basic body functions, and it helps us maintain a healthy weight. Additionally, good food can improve our overall quality of life and productivity.\n", + " Good food is food that is able to provide the recommended amounts of nutrients for the body to perform all its physiological activities. It is important because it helps with physical and cognitive development, promotes good health, and improves the quality of life. Good food should be eaten at the right time, in the right amounts, and prepared correctly. It can also be classified into different categories based on its function in the body, such as energy-giving foods, body-building foods, and protective foods.\n", "*****************************************************************\n", "\n", "--------------------------------------------------------------------------------\n", "assistant (to user_proxy):\n", "\n", - "Good food is food that is able to provide the recommended amounts of nutrients for the body to perform all its physiological activities. It is important for our health and well-being because it helps us maintain a balanced diet, promotes physical and cognitive development, and protects us from foodborne illnesses. Good food also ensures that we have enough energy for physical activity and basic body functions, and it helps us maintain a healthy weight. Additionally, good food can improve our overall quality of life and productivity.\n", + "Good food is food that provides the recommended amounts of nutrients for the body to perform all its physiological activities. It is important because it helps with physical and cognitive development, promotes good health, and improves the quality of life. Good food should be eaten at the right time, in the right amounts, and prepared correctly. It can also be classified into different categories based on its function in the body, such as energy-giving foods, body-building foods, and protective foods.\n", "\n", "TERMINATE\n", "\n", @@ -518,13 +500,13 @@ }, { "cell_type": "code", - "execution_count": null, + "execution_count": 51, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "UDXo2V06fNjz", - "outputId": "37cf6766-9b68-4a81-e0c6-245d2af28a30" + "outputId": "057425b5-f0c1-4132-cdc6-fd03197e95c3" }, "outputs": [ { @@ -549,26 +531,26 @@ " - Deficiency Symptoms: Osteoporosis, rickets in children, muscle cramps, dental problems.\n", "\n", "2. Iron:\n", - " - Sources: Red meat, poultry, eggs, fruits, green vegetables, fortified bread.\n", - " - Functions: Essential for the production of red blood cells, helps in oxygen transport.\n", - " - Deficiency Symptoms: Anemia, fatigue, weakness, immune system problems.\n", + " - Sources: Red meat, poultry, fish, legumes, fortified cereals.\n", + " - Functions: Essential for the production of red blood cells, carries oxygen in the blood.\n", + " - Deficiency Symptoms: Anemia, fatigue, weakness, pale skin, shortness of breath.\n", "\n", - "3. Magnesium:\n", - " - Sources: Nuts, seeds, whole grains, green leafy vegetables, fish, beans, yogurt.\n", - " - Functions: Helps in over 300 enzyme reactions, including regulation of blood pressure, supports immune system.\n", - " - Deficiency Symptoms: Loss of appetite, nausea, fatigue, weakness, muscle cramps, numbness.\n", + "3. Potassium:\n", + " - Sources: Bananas, oranges, cantaloupes, raisins, nuts, fish, chicken, beef, and pork.\n", + " - Functions: Helps maintain fluid balance, nerve transmission, muscle contractions.\n", + " - Deficiency Symptoms: Weakness, fatigue, muscle cramps, constipation.\n", "\n", - "4. Potassium:\n", - " - Sources: Bananas, oranges, cantaloupe, honeydew, apricots, grapefruit, cooked spinach, cooked broccoli, potatoes, sweet potatoes, mushrooms, peas, cucumbers, zucchini, eggplant, pumpkins, leafy greens.\n", - " - Functions: Maintains fluid balance, helps in nerve transmission and muscle contraction.\n", - " - Deficiency Symptoms: Fatigue, weakness, constipation, muscle cramps.\n", + "4. Magnesium:\n", + " - Sources: Green leafy vegetables, nuts, seeds, whole grains, fish.\n", + " - Functions: Involved in over 300 enzymatic reactions in the body including energy production, protein synthesis, muscle and nerve function.\n", + " - Deficiency Symptoms: Loss of appetite, nausea, fatigue, weakness, muscle cramps, numbness and tingling.\n", "\n", "5. Zinc:\n", - " - Sources: Meat, shellfish, legumes, seeds, nuts, dairy, eggs, whole grains.\n", - " - Functions: Necessary for immune function, protein synthesis, DNA synthesis, cell division, wound healing.\n", - " - Deficiency Symptoms: Growth retardation, loss of appetite, impaired immune function, hair loss, diarrhea, delayed sexual maturation.\n", + " - Sources: Meat, shellfish, legumes, seeds, nuts, dairy, eggs.\n", + " - Functions: Supports immune function, protein synthesis, wound healing, DNA synthesis, and cell division.\n", + " - Deficiency Symptoms: Loss of appetite, impaired immune function, hair loss, diarrhea, delayed sexual maturation.\n", "\n", - "Please note that this is not an exhaustive list and there are other essential minerals as well. Also, the symptoms of deficiency can vary from person to person and can often be symptoms of other conditions as well. Always consult with a healthcare provider for accurate information.\n", + "Please note that this is not an exhaustive list and there are many other essential minerals that the body needs. It's also important to remember that while these minerals are essential for health, they should be consumed in moderation as too much can also lead to health problems. Always consult with a healthcare provider or a registered dietitian for personalized advice.\n", "\n", "--------------------------------------------------------------------------------\n", "user_proxy (to assistant):\n", @@ -592,13 +574,13 @@ }, { "cell_type": "code", - "execution_count": null, + "execution_count": 52, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "UrlFGYW0g0sJ", - "outputId": "e82396e5-3dca-402c-889c-394557aeea0d" + "outputId": "0fbf3ed8-0103-4904-9737-2a41e6e1cf0d" }, "outputs": [ { @@ -628,22 +610,13 @@ "user_proxy (to assistant):\n", "\n", "***** Response from calling function \"answer_food_question\" *****\n", - " Foods that are rich in Vitamin A, such as yellow/orange fruits and vegetables, dark green and deep yellow fruits and vegetables, liver, egg yolk, dairy products, and margarine can help maintain healthy eyes.\n", + " Fruits and vegetables, particularly dark green leafy vegetables and yellow fruits, are considered protective and can help keep eyes healthy.\n", "*****************************************************************\n", "\n", "--------------------------------------------------------------------------------\n", "assistant (to user_proxy):\n", "\n", - "Foods that are rich in Vitamin A can help maintain healthy eyes. These include:\n", - "\n", - "1. Yellow/orange fruits and vegetables: These include carrots, sweet potatoes, pumpkins, and apricots.\n", - "2. Dark green and deep yellow fruits and vegetables: These include spinach, kale, and other leafy greens.\n", - "3. Liver: This is a great source of Vitamin A.\n", - "4. Egg yolk: This is another good source of Vitamin A.\n", - "5. Dairy products: These include milk, cheese, and yogurt.\n", - "6. Margarine: This is also a good source of Vitamin A.\n", - "\n", - "Including these foods in your diet can help keep your eyes healthy.\n", + "Fruits and vegetables, particularly dark green leafy vegetables and yellow fruits, are considered protective and can help keep eyes healthy. These foods are rich in vitamins A, C, E, and minerals like Copper and Zinc which are essential for eye health. Foods like carrots, sweet potatoes, spinach, kale, and other dark green leafy vegetables; and fish like salmon and tuna are good for eye health.\n", "\n", "TERMINATE\n", "\n", @@ -665,15 +638,6 @@ "\"\"\",\n", ")" ] - }, - { - "cell_type": "code", - "execution_count": null, - "metadata": { - "id": "En6-kvjcjaid" - }, - "outputs": [], - "source": [] } ], "metadata": { diff --git a/tutorials/better-rag-FLAIR/README.md b/examples/better-rag-FLAIR/README.md similarity index 100% rename from tutorials/better-rag-FLAIR/README.md rename to examples/better-rag-FLAIR/README.md diff --git a/tutorials/better-rag-FLAIR/app.py b/examples/better-rag-FLAIR/app.py similarity index 100% rename from tutorials/better-rag-FLAIR/app.py rename to examples/better-rag-FLAIR/app.py diff --git a/tutorials/better-rag-FLAIR/main.ipynb b/examples/better-rag-FLAIR/main.ipynb similarity index 100% rename from tutorials/better-rag-FLAIR/main.ipynb rename to examples/better-rag-FLAIR/main.ipynb diff --git a/tutorials/better-rag-FLAIR/requirements.txt b/examples/better-rag-FLAIR/requirements.txt similarity index 100% rename from tutorials/better-rag-FLAIR/requirements.txt rename to examples/better-rag-FLAIR/requirements.txt diff --git a/examples/databricks_DBRX_website_bot/README.md b/examples/databricks_DBRX_website_bot/README.md index e99db29e..18d3883e 100644 --- a/examples/databricks_DBRX_website_bot/README.md +++ b/examples/databricks_DBRX_website_bot/README.md @@ -15,7 +15,7 @@ export DATABRICKS_TOKEN= DATABRICKS_SERVING_ENDPOINT= ``` -3. Run the application +3. Run the application in CLI mode ``` python main.py ``` @@ -25,3 +25,12 @@ Accepted arguments: - `embed_model`: Huggingface model to use for embeddings. Default is `mixedbread-ai/mxbai-embed-large-v1`. - `uri`: URI of the vector store. Default is `~/tmp/lancedb_hogwarts`. - `force_create_embeddings`: Whether to force create embeddings. Default is `False`. +- `illustrate`: Whether to illustrate the responses. Default is `True`. + +4. Run the application in GUI mode +``` +streamlit run gui.py +``` + +## MLX SDXL +The MLX SDXL implementation is taken from MLX [examples repo](https://github.com/ml-explore/mlx-examples/tree/main/stable_diffusion). The implementation is modified a bit to make it work faster with the current application. \ No newline at end of file diff --git a/examples/databricks_DBRX_website_bot/__init__.py b/examples/databricks_DBRX_website_bot/__init__.py new file mode 100644 index 00000000..e69de29b diff --git a/examples/databricks_DBRX_website_bot/diffusion_mlx/__init__.py b/examples/databricks_DBRX_website_bot/diffusion_mlx/__init__.py new file mode 100644 index 00000000..f266816f --- /dev/null +++ b/examples/databricks_DBRX_website_bot/diffusion_mlx/__init__.py @@ -0,0 +1,306 @@ +# Copyright © 2023-2024 Apple Inc. + +import time +from typing import Optional, Tuple + +import mlx.core as mx + +from .model_io import ( + _DEFAULT_MODEL, + load_autoencoder, + load_diffusion_config, + load_text_encoder, + load_tokenizer, + load_unet, +) +from .sampler import SimpleEulerAncestralSampler, SimpleEulerSampler + + +class StableDiffusion: + def __init__(self, model: str = _DEFAULT_MODEL, float16: bool = True): + self.dtype = mx.float16 if float16 else mx.float32 + self.diffusion_config = load_diffusion_config(model) + self.unet = load_unet(model, float16) + self.text_encoder = load_text_encoder(model, float16) + self.autoencoder = load_autoencoder(model, False) + self.sampler = SimpleEulerSampler(self.diffusion_config) + self.tokenizer = load_tokenizer(model) + + def ensure_models_are_loaded(self): + mx.eval(self.unet.parameters()) + mx.eval(self.text_encoder.parameters()) + mx.eval(self.autoencoder.parameters()) + + def _tokenize(self, tokenizer, text: str, negative_text: Optional[str] = None): + # Tokenize the text + tokens = [tokenizer.tokenize(text)] + if negative_text is not None: + tokens += [tokenizer.tokenize(negative_text)] + lengths = [len(t) for t in tokens] + N = max(lengths) + tokens = [t + [0] * (N - len(t)) for t in tokens] + tokens = mx.array(tokens) + + return tokens + + def _get_text_conditioning( + self, + text: str, + n_images: int = 1, + cfg_weight: float = 7.5, + negative_text: str = "", + ): + # Tokenize the text + tokens = self._tokenize( + self.tokenizer, text, (negative_text if cfg_weight > 1 else None) + ) + + # Compute the features + conditioning = self.text_encoder(tokens).last_hidden_state + + # Repeat the conditioning for each of the generated images + if n_images > 1: + conditioning = mx.repeat(conditioning, n_images, axis=0) + + return conditioning + + def _denoising_step( + self, x_t, t, t_prev, conditioning, cfg_weight: float = 7.5, text_time=None + ): + x_t_unet = mx.concatenate([x_t] * 2, axis=0) if cfg_weight > 1 else x_t + t_unet = mx.broadcast_to(t, [len(x_t_unet)]) + eps_pred = self.unet( + x_t_unet, t_unet, encoder_x=conditioning, text_time=text_time + ) + + if cfg_weight > 1: + eps_text, eps_neg = eps_pred.split(2) + eps_pred = eps_neg + cfg_weight * (eps_text - eps_neg) + + x_t_prev = self.sampler.step(eps_pred, x_t, t, t_prev) + + return x_t_prev + + def _denoising_loop( + self, + x_T, + T, + conditioning, + num_steps: int = 50, + cfg_weight: float = 7.5, + text_time=None, + ): + x_t = x_T + for t, t_prev in self.sampler.timesteps( + num_steps, start_time=T, dtype=self.dtype + ): + x_t = self._denoising_step( + x_t, t, t_prev, conditioning, cfg_weight, text_time + ) + yield x_t + + def generate_latents( + self, + text: str, + n_images: int = 1, + num_steps: int = 50, + cfg_weight: float = 7.5, + negative_text: str = "", + latent_size: Tuple[int] = (64, 64), + seed=None, + ): + # Set the PRNG state + seed = int(time.time()) if seed is None else seed + mx.random.seed(seed) + + # Get the text conditioning + conditioning = self._get_text_conditioning( + text, n_images, cfg_weight, negative_text + ) + + # Create the latent variables + x_T = self.sampler.sample_prior( + (n_images, *latent_size, self.autoencoder.latent_channels), dtype=self.dtype + ) + + # Perform the denoising loop + yield from self._denoising_loop( + x_T, self.sampler.max_time, conditioning, num_steps, cfg_weight + ) + + def generate_latents_from_image( + self, + image, + text: str, + n_images: int = 1, + strength: float = 0.8, + num_steps: int = 50, + cfg_weight: float = 7.5, + negative_text: str = "", + seed=None, + ): + # Set the PRNG state + seed = int(time.time()) if seed is None else seed + mx.random.seed(seed) + + # Define the num steps and start step + start_step = self.sampler.max_time * strength + num_steps = int(num_steps * strength) + + # Get the text conditioning + conditioning = self._get_text_conditioning( + text, n_images, cfg_weight, negative_text + ) + + # Get the latents from the input image and add noise according to the + # start time. + x_0, _ = self.autoencoder.encode(image[None]) + x_0 = mx.broadcast_to(x_0, (n_images,) + x_0.shape[1:]) + x_T = self.sampler.add_noise(x_0, mx.array(start_step)) + + # Perform the denoising loop + yield from self._denoising_loop( + x_T, start_step, conditioning, num_steps, cfg_weight + ) + + def decode(self, x_t): + x = self.autoencoder.decode(x_t) + x = mx.clip(x / 2 + 0.5, 0, 1) + return x + + +class StableDiffusionXL(StableDiffusion): + def __init__(self, model: str = _DEFAULT_MODEL, float16: bool = False): + super().__init__(model, float16) + + self.sampler = SimpleEulerAncestralSampler(self.diffusion_config) + + self.text_encoder_1 = self.text_encoder + self.tokenizer_1 = self.tokenizer + del self.tokenizer, self.text_encoder + + self.text_encoder_2 = load_text_encoder( + model, + float16, + model_key="text_encoder_2", + ) + self.tokenizer_2 = load_tokenizer( + model, + merges_key="tokenizer_2_merges", + vocab_key="tokenizer_2_vocab", + ) + + def ensure_models_are_loaded(self): + mx.eval(self.unet.parameters()) + mx.eval(self.text_encoder_1.parameters()) + mx.eval(self.text_encoder_2.parameters()) + mx.eval(self.autoencoder.parameters()) + + def _get_text_conditioning( + self, + text: str, + n_images: int = 1, + cfg_weight: float = 7.5, + negative_text: str = "", + ): + tokens_1 = self._tokenize( + self.tokenizer_1, + text, + (negative_text if cfg_weight > 1 else None), + ) + tokens_2 = self._tokenize( + self.tokenizer_2, + text, + (negative_text if cfg_weight > 1 else None), + ) + + conditioning_1 = self.text_encoder_1(tokens_1) + conditioning_2 = self.text_encoder_2(tokens_2) + conditioning = mx.concatenate( + [conditioning_1.hidden_states[-2], conditioning_2.hidden_states[-2]], + axis=-1, + ) + pooled_conditioning = conditioning_2.pooled_output + + if n_images > 1: + conditioning = mx.repeat(conditioning, n_images, axis=0) + pooled_conditioning = mx.repeat(pooled_conditioning, n_images, axis=0) + + return conditioning, pooled_conditioning + + def generate_latents( + self, + text: str, + n_images: int = 1, + num_steps: int = 2, + cfg_weight: float = 0.0, + negative_text: str = "", + latent_size: Tuple[int] = (64, 64), + seed=None, + ): + # Set the PRNG state + seed = int(time.time()) if seed is None else seed + mx.random.seed(seed) + + # Get the text conditioning + conditioning, pooled_conditioning = self._get_text_conditioning( + text, n_images, cfg_weight, negative_text + ) + text_time = ( + pooled_conditioning, + mx.array([[512, 512, 0, 0, 512, 512.0]] * len(pooled_conditioning)), + ) + + # Create the latent variables + x_T = self.sampler.sample_prior( + (n_images, *latent_size, self.autoencoder.latent_channels), dtype=self.dtype + ) + + # Perform the denoising loop + yield from self._denoising_loop( + x_T, + self.sampler.max_time, + conditioning, + num_steps, + cfg_weight, + text_time=text_time, + ) + + def generate_latents_from_image( + self, + image, + text: str, + n_images: int = 1, + strength: float = 0.8, + num_steps: int = 2, + cfg_weight: float = 0.0, + negative_text: str = "", + seed=None, + ): + # Set the PRNG state + seed = seed or int(time.time()) + mx.random.seed(seed) + + # Define the num steps and start step + start_step = self.sampler.max_time * strength + num_steps = int(num_steps * strength) + + # Get the text conditioning + conditioning, pooled_conditioning = self._get_text_conditioning( + text, n_images, cfg_weight, negative_text + ) + text_time = ( + pooled_conditioning, + mx.array([[512, 512, 0, 0, 512, 512.0]] * len(pooled_conditioning)), + ) + + # Get the latents from the input image and add noise according to the + # start time. + x_0, _ = self.autoencoder.encode(image[None]) + x_0 = mx.broadcast_to(x_0, (n_images,) + x_0.shape[1:]) + x_T = self.sampler.add_noise(x_0, mx.array(start_step)) + + # Perform the denoising loop + yield from self._denoising_loop( + x_T, start_step, conditioning, num_steps, cfg_weight, text_time=text_time + ) diff --git a/examples/databricks_DBRX_website_bot/diffusion_mlx/clip.py b/examples/databricks_DBRX_website_bot/diffusion_mlx/clip.py new file mode 100644 index 00000000..b5e11fde --- /dev/null +++ b/examples/databricks_DBRX_website_bot/diffusion_mlx/clip.py @@ -0,0 +1,116 @@ +# Copyright © 2023-2024 Apple Inc. + +from dataclasses import dataclass +from typing import List, Optional + +import mlx.core as mx +import mlx.nn as nn + +from .config import CLIPTextModelConfig + +_ACTIVATIONS = {"quick_gelu": nn.gelu_fast_approx, "gelu": nn.gelu} + + +@dataclass +class CLIPOutput: + # The last_hidden_state indexed at the EOS token and possibly projected if + # the model has a projection layer + pooled_output: Optional[mx.array] = None + + # The full sequence output of the transformer after the final layernorm + last_hidden_state: Optional[mx.array] = None + + # A list of hidden states corresponding to the outputs of the transformer layers + hidden_states: Optional[List[mx.array]] = None + + +class CLIPEncoderLayer(nn.Module): + """The transformer encoder layer from CLIP.""" + + def __init__(self, model_dims: int, num_heads: int, activation: str): + super().__init__() + + self.layer_norm1 = nn.LayerNorm(model_dims) + self.layer_norm2 = nn.LayerNorm(model_dims) + + self.attention = nn.MultiHeadAttention(model_dims, num_heads) + # Add biases to the attention projections to match CLIP + self.attention.query_proj.bias = mx.zeros(model_dims) + self.attention.key_proj.bias = mx.zeros(model_dims) + self.attention.value_proj.bias = mx.zeros(model_dims) + self.attention.out_proj.bias = mx.zeros(model_dims) + + self.linear1 = nn.Linear(model_dims, 4 * model_dims) + self.linear2 = nn.Linear(4 * model_dims, model_dims) + + self.act = _ACTIVATIONS[activation] + + def __call__(self, x, attn_mask=None): + y = self.layer_norm1(x) + y = self.attention(y, y, y, attn_mask) + x = y + x + + y = self.layer_norm2(x) + y = self.linear1(y) + y = self.act(y) + y = self.linear2(y) + x = y + x + + return x + + +class CLIPTextModel(nn.Module): + """Implements the text encoder transformer from CLIP.""" + + def __init__(self, config: CLIPTextModelConfig): + super().__init__() + + self.token_embedding = nn.Embedding(config.vocab_size, config.model_dims) + self.position_embedding = nn.Embedding(config.max_length, config.model_dims) + self.layers = [ + CLIPEncoderLayer(config.model_dims, config.num_heads, config.hidden_act) + for i in range(config.num_layers) + ] + self.final_layer_norm = nn.LayerNorm(config.model_dims) + + if config.projection_dim is not None: + self.text_projection = nn.Linear( + config.model_dims, config.projection_dim, bias=False + ) + + def _get_mask(self, N, dtype): + indices = mx.arange(N) + mask = indices[:, None] < indices[None] + mask = mask.astype(dtype) * (-6e4 if dtype == mx.float16 else -1e9) + return mask + + def __call__(self, x): + # Extract some shapes + B, N = x.shape + eos_tokens = x.argmax(-1) + + # Compute the embeddings + x = self.token_embedding(x) + x = x + self.position_embedding.weight[:N] + + # Compute the features from the transformer + mask = self._get_mask(N, x.dtype) + hidden_states = [] + for l in self.layers: + x = l(x, mask) + hidden_states.append(x) + + # Apply the final layernorm and return + x = self.final_layer_norm(x) + last_hidden_state = x + + # Select the EOS token + pooled_output = x[mx.arange(len(x)), eos_tokens] + if "text_projection" in self: + pooled_output = self.text_projection(pooled_output) + + return CLIPOutput( + pooled_output=pooled_output, + last_hidden_state=last_hidden_state, + hidden_states=hidden_states, + ) diff --git a/examples/databricks_DBRX_website_bot/diffusion_mlx/config.py b/examples/databricks_DBRX_website_bot/diffusion_mlx/config.py new file mode 100644 index 00000000..6715757a --- /dev/null +++ b/examples/databricks_DBRX_website_bot/diffusion_mlx/config.py @@ -0,0 +1,65 @@ +# Copyright © 2023-2024 Apple Inc. + +from dataclasses import dataclass +from typing import Optional, Tuple + + +@dataclass +class AutoencoderConfig: + in_channels: int = 3 + out_channels: int = 3 + latent_channels_out: int = 8 + latent_channels_in: int = 4 + block_out_channels: Tuple[int] = (128, 256, 512, 512) + layers_per_block: int = 2 + norm_num_groups: int = 32 + scaling_factor: float = 0.18215 + + +@dataclass +class CLIPTextModelConfig: + num_layers: int = 23 + model_dims: int = 1024 + num_heads: int = 16 + max_length: int = 77 + vocab_size: int = 49408 + projection_dim: Optional[int] = None + hidden_act: str = "quick_gelu" + + +@dataclass +class UNetConfig: + in_channels: int = 4 + out_channels: int = 4 + conv_in_kernel: int = 3 + conv_out_kernel: int = 3 + block_out_channels: Tuple[int] = (320, 640, 1280, 1280) + layers_per_block: Tuple[int] = (2, 2, 2, 2) + mid_block_layers: int = 2 + transformer_layers_per_block: Tuple[int] = (1, 1, 1, 1) + num_attention_heads: Tuple[int] = (5, 10, 20, 20) + cross_attention_dim: Tuple[int] = (1024,) * 4 + norm_num_groups: int = 32 + down_block_types: Tuple[str] = ( + "CrossAttnDownBlock2D", + "CrossAttnDownBlock2D", + "CrossAttnDownBlock2D", + "DownBlock2D", + ) + up_block_types: Tuple[str] = ( + "UpBlock2D", + "CrossAttnUpBlock2D", + "CrossAttnUpBlock2D", + "CrossAttnUpBlock2D", + ) + addition_embed_type: Optional[str] = None + addition_time_embed_dim: Optional[int] = None + projection_class_embeddings_input_dim: Optional[int] = None + + +@dataclass +class DiffusionConfig: + beta_schedule: str = "scaled_linear" + beta_start: float = 0.00085 + beta_end: float = 0.012 + num_train_steps: int = 1000 diff --git a/examples/databricks_DBRX_website_bot/diffusion_mlx/model_io.py b/examples/databricks_DBRX_website_bot/diffusion_mlx/model_io.py new file mode 100644 index 00000000..2c2227db --- /dev/null +++ b/examples/databricks_DBRX_website_bot/diffusion_mlx/model_io.py @@ -0,0 +1,330 @@ +# Copyright © 2023-2024 Apple Inc. + +import json +from typing import Optional + +import mlx.core as mx +from huggingface_hub import hf_hub_download +from mlx.utils import tree_unflatten + +from .clip import CLIPTextModel +from .config import AutoencoderConfig, CLIPTextModelConfig, DiffusionConfig, UNetConfig +from .tokenizer import Tokenizer +from .unet import UNetModel +from .vae import Autoencoder + +_DEFAULT_MODEL = "stabilityai/stable-diffusion-2-1-base" +_MODELS = { + # See https://huggingface.co/stabilityai/sdxl-turbo for the model details and license + "stabilityai/sdxl-turbo": { + "unet_config": "unet/config.json", + "unet": "unet/diffusion_pytorch_model.safetensors", + "text_encoder_config": "text_encoder/config.json", + "text_encoder": "text_encoder/model.safetensors", + "text_encoder_2_config": "text_encoder_2/config.json", + "text_encoder_2": "text_encoder_2/model.safetensors", + "vae_config": "vae/config.json", + "vae": "vae/diffusion_pytorch_model.safetensors", + "diffusion_config": "scheduler/scheduler_config.json", + "tokenizer_vocab": "tokenizer/vocab.json", + "tokenizer_merges": "tokenizer/merges.txt", + "tokenizer_2_vocab": "tokenizer_2/vocab.json", + "tokenizer_2_merges": "tokenizer_2/merges.txt", + }, + # See https://huggingface.co/stabilityai/stable-diffusion-2-1-base for the model details and license + "stabilityai/stable-diffusion-2-1-base": { + "unet_config": "unet/config.json", + "unet": "unet/diffusion_pytorch_model.safetensors", + "text_encoder_config": "text_encoder/config.json", + "text_encoder": "text_encoder/model.safetensors", + "vae_config": "vae/config.json", + "vae": "vae/diffusion_pytorch_model.safetensors", + "diffusion_config": "scheduler/scheduler_config.json", + "tokenizer_vocab": "tokenizer/vocab.json", + "tokenizer_merges": "tokenizer/merges.txt", + }, +} + + +def map_unet_weights(key, value): + # Map up/downsampling + if "downsamplers" in key: + key = key.replace("downsamplers.0.conv", "downsample") + if "upsamplers" in key: + key = key.replace("upsamplers.0.conv", "upsample") + + # Map the mid block + if "mid_block.resnets.0" in key: + key = key.replace("mid_block.resnets.0", "mid_blocks.0") + if "mid_block.attentions.0" in key: + key = key.replace("mid_block.attentions.0", "mid_blocks.1") + if "mid_block.resnets.1" in key: + key = key.replace("mid_block.resnets.1", "mid_blocks.2") + + # Map attention layers + if "to_k" in key: + key = key.replace("to_k", "key_proj") + if "to_out.0" in key: + key = key.replace("to_out.0", "out_proj") + if "to_q" in key: + key = key.replace("to_q", "query_proj") + if "to_v" in key: + key = key.replace("to_v", "value_proj") + + # Map transformer ffn + if "ff.net.2" in key: + key = key.replace("ff.net.2", "linear3") + if "ff.net.0" in key: + k1 = key.replace("ff.net.0.proj", "linear1") + k2 = key.replace("ff.net.0.proj", "linear2") + v1, v2 = mx.split(value, 2) + + return [(k1, v1), (k2, v2)] + + if "conv_shortcut.weight" in key: + value = value.squeeze() + + # Transform the weights from 1x1 convs to linear + if len(value.shape) == 4 and ("proj_in" in key or "proj_out" in key): + value = value.squeeze() + + if len(value.shape) == 4: + value = value.transpose(0, 2, 3, 1) + value = value.reshape(-1).reshape(value.shape) + + return [(key, value)] + + +def map_clip_text_encoder_weights(key, value): + # Remove prefixes + if key.startswith("text_model."): + key = key[11:] + if key.startswith("embeddings."): + key = key[11:] + if key.startswith("encoder."): + key = key[8:] + + # Map attention layers + if "self_attn." in key: + key = key.replace("self_attn.", "attention.") + if "q_proj." in key: + key = key.replace("q_proj.", "query_proj.") + if "k_proj." in key: + key = key.replace("k_proj.", "key_proj.") + if "v_proj." in key: + key = key.replace("v_proj.", "value_proj.") + + # Map ffn layers + if "mlp.fc1" in key: + key = key.replace("mlp.fc1", "linear1") + if "mlp.fc2" in key: + key = key.replace("mlp.fc2", "linear2") + + return [(key, value)] + + +def map_vae_weights(key, value): + # Map up/downsampling + if "downsamplers" in key: + key = key.replace("downsamplers.0.conv", "downsample") + if "upsamplers" in key: + key = key.replace("upsamplers.0.conv", "upsample") + + # Map attention layers + if "to_k" in key: + key = key.replace("to_k", "key_proj") + if "to_out.0" in key: + key = key.replace("to_out.0", "out_proj") + if "to_q" in key: + key = key.replace("to_q", "query_proj") + if "to_v" in key: + key = key.replace("to_v", "value_proj") + + # Map the mid block + if "mid_block.resnets.0" in key: + key = key.replace("mid_block.resnets.0", "mid_blocks.0") + if "mid_block.attentions.0" in key: + key = key.replace("mid_block.attentions.0", "mid_blocks.1") + if "mid_block.resnets.1" in key: + key = key.replace("mid_block.resnets.1", "mid_blocks.2") + + # Map the quant/post_quant layers + if "quant_conv" in key: + key = key.replace("quant_conv", "quant_proj") + value = value.squeeze() + + # Map the conv_shortcut to linear + if "conv_shortcut.weight" in key: + value = value.squeeze() + + if len(value.shape) == 4: + value = value.transpose(0, 2, 3, 1) + value = value.reshape(-1).reshape(value.shape) + + return [(key, value)] + + +def _flatten(params): + return [(k, v) for p in params for (k, v) in p] + + +def _load_safetensor_weights(mapper, model, weight_file, float16: bool = False): + dtype = mx.float16 if float16 else mx.float32 + weights = mx.load(weight_file) + weights = _flatten([mapper(k, v.astype(dtype)) for k, v in weights.items()]) + model.update(tree_unflatten(weights)) + + +def _check_key(key: str, part: str): + if key not in _MODELS: + raise ValueError( + f"[{part}] '{key}' model not found, choose one of {{{','.join(_MODELS.keys())}}}" + ) + + +def load_unet(key: str = _DEFAULT_MODEL, float16: bool = False): + """Load the stable diffusion UNet from Hugging Face Hub.""" + _check_key(key, "load_unet") + + # Download the config and create the model + unet_config = _MODELS[key]["unet_config"] + with open(hf_hub_download(key, unet_config)) as f: + config = json.load(f) + + n_blocks = len(config["block_out_channels"]) + model = UNetModel( + UNetConfig( + in_channels=config["in_channels"], + out_channels=config["out_channels"], + block_out_channels=config["block_out_channels"], + layers_per_block=[config["layers_per_block"]] * n_blocks, + transformer_layers_per_block=config.get( + "transformer_layers_per_block", (1,) * 4 + ), + num_attention_heads=( + [config["attention_head_dim"]] * n_blocks + if isinstance(config["attention_head_dim"], int) + else config["attention_head_dim"] + ), + cross_attention_dim=[config["cross_attention_dim"]] * n_blocks, + norm_num_groups=config["norm_num_groups"], + down_block_types=config["down_block_types"], + up_block_types=config["up_block_types"][::-1], + addition_embed_type=config.get("addition_embed_type", None), + addition_time_embed_dim=config.get("addition_time_embed_dim", None), + projection_class_embeddings_input_dim=config.get( + "projection_class_embeddings_input_dim", None + ), + ) + ) + + # Download the weights and map them into the model + unet_weights = _MODELS[key]["unet"] + weight_file = hf_hub_download(key, unet_weights) + _load_safetensor_weights(map_unet_weights, model, weight_file, float16) + + return model + + +def load_text_encoder( + key: str = _DEFAULT_MODEL, + float16: bool = False, + model_key: str = "text_encoder", + config_key: Optional[str] = None, +): + """Load the stable diffusion text encoder from Hugging Face Hub.""" + _check_key(key, "load_text_encoder") + + config_key = config_key or (model_key + "_config") + + # Download the config and create the model + text_encoder_config = _MODELS[key][config_key] + with open(hf_hub_download(key, text_encoder_config)) as f: + config = json.load(f) + + with_projection = "WithProjection" in config["architectures"][0] + + model = CLIPTextModel( + CLIPTextModelConfig( + num_layers=config["num_hidden_layers"], + model_dims=config["hidden_size"], + num_heads=config["num_attention_heads"], + max_length=config["max_position_embeddings"], + vocab_size=config["vocab_size"], + projection_dim=config["projection_dim"] if with_projection else None, + hidden_act=config.get("hidden_act", "quick_gelu"), + ) + ) + + # Download the weights and map them into the model + text_encoder_weights = _MODELS[key][model_key] + weight_file = hf_hub_download(key, text_encoder_weights) + _load_safetensor_weights(map_clip_text_encoder_weights, model, weight_file, float16) + + return model + + +def load_autoencoder(key: str = _DEFAULT_MODEL, float16: bool = False): + """Load the stable diffusion autoencoder from Hugging Face Hub.""" + _check_key(key, "load_autoencoder") + + # Download the config and create the model + vae_config = _MODELS[key]["vae_config"] + with open(hf_hub_download(key, vae_config)) as f: + config = json.load(f) + + model = Autoencoder( + AutoencoderConfig( + in_channels=config["in_channels"], + out_channels=config["out_channels"], + latent_channels_out=2 * config["latent_channels"], + latent_channels_in=config["latent_channels"], + block_out_channels=config["block_out_channels"], + layers_per_block=config["layers_per_block"], + norm_num_groups=config["norm_num_groups"], + scaling_factor=config.get("scaling_factor", 0.18215), + ) + ) + + # Download the weights and map them into the model + vae_weights = _MODELS[key]["vae"] + weight_file = hf_hub_download(key, vae_weights) + _load_safetensor_weights(map_vae_weights, model, weight_file, float16) + + return model + + +def load_diffusion_config(key: str = _DEFAULT_MODEL): + """Load the stable diffusion config from Hugging Face Hub.""" + _check_key(key, "load_diffusion_config") + + diffusion_config = _MODELS[key]["diffusion_config"] + with open(hf_hub_download(key, diffusion_config)) as f: + config = json.load(f) + + return DiffusionConfig( + beta_start=config["beta_start"], + beta_end=config["beta_end"], + beta_schedule=config["beta_schedule"], + num_train_steps=config["num_train_timesteps"], + ) + + +def load_tokenizer( + key: str = _DEFAULT_MODEL, + vocab_key: str = "tokenizer_vocab", + merges_key: str = "tokenizer_merges", +): + _check_key(key, "load_tokenizer") + + vocab_file = hf_hub_download(key, _MODELS[key][vocab_key]) + with open(vocab_file, encoding="utf-8") as f: + vocab = json.load(f) + + merges_file = hf_hub_download(key, _MODELS[key][merges_key]) + with open(merges_file, encoding="utf-8") as f: + bpe_merges = f.read().strip().split("\n")[1 : 49152 - 256 - 2 + 1] + bpe_merges = [tuple(m.split()) for m in bpe_merges] + bpe_ranks = dict(map(reversed, enumerate(bpe_merges))) + + return Tokenizer(bpe_ranks, vocab) diff --git a/examples/databricks_DBRX_website_bot/diffusion_mlx/sampler.py b/examples/databricks_DBRX_website_bot/diffusion_mlx/sampler.py new file mode 100644 index 00000000..ff4433d0 --- /dev/null +++ b/examples/databricks_DBRX_website_bot/diffusion_mlx/sampler.py @@ -0,0 +1,105 @@ +# Copyright © 2023 Apple Inc. + +import mlx.core as mx + +from .config import DiffusionConfig + + +def _linspace(a, b, num): + x = mx.arange(0, num) / (num - 1) + return (b - a) * x + a + + +def _interp(y, x_new): + """Interpolate the function defined by (arange(0, len(y)), y) at positions x_new.""" + x_low = x_new.astype(mx.int32) + x_high = mx.minimum(x_low + 1, len(y) - 1) + + y_low = y[x_low] + y_high = y[x_high] + delta_x = x_new - x_low + y_new = y_low * (1 - delta_x) + delta_x * y_high + + return y_new + + +class SimpleEulerSampler: + """A simple Euler integrator that can be used to sample from our diffusion models. + + The method ``step()`` performs one Euler step from x_t to x_t_prev. + """ + + def __init__(self, config: DiffusionConfig): + # Compute the noise schedule + if config.beta_schedule == "linear": + betas = _linspace( + config.beta_start, config.beta_end, config.num_train_steps + ) + elif config.beta_schedule == "scaled_linear": + betas = _linspace( + config.beta_start**0.5, config.beta_end**0.5, config.num_train_steps + ).square() + else: + raise NotImplementedError(f"{config.beta_schedule} is not implemented.") + + alphas = 1 - betas + alphas_cumprod = mx.cumprod(alphas) + + self._sigmas = mx.concatenate( + [mx.zeros(1), ((1 - alphas_cumprod) / alphas_cumprod).sqrt()] + ) + + @property + def max_time(self): + return len(self._sigmas) - 1 + + def sample_prior(self, shape, dtype=mx.float32, key=None): + noise = mx.random.normal(shape, key=key) + return ( + noise * self._sigmas[-1] * (self._sigmas[-1].square() + 1).rsqrt() + ).astype(dtype) + + def add_noise(self, x, t, key=None): + noise = mx.random.normal(x.shape, key=key) + s = self.sigmas(t) + return (x + noise * s) * (s.square() + 1).rsqrt() + + def sigmas(self, t): + return _interp(self._sigmas, t) + + def timesteps(self, num_steps: int, start_time=None, dtype=mx.float32): + start_time = start_time or (len(self._sigmas) - 1) + assert 0 < start_time <= (len(self._sigmas) - 1) + steps = _linspace(start_time, 0, num_steps + 1).astype(dtype) + return list(zip(steps, steps[1:])) + + def step(self, eps_pred, x_t, t, t_prev): + sigma = self.sigmas(t).astype(eps_pred.dtype) + sigma_prev = self.sigmas(t_prev).astype(eps_pred.dtype) + + dt = sigma_prev - sigma + x_t_prev = (sigma.square() + 1).sqrt() * x_t + eps_pred * dt + + x_t_prev = x_t_prev * (sigma_prev.square() + 1).rsqrt() + + return x_t_prev + + +class SimpleEulerAncestralSampler(SimpleEulerSampler): + def step(self, eps_pred, x_t, t, t_prev): + sigma = self.sigmas(t).astype(eps_pred.dtype) + sigma_prev = self.sigmas(t_prev).astype(eps_pred.dtype) + + sigma2 = sigma.square() + sigma_prev2 = sigma_prev.square() + sigma_up = (sigma_prev2 * (sigma2 - sigma_prev2) / sigma2).sqrt() + sigma_down = (sigma_prev2 - sigma_up**2).sqrt() + + dt = sigma_down - sigma + x_t_prev = (sigma2 + 1).sqrt() * x_t + eps_pred * dt + noise = mx.random.normal(x_t_prev.shape).astype(x_t_prev.dtype) + x_t_prev = x_t_prev + noise * sigma_up + + x_t_prev = x_t_prev * (sigma_prev2 + 1).rsqrt() + + return x_t_prev diff --git a/examples/databricks_DBRX_website_bot/diffusion_mlx/tokenizer.py b/examples/databricks_DBRX_website_bot/diffusion_mlx/tokenizer.py new file mode 100644 index 00000000..ae9b967a --- /dev/null +++ b/examples/databricks_DBRX_website_bot/diffusion_mlx/tokenizer.py @@ -0,0 +1,100 @@ +# Copyright © 2023 Apple Inc. + +import regex + + +class Tokenizer: + """A simple port of CLIPTokenizer from https://github.com/huggingface/transformers/ .""" + + def __init__(self, bpe_ranks, vocab): + self.bpe_ranks = bpe_ranks + self.vocab = vocab + self.pat = regex.compile( + r"""<\|startoftext\|>|<\|endoftext\|>|'s|'t|'re|'ve|'m|'ll|'d|[\p{L}]+|[\p{N}]|[^\s\p{L}\p{N}]+""", + regex.IGNORECASE, + ) + + self._cache = {self.bos: self.bos, self.eos: self.eos} + + @property + def bos(self): + return "<|startoftext|>" + + @property + def bos_token(self): + return self.vocab[self.bos] + + @property + def eos(self): + return "<|endoftext|>" + + @property + def eos_token(self): + return self.vocab[self.eos] + + def bpe(self, text): + if text in self._cache: + return self._cache[text] + + unigrams = list(text[:-1]) + [text[-1] + ""] + unique_bigrams = set(zip(unigrams, unigrams[1:])) + + if not unique_bigrams: + return unigrams + + # In every iteration try to merge the two most likely bigrams. If none + # was merged we are done. + # + # Ported from https://github.com/huggingface/transformers/blob/main/src/transformers/models/clip/tokenization_clip.py + while unique_bigrams: + bigram = min( + unique_bigrams, key=lambda pair: self.bpe_ranks.get(pair, float("inf")) + ) + if bigram not in self.bpe_ranks: + break + + new_unigrams = [] + skip = False + for a, b in zip(unigrams, unigrams[1:]): + if skip: + skip = False + continue + + if (a, b) == bigram: + new_unigrams.append(a + b) + skip = True + + else: + new_unigrams.append(a) + + if not skip: + new_unigrams.append(b) + + unigrams = new_unigrams + unique_bigrams = set(zip(unigrams, unigrams[1:])) + + self._cache[text] = unigrams + + return unigrams + + def tokenize(self, text, prepend_bos=True, append_eos=True): + if isinstance(text, list): + return [self.tokenize(t, prepend_bos, append_eos) for t in text] + + # Lower case cleanup and split according to self.pat. Hugging Face does + # a much more thorough job here but this should suffice for 95% of + # cases. + clean_text = regex.sub(r"\s+", " ", text.lower()) + tokens = regex.findall(self.pat, clean_text) + + # Split the tokens according to the byte-pair merge file + bpe_tokens = [ti for t in tokens for ti in self.bpe(t)] + + # Map to token ids and return + tokens = [self.vocab[t] for t in bpe_tokens] + if prepend_bos: + tokens = [self.bos_token] + tokens + if append_eos: + tokens.append(self.eos_token) + + return tokens diff --git a/examples/databricks_DBRX_website_bot/diffusion_mlx/unet.py b/examples/databricks_DBRX_website_bot/diffusion_mlx/unet.py new file mode 100644 index 00000000..ec2915e5 --- /dev/null +++ b/examples/databricks_DBRX_website_bot/diffusion_mlx/unet.py @@ -0,0 +1,461 @@ +# Copyright © 2023 Apple Inc. + +import math +from typing import Optional + +import mlx.core as mx +import mlx.nn as nn + +from .config import UNetConfig + + +def upsample_nearest(x, scale: int = 2): + B, H, W, C = x.shape + x = mx.broadcast_to(x[:, :, None, :, None, :], (B, H, scale, W, scale, C)) + x = x.reshape(B, H * scale, W * scale, C) + + return x + + +class TimestepEmbedding(nn.Module): + def __init__(self, in_channels: int, time_embed_dim: int): + super().__init__() + + self.linear_1 = nn.Linear(in_channels, time_embed_dim) + self.linear_2 = nn.Linear(time_embed_dim, time_embed_dim) + + def __call__(self, x): + x = self.linear_1(x) + x = nn.silu(x) + x = self.linear_2(x) + + return x + + +class TransformerBlock(nn.Module): + def __init__( + self, + model_dims: int, + num_heads: int, + hidden_dims: Optional[int] = None, + memory_dims: Optional[int] = None, + ): + super().__init__() + + self.norm1 = nn.LayerNorm(model_dims) + self.attn1 = nn.MultiHeadAttention(model_dims, num_heads) + self.attn1.out_proj.bias = mx.zeros(model_dims) + + memory_dims = memory_dims or model_dims + self.norm2 = nn.LayerNorm(model_dims) + self.attn2 = nn.MultiHeadAttention( + model_dims, num_heads, key_input_dims=memory_dims + ) + self.attn2.out_proj.bias = mx.zeros(model_dims) + + hidden_dims = hidden_dims or 4 * model_dims + self.norm3 = nn.LayerNorm(model_dims) + self.linear1 = nn.Linear(model_dims, hidden_dims) + self.linear2 = nn.Linear(model_dims, hidden_dims) + self.linear3 = nn.Linear(hidden_dims, model_dims) + + def __call__(self, x, memory, attn_mask, memory_mask): + # Self attention + y = self.norm1(x) + y = self.attn1(y, y, y, attn_mask) + x = x + y + + # Cross attention + y = self.norm2(x) + y = self.attn2(y, memory, memory, memory_mask) + x = x + y + + # FFN + y = self.norm3(x) + y_a = self.linear1(y) + y_b = self.linear2(y) + y = y_a * nn.gelu(y_b) + y = self.linear3(y) + x = x + y + + return x + + +class Transformer2D(nn.Module): + """A transformer model for inputs with 2 spatial dimensions.""" + + def __init__( + self, + in_channels: int, + model_dims: int, + encoder_dims: int, + num_heads: int, + num_layers: int = 1, + norm_num_groups: int = 32, + ): + super().__init__() + + self.norm = nn.GroupNorm(norm_num_groups, in_channels, pytorch_compatible=True) + self.proj_in = nn.Linear(in_channels, model_dims) + self.transformer_blocks = [ + TransformerBlock(model_dims, num_heads, memory_dims=encoder_dims) + for i in range(num_layers) + ] + self.proj_out = nn.Linear(model_dims, in_channels) + + def __call__(self, x, encoder_x, attn_mask, encoder_attn_mask): + # Save the input to add to the output + input_x = x + dtype = x.dtype + + # Perform the input norm and projection + B, H, W, C = x.shape + x = self.norm(x.astype(mx.float32)).astype(dtype).reshape(B, -1, C) + x = self.proj_in(x) + + # Apply the transformer + for block in self.transformer_blocks: + x = block(x, encoder_x, attn_mask, encoder_attn_mask) + + # Apply the output projection and reshape + x = self.proj_out(x) + x = x.reshape(B, H, W, C) + + return x + input_x + + +class ResnetBlock2D(nn.Module): + def __init__( + self, + in_channels: int, + out_channels: Optional[int] = None, + groups: int = 32, + temb_channels: Optional[int] = None, + ): + super().__init__() + + out_channels = out_channels or in_channels + + self.norm1 = nn.GroupNorm(groups, in_channels, pytorch_compatible=True) + self.conv1 = nn.Conv2d( + in_channels, out_channels, kernel_size=3, stride=1, padding=1 + ) + if temb_channels is not None: + self.time_emb_proj = nn.Linear(temb_channels, out_channels) + self.norm2 = nn.GroupNorm(groups, out_channels, pytorch_compatible=True) + self.conv2 = nn.Conv2d( + out_channels, out_channels, kernel_size=3, stride=1, padding=1 + ) + + if in_channels != out_channels: + self.conv_shortcut = nn.Linear(in_channels, out_channels) + + def __call__(self, x, temb=None): + dtype = x.dtype + + if temb is not None: + temb = self.time_emb_proj(nn.silu(temb)) + + y = self.norm1(x.astype(mx.float32)).astype(dtype) + y = nn.silu(y) + y = self.conv1(y) + if temb is not None: + y = y + temb[:, None, None, :] + y = self.norm2(y.astype(mx.float32)).astype(dtype) + y = nn.silu(y) + y = self.conv2(y) + + x = y + (x if "conv_shortcut" not in self else self.conv_shortcut(x)) + + return x + + +class UNetBlock2D(nn.Module): + def __init__( + self, + in_channels: int, + out_channels: int, + temb_channels: int, + prev_out_channels: Optional[int] = None, + num_layers: int = 1, + transformer_layers_per_block: int = 1, + num_attention_heads: int = 8, + cross_attention_dim=1280, + resnet_groups: int = 32, + add_downsample=True, + add_upsample=True, + add_cross_attention=True, + ): + super().__init__() + + # Prepare the in channels list for the resnets + if prev_out_channels is None: + in_channels_list = [in_channels] + [out_channels] * (num_layers - 1) + else: + in_channels_list = [prev_out_channels] + [out_channels] * (num_layers - 1) + res_channels_list = [out_channels] * (num_layers - 1) + [in_channels] + in_channels_list = [ + a + b for a, b in zip(in_channels_list, res_channels_list) + ] + + # Add resnet blocks that also process the time embedding + self.resnets = [ + ResnetBlock2D( + in_channels=ic, + out_channels=out_channels, + temb_channels=temb_channels, + groups=resnet_groups, + ) + for ic in in_channels_list + ] + + # Add optional cross attention layers + if add_cross_attention: + self.attentions = [ + Transformer2D( + in_channels=out_channels, + model_dims=out_channels, + num_heads=num_attention_heads, + num_layers=transformer_layers_per_block, + encoder_dims=cross_attention_dim, + ) + for i in range(num_layers) + ] + + # Add an optional downsampling layer + if add_downsample: + self.downsample = nn.Conv2d( + out_channels, out_channels, kernel_size=3, stride=2, padding=1 + ) + + # or upsampling layer + if add_upsample: + self.upsample = nn.Conv2d( + out_channels, out_channels, kernel_size=3, stride=1, padding=1 + ) + + def __call__( + self, + x, + encoder_x=None, + temb=None, + attn_mask=None, + encoder_attn_mask=None, + residual_hidden_states=None, + ): + output_states = [] + + for i in range(len(self.resnets)): + if residual_hidden_states is not None: + x = mx.concatenate([x, residual_hidden_states.pop()], axis=-1) + + x = self.resnets[i](x, temb) + + if "attentions" in self: + x = self.attentions[i](x, encoder_x, attn_mask, encoder_attn_mask) + + output_states.append(x) + + if "downsample" in self: + x = self.downsample(x) + output_states.append(x) + + if "upsample" in self: + x = self.upsample(upsample_nearest(x)) + output_states.append(x) + + return x, output_states + + +class UNetModel(nn.Module): + """The conditional 2D UNet model that actually performs the denoising.""" + + def __init__(self, config: UNetConfig): + super().__init__() + + self.conv_in = nn.Conv2d( + config.in_channels, + config.block_out_channels[0], + config.conv_in_kernel, + padding=(config.conv_in_kernel - 1) // 2, + ) + + self.timesteps = nn.SinusoidalPositionalEncoding( + config.block_out_channels[0], + max_freq=1, + min_freq=math.exp( + -math.log(10000) + 2 * math.log(10000) / config.block_out_channels[0] + ), + scale=1.0, + cos_first=True, + full_turns=False, + ) + self.time_embedding = TimestepEmbedding( + config.block_out_channels[0], + config.block_out_channels[0] * 4, + ) + + if config.addition_embed_type == "text_time": + self.add_time_proj = nn.SinusoidalPositionalEncoding( + config.addition_time_embed_dim, + max_freq=1, + min_freq=math.exp( + -math.log(10000) + + 2 * math.log(10000) / config.addition_time_embed_dim + ), + scale=1.0, + cos_first=True, + full_turns=False, + ) + self.add_embedding = TimestepEmbedding( + config.projection_class_embeddings_input_dim, + config.block_out_channels[0] * 4, + ) + + # Make the downsampling blocks + block_channels = [config.block_out_channels[0]] + list( + config.block_out_channels + ) + self.down_blocks = [ + UNetBlock2D( + in_channels=in_channels, + out_channels=out_channels, + temb_channels=config.block_out_channels[0] * 4, + num_layers=config.layers_per_block[i], + transformer_layers_per_block=config.transformer_layers_per_block[i], + num_attention_heads=config.num_attention_heads[i], + cross_attention_dim=config.cross_attention_dim[i], + resnet_groups=config.norm_num_groups, + add_downsample=(i < len(config.block_out_channels) - 1), + add_upsample=False, + add_cross_attention="CrossAttn" in config.down_block_types[i], + ) + for i, (in_channels, out_channels) in enumerate( + zip(block_channels, block_channels[1:]) + ) + ] + + # Make the middle block + self.mid_blocks = [ + ResnetBlock2D( + in_channels=config.block_out_channels[-1], + out_channels=config.block_out_channels[-1], + temb_channels=config.block_out_channels[0] * 4, + groups=config.norm_num_groups, + ), + Transformer2D( + in_channels=config.block_out_channels[-1], + model_dims=config.block_out_channels[-1], + num_heads=config.num_attention_heads[-1], + num_layers=config.transformer_layers_per_block[-1], + encoder_dims=config.cross_attention_dim[-1], + ), + ResnetBlock2D( + in_channels=config.block_out_channels[-1], + out_channels=config.block_out_channels[-1], + temb_channels=config.block_out_channels[0] * 4, + groups=config.norm_num_groups, + ), + ] + + # Make the upsampling blocks + block_channels = ( + [config.block_out_channels[0]] + + list(config.block_out_channels) + + [config.block_out_channels[-1]] + ) + self.up_blocks = [ + UNetBlock2D( + in_channels=in_channels, + out_channels=out_channels, + temb_channels=config.block_out_channels[0] * 4, + prev_out_channels=prev_out_channels, + num_layers=config.layers_per_block[i] + 1, + transformer_layers_per_block=config.transformer_layers_per_block[i], + num_attention_heads=config.num_attention_heads[i], + cross_attention_dim=config.cross_attention_dim[i], + resnet_groups=config.norm_num_groups, + add_downsample=False, + add_upsample=(i > 0), + add_cross_attention="CrossAttn" in config.up_block_types[i], + ) + for i, (in_channels, out_channels, prev_out_channels) in reversed( + list( + enumerate( + zip(block_channels, block_channels[1:], block_channels[2:]) + ) + ) + ) + ] + + self.conv_norm_out = nn.GroupNorm( + config.norm_num_groups, + config.block_out_channels[0], + pytorch_compatible=True, + ) + self.conv_out = nn.Conv2d( + config.block_out_channels[0], + config.out_channels, + config.conv_out_kernel, + padding=(config.conv_out_kernel - 1) // 2, + ) + + def __call__( + self, + x, + timestep, + encoder_x, + attn_mask=None, + encoder_attn_mask=None, + text_time=None, + ): + # Compute the time embeddings + temb = self.timesteps(timestep).astype(x.dtype) + temb = self.time_embedding(temb) + + # Add the extra text_time conditioning + if text_time is not None: + text_emb, time_ids = text_time + emb = self.add_time_proj(time_ids).flatten(1).astype(x.dtype) + emb = mx.concatenate([text_emb, emb], axis=-1) + emb = self.add_embedding(emb) + temb = temb + emb + + # Preprocess the input + x = self.conv_in(x) + + # Run the downsampling part of the unet + residuals = [x] + for block in self.down_blocks: + x, res = block( + x, + encoder_x=encoder_x, + temb=temb, + attn_mask=attn_mask, + encoder_attn_mask=encoder_attn_mask, + ) + residuals.extend(res) + + # Run the middle part of the unet + x = self.mid_blocks[0](x, temb) + x = self.mid_blocks[1](x, encoder_x, attn_mask, encoder_attn_mask) + x = self.mid_blocks[2](x, temb) + + # Run the upsampling part of the unet + for block in self.up_blocks: + x, _ = block( + x, + encoder_x=encoder_x, + temb=temb, + attn_mask=attn_mask, + encoder_attn_mask=encoder_attn_mask, + residual_hidden_states=residuals, + ) + + # Postprocess the output + dtype = x.dtype + x = self.conv_norm_out(x.astype(mx.float32)).astype(dtype) + x = nn.silu(x) + x = self.conv_out(x) + + return x diff --git a/examples/databricks_DBRX_website_bot/diffusion_mlx/vae.py b/examples/databricks_DBRX_website_bot/diffusion_mlx/vae.py new file mode 100644 index 00000000..5fd47f13 --- /dev/null +++ b/examples/databricks_DBRX_website_bot/diffusion_mlx/vae.py @@ -0,0 +1,274 @@ +# Copyright © 2023 Apple Inc. + +import math +from typing import List + +import mlx.core as mx +import mlx.nn as nn + +from .config import AutoencoderConfig +from .unet import ResnetBlock2D, upsample_nearest + + +class Attention(nn.Module): + """A single head unmasked attention for use with the VAE.""" + + def __init__(self, dims: int, norm_groups: int = 32): + super().__init__() + + self.group_norm = nn.GroupNorm(norm_groups, dims, pytorch_compatible=True) + self.query_proj = nn.Linear(dims, dims) + self.key_proj = nn.Linear(dims, dims) + self.value_proj = nn.Linear(dims, dims) + self.out_proj = nn.Linear(dims, dims) + + def __call__(self, x): + B, H, W, C = x.shape + + y = self.group_norm(x) + + queries = self.query_proj(y).reshape(B, H * W, C) + keys = self.key_proj(y).reshape(B, H * W, C) + values = self.value_proj(y).reshape(B, H * W, C) + + scale = 1 / math.sqrt(queries.shape[-1]) + scores = (queries * scale) @ keys.transpose(0, 2, 1) + attn = mx.softmax(scores, axis=-1) + y = (attn @ values).reshape(B, H, W, C) + + y = self.out_proj(y) + x = x + y + + return x + + +class EncoderDecoderBlock2D(nn.Module): + def __init__( + self, + in_channels: int, + out_channels: int, + num_layers: int = 1, + resnet_groups: int = 32, + add_downsample=True, + add_upsample=True, + ): + super().__init__() + + # Add the resnet blocks + self.resnets = [ + ResnetBlock2D( + in_channels=in_channels if i == 0 else out_channels, + out_channels=out_channels, + groups=resnet_groups, + ) + for i in range(num_layers) + ] + + # Add an optional downsampling layer + if add_downsample: + self.downsample = nn.Conv2d( + out_channels, out_channels, kernel_size=3, stride=2, padding=0 + ) + + # or upsampling layer + if add_upsample: + self.upsample = nn.Conv2d( + out_channels, out_channels, kernel_size=3, stride=1, padding=1 + ) + + def __call__(self, x): + for resnet in self.resnets: + x = resnet(x) + + if "downsample" in self: + x = mx.pad(x, [(0, 0), (0, 1), (0, 1), (0, 0)]) + x = self.downsample(x) + + if "upsample" in self: + x = self.upsample(upsample_nearest(x)) + + return x + + +class Encoder(nn.Module): + """Implements the encoder side of the Autoencoder.""" + + def __init__( + self, + in_channels: int, + out_channels: int, + block_out_channels: List[int] = [64], + layers_per_block: int = 2, + resnet_groups: int = 32, + ): + super().__init__() + + self.conv_in = nn.Conv2d( + in_channels, block_out_channels[0], kernel_size=3, stride=1, padding=1 + ) + + channels = [block_out_channels[0]] + list(block_out_channels) + self.down_blocks = [ + EncoderDecoderBlock2D( + in_channels, + out_channels, + num_layers=layers_per_block, + resnet_groups=resnet_groups, + add_downsample=i < len(block_out_channels) - 1, + add_upsample=False, + ) + for i, (in_channels, out_channels) in enumerate(zip(channels, channels[1:])) + ] + + self.mid_blocks = [ + ResnetBlock2D( + in_channels=block_out_channels[-1], + out_channels=block_out_channels[-1], + groups=resnet_groups, + ), + Attention(block_out_channels[-1], resnet_groups), + ResnetBlock2D( + in_channels=block_out_channels[-1], + out_channels=block_out_channels[-1], + groups=resnet_groups, + ), + ] + + self.conv_norm_out = nn.GroupNorm( + resnet_groups, block_out_channels[-1], pytorch_compatible=True + ) + self.conv_out = nn.Conv2d(block_out_channels[-1], out_channels, 3, padding=1) + + def __call__(self, x): + x = self.conv_in(x) + + for l in self.down_blocks: + x = l(x) + + x = self.mid_blocks[0](x) + x = self.mid_blocks[1](x) + x = self.mid_blocks[2](x) + + x = self.conv_norm_out(x) + x = nn.silu(x) + x = self.conv_out(x) + + return x + + +class Decoder(nn.Module): + """Implements the decoder side of the Autoencoder.""" + + def __init__( + self, + in_channels: int, + out_channels: int, + block_out_channels: List[int] = [64], + layers_per_block: int = 2, + resnet_groups: int = 32, + ): + super().__init__() + + self.conv_in = nn.Conv2d( + in_channels, block_out_channels[-1], kernel_size=3, stride=1, padding=1 + ) + + self.mid_blocks = [ + ResnetBlock2D( + in_channels=block_out_channels[-1], + out_channels=block_out_channels[-1], + groups=resnet_groups, + ), + Attention(block_out_channels[-1], resnet_groups), + ResnetBlock2D( + in_channels=block_out_channels[-1], + out_channels=block_out_channels[-1], + groups=resnet_groups, + ), + ] + + channels = list(reversed(block_out_channels)) + channels = [channels[0]] + channels + self.up_blocks = [ + EncoderDecoderBlock2D( + in_channels, + out_channels, + num_layers=layers_per_block, + resnet_groups=resnet_groups, + add_downsample=False, + add_upsample=i < len(block_out_channels) - 1, + ) + for i, (in_channels, out_channels) in enumerate(zip(channels, channels[1:])) + ] + + self.conv_norm_out = nn.GroupNorm( + resnet_groups, block_out_channels[0], pytorch_compatible=True + ) + self.conv_out = nn.Conv2d(block_out_channels[0], out_channels, 3, padding=1) + + def __call__(self, x): + x = self.conv_in(x) + + x = self.mid_blocks[0](x) + x = self.mid_blocks[1](x) + x = self.mid_blocks[2](x) + + for l in self.up_blocks: + x = l(x) + + x = self.conv_norm_out(x) + x = nn.silu(x) + x = self.conv_out(x) + + return x + + +class Autoencoder(nn.Module): + """The autoencoder that allows us to perform diffusion in the latent space.""" + + def __init__(self, config: AutoencoderConfig): + super().__init__() + + self.latent_channels = config.latent_channels_in + self.scaling_factor = config.scaling_factor + self.encoder = Encoder( + config.in_channels, + config.latent_channels_out, + config.block_out_channels, + config.layers_per_block, + resnet_groups=config.norm_num_groups, + ) + self.decoder = Decoder( + config.latent_channels_in, + config.out_channels, + config.block_out_channels, + config.layers_per_block + 1, + resnet_groups=config.norm_num_groups, + ) + + self.quant_proj = nn.Linear( + config.latent_channels_out, config.latent_channels_out + ) + self.post_quant_proj = nn.Linear( + config.latent_channels_in, config.latent_channels_in + ) + + def decode(self, z): + z = z / self.scaling_factor + return self.decoder(self.post_quant_proj(z)) + + def encode(self, x): + x = self.encoder(x) + x = self.quant_proj(x) + mean, logvar = x.split(2, axis=-1) + mean = mean * self.scaling_factor + logvar = logvar + 2 * math.log(self.scaling_factor) + + return mean, logvar + + def __call__(self, x, key=None): + mean, logvar = self.encode(x) + z = mx.random.normal(mean.shape, key=key) * mx.exp(0.5 * logvar) + mean + x_hat = self.decode(z) + + return dict(x_hat=x_hat, z=z, mean=mean, logvar=logvar) diff --git a/examples/databricks_DBRX_website_bot/gen_image.py b/examples/databricks_DBRX_website_bot/gen_image.py new file mode 100644 index 00000000..0e80f3af --- /dev/null +++ b/examples/databricks_DBRX_website_bot/gen_image.py @@ -0,0 +1,142 @@ +from diffusers import ( + StableDiffusionPipeline, + StableDiffusionXLPipeline, + AutoPipelineForText2Image, +) +import mlx.core as mx +from diffusion_mlx import StableDiffusion, StableDiffusionXL +import torch +from tqdm import tqdm +from PIL import Image +import numpy as np +import time + +SUPPORTS_NEGATIVE_PROMPT = False +GLOBAL_NEGATIVE_PROMPT = ( + "3d, cartoon, anime, (deformed eyes, nose, ears, nose), bad anatomy, ugly, text" +) +RESPONSE_TO_DIFFUSER_PROMPT = "Get minimal text (no longer than 70 tokesn) describe the response and use it as a prompt for a diffuser: {} | avoid adding text to the image |" + +""" +MODEL_MAP = { + "runway_diffusion_v1": "runwayml/stable-diffusion-v1-5", + "sdxl": "stabilityai/stable-diffusion-xl-base-1.0", +} + +def load_model(model_id="runway_diffusion_v1"): + global MODEL_PIPE, SUPPORTS_NEGATIVE_PROMPT + if model_id == "runway_diffusion_v1": + MODEL_PIPE = StableDiffusionPipeline.from_pretrained(MODEL_MAP[model_id]) + elif model_id == "sdxl": + MODEL_PIPE = StableDiffusionXLPipeline.from_pretrained( + "stabilityai/stable-diffusion-xl-base-1.0", variant="fp16", use_safetensors=True + ) + SUPPORTS_NEGATIVE_PROMPT = True + elif model_id == "sdxl-turbo": + MODEL_PIPE = AutoPipelineForText2Image.from_pretrained("stabilityai/sdxl-turbo", variant="fp16") + + + +def generate_image(prompt, model_id="runway_diffusion_v1"): + prompt += " | avoid adding text to the image |" + image = MODEL_PIPE(prompt).images[0] if not SUPPORTS_NEGATIVE_PROMPT else MODEL_PIPE(prompt, negative_prompt=GLOBAL_NEGATIVE_PROMPT).images[0] + return image +""" + +### MLX version +import mlx.core as mx +import mlx.nn as nn + + +def load_models(model="sdxl", float16=True, quantize=True, preload_models=True): + # Load the models + if model == "sdxl": + model = StableDiffusionXL("stabilityai/sdxl-turbo", float16=float16) + if quantize: + nn.quantize( + model.text_encoder_1, + class_predicate=lambda _, m: isinstance(m, nn.Linear), + ) + nn.quantize( + model.text_encoder_2, + class_predicate=lambda _, m: isinstance(m, nn.Linear), + ) + nn.quantize(model.unet, group_size=32, bits=8) + steps = 2 + else: + model = StableDiffusion( + "stabilityai/stable-diffusion-2-1-base", float16=float16 + ) + if quantize: + nn.quantize( + model.text_encoder, + class_predicate=lambda _, m: isinstance(m, nn.Linear), + ) + nn.quantize(model.unet, group_size=32, bits=8) + steps = 50 + + # Ensure that models are read in memory if needed + if preload_models: + model.ensure_models_are_loaded() + + return model, steps + + +def generate_image(model, steps, prompt, verbose=True): + # Generate the latent vectors using diffusion + time1 = time.time() + latents = model.generate_latents( + prompt, + n_images=1, + num_steps=steps, + negative_text=GLOBAL_NEGATIVE_PROMPT, + ) + for x_t in tqdm(latents, total=steps): + mx.eval(x_t) + + # The following is not necessary but it may help in memory + # constrained systems by reusing the memory kept by the unet and the text + # encoders. + + # if model == "sdxl": + # del MODEL_PIPE.text_encoder_1 + # del MODEL_PIPE.text_encoder_2 + # else: + # del MODEL_PIPE.text_encoder + # del sd.unet + # del sd.sampler + peak_mem_unet = mx.metal.get_peak_memory() / 1024**3 + + # Decode them into images + decoded = [] + for i in tqdm(range(0, 1, 1)): + decoded.append(model.decode(x_t[i : i + 1])) + mx.eval(decoded[-1]) + peak_mem_overall = mx.metal.get_peak_memory() / 1024**3 + + # Arrange them on a grid + x = mx.concatenate(decoded, axis=0) + x = mx.pad(x, [(0, 0), (8, 8), (8, 8), (0, 0)]) + B, H, W, C = x.shape + x = x.reshape(1, B, H, W, C).transpose(0, 2, 1, 3, 4) + x = x.reshape(1 * H, B * W, C) + x = (x * 255).astype(mx.uint8) + + time2 = time.time() + if verbose: + print(f"Time taken to generate the image: {time2 - time1:.3f}s") + # Save them to disc + im = Image.fromarray(np.array(x)) + + # Report the peak memory used during generation + if verbose: + print(f"Peak memory used for the unet: {peak_mem_unet:.3f}GB") + print(f"Peak memory used overall: {peak_mem_overall:.3f}GB") + + return im + + +if __name__ == "__main__": + load_models() + generate_image("A cartoon of a cute cat", verbose=True) + generate_image("Hogwartz school of witchcraft and wizardry", verbose=True) diff --git a/examples/databricks_DBRX_website_bot/gui.py b/examples/databricks_DBRX_website_bot/gui.py new file mode 100644 index 00000000..95360a09 --- /dev/null +++ b/examples/databricks_DBRX_website_bot/gui.py @@ -0,0 +1,76 @@ +import streamlit as st +from main import build_RAG +from gen_image import generate_image, RESPONSE_TO_DIFFUSER_PROMPT +from llama_index.core import Settings + + +def add_to_session(key, value): + st.session_state[key] = value + + +def main(): + st.title("Databricks DBRX Website Bot") + if st.session_state.get("query_engine") is None: + context = st.text_area( + "Enter the link to the context", + value="https://harrypotter.fandom.com/wiki/Hogwarts_School_of_Witchcraft_and_Wizardry", + ) + illustrate = st.checkbox("Illustrate") + steps = st.selectbox("Select the number of steps for diffusion", (1, 2)) + build_rag = st.button("Build RAG") + query_engine, model = None, None + if build_rag: + query_engine, model, _ = build_RAG( + context, + "mixedbread-ai/mxbai-embed-large-v1", + "~/tmp/lancedb_hogwarts_12", + False, + illustrate, + "sdxl", + ) + add_to_session("query_engine", query_engine) + add_to_session("model", model) + add_to_session("steps", steps or 1) + add_to_session("illustrate", illustrate) + print("steps", steps) + st._experimental_rerun() + else: + query_engine = st.session_state["query_engine"] + model = st.session_state["model"] + steps = st.session_state["steps"] + illustrate = st.session_state["illustrate"] + col1, col2 = st.columns(2) + with col1: + query = st.text_input( + "Enter a question", + value="What is Hogwarts?", + label_visibility="collapsed", + ) + with col2: + enter = st.button("Enter") + if enter: + response = query_engine.chat(query) + if illustrate: + with col1: + st.write("Response") + st.write(response.response) + with col2: + st.write("Illustration") + with st.spinner("waiting"): + image = generate_image( + model, + steps, + Settings.llm.complete( + RESPONSE_TO_DIFFUSER_PROMPT.format( + str(response.response) + ) + ).text, + ) + st.image(image) + else: + st.write("Response") + st.write(response) + + +if __name__ == "__main__": + main() diff --git a/examples/databricks_DBRX_website_bot/main.py b/examples/databricks_DBRX_website_bot/main.py index cef72f9b..9dad226c 100644 --- a/examples/databricks_DBRX_website_bot/main.py +++ b/examples/databricks_DBRX_website_bot/main.py @@ -5,6 +5,9 @@ from llama_index.vector_stores.lancedb import LanceDBVectorStore from llama_index.llms.databricks import Databricks from llama_index.embeddings.huggingface import HuggingFaceEmbedding +from gen_image import load_models, generate_image, RESPONSE_TO_DIFFUSER_PROMPT + +MODEL, STEPS = None, None def get_doc_from_url(url): @@ -17,22 +20,24 @@ def build_RAG( embed_model="mixedbread-ai/mxbai-embed-large-v1", uri="~/tmp/lancedb_hogwart", force_create_embeddings=False, + illustrate=True, + diffuser_model="sdxl", ): Settings.embed_model = HuggingFaceEmbedding(model_name=embed_model) Settings.llm = Databricks(model="databricks-dbrx-instruct") - + if illustrate: + print("Loading sdxl model") + model, steps = load_models(diffuser_model) + # This is a hack to tradeoff between speed and quality + steps = 1 + print("Model loaded") documents = get_doc_from_url(url) vector_store = LanceDBVectorStore(uri=uri) storage_context = StorageContext.from_defaults(vector_store=vector_store) index = VectorStoreIndex.from_documents(documents, storage_context=storage_context) query_engine = index.as_chat_engine() - print("Ask a question relevant to the given context:") - while True: - query = input() - response = query_engine.chat(query) - print(response) - print("\n") + return query_engine, model, steps if __name__ == "__main__": @@ -61,5 +66,42 @@ def build_RAG( default=False, help="Force create embeddings", ) + parser.add_argument( + "--diffuser_model", + type=str, + default="sdxl", + help="Model ID", + ) + + parser.add_argument( + "--illustrate", + type=bool, + default=True, + help="Annotate", + ) args = parser.parse_args() - build_RAG(args.url, args.embed_model, args.uri, args.force_create_embeddings) + # hardcode model because no one should use sd + args.diffuser_model = "sdxl" + query_engine, model, steps = build_RAG( + args.url, + args.embed_model, + args.uri, + args.force_create_embeddings, + args.illustrate, + args.diffuser_model, + ) + + print("Ask a question relevant to the given context:") + while True: + query = input() + response = query_engine.chat(query) + print(response) + print("\n Illustrating the response...:") + image = generate_image( + model, + steps, + Settings.llm.complete( + RESPONSE_TO_DIFFUSER_PROMPT.format(str(response.response)) + ).text, + ) + image.show() diff --git a/examples/databricks_DBRX_website_bot/requirements.txt b/examples/databricks_DBRX_website_bot/requirements.txt index a25f1f64..f5719f61 100644 --- a/examples/databricks_DBRX_website_bot/requirements.txt +++ b/examples/databricks_DBRX_website_bot/requirements.txt @@ -2,4 +2,12 @@ llama-index llama-index-llms-databricks llama-index-embeddings-huggingface llama-index-readers-web -llama-index-vector-stores-lancedb \ No newline at end of file +llama-index-vector-stores-lancedb +diffusers +mlx>=0.11 +huggingface-hub +regex +numpy +tqdm +Pillow +streamlit \ No newline at end of file diff --git a/examples/imagebind_demo/README.md b/examples/imagebind_demo/README.md index 2fec470d..1c925f14 100644 --- a/examples/imagebind_demo/README.md +++ b/examples/imagebind_demo/README.md @@ -2,6 +2,8 @@ A gradio app showcasing multi-modal capabilities of Imagebind supported via lanceDB API +![alt text](<../../assets/imagebind-demo.png>) + ## Usage you can run it locally by cloning the project as mentioned below, or access via Spaces: hf spaces diff --git a/examples/movie-recommendation-with-genres/README.md b/examples/movie-recommendation-with-genres/README.md new file mode 100644 index 00000000..f79e4e4a --- /dev/null +++ b/examples/movie-recommendation-with-genres/README.md @@ -0,0 +1,9 @@ +# Movie Recommendation using Emebeddings and VectorDB + +![alt text](../../assets/movie-recommendation-with-genre.png) + +This example provides a comprehensive guide on creating a movie recommendation system by leveraging the power of Embeddings and VectorDB. We'll explore how combining these two techniques can significantly enhance the recommendation experience, addressing key challenges faced by traditional systems. + +Colab walkthrough - Open In Colab + +[Read the Blog Post](https://blog.lancedb.com/movie-recommendation-system-using-lancedb-and-doc2vec/) \ No newline at end of file diff --git a/examples/movie-recommendation-with-genres/movie_recommendation_with_doc2vec_and_lancedb.ipynb b/examples/movie-recommendation-with-genres/movie_recommendation_with_doc2vec_and_lancedb.ipynb new file mode 100644 index 00000000..2a75b19a --- /dev/null +++ b/examples/movie-recommendation-with-genres/movie_recommendation_with_doc2vec_and_lancedb.ipynb @@ -0,0 +1,614 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": { + "id": "K45xhdPRsZJV" + }, + "source": [ + "# Movie Recommendation System using Doc2vec Embeddings and Vector DB" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "XUj6NXD0sdgf" + }, + "source": [ + "This Colab notebook aims to illustrate the process of creating a recommendation system using embeddings and a Vector DB.\n", + "\n", + "This approach involves combining the various movie genres or characteristics of a movie to form Doc2Vec embeddings, which offer a comprehensive portrayal of the movie content.\n", + "\n", + "These embeddings serve dual purposes: they can either be directly inputted into a classification model for genre classification or stored in a VectorDB. By storing embeddings in a VectorDB, efficient retrieval and query search for recommendations become possible at a later stage.\n" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "qEa74a_Wtpc7" + }, + "source": [ + "### Installing the relevant dependencies\n" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "hyde90IntuFi" + }, + "outputs": [], + "source": [ + "!pip install torch scikit-learn lancedb nltk gensim lancedb scipy==1.12 kaggle" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "shPjHTZbtxTh" + }, + "source": [ + "## Kaggle Configuration and Data Needs\n", + "\n", + "We are using a movies metadata data which is being uploaded on the Kaggle. To download the dataset and use it for our recommendation system, we will need a `kaggle.json` file containing our creds.\n", + "\n", + "You can download the `kaggle.json` file from your Kaggle account settings. Follow these steps and make your life easy.\n", + "\n", + "1. Go to Kaggle and log in to your account.\n", + "2. Navigate to Your Account Settings and click on your profile picture in the top right corner of the page, Now From the dropdown menu, select `Account`.\n", + "3. Scroll down to the `API` section, Click on `Create New API Token`. This will download a file named kaggle.json to your computer.\n", + "\n", + "Once you have the `kaggle.json` file, you need to upload it here on colab data space. After uploading the `kaggle.json` file, run the following code to set up the credentials and download the dataset in `data` directory" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "6Tl2qzgKsWtF" + }, + "outputs": [], + "source": [ + "import json\n", + "import os\n", + "\n", + "# Assuming kaggle.json is uploaded to the current directory\n", + "with open(\"kaggle.json\") as f:\n", + " kaggle_credentials = json.load(f)\n", + "\n", + "os.environ[\"KAGGLE_USERNAME\"] = kaggle_credentials[\"username\"]\n", + "os.environ[\"KAGGLE_KEY\"] = kaggle_credentials[\"key\"]" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "8va-0of3sU0x" + }, + "outputs": [], + "source": [ + "from kaggle.api.kaggle_api_extended import KaggleApi\n", + "\n", + "# Initialize the Kaggle API\n", + "api = KaggleApi()\n", + "api.authenticate()\n", + "\n", + "# Specify the dataset you want to download\n", + "dataset = \"rounakbanik/the-movies-dataset\"\n", + "destination = \"data/\"\n", + "\n", + "# Create the destination directory if it doesn't exist\n", + "if not os.path.exists(destination):\n", + " os.makedirs(destination)\n", + "\n", + "# Download the dataset\n", + "api.dataset_download_files(dataset, path=destination, unzip=True)\n", + "\n", + "print(f\"Dataset {dataset} downloaded to {destination}\")" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "hBYzad3lrY4e", + "outputId": "5a8f7983-80be-47e0-aa9c-ae4e10495c1e" + }, + "outputs": [ + { + "name": "stderr", + "output_type": "stream", + "text": [ + "100%|██████████| 1000/1000 [00:00<00:00, 5050.83it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5161.29it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5006.18it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5222.83it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5216.24it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5171.35it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5109.78it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5222.42it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5133.39it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5024.74it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5117.18it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 4963.78it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5405.55it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5369.51it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5349.33it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5374.53it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5194.32it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5296.75it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5204.32it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5309.43it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5333.12it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5289.35it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5317.42it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5322.46it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5378.43it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5488.32it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5546.43it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 2502.38it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5369.91it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 4354.99it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5193.60it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5536.27it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 3476.56it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 4819.07it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 4500.37it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5184.11it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5098.14it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5523.73it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 4655.12it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5113.63it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5336.63it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5564.83it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5310.91it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 5533.46it/s]\n", + "100%|██████████| 1000/1000 [00:00<00:00, 4255.41it/s]\n", + "100%|██████████| 466/466 [00:00<00:00, 5617.03it/s]\n", + "Building Vocabulary: 100%|██████████| 44506/44506 [00:00<00:00, 104121.48it/s]\n", + "Epoch 1: 100%|██████████| 44506/44506 [00:02<00:00, 20444.80it/s]\n", + "Epoch 2: 100%|██████████| 44506/44506 [00:02<00:00, 20700.43it/s]\n", + "Epoch 3: 100%|██████████| 44506/44506 [00:02<00:00, 20831.06it/s]\n", + "Epoch 4: 100%|██████████| 44506/44506 [00:02<00:00, 20885.78it/s]\n", + "Epoch 5: 100%|██████████| 44506/44506 [00:02<00:00, 19616.38it/s]\n", + "Epoch 6: 100%|██████████| 44506/44506 [00:02<00:00, 19634.24it/s]\n", + "Epoch 7: 100%|██████████| 44506/44506 [00:02<00:00, 20579.08it/s]\n", + "Epoch 8: 100%|██████████| 44506/44506 [00:02<00:00, 20727.00it/s]\n", + "Epoch 9: 100%|██████████| 44506/44506 [00:02<00:00, 21242.19it/s]\n", + "Epoch 10: 100%|██████████| 44506/44506 [00:02<00:00, 18476.39it/s]\n", + "Epoch 11: 100%|██████████| 44506/44506 [00:02<00:00, 21169.07it/s]\n", + "Epoch 12: 100%|██████████| 44506/44506 [00:02<00:00, 20967.64it/s]\n", + "Epoch 13: 100%|██████████| 44506/44506 [00:02<00:00, 20192.34it/s]\n", + "Epoch 14: 100%|██████████| 44506/44506 [00:02<00:00, 18910.62it/s]\n", + "Epoch 15: 100%|██████████| 44506/44506 [00:02<00:00, 20810.41it/s]\n", + "Epoch 16: 100%|██████████| 44506/44506 [00:02<00:00, 21361.88it/s]\n", + "Epoch 17: 100%|██████████| 44506/44506 [00:02<00:00, 18440.51it/s]\n", + "Epoch 18: 100%|██████████| 44506/44506 [00:02<00:00, 21206.01it/s]\n", + "Epoch 19: 100%|██████████| 44506/44506 [00:02<00:00, 20086.00it/s]\n", + "Epoch 20: 100%|██████████| 44506/44506 [00:02<00:00, 20943.08it/s]\n" + ] + } + ], + "source": [ + "import pandas as pd\n", + "import numpy as np\n", + "import torch\n", + "import torch.nn as nn\n", + "import torch.optim as optim\n", + "from torch.utils.data import DataLoader, TensorDataset\n", + "from gensim.models.doc2vec import Doc2Vec, TaggedDocument\n", + "from nltk.tokenize import word_tokenize\n", + "from sklearn.preprocessing import MultiLabelBinarizer\n", + "from sklearn.model_selection import train_test_split\n", + "from tqdm import tqdm\n", + "\n", + "# Read data from CSV file\n", + "movie_data = pd.read_csv(\n", + " \"/Users/vipul/Nova/Projects/genre_spectrum/movies_metadata.csv\", low_memory=False\n", + ")\n", + "device = torch.device(\"cuda\" if torch.cuda.is_available() else \"cpu\")\n", + "\n", + "\n", + "def preprocess_data(movie_data_chunk):\n", + " tagged_docs = []\n", + " valid_indices = []\n", + " movie_info = []\n", + "\n", + " # Wrap your loop with tqdm\n", + " for i, row in tqdm(movie_data_chunk.iterrows(), total=len(movie_data_chunk)):\n", + " try:\n", + " # Constructing movie text\n", + " movies_text = \"\"\n", + " movies_text += \"Overview: \" + row[\"overview\"] + \"\\n\"\n", + " genres = \", \".join([genre[\"name\"] for genre in eval(row[\"genres\"])])\n", + " movies_text += \"Overview: \" + row[\"overview\"] + \"\\n\"\n", + " movies_text += \"Genres: \" + genres + \"\\n\"\n", + " movies_text += \"Title: \" + row[\"title\"] + \"\\n\"\n", + " tagged_docs.append(\n", + " TaggedDocument(words=word_tokenize(movies_text.lower()), tags=[str(i)])\n", + " )\n", + " valid_indices.append(i)\n", + " movie_info.append((row[\"title\"], genres))\n", + " except Exception as e:\n", + " continue\n", + "\n", + " return tagged_docs, valid_indices, movie_info\n", + "\n", + "\n", + "def train_doc2vec_model(tagged_data, num_epochs=20):\n", + " # Initialize Doc2Vec model\n", + " doc2vec_model = Doc2Vec(vector_size=100, min_count=2, epochs=num_epochs)\n", + " doc2vec_model.build_vocab(tqdm(tagged_data, desc=\"Building Vocabulary\"))\n", + " for epoch in range(num_epochs):\n", + " doc2vec_model.train(\n", + " tqdm(tagged_data, desc=f\"Epoch {epoch+1}\"),\n", + " total_examples=doc2vec_model.corpus_count,\n", + " epochs=doc2vec_model.epochs,\n", + " )\n", + "\n", + " return doc2vec_model\n", + "\n", + "\n", + "# Preprocess data and extract genres for the first 1000 movies\n", + "chunk_size = 1000\n", + "tagged_data = []\n", + "valid_indices = []\n", + "movie_info = []\n", + "for chunk_start in range(0, len(movie_data), chunk_size):\n", + " movie_data_chunk = movie_data.iloc[chunk_start : chunk_start + chunk_size]\n", + " chunk_tagged_data, chunk_valid_indices, chunk_movie_info = preprocess_data(\n", + " movie_data_chunk\n", + " )\n", + " tagged_data.extend(chunk_tagged_data)\n", + " valid_indices.extend(chunk_valid_indices)\n", + " movie_info.extend(chunk_movie_info)\n", + "\n", + "doc2vec_model = train_doc2vec_model(tagged_data)" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "VryHT1zVuEp0" + }, + "source": [ + "### Training a Neural Network for the Genre Classification Task" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "3pVNy2UKt5lu" + }, + "outputs": [], + "source": [ + "# Extract genre labels for the valid indices\n", + "genres_list = []\n", + "for i in valid_indices:\n", + " row = movie_data.loc[i]\n", + " genres = [genre[\"name\"] for genre in eval(row[\"genres\"])]\n", + " genres_list.append(genres)\n", + "\n", + "mlb = MultiLabelBinarizer()\n", + "genre_labels = mlb.fit_transform(genres_list)\n", + "\n", + "embeddings = []\n", + "for i in valid_indices:\n", + " embeddings.append(doc2vec_model.dv[str(i)])\n", + "X_train, X_test, y_train, y_test = train_test_split(\n", + " embeddings, genre_labels, test_size=0.2, random_state=42\n", + ")\n", + "\n", + "X_train_np = np.array(X_train, dtype=np.float32)\n", + "y_train_np = np.array(y_train, dtype=np.float32)\n", + "X_test_np = np.array(X_test, dtype=np.float32)\n", + "y_test_np = np.array(y_test, dtype=np.float32)\n", + "\n", + "X_train_tensor = torch.tensor(X_train_np)\n", + "y_train_tensor = torch.tensor(y_train_np)\n", + "X_test_tensor = torch.tensor(X_test_np)\n", + "y_test_tensor = torch.tensor(y_test_np)\n", + "\n", + "\n", + "class GenreClassifier(nn.Module):\n", + " def __init__(self, input_size, output_size):\n", + " super(GenreClassifier, self).__init__()\n", + " self.fc1 = nn.Linear(input_size, 512)\n", + " self.bn1 = nn.BatchNorm1d(512)\n", + " self.fc2 = nn.Linear(512, 256)\n", + " self.bn2 = nn.BatchNorm1d(256)\n", + " self.fc3 = nn.Linear(256, 128)\n", + " self.bn3 = nn.BatchNorm1d(128)\n", + " self.fc4 = nn.Linear(128, output_size)\n", + " self.relu = nn.ReLU()\n", + " self.dropout = nn.Dropout(p=0.2) # Adjust the dropout rate as needed\n", + "\n", + " def forward(self, x):\n", + " x = self.fc1(x)\n", + " x = self.bn1(x)\n", + " x = self.relu(x)\n", + " x = self.dropout(x)\n", + " x = self.fc2(x)\n", + " x = self.bn2(x)\n", + " x = self.relu(x)\n", + " x = self.dropout(x)\n", + " x = self.fc3(x)\n", + " x = self.bn3(x)\n", + " x = self.relu(x)\n", + " x = self.dropout(x)\n", + " x = self.fc4(x)\n", + " return x\n", + "\n", + "\n", + "# Move model to the selected device\n", + "model = GenreClassifier(input_size=100, output_size=len(mlb.classes_)).to(device)\n", + "\n", + "# Define loss function and optimizer\n", + "criterion = nn.BCEWithLogitsLoss()\n", + "optimizer = optim.Adam(model.parameters(), lr=0.001)\n", + "\n", + "# Training loop\n", + "epochs = 50\n", + "batch_size = 64\n", + "\n", + "train_dataset = TensorDataset(X_train_tensor.to(device), y_train_tensor.to(device))\n", + "train_loader = DataLoader(train_dataset, batch_size=batch_size, shuffle=True)\n", + "\n", + "for epoch in range(epochs):\n", + " model.train()\n", + " running_loss = 0.0\n", + " for inputs, labels in train_loader:\n", + " inputs, labels = inputs.to(device), labels.to(device) # Move data to device\n", + " optimizer.zero_grad()\n", + " outputs = model(inputs)\n", + " loss = criterion(outputs, labels)\n", + " loss.backward()\n", + " optimizer.step()\n", + " running_loss += loss.item() * inputs.size(0)\n", + " epoch_loss = running_loss / len(train_loader.dataset)\n", + " print(f\"Epoch [{epoch + 1}/{epochs}], Loss: {epoch_loss:.4f}\")" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "yV8lTDYIubEQ" + }, + "source": [ + "### Testing the `model` to see if our model is able to predict the genres for the movies from the test dataset" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "73D3aqdJuct8" + }, + "outputs": [], + "source": [ + "from sklearn.metrics import f1_score\n", + "\n", + "model.eval()\n", + "with torch.no_grad():\n", + " X_test_tensor, y_test_tensor = X_test_tensor.to(device), y_test_tensor.to(\n", + " device\n", + " ) # Move test data to device\n", + " outputs = model(X_test_tensor)\n", + " test_loss = criterion(outputs, y_test_tensor)\n", + " print(f\"Test Loss: {test_loss.item():.4f}\")\n", + "\n", + "\n", + "thresholds = [0.1] * len(mlb.classes_)\n", + "thresholds_tensor = torch.tensor(thresholds, device=device).unsqueeze(0)\n", + "\n", + "# Convert the outputs to binary predictions using varying thresholds\n", + "predicted_labels = (outputs > thresholds_tensor).cpu().numpy()\n", + "\n", + "# Convert binary predictions and actual labels to multi-label format\n", + "predicted_multilabels = mlb.inverse_transform(predicted_labels)\n", + "actual_multilabels = mlb.inverse_transform(y_test_np)\n", + "\n", + "# Print the Predicted and Actual Labels for each movie\n", + "for i, (predicted, actual) in enumerate(zip(predicted_multilabels, actual_multilabels)):\n", + " print(f\"Movie {i+1}:\")\n", + " print(f\" Predicted Labels: {predicted}\")\n", + " print(f\" Actual Labels: {actual}\")\n", + "\n", + "\n", + "# Compute F1-score\n", + "f1 = f1_score(y_test_np, predicted_labels, average=\"micro\")\n", + "print(f\"F1-score: {f1:.4f}\")\n", + "\n", + "# Saving the trained model\n", + "torch.save(model.state_dict(), \"trained_model.pth\")" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "kZrHpMm4un0G" + }, + "source": [ + "### Storing the Doc2Vec Embeddings into LanceDB VectorDatabase" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "BTTNb9irrY4h" + }, + "outputs": [], + "source": [ + "import lancedb\n", + "import numpy as np\n", + "import pandas as pd\n", + "\n", + "data = []\n", + "\n", + "for i in valid_indices:\n", + " embedding = doc2vec_model.dv[str(i)]\n", + " title, genres = movie_info[valid_indices.index(i)]\n", + " data.append({\"title\": title, \"genres\": genres, \"vector\": embedding.tolist()})\n", + "\n", + "db = lancedb.connect(\".db\")\n", + "tbl = db.create_table(\"doc2vec_embeddings\", data, mode=\"Overwrite\")\n", + "db[\"doc2vec_embeddings\"].head()" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "ciUFn7uQrY4i" + }, + "outputs": [], + "source": [ + "def get_recommendations(title):\n", + " pd_data = pd.DataFrame(data)\n", + " result = (\n", + " tbl.search(pd_data[pd_data[\"title\"] == title][\"vector\"].values[0])\n", + " .metric(\"cosine\")\n", + " .limit(10)\n", + " .to_pandas()\n", + " )\n", + " return result[[\"title\"]]" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "8Kz-JGsTuwmk" + }, + "source": [ + "### D-Day : Let's generate some recommendations" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "uw_El12JrY4j", + "outputId": "c245bab5-7966-4fd1-ec72-37f708c3b570" + }, + "outputs": [ + { + "data": { + "text/html": [ + "
\n", + "\n", + "\n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + "
title
0Vertical Limit
1Demons of War
2Fear and Desire
3Escape from Sobibor
4Last Girl Standing
5K2: Siren of the Himalayas
6Ghost Ship
7Camp Massacre
8Captain Nemo and the Underwater City
9Seas Beneath
\n", + "
" + ], + "text/plain": [ + " title\n", + "0 Vertical Limit\n", + "1 Demons of War\n", + "2 Fear and Desire\n", + "3 Escape from Sobibor\n", + "4 Last Girl Standing\n", + "5 K2: Siren of the Himalayas\n", + "6 Ghost Ship\n", + "7 Camp Massacre\n", + "8 Captain Nemo and the Underwater City\n", + "9 Seas Beneath" + ] + }, + "execution_count": 20, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "get_recommendations(\"Vertical Limit\")" + ] + } + ], + "metadata": { + "colab": { + "provenance": [] + }, + "kernelspec": { + "display_name": "env", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.12.3" + } + }, + "nbformat": 4, + "nbformat_minor": 0 +} diff --git a/examples/parent_document_retriever/main.ipynb b/examples/parent_document_retriever/main.ipynb index eead31ea..9b7e51a3 100644 --- a/examples/parent_document_retriever/main.ipynb +++ b/examples/parent_document_retriever/main.ipynb @@ -109,9 +109,9 @@ "metadata": {}, "outputs": [], "source": [ - "os.environ[\"OPENAI_API_KEY\"] = (\n", - " \"YOUR_API_KEY_HERE\" # NEEDED if you run LLM Experiment below\n", - ")" + "os.environ[\n", + " \"OPENAI_API_KEY\"\n", + "] = \"YOUR_API_KEY_HERE\" # NEEDED if you run LLM Experiment below" ] }, { diff --git a/tutorials/Advace_RAG_LlamaParser/README.md b/tutorials/Advace_RAG_LlamaParser/README.md new file mode 100644 index 00000000..50a5d258 --- /dev/null +++ b/tutorials/Advace_RAG_LlamaParser/README.md @@ -0,0 +1,18 @@ +## Advanced RAG: Extracting Complex PDFs containing tables & Text Using LlamaParse +[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/tutorials/Advace_RAG_LlamaParser/main.ipynb) + +This example contains code and examples for comparing LangChain, LlamaIndex, and LlamaParse in extracting data from PDFs, especially those with complex tables and text. + +### Overview +In this project, we explore: + +* Q&A on PDF data using LangChain + +* Q&A on PDF data using LlamaIndex + +* Q&A on PDF data using LlamaIndex with LlamaParse + +The results of each method are compared in colab notebook +[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/lancedb/vectordb-recipes/blob/main/tutorials/Advace_RAG_LlamaParser/main.ipynb) + + diff --git a/tutorials/Advace_RAG_LlamaParser/main.ipynb b/tutorials/Advace_RAG_LlamaParser/main.ipynb new file mode 100644 index 00000000..c303d0f9 --- /dev/null +++ b/tutorials/Advace_RAG_LlamaParser/main.ipynb @@ -0,0 +1,3529 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": { + "id": "7wD8dJo-WZH7" + }, + "source": [ + "This notebook compares Langchain & Llamaindex for understand which method is best extraction of table & text from PDF in the following\n", + "\n", + "\n", + "Here we have covered\n", + "\n", + "1. Langchain RAG\n", + "2. Llamaindex RAG\n", + "3. Langchain wiht llamaparser\n", + "4. Llamaindex with llamaparser\n", + "\n", + "\n", + "from above this method will get idea about which is best method for table extraction for the following data used\n" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/" + }, + "id": "HGcMcXLF7zoM", + "outputId": "b4f6876b-0531-4bce-c1a6-1615c77322a2" + }, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "Collecting llama-index\n", + " Downloading llama_index-0.10.37-py3-none-any.whl (6.8 kB)\n", + "Collecting llama-index-core\n", + " Downloading llama_index_core-0.10.37.post1-py3-none-any.whl (15.4 MB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m15.4/15.4 MB\u001b[0m \u001b[31m40.6 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hCollecting llama-index-embeddings-openai\n", + " Downloading llama_index_embeddings_openai-0.1.9-py3-none-any.whl (6.0 kB)\n", + "Collecting llama-parse\n", + " Downloading llama_parse-0.4.3-py3-none-any.whl (7.7 kB)\n", + "Collecting llama-index-agent-openai<0.3.0,>=0.1.4 (from llama-index)\n", + " Downloading llama_index_agent_openai-0.2.5-py3-none-any.whl (13 kB)\n", + "Collecting llama-index-cli<0.2.0,>=0.1.2 (from llama-index)\n", + " Downloading llama_index_cli-0.1.12-py3-none-any.whl (26 kB)\n", + "Collecting llama-index-indices-managed-llama-cloud<0.2.0,>=0.1.2 (from llama-index)\n", + " Downloading llama_index_indices_managed_llama_cloud-0.1.6-py3-none-any.whl (6.7 kB)\n", + "Collecting llama-index-legacy<0.10.0,>=0.9.48 (from llama-index)\n", + " Downloading llama_index_legacy-0.9.48-py3-none-any.whl (2.0 MB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m2.0/2.0 MB\u001b[0m \u001b[31m52.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hCollecting llama-index-llms-openai<0.2.0,>=0.1.13 (from llama-index)\n", + " Downloading llama_index_llms_openai-0.1.19-py3-none-any.whl (11 kB)\n", + "Collecting llama-index-multi-modal-llms-openai<0.2.0,>=0.1.3 (from llama-index)\n", + " Downloading llama_index_multi_modal_llms_openai-0.1.6-py3-none-any.whl (5.8 kB)\n", + "Collecting llama-index-program-openai<0.2.0,>=0.1.3 (from llama-index)\n", + " Downloading llama_index_program_openai-0.1.6-py3-none-any.whl (5.2 kB)\n", + "Collecting llama-index-question-gen-openai<0.2.0,>=0.1.2 (from llama-index)\n", + " Downloading llama_index_question_gen_openai-0.1.3-py3-none-any.whl (2.9 kB)\n", + "Collecting llama-index-readers-file<0.2.0,>=0.1.4 (from llama-index)\n", + " Downloading llama_index_readers_file-0.1.22-py3-none-any.whl (36 kB)\n", + "Collecting llama-index-readers-llama-parse<0.2.0,>=0.1.2 (from llama-index)\n", + " Downloading llama_index_readers_llama_parse-0.1.4-py3-none-any.whl (2.5 kB)\n", + "Requirement already satisfied: PyYAML>=6.0.1 in /usr/local/lib/python3.10/dist-packages (from llama-index-core) (6.0.1)\n", + "Requirement already satisfied: SQLAlchemy[asyncio]>=1.4.49 in /usr/local/lib/python3.10/dist-packages (from llama-index-core) (2.0.30)\n", + "Requirement already satisfied: aiohttp<4.0.0,>=3.8.6 in /usr/local/lib/python3.10/dist-packages (from llama-index-core) (3.9.5)\n", + "Collecting dataclasses-json (from llama-index-core)\n", + " Downloading dataclasses_json-0.6.6-py3-none-any.whl (28 kB)\n", + "Collecting deprecated>=1.2.9.3 (from llama-index-core)\n", + " Downloading Deprecated-1.2.14-py2.py3-none-any.whl (9.6 kB)\n", + "Collecting dirtyjson<2.0.0,>=1.0.8 (from llama-index-core)\n", + " Downloading dirtyjson-1.0.8-py3-none-any.whl (25 kB)\n", + "Requirement already satisfied: fsspec>=2023.5.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core) (2023.6.0)\n", + "Collecting httpx (from llama-index-core)\n", + " Downloading httpx-0.27.0-py3-none-any.whl (75 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m75.6/75.6 kB\u001b[0m \u001b[31m9.9 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hCollecting jsonpath-ng<2.0.0,>=1.6.0 (from llama-index-core)\n", + " Downloading jsonpath_ng-1.6.1-py3-none-any.whl (29 kB)\n", + "Collecting llamaindex-py-client<0.2.0,>=0.1.18 (from llama-index-core)\n", + " Downloading llamaindex_py_client-0.1.19-py3-none-any.whl (141 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m141.9/141.9 kB\u001b[0m \u001b[31m14.4 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hRequirement already satisfied: nest-asyncio<2.0.0,>=1.5.8 in /usr/local/lib/python3.10/dist-packages (from llama-index-core) (1.6.0)\n", + "Requirement already satisfied: networkx>=3.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core) (3.3)\n", + "Requirement already satisfied: nltk<4.0.0,>=3.8.1 in /usr/local/lib/python3.10/dist-packages (from llama-index-core) (3.8.1)\n", + "Requirement already satisfied: numpy in /usr/local/lib/python3.10/dist-packages (from llama-index-core) (1.25.2)\n", + "Collecting openai>=1.1.0 (from llama-index-core)\n", + " Downloading openai-1.30.1-py3-none-any.whl (320 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m320.6/320.6 kB\u001b[0m \u001b[31m26.6 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hRequirement already satisfied: pandas in /usr/local/lib/python3.10/dist-packages (from llama-index-core) (2.0.3)\n", + "Requirement already satisfied: pillow>=9.0.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core) (9.4.0)\n", + "Requirement already satisfied: requests>=2.31.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core) (2.31.0)\n", + "Requirement already satisfied: spacy<4.0.0,>=3.7.1 in /usr/local/lib/python3.10/dist-packages (from llama-index-core) (3.7.4)\n", + "Requirement already satisfied: tenacity<9.0.0,>=8.2.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core) (8.3.0)\n", + "Collecting tiktoken>=0.3.3 (from llama-index-core)\n", + " Downloading tiktoken-0.7.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (1.1 MB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m1.1/1.1 MB\u001b[0m \u001b[31m54.8 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hRequirement already satisfied: tqdm<5.0.0,>=4.66.1 in /usr/local/lib/python3.10/dist-packages (from llama-index-core) (4.66.4)\n", + "Requirement already satisfied: typing-extensions>=4.5.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core) (4.11.0)\n", + "Collecting typing-inspect>=0.8.0 (from llama-index-core)\n", + " Downloading typing_inspect-0.9.0-py3-none-any.whl (8.8 kB)\n", + "Requirement already satisfied: wrapt in /usr/local/lib/python3.10/dist-packages (from llama-index-core) (1.14.1)\n", + "Requirement already satisfied: aiosignal>=1.1.2 in /usr/local/lib/python3.10/dist-packages (from aiohttp<4.0.0,>=3.8.6->llama-index-core) (1.3.1)\n", + "Requirement already satisfied: attrs>=17.3.0 in /usr/local/lib/python3.10/dist-packages (from aiohttp<4.0.0,>=3.8.6->llama-index-core) (23.2.0)\n", + "Requirement already satisfied: frozenlist>=1.1.1 in /usr/local/lib/python3.10/dist-packages (from aiohttp<4.0.0,>=3.8.6->llama-index-core) (1.4.1)\n", + "Requirement already satisfied: multidict<7.0,>=4.5 in /usr/local/lib/python3.10/dist-packages (from aiohttp<4.0.0,>=3.8.6->llama-index-core) (6.0.5)\n", + "Requirement already satisfied: yarl<2.0,>=1.0 in /usr/local/lib/python3.10/dist-packages (from aiohttp<4.0.0,>=3.8.6->llama-index-core) (1.9.4)\n", + "Requirement already satisfied: async-timeout<5.0,>=4.0 in /usr/local/lib/python3.10/dist-packages (from aiohttp<4.0.0,>=3.8.6->llama-index-core) (4.0.3)\n", + "Collecting ply (from jsonpath-ng<2.0.0,>=1.6.0->llama-index-core)\n", + " Downloading ply-3.11-py2.py3-none-any.whl (49 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m49.6/49.6 kB\u001b[0m \u001b[31m5.7 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hRequirement already satisfied: beautifulsoup4<5.0.0,>=4.12.3 in /usr/local/lib/python3.10/dist-packages (from llama-index-readers-file<0.2.0,>=0.1.4->llama-index) (4.12.3)\n", + "Collecting pypdf<5.0.0,>=4.0.1 (from llama-index-readers-file<0.2.0,>=0.1.4->llama-index)\n", + " Downloading pypdf-4.2.0-py3-none-any.whl (290 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m290.4/290.4 kB\u001b[0m \u001b[31m27.4 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hCollecting striprtf<0.0.27,>=0.0.26 (from llama-index-readers-file<0.2.0,>=0.1.4->llama-index)\n", + " Downloading striprtf-0.0.26-py3-none-any.whl (6.9 kB)\n", + "Requirement already satisfied: pydantic>=1.10 in /usr/local/lib/python3.10/dist-packages (from llamaindex-py-client<0.2.0,>=0.1.18->llama-index-core) (2.7.1)\n", + "Requirement already satisfied: anyio in /usr/local/lib/python3.10/dist-packages (from httpx->llama-index-core) (3.7.1)\n", + "Requirement already satisfied: certifi in /usr/local/lib/python3.10/dist-packages (from httpx->llama-index-core) (2024.2.2)\n", + "Collecting httpcore==1.* (from httpx->llama-index-core)\n", + " Downloading httpcore-1.0.5-py3-none-any.whl (77 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m77.9/77.9 kB\u001b[0m \u001b[31m10.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hRequirement already satisfied: idna in /usr/local/lib/python3.10/dist-packages (from httpx->llama-index-core) (3.7)\n", + "Requirement already satisfied: sniffio in /usr/local/lib/python3.10/dist-packages (from httpx->llama-index-core) (1.3.1)\n", + "Collecting h11<0.15,>=0.13 (from httpcore==1.*->httpx->llama-index-core)\n", + " Downloading h11-0.14.0-py3-none-any.whl (58 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m58.3/58.3 kB\u001b[0m \u001b[31m6.2 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hRequirement already satisfied: click in /usr/local/lib/python3.10/dist-packages (from nltk<4.0.0,>=3.8.1->llama-index-core) (8.1.7)\n", + "Requirement already satisfied: joblib in /usr/local/lib/python3.10/dist-packages (from nltk<4.0.0,>=3.8.1->llama-index-core) (1.4.2)\n", + "Requirement already satisfied: regex>=2021.8.3 in /usr/local/lib/python3.10/dist-packages (from nltk<4.0.0,>=3.8.1->llama-index-core) (2023.12.25)\n", + "Requirement already satisfied: distro<2,>=1.7.0 in /usr/lib/python3/dist-packages (from openai>=1.1.0->llama-index-core) (1.7.0)\n", + "Requirement already satisfied: charset-normalizer<4,>=2 in /usr/local/lib/python3.10/dist-packages (from requests>=2.31.0->llama-index-core) (3.3.2)\n", + "Requirement already satisfied: urllib3<3,>=1.21.1 in /usr/local/lib/python3.10/dist-packages (from requests>=2.31.0->llama-index-core) (2.0.7)\n", + "Requirement already satisfied: spacy-legacy<3.1.0,>=3.0.11 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core) (3.0.12)\n", + "Requirement already satisfied: spacy-loggers<2.0.0,>=1.0.0 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core) (1.0.5)\n", + "Requirement already satisfied: murmurhash<1.1.0,>=0.28.0 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core) (1.0.10)\n", + "Requirement already satisfied: cymem<2.1.0,>=2.0.2 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core) (2.0.8)\n", + "Requirement already satisfied: preshed<3.1.0,>=3.0.2 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core) (3.0.9)\n", + "Requirement already satisfied: thinc<8.3.0,>=8.2.2 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core) (8.2.3)\n", + "Requirement already satisfied: wasabi<1.2.0,>=0.9.1 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core) (1.1.2)\n", + "Requirement already satisfied: srsly<3.0.0,>=2.4.3 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core) (2.4.8)\n", + "Requirement already satisfied: catalogue<2.1.0,>=2.0.6 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core) (2.0.10)\n", + "Requirement already satisfied: weasel<0.4.0,>=0.1.0 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core) (0.3.4)\n", + "Requirement already satisfied: typer<0.10.0,>=0.3.0 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core) (0.9.4)\n", + "Requirement already satisfied: smart-open<7.0.0,>=5.2.1 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core) (6.4.0)\n", + "Requirement already satisfied: jinja2 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core) (3.1.4)\n", + "Requirement already satisfied: setuptools in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core) (67.7.2)\n", + "Requirement already satisfied: packaging>=20.0 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core) (24.0)\n", + "Requirement already satisfied: langcodes<4.0.0,>=3.2.0 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core) (3.4.0)\n", + "Requirement already satisfied: greenlet!=0.4.17 in /usr/local/lib/python3.10/dist-packages (from SQLAlchemy[asyncio]>=1.4.49->llama-index-core) (3.0.3)\n", + "Collecting mypy-extensions>=0.3.0 (from typing-inspect>=0.8.0->llama-index-core)\n", + " Downloading mypy_extensions-1.0.0-py3-none-any.whl (4.7 kB)\n", + "Collecting marshmallow<4.0.0,>=3.18.0 (from dataclasses-json->llama-index-core)\n", + " Downloading marshmallow-3.21.2-py3-none-any.whl (49 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m49.3/49.3 kB\u001b[0m \u001b[31m4.4 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hRequirement already satisfied: python-dateutil>=2.8.2 in /usr/local/lib/python3.10/dist-packages (from pandas->llama-index-core) (2.8.2)\n", + "Requirement already satisfied: pytz>=2020.1 in /usr/local/lib/python3.10/dist-packages (from pandas->llama-index-core) (2023.4)\n", + "Requirement already satisfied: tzdata>=2022.1 in /usr/local/lib/python3.10/dist-packages (from pandas->llama-index-core) (2024.1)\n", + "Requirement already satisfied: exceptiongroup in /usr/local/lib/python3.10/dist-packages (from anyio->httpx->llama-index-core) (1.2.1)\n", + "Requirement already satisfied: soupsieve>1.2 in /usr/local/lib/python3.10/dist-packages (from beautifulsoup4<5.0.0,>=4.12.3->llama-index-readers-file<0.2.0,>=0.1.4->llama-index) (2.5)\n", + "Requirement already satisfied: language-data>=1.2 in /usr/local/lib/python3.10/dist-packages (from langcodes<4.0.0,>=3.2.0->spacy<4.0.0,>=3.7.1->llama-index-core) (1.2.0)\n", + "Requirement already satisfied: annotated-types>=0.4.0 in /usr/local/lib/python3.10/dist-packages (from pydantic>=1.10->llamaindex-py-client<0.2.0,>=0.1.18->llama-index-core) (0.6.0)\n", + "Requirement already satisfied: pydantic-core==2.18.2 in /usr/local/lib/python3.10/dist-packages (from pydantic>=1.10->llamaindex-py-client<0.2.0,>=0.1.18->llama-index-core) (2.18.2)\n", + "Requirement already satisfied: six>=1.5 in /usr/local/lib/python3.10/dist-packages (from python-dateutil>=2.8.2->pandas->llama-index-core) (1.16.0)\n", + "Requirement already satisfied: blis<0.8.0,>=0.7.8 in /usr/local/lib/python3.10/dist-packages (from thinc<8.3.0,>=8.2.2->spacy<4.0.0,>=3.7.1->llama-index-core) (0.7.11)\n", + "Requirement already satisfied: confection<1.0.0,>=0.0.1 in /usr/local/lib/python3.10/dist-packages (from thinc<8.3.0,>=8.2.2->spacy<4.0.0,>=3.7.1->llama-index-core) (0.1.4)\n", + "Requirement already satisfied: cloudpathlib<0.17.0,>=0.7.0 in /usr/local/lib/python3.10/dist-packages (from weasel<0.4.0,>=0.1.0->spacy<4.0.0,>=3.7.1->llama-index-core) (0.16.0)\n", + "Requirement already satisfied: MarkupSafe>=2.0 in /usr/local/lib/python3.10/dist-packages (from jinja2->spacy<4.0.0,>=3.7.1->llama-index-core) (2.1.5)\n", + "Requirement already satisfied: marisa-trie>=0.7.7 in /usr/local/lib/python3.10/dist-packages (from language-data>=1.2->langcodes<4.0.0,>=3.2.0->spacy<4.0.0,>=3.7.1->llama-index-core) (1.1.1)\n", + "Installing collected packages: striprtf, ply, dirtyjson, pypdf, mypy-extensions, marshmallow, jsonpath-ng, h11, deprecated, typing-inspect, tiktoken, httpcore, httpx, dataclasses-json, openai, llamaindex-py-client, llama-index-legacy, llama-index-core, llama-parse, llama-index-readers-file, llama-index-llms-openai, llama-index-indices-managed-llama-cloud, llama-index-embeddings-openai, llama-index-readers-llama-parse, llama-index-multi-modal-llms-openai, llama-index-cli, llama-index-agent-openai, llama-index-program-openai, llama-index-question-gen-openai, llama-index\n", + "Successfully installed dataclasses-json-0.6.6 deprecated-1.2.14 dirtyjson-1.0.8 h11-0.14.0 httpcore-1.0.5 httpx-0.27.0 jsonpath-ng-1.6.1 llama-index-0.10.37 llama-index-agent-openai-0.2.5 llama-index-cli-0.1.12 llama-index-core-0.10.37.post1 llama-index-embeddings-openai-0.1.9 llama-index-indices-managed-llama-cloud-0.1.6 llama-index-legacy-0.9.48 llama-index-llms-openai-0.1.19 llama-index-multi-modal-llms-openai-0.1.6 llama-index-program-openai-0.1.6 llama-index-question-gen-openai-0.1.3 llama-index-readers-file-0.1.22 llama-index-readers-llama-parse-0.1.4 llama-parse-0.4.3 llamaindex-py-client-0.1.19 marshmallow-3.21.2 mypy-extensions-1.0.0 openai-1.30.1 ply-3.11 pypdf-4.2.0 striprtf-0.0.26 tiktoken-0.7.0 typing-inspect-0.9.0\n", + "Collecting llama-index-postprocessor-flag-embedding-reranker\n", + " Downloading llama_index_postprocessor_flag_embedding_reranker-0.1.3-py3-none-any.whl (3.0 kB)\n", + "Requirement already satisfied: llama-index-core<0.11.0,>=0.10.35 in /usr/local/lib/python3.10/dist-packages (from llama-index-postprocessor-flag-embedding-reranker) (0.10.37.post1)\n", + "Requirement already satisfied: PyYAML>=6.0.1 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (6.0.1)\n", + "Requirement already satisfied: SQLAlchemy[asyncio]>=1.4.49 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (2.0.30)\n", + "Requirement already satisfied: aiohttp<4.0.0,>=3.8.6 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (3.9.5)\n", + "Requirement already satisfied: dataclasses-json in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (0.6.6)\n", + "Requirement already satisfied: deprecated>=1.2.9.3 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.2.14)\n", + "Requirement already satisfied: dirtyjson<2.0.0,>=1.0.8 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.0.8)\n", + "Requirement already satisfied: fsspec>=2023.5.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (2023.6.0)\n", + "Requirement already satisfied: httpx in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (0.27.0)\n", + "Requirement already satisfied: jsonpath-ng<2.0.0,>=1.6.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.6.1)\n", + "Requirement already satisfied: llamaindex-py-client<0.2.0,>=0.1.18 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (0.1.19)\n", + "Requirement already satisfied: nest-asyncio<2.0.0,>=1.5.8 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.6.0)\n", + "Requirement already satisfied: networkx>=3.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (3.3)\n", + "Requirement already satisfied: nltk<4.0.0,>=3.8.1 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (3.8.1)\n", + "Requirement already satisfied: numpy in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.25.2)\n", + "Requirement already satisfied: openai>=1.1.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.30.1)\n", + "Requirement already satisfied: pandas in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (2.0.3)\n", + "Requirement already satisfied: pillow>=9.0.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (9.4.0)\n", + "Requirement already satisfied: requests>=2.31.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (2.31.0)\n", + "Requirement already satisfied: spacy<4.0.0,>=3.7.1 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (3.7.4)\n", + "Requirement already satisfied: tenacity<9.0.0,>=8.2.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (8.3.0)\n", + "Requirement already satisfied: tiktoken>=0.3.3 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (0.7.0)\n", + "Requirement already satisfied: tqdm<5.0.0,>=4.66.1 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (4.66.4)\n", + "Requirement already satisfied: typing-extensions>=4.5.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (4.11.0)\n", + "Requirement already satisfied: typing-inspect>=0.8.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (0.9.0)\n", + "Requirement already satisfied: wrapt in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.14.1)\n", + "Requirement already satisfied: aiosignal>=1.1.2 in /usr/local/lib/python3.10/dist-packages (from aiohttp<4.0.0,>=3.8.6->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.3.1)\n", + "Requirement already satisfied: attrs>=17.3.0 in /usr/local/lib/python3.10/dist-packages (from aiohttp<4.0.0,>=3.8.6->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (23.2.0)\n", + "Requirement already satisfied: frozenlist>=1.1.1 in /usr/local/lib/python3.10/dist-packages (from aiohttp<4.0.0,>=3.8.6->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.4.1)\n", + "Requirement already satisfied: multidict<7.0,>=4.5 in /usr/local/lib/python3.10/dist-packages (from aiohttp<4.0.0,>=3.8.6->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (6.0.5)\n", + "Requirement already satisfied: yarl<2.0,>=1.0 in /usr/local/lib/python3.10/dist-packages (from aiohttp<4.0.0,>=3.8.6->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.9.4)\n", + "Requirement already satisfied: async-timeout<5.0,>=4.0 in /usr/local/lib/python3.10/dist-packages (from aiohttp<4.0.0,>=3.8.6->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (4.0.3)\n", + "Requirement already satisfied: ply in /usr/local/lib/python3.10/dist-packages (from jsonpath-ng<2.0.0,>=1.6.0->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (3.11)\n", + "Requirement already satisfied: pydantic>=1.10 in /usr/local/lib/python3.10/dist-packages (from llamaindex-py-client<0.2.0,>=0.1.18->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (2.7.1)\n", + "Requirement already satisfied: anyio in /usr/local/lib/python3.10/dist-packages (from httpx->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (3.7.1)\n", + "Requirement already satisfied: certifi in /usr/local/lib/python3.10/dist-packages (from httpx->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (2024.2.2)\n", + "Requirement already satisfied: httpcore==1.* in /usr/local/lib/python3.10/dist-packages (from httpx->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.0.5)\n", + "Requirement already satisfied: idna in /usr/local/lib/python3.10/dist-packages (from httpx->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (3.7)\n", + "Requirement already satisfied: sniffio in /usr/local/lib/python3.10/dist-packages (from httpx->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.3.1)\n", + "Requirement already satisfied: h11<0.15,>=0.13 in /usr/local/lib/python3.10/dist-packages (from httpcore==1.*->httpx->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (0.14.0)\n", + "Requirement already satisfied: click in /usr/local/lib/python3.10/dist-packages (from nltk<4.0.0,>=3.8.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (8.1.7)\n", + "Requirement already satisfied: joblib in /usr/local/lib/python3.10/dist-packages (from nltk<4.0.0,>=3.8.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.4.2)\n", + "Requirement already satisfied: regex>=2021.8.3 in /usr/local/lib/python3.10/dist-packages (from nltk<4.0.0,>=3.8.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (2023.12.25)\n", + "Requirement already satisfied: distro<2,>=1.7.0 in /usr/lib/python3/dist-packages (from openai>=1.1.0->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.7.0)\n", + "Requirement already satisfied: charset-normalizer<4,>=2 in /usr/local/lib/python3.10/dist-packages (from requests>=2.31.0->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (3.3.2)\n", + "Requirement already satisfied: urllib3<3,>=1.21.1 in /usr/local/lib/python3.10/dist-packages (from requests>=2.31.0->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (2.0.7)\n", + "Requirement already satisfied: spacy-legacy<3.1.0,>=3.0.11 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (3.0.12)\n", + "Requirement already satisfied: spacy-loggers<2.0.0,>=1.0.0 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.0.5)\n", + "Requirement already satisfied: murmurhash<1.1.0,>=0.28.0 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.0.10)\n", + "Requirement already satisfied: cymem<2.1.0,>=2.0.2 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (2.0.8)\n", + "Requirement already satisfied: preshed<3.1.0,>=3.0.2 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (3.0.9)\n", + "Requirement already satisfied: thinc<8.3.0,>=8.2.2 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (8.2.3)\n", + "Requirement already satisfied: wasabi<1.2.0,>=0.9.1 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.1.2)\n", + "Requirement already satisfied: srsly<3.0.0,>=2.4.3 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (2.4.8)\n", + "Requirement already satisfied: catalogue<2.1.0,>=2.0.6 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (2.0.10)\n", + "Requirement already satisfied: weasel<0.4.0,>=0.1.0 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (0.3.4)\n", + "Requirement already satisfied: typer<0.10.0,>=0.3.0 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (0.9.4)\n", + "Requirement already satisfied: smart-open<7.0.0,>=5.2.1 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (6.4.0)\n", + "Requirement already satisfied: jinja2 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (3.1.4)\n", + "Requirement already satisfied: setuptools in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (67.7.2)\n", + "Requirement already satisfied: packaging>=20.0 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (24.0)\n", + "Requirement already satisfied: langcodes<4.0.0,>=3.2.0 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (3.4.0)\n", + "Requirement already satisfied: greenlet!=0.4.17 in /usr/local/lib/python3.10/dist-packages (from SQLAlchemy[asyncio]>=1.4.49->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (3.0.3)\n", + "Requirement already satisfied: mypy-extensions>=0.3.0 in /usr/local/lib/python3.10/dist-packages (from typing-inspect>=0.8.0->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.0.0)\n", + "Requirement already satisfied: marshmallow<4.0.0,>=3.18.0 in /usr/local/lib/python3.10/dist-packages (from dataclasses-json->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (3.21.2)\n", + "Requirement already satisfied: python-dateutil>=2.8.2 in /usr/local/lib/python3.10/dist-packages (from pandas->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (2.8.2)\n", + "Requirement already satisfied: pytz>=2020.1 in /usr/local/lib/python3.10/dist-packages (from pandas->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (2023.4)\n", + "Requirement already satisfied: tzdata>=2022.1 in /usr/local/lib/python3.10/dist-packages (from pandas->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (2024.1)\n", + "Requirement already satisfied: exceptiongroup in /usr/local/lib/python3.10/dist-packages (from anyio->httpx->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.2.1)\n", + "Requirement already satisfied: language-data>=1.2 in /usr/local/lib/python3.10/dist-packages (from langcodes<4.0.0,>=3.2.0->spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.2.0)\n", + "Requirement already satisfied: annotated-types>=0.4.0 in /usr/local/lib/python3.10/dist-packages (from pydantic>=1.10->llamaindex-py-client<0.2.0,>=0.1.18->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (0.6.0)\n", + "Requirement already satisfied: pydantic-core==2.18.2 in /usr/local/lib/python3.10/dist-packages (from pydantic>=1.10->llamaindex-py-client<0.2.0,>=0.1.18->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (2.18.2)\n", + "Requirement already satisfied: six>=1.5 in /usr/local/lib/python3.10/dist-packages (from python-dateutil>=2.8.2->pandas->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.16.0)\n", + "Requirement already satisfied: blis<0.8.0,>=0.7.8 in /usr/local/lib/python3.10/dist-packages (from thinc<8.3.0,>=8.2.2->spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (0.7.11)\n", + "Requirement already satisfied: confection<1.0.0,>=0.0.1 in /usr/local/lib/python3.10/dist-packages (from thinc<8.3.0,>=8.2.2->spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (0.1.4)\n", + "Requirement already satisfied: cloudpathlib<0.17.0,>=0.7.0 in /usr/local/lib/python3.10/dist-packages (from weasel<0.4.0,>=0.1.0->spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (0.16.0)\n", + "Requirement already satisfied: MarkupSafe>=2.0 in /usr/local/lib/python3.10/dist-packages (from jinja2->spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (2.1.5)\n", + "Requirement already satisfied: marisa-trie>=0.7.7 in /usr/local/lib/python3.10/dist-packages (from language-data>=1.2->langcodes<4.0.0,>=3.2.0->spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.35->llama-index-postprocessor-flag-embedding-reranker) (1.1.1)\n", + "Installing collected packages: llama-index-postprocessor-flag-embedding-reranker\n", + "Successfully installed llama-index-postprocessor-flag-embedding-reranker-0.1.3\n", + "Collecting git+https://github.com/FlagOpen/FlagEmbedding.git\n", + " Cloning https://github.com/FlagOpen/FlagEmbedding.git to /tmp/pip-req-build-wmws0zv2\n", + " Running command git clone --filter=blob:none --quiet https://github.com/FlagOpen/FlagEmbedding.git /tmp/pip-req-build-wmws0zv2\n", + " Resolved https://github.com/FlagOpen/FlagEmbedding.git to commit 95b873d9ac923bca47436efeae39ca4559970210\n", + " Preparing metadata (setup.py) ... \u001b[?25l\u001b[?25hdone\n", + "Requirement already satisfied: torch>=1.6.0 in /usr/local/lib/python3.10/dist-packages (from FlagEmbedding==1.2.9) (2.2.1+cu121)\n", + "Requirement already satisfied: transformers>=4.33.0 in /usr/local/lib/python3.10/dist-packages (from FlagEmbedding==1.2.9) (4.40.2)\n", + "Collecting datasets (from FlagEmbedding==1.2.9)\n", + " Downloading datasets-2.19.1-py3-none-any.whl (542 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m542.0/542.0 kB\u001b[0m \u001b[31m6.6 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hCollecting accelerate>=0.20.1 (from FlagEmbedding==1.2.9)\n", + " Downloading accelerate-0.30.1-py3-none-any.whl (302 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m302.6/302.6 kB\u001b[0m \u001b[31m10.6 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hCollecting sentence_transformers (from FlagEmbedding==1.2.9)\n", + " Downloading sentence_transformers-2.7.0-py3-none-any.whl (171 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m171.5/171.5 kB\u001b[0m \u001b[31m9.6 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hRequirement already satisfied: numpy>=1.17 in /usr/local/lib/python3.10/dist-packages (from accelerate>=0.20.1->FlagEmbedding==1.2.9) (1.25.2)\n", + "Requirement already satisfied: packaging>=20.0 in /usr/local/lib/python3.10/dist-packages (from accelerate>=0.20.1->FlagEmbedding==1.2.9) (24.0)\n", + "Requirement already satisfied: psutil in /usr/local/lib/python3.10/dist-packages (from accelerate>=0.20.1->FlagEmbedding==1.2.9) (5.9.5)\n", + "Requirement already satisfied: pyyaml in /usr/local/lib/python3.10/dist-packages (from accelerate>=0.20.1->FlagEmbedding==1.2.9) (6.0.1)\n", + "Requirement already satisfied: huggingface-hub in /usr/local/lib/python3.10/dist-packages (from accelerate>=0.20.1->FlagEmbedding==1.2.9) (0.20.3)\n", + "Requirement already satisfied: safetensors>=0.3.1 in /usr/local/lib/python3.10/dist-packages (from accelerate>=0.20.1->FlagEmbedding==1.2.9) (0.4.3)\n", + "Requirement already satisfied: filelock in /usr/local/lib/python3.10/dist-packages (from torch>=1.6.0->FlagEmbedding==1.2.9) (3.14.0)\n", + "Requirement already satisfied: typing-extensions>=4.8.0 in /usr/local/lib/python3.10/dist-packages (from torch>=1.6.0->FlagEmbedding==1.2.9) (4.11.0)\n", + "Requirement already satisfied: sympy in /usr/local/lib/python3.10/dist-packages (from torch>=1.6.0->FlagEmbedding==1.2.9) (1.12)\n", + "Requirement already satisfied: networkx in /usr/local/lib/python3.10/dist-packages (from torch>=1.6.0->FlagEmbedding==1.2.9) (3.3)\n", + "Requirement already satisfied: jinja2 in /usr/local/lib/python3.10/dist-packages (from torch>=1.6.0->FlagEmbedding==1.2.9) (3.1.4)\n", + "Requirement already satisfied: fsspec in /usr/local/lib/python3.10/dist-packages (from torch>=1.6.0->FlagEmbedding==1.2.9) (2023.6.0)\n", + "Collecting nvidia-cuda-nvrtc-cu12==12.1.105 (from torch>=1.6.0->FlagEmbedding==1.2.9)\n", + " Using cached nvidia_cuda_nvrtc_cu12-12.1.105-py3-none-manylinux1_x86_64.whl (23.7 MB)\n", + "Collecting nvidia-cuda-runtime-cu12==12.1.105 (from torch>=1.6.0->FlagEmbedding==1.2.9)\n", + " Using cached nvidia_cuda_runtime_cu12-12.1.105-py3-none-manylinux1_x86_64.whl (823 kB)\n", + "Collecting nvidia-cuda-cupti-cu12==12.1.105 (from torch>=1.6.0->FlagEmbedding==1.2.9)\n", + " Using cached nvidia_cuda_cupti_cu12-12.1.105-py3-none-manylinux1_x86_64.whl (14.1 MB)\n", + "Collecting nvidia-cudnn-cu12==8.9.2.26 (from torch>=1.6.0->FlagEmbedding==1.2.9)\n", + " Using cached nvidia_cudnn_cu12-8.9.2.26-py3-none-manylinux1_x86_64.whl (731.7 MB)\n", + "Collecting nvidia-cublas-cu12==12.1.3.1 (from torch>=1.6.0->FlagEmbedding==1.2.9)\n", + " Using cached nvidia_cublas_cu12-12.1.3.1-py3-none-manylinux1_x86_64.whl (410.6 MB)\n", + "Collecting nvidia-cufft-cu12==11.0.2.54 (from torch>=1.6.0->FlagEmbedding==1.2.9)\n", + " Using cached nvidia_cufft_cu12-11.0.2.54-py3-none-manylinux1_x86_64.whl (121.6 MB)\n", + "Collecting nvidia-curand-cu12==10.3.2.106 (from torch>=1.6.0->FlagEmbedding==1.2.9)\n", + " Using cached nvidia_curand_cu12-10.3.2.106-py3-none-manylinux1_x86_64.whl (56.5 MB)\n", + "Collecting nvidia-cusolver-cu12==11.4.5.107 (from torch>=1.6.0->FlagEmbedding==1.2.9)\n", + " Using cached nvidia_cusolver_cu12-11.4.5.107-py3-none-manylinux1_x86_64.whl (124.2 MB)\n", + "Collecting nvidia-cusparse-cu12==12.1.0.106 (from torch>=1.6.0->FlagEmbedding==1.2.9)\n", + " Using cached nvidia_cusparse_cu12-12.1.0.106-py3-none-manylinux1_x86_64.whl (196.0 MB)\n", + "Collecting nvidia-nccl-cu12==2.19.3 (from torch>=1.6.0->FlagEmbedding==1.2.9)\n", + " Using cached nvidia_nccl_cu12-2.19.3-py3-none-manylinux1_x86_64.whl (166.0 MB)\n", + "Collecting nvidia-nvtx-cu12==12.1.105 (from torch>=1.6.0->FlagEmbedding==1.2.9)\n", + " Using cached nvidia_nvtx_cu12-12.1.105-py3-none-manylinux1_x86_64.whl (99 kB)\n", + "Requirement already satisfied: triton==2.2.0 in /usr/local/lib/python3.10/dist-packages (from torch>=1.6.0->FlagEmbedding==1.2.9) (2.2.0)\n", + "Collecting nvidia-nvjitlink-cu12 (from nvidia-cusolver-cu12==11.4.5.107->torch>=1.6.0->FlagEmbedding==1.2.9)\n", + " Using cached nvidia_nvjitlink_cu12-12.4.127-py3-none-manylinux2014_x86_64.whl (21.1 MB)\n", + "Requirement already satisfied: regex!=2019.12.17 in /usr/local/lib/python3.10/dist-packages (from transformers>=4.33.0->FlagEmbedding==1.2.9) (2023.12.25)\n", + "Requirement already satisfied: requests in /usr/local/lib/python3.10/dist-packages (from transformers>=4.33.0->FlagEmbedding==1.2.9) (2.31.0)\n", + "Requirement already satisfied: tokenizers<0.20,>=0.19 in /usr/local/lib/python3.10/dist-packages (from transformers>=4.33.0->FlagEmbedding==1.2.9) (0.19.1)\n", + "Requirement already satisfied: tqdm>=4.27 in /usr/local/lib/python3.10/dist-packages (from transformers>=4.33.0->FlagEmbedding==1.2.9) (4.66.4)\n", + "Requirement already satisfied: pyarrow>=12.0.0 in /usr/local/lib/python3.10/dist-packages (from datasets->FlagEmbedding==1.2.9) (14.0.2)\n", + "Requirement already satisfied: pyarrow-hotfix in /usr/local/lib/python3.10/dist-packages (from datasets->FlagEmbedding==1.2.9) (0.6)\n", + "Collecting dill<0.3.9,>=0.3.0 (from datasets->FlagEmbedding==1.2.9)\n", + " Downloading dill-0.3.8-py3-none-any.whl (116 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m116.3/116.3 kB\u001b[0m \u001b[31m15.4 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hRequirement already satisfied: pandas in /usr/local/lib/python3.10/dist-packages (from datasets->FlagEmbedding==1.2.9) (2.0.3)\n", + "Collecting xxhash (from datasets->FlagEmbedding==1.2.9)\n", + " Downloading xxhash-3.4.1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (194 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m194.1/194.1 kB\u001b[0m \u001b[31m11.2 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hCollecting multiprocess (from datasets->FlagEmbedding==1.2.9)\n", + " Downloading multiprocess-0.70.16-py310-none-any.whl (134 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m134.8/134.8 kB\u001b[0m \u001b[31m16.2 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hRequirement already satisfied: aiohttp in /usr/local/lib/python3.10/dist-packages (from datasets->FlagEmbedding==1.2.9) (3.9.5)\n", + "Collecting huggingface-hub (from accelerate>=0.20.1->FlagEmbedding==1.2.9)\n", + " Downloading huggingface_hub-0.23.0-py3-none-any.whl (401 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m401.2/401.2 kB\u001b[0m \u001b[31m14.4 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hRequirement already satisfied: scikit-learn in /usr/local/lib/python3.10/dist-packages (from sentence_transformers->FlagEmbedding==1.2.9) (1.2.2)\n", + "Requirement already satisfied: scipy in /usr/local/lib/python3.10/dist-packages (from sentence_transformers->FlagEmbedding==1.2.9) (1.11.4)\n", + "Requirement already satisfied: Pillow in /usr/local/lib/python3.10/dist-packages (from sentence_transformers->FlagEmbedding==1.2.9) (9.4.0)\n", + "Requirement already satisfied: aiosignal>=1.1.2 in /usr/local/lib/python3.10/dist-packages (from aiohttp->datasets->FlagEmbedding==1.2.9) (1.3.1)\n", + "Requirement already satisfied: attrs>=17.3.0 in /usr/local/lib/python3.10/dist-packages (from aiohttp->datasets->FlagEmbedding==1.2.9) (23.2.0)\n", + "Requirement already satisfied: frozenlist>=1.1.1 in /usr/local/lib/python3.10/dist-packages (from aiohttp->datasets->FlagEmbedding==1.2.9) (1.4.1)\n", + "Requirement already satisfied: multidict<7.0,>=4.5 in /usr/local/lib/python3.10/dist-packages (from aiohttp->datasets->FlagEmbedding==1.2.9) (6.0.5)\n", + "Requirement already satisfied: yarl<2.0,>=1.0 in /usr/local/lib/python3.10/dist-packages (from aiohttp->datasets->FlagEmbedding==1.2.9) (1.9.4)\n", + "Requirement already satisfied: async-timeout<5.0,>=4.0 in /usr/local/lib/python3.10/dist-packages (from aiohttp->datasets->FlagEmbedding==1.2.9) (4.0.3)\n", + "Requirement already satisfied: charset-normalizer<4,>=2 in /usr/local/lib/python3.10/dist-packages (from requests->transformers>=4.33.0->FlagEmbedding==1.2.9) (3.3.2)\n", + "Requirement already satisfied: idna<4,>=2.5 in /usr/local/lib/python3.10/dist-packages (from requests->transformers>=4.33.0->FlagEmbedding==1.2.9) (3.7)\n", + "Requirement already satisfied: urllib3<3,>=1.21.1 in /usr/local/lib/python3.10/dist-packages (from requests->transformers>=4.33.0->FlagEmbedding==1.2.9) (2.0.7)\n", + "Requirement already satisfied: certifi>=2017.4.17 in /usr/local/lib/python3.10/dist-packages (from requests->transformers>=4.33.0->FlagEmbedding==1.2.9) (2024.2.2)\n", + "Requirement already satisfied: MarkupSafe>=2.0 in /usr/local/lib/python3.10/dist-packages (from jinja2->torch>=1.6.0->FlagEmbedding==1.2.9) (2.1.5)\n", + "Requirement already satisfied: python-dateutil>=2.8.2 in /usr/local/lib/python3.10/dist-packages (from pandas->datasets->FlagEmbedding==1.2.9) (2.8.2)\n", + "Requirement already satisfied: pytz>=2020.1 in /usr/local/lib/python3.10/dist-packages (from pandas->datasets->FlagEmbedding==1.2.9) (2023.4)\n", + "Requirement already satisfied: tzdata>=2022.1 in /usr/local/lib/python3.10/dist-packages (from pandas->datasets->FlagEmbedding==1.2.9) (2024.1)\n", + "Requirement already satisfied: joblib>=1.1.1 in /usr/local/lib/python3.10/dist-packages (from scikit-learn->sentence_transformers->FlagEmbedding==1.2.9) (1.4.2)\n", + "Requirement already satisfied: threadpoolctl>=2.0.0 in /usr/local/lib/python3.10/dist-packages (from scikit-learn->sentence_transformers->FlagEmbedding==1.2.9) (3.5.0)\n", + "Requirement already satisfied: mpmath>=0.19 in /usr/local/lib/python3.10/dist-packages (from sympy->torch>=1.6.0->FlagEmbedding==1.2.9) (1.3.0)\n", + "Requirement already satisfied: six>=1.5 in /usr/local/lib/python3.10/dist-packages (from python-dateutil>=2.8.2->pandas->datasets->FlagEmbedding==1.2.9) (1.16.0)\n", + "Building wheels for collected packages: FlagEmbedding\n", + " Building wheel for FlagEmbedding (setup.py) ... \u001b[?25l\u001b[?25hdone\n", + " Created wheel for FlagEmbedding: filename=FlagEmbedding-1.2.9-py3-none-any.whl size=165917 sha256=24688c17b3bc6214be93c7bef77d4b9baacde749336b83f600188a704c6d8cad\n", + " Stored in directory: /tmp/pip-ephem-wheel-cache-45wml86h/wheels/41/cf/a5/5dee96ed64e5aaffe5aa3d583828258fdefed9a305db6e7f48\n", + "Successfully built FlagEmbedding\n", + "Installing collected packages: xxhash, nvidia-nvtx-cu12, nvidia-nvjitlink-cu12, nvidia-nccl-cu12, nvidia-curand-cu12, nvidia-cufft-cu12, nvidia-cuda-runtime-cu12, nvidia-cuda-nvrtc-cu12, nvidia-cuda-cupti-cu12, nvidia-cublas-cu12, dill, nvidia-cusparse-cu12, nvidia-cudnn-cu12, multiprocess, huggingface-hub, nvidia-cusolver-cu12, datasets, sentence_transformers, accelerate, FlagEmbedding\n", + " Attempting uninstall: huggingface-hub\n", + " Found existing installation: huggingface-hub 0.20.3\n", + " Uninstalling huggingface-hub-0.20.3:\n", + " Successfully uninstalled huggingface-hub-0.20.3\n", + "Successfully installed FlagEmbedding-1.2.9 accelerate-0.30.1 datasets-2.19.1 dill-0.3.8 huggingface-hub-0.23.0 multiprocess-0.70.16 nvidia-cublas-cu12-12.1.3.1 nvidia-cuda-cupti-cu12-12.1.105 nvidia-cuda-nvrtc-cu12-12.1.105 nvidia-cuda-runtime-cu12-12.1.105 nvidia-cudnn-cu12-8.9.2.26 nvidia-cufft-cu12-11.0.2.54 nvidia-curand-cu12-10.3.2.106 nvidia-cusolver-cu12-11.4.5.107 nvidia-cusparse-cu12-12.1.0.106 nvidia-nccl-cu12-2.19.3 nvidia-nvjitlink-cu12-12.4.127 nvidia-nvtx-cu12-12.1.105 sentence_transformers-2.7.0 xxhash-3.4.1\n", + "Collecting llama-index-vector-stores-lancedb\n", + " Downloading llama_index_vector_stores_lancedb-0.1.3-py3-none-any.whl (4.1 kB)\n", + "Collecting lancedb<0.6.0,>=0.5.1 (from llama-index-vector-stores-lancedb)\n", + " Downloading lancedb-0.5.7-py3-none-any.whl (115 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m115.1/115.1 kB\u001b[0m \u001b[31m3.2 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hRequirement already satisfied: llama-index-core<0.11.0,>=0.10.1 in /usr/local/lib/python3.10/dist-packages (from llama-index-vector-stores-lancedb) (0.10.37.post1)\n", + "Collecting deprecation (from lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb)\n", + " Downloading deprecation-2.1.0-py2.py3-none-any.whl (11 kB)\n", + "Collecting pylance==0.9.18 (from lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb)\n", + " Downloading pylance-0.9.18-cp38-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (21.6 MB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m21.6/21.6 MB\u001b[0m \u001b[31m14.3 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hCollecting ratelimiter~=1.0 (from lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb)\n", + " Downloading ratelimiter-1.2.0.post0-py3-none-any.whl (6.6 kB)\n", + "Collecting retry>=0.9.2 (from lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb)\n", + " Downloading retry-0.9.2-py2.py3-none-any.whl (8.0 kB)\n", + "Requirement already satisfied: tqdm>=4.27.0 in /usr/local/lib/python3.10/dist-packages (from lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb) (4.66.4)\n", + "Requirement already satisfied: pydantic>=1.10 in /usr/local/lib/python3.10/dist-packages (from lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb) (2.7.1)\n", + "Requirement already satisfied: attrs>=21.3.0 in /usr/local/lib/python3.10/dist-packages (from lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb) (23.2.0)\n", + "Collecting semver>=3.0 (from lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb)\n", + " Downloading semver-3.0.2-py3-none-any.whl (17 kB)\n", + "Requirement already satisfied: cachetools in /usr/local/lib/python3.10/dist-packages (from lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb) (5.3.3)\n", + "Requirement already satisfied: pyyaml>=6.0 in /usr/local/lib/python3.10/dist-packages (from lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb) (6.0.1)\n", + "Requirement already satisfied: click>=8.1.7 in /usr/local/lib/python3.10/dist-packages (from lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb) (8.1.7)\n", + "Requirement already satisfied: requests>=2.31.0 in /usr/local/lib/python3.10/dist-packages (from lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb) (2.31.0)\n", + "Collecting overrides>=0.7 (from lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb)\n", + " Downloading overrides-7.7.0-py3-none-any.whl (17 kB)\n", + "Requirement already satisfied: pyarrow>=12 in /usr/local/lib/python3.10/dist-packages (from pylance==0.9.18->lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb) (14.0.2)\n", + "Requirement already satisfied: numpy>=1.22 in /usr/local/lib/python3.10/dist-packages (from pylance==0.9.18->lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb) (1.25.2)\n", + "Requirement already satisfied: SQLAlchemy[asyncio]>=1.4.49 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (2.0.30)\n", + "Requirement already satisfied: aiohttp<4.0.0,>=3.8.6 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (3.9.5)\n", + "Requirement already satisfied: dataclasses-json in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (0.6.6)\n", + "Requirement already satisfied: deprecated>=1.2.9.3 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.2.14)\n", + "Requirement already satisfied: dirtyjson<2.0.0,>=1.0.8 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.0.8)\n", + "Requirement already satisfied: fsspec>=2023.5.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (2023.6.0)\n", + "Requirement already satisfied: httpx in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (0.27.0)\n", + "Requirement already satisfied: jsonpath-ng<2.0.0,>=1.6.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.6.1)\n", + "Requirement already satisfied: llamaindex-py-client<0.2.0,>=0.1.18 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (0.1.19)\n", + "Requirement already satisfied: nest-asyncio<2.0.0,>=1.5.8 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.6.0)\n", + "Requirement already satisfied: networkx>=3.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (3.3)\n", + "Requirement already satisfied: nltk<4.0.0,>=3.8.1 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (3.8.1)\n", + "Requirement already satisfied: openai>=1.1.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.30.1)\n", + "Requirement already satisfied: pandas in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (2.0.3)\n", + "Requirement already satisfied: pillow>=9.0.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (9.4.0)\n", + "Requirement already satisfied: spacy<4.0.0,>=3.7.1 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (3.7.4)\n", + "Requirement already satisfied: tenacity<9.0.0,>=8.2.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (8.3.0)\n", + "Requirement already satisfied: tiktoken>=0.3.3 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (0.7.0)\n", + "Requirement already satisfied: typing-extensions>=4.5.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (4.11.0)\n", + "Requirement already satisfied: typing-inspect>=0.8.0 in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (0.9.0)\n", + "Requirement already satisfied: wrapt in /usr/local/lib/python3.10/dist-packages (from llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.14.1)\n", + "Requirement already satisfied: aiosignal>=1.1.2 in /usr/local/lib/python3.10/dist-packages (from aiohttp<4.0.0,>=3.8.6->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.3.1)\n", + "Requirement already satisfied: frozenlist>=1.1.1 in /usr/local/lib/python3.10/dist-packages (from aiohttp<4.0.0,>=3.8.6->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.4.1)\n", + "Requirement already satisfied: multidict<7.0,>=4.5 in /usr/local/lib/python3.10/dist-packages (from aiohttp<4.0.0,>=3.8.6->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (6.0.5)\n", + "Requirement already satisfied: yarl<2.0,>=1.0 in /usr/local/lib/python3.10/dist-packages (from aiohttp<4.0.0,>=3.8.6->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.9.4)\n", + "Requirement already satisfied: async-timeout<5.0,>=4.0 in /usr/local/lib/python3.10/dist-packages (from aiohttp<4.0.0,>=3.8.6->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (4.0.3)\n", + "Requirement already satisfied: ply in /usr/local/lib/python3.10/dist-packages (from jsonpath-ng<2.0.0,>=1.6.0->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (3.11)\n", + "Requirement already satisfied: anyio in /usr/local/lib/python3.10/dist-packages (from httpx->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (3.7.1)\n", + "Requirement already satisfied: certifi in /usr/local/lib/python3.10/dist-packages (from httpx->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (2024.2.2)\n", + "Requirement already satisfied: httpcore==1.* in /usr/local/lib/python3.10/dist-packages (from httpx->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.0.5)\n", + "Requirement already satisfied: idna in /usr/local/lib/python3.10/dist-packages (from httpx->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (3.7)\n", + "Requirement already satisfied: sniffio in /usr/local/lib/python3.10/dist-packages (from httpx->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.3.1)\n", + "Requirement already satisfied: h11<0.15,>=0.13 in /usr/local/lib/python3.10/dist-packages (from httpcore==1.*->httpx->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (0.14.0)\n", + "Requirement already satisfied: joblib in /usr/local/lib/python3.10/dist-packages (from nltk<4.0.0,>=3.8.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.4.2)\n", + "Requirement already satisfied: regex>=2021.8.3 in /usr/local/lib/python3.10/dist-packages (from nltk<4.0.0,>=3.8.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (2023.12.25)\n", + "Requirement already satisfied: distro<2,>=1.7.0 in /usr/lib/python3/dist-packages (from openai>=1.1.0->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.7.0)\n", + "Requirement already satisfied: annotated-types>=0.4.0 in /usr/local/lib/python3.10/dist-packages (from pydantic>=1.10->lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb) (0.6.0)\n", + "Requirement already satisfied: pydantic-core==2.18.2 in /usr/local/lib/python3.10/dist-packages (from pydantic>=1.10->lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb) (2.18.2)\n", + "Requirement already satisfied: charset-normalizer<4,>=2 in /usr/local/lib/python3.10/dist-packages (from requests>=2.31.0->lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb) (3.3.2)\n", + "Requirement already satisfied: urllib3<3,>=1.21.1 in /usr/local/lib/python3.10/dist-packages (from requests>=2.31.0->lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb) (2.0.7)\n", + "Requirement already satisfied: decorator>=3.4.2 in /usr/local/lib/python3.10/dist-packages (from retry>=0.9.2->lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb) (4.4.2)\n", + "Collecting py<2.0.0,>=1.4.26 (from retry>=0.9.2->lancedb<0.6.0,>=0.5.1->llama-index-vector-stores-lancedb)\n", + " Downloading py-1.11.0-py2.py3-none-any.whl (98 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m98.7/98.7 kB\u001b[0m \u001b[31m11.8 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hRequirement already satisfied: spacy-legacy<3.1.0,>=3.0.11 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (3.0.12)\n", + "Requirement already satisfied: spacy-loggers<2.0.0,>=1.0.0 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.0.5)\n", + "Requirement already satisfied: murmurhash<1.1.0,>=0.28.0 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.0.10)\n", + "Requirement already satisfied: cymem<2.1.0,>=2.0.2 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (2.0.8)\n", + "Requirement already satisfied: preshed<3.1.0,>=3.0.2 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (3.0.9)\n", + "Requirement already satisfied: thinc<8.3.0,>=8.2.2 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (8.2.3)\n", + "Requirement already satisfied: wasabi<1.2.0,>=0.9.1 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.1.2)\n", + "Requirement already satisfied: srsly<3.0.0,>=2.4.3 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (2.4.8)\n", + "Requirement already satisfied: catalogue<2.1.0,>=2.0.6 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (2.0.10)\n", + "Requirement already satisfied: weasel<0.4.0,>=0.1.0 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (0.3.4)\n", + "Requirement already satisfied: typer<0.10.0,>=0.3.0 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (0.9.4)\n", + "Requirement already satisfied: smart-open<7.0.0,>=5.2.1 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (6.4.0)\n", + "Requirement already satisfied: jinja2 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (3.1.4)\n", + "Requirement already satisfied: setuptools in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (67.7.2)\n", + "Requirement already satisfied: packaging>=20.0 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (24.0)\n", + "Requirement already satisfied: langcodes<4.0.0,>=3.2.0 in /usr/local/lib/python3.10/dist-packages (from spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (3.4.0)\n", + "Requirement already satisfied: greenlet!=0.4.17 in /usr/local/lib/python3.10/dist-packages (from SQLAlchemy[asyncio]>=1.4.49->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (3.0.3)\n", + "Requirement already satisfied: mypy-extensions>=0.3.0 in /usr/local/lib/python3.10/dist-packages (from typing-inspect>=0.8.0->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.0.0)\n", + "Requirement already satisfied: marshmallow<4.0.0,>=3.18.0 in /usr/local/lib/python3.10/dist-packages (from dataclasses-json->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (3.21.2)\n", + "Requirement already satisfied: python-dateutil>=2.8.2 in /usr/local/lib/python3.10/dist-packages (from pandas->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (2.8.2)\n", + "Requirement already satisfied: pytz>=2020.1 in /usr/local/lib/python3.10/dist-packages (from pandas->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (2023.4)\n", + "Requirement already satisfied: tzdata>=2022.1 in /usr/local/lib/python3.10/dist-packages (from pandas->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (2024.1)\n", + "Requirement already satisfied: exceptiongroup in /usr/local/lib/python3.10/dist-packages (from anyio->httpx->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.2.1)\n", + "Requirement already satisfied: language-data>=1.2 in /usr/local/lib/python3.10/dist-packages (from langcodes<4.0.0,>=3.2.0->spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.2.0)\n", + "Requirement already satisfied: six>=1.5 in /usr/local/lib/python3.10/dist-packages (from python-dateutil>=2.8.2->pandas->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.16.0)\n", + "Requirement already satisfied: blis<0.8.0,>=0.7.8 in /usr/local/lib/python3.10/dist-packages (from thinc<8.3.0,>=8.2.2->spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (0.7.11)\n", + "Requirement already satisfied: confection<1.0.0,>=0.0.1 in /usr/local/lib/python3.10/dist-packages (from thinc<8.3.0,>=8.2.2->spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (0.1.4)\n", + "Requirement already satisfied: cloudpathlib<0.17.0,>=0.7.0 in /usr/local/lib/python3.10/dist-packages (from weasel<0.4.0,>=0.1.0->spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (0.16.0)\n", + "Requirement already satisfied: MarkupSafe>=2.0 in /usr/local/lib/python3.10/dist-packages (from jinja2->spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (2.1.5)\n", + "Requirement already satisfied: marisa-trie>=0.7.7 in /usr/local/lib/python3.10/dist-packages (from language-data>=1.2->langcodes<4.0.0,>=3.2.0->spacy<4.0.0,>=3.7.1->llama-index-core<0.11.0,>=0.10.1->llama-index-vector-stores-lancedb) (1.1.1)\n", + "Installing collected packages: ratelimiter, semver, py, overrides, deprecation, retry, pylance, lancedb, llama-index-vector-stores-lancedb\n", + "Successfully installed deprecation-2.1.0 lancedb-0.5.7 llama-index-vector-stores-lancedb-0.1.3 overrides-7.7.0 py-1.11.0 pylance-0.9.18 ratelimiter-1.2.0.post0 retry-0.9.2 semver-3.0.2\n" + ] + } + ], + "source": [ + "# install dependencies\n", + "%pip install llama-index llama-index-core llama-index-embeddings-openai llama-parse\n", + "%pip install llama-index-postprocessor-flag-embedding-reranker\n", + "%pip install git+https://github.com/FlagOpen/FlagEmbedding.git\n", + "%pip install llama-index-vector-stores-lancedb\n", + "%pip install --upgrade --quiet langchain langchain-community langchainhub langchain-openai langchain-chroma bs4 lancedb\n", + "%pip install unstructured" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "NqK1g8Bg7zlZ" + }, + "outputs": [], + "source": [ + "# llama-parse is async-first, running the async code in a notebook requires the use of nest_asyncio\n", + "import os\n", + "import nest_asyncio\n", + "\n", + "nest_asyncio.apply()\n", + "\n", + "# API access to llama-cloud\n", + "os.environ[\"LLAMA_CLOUD_API_KEY\"] = \"llx-...\"\n", + "# Using OpenAI API for embeddings/llms\n", + "os.environ[\"OPENAI_API_KEY\"] = \"sk-proj-...\"" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "4OmWRDtAKONC" + }, + "source": [ + "### Download the PDF (contains both tables & text)" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "smCjT2FIj9Fo" + }, + "outputs": [], + "source": [ + "!wget 'https://raw.githubusercontent.com/run-llama/llama_index/main/docs/docs/examples/data/10q/uber_10q_march_2022.pdf' -O './uber_10q_march_2022.pdf'" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "1I--ouSiTGvj" + }, + "source": [ + "# 1. Langchain with Q&A on PDF" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "4ysMDhHiR2bG" + }, + "outputs": [], + "source": [ + "import bs4\n", + "from langchain import hub\n", + "from langchain_community.document_loaders import WebBaseLoader\n", + "from langchain_openai import ChatOpenAI\n", + "from langchain_community.document_loaders import PyPDFLoader\n", + "from langchain.vectorstores import LanceDB\n", + "from langchain_core.output_parsers import StrOutputParser\n", + "from langchain_core.runnables import RunnablePassthrough\n", + "from langchain_openai import OpenAIEmbeddings\n", + "from langchain_text_splitters import RecursiveCharacterTextSplitter" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/", + "height": 54 + }, + "id": "4emkzsCqSMTe", + "outputId": "d122f7f4-9072-4a4f-acf8-3d3b39755328" + }, + "outputs": [ + { + "data": { + "application/vnd.google.colaboratory.intrinsic+json": { + "type": "string" + }, + "text/plain": [ + "'The net loss value attributable to Uber Technologies, Inc. for the period was $5.9 billion, compared to $108 million in the same period the previous year. This represents a significant increase in net loss year-over-year.'" + ] + }, + "execution_count": 9, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "llm = ChatOpenAI(model=\"gpt-3.5-turbo-0125\")\n", + "\n", + "loader = PyPDFLoader(\"/content/uber_10q_march_2022.pdf\")\n", + "docs = loader.load()\n", + "text_splitter = RecursiveCharacterTextSplitter(chunk_size=1000, chunk_overlap=200)\n", + "splits = text_splitter.split_documents(docs)\n", + "vectorstore = LanceDB.from_documents(documents=splits, embedding=OpenAIEmbeddings())\n", + "\n", + "# Retrieve and generate using the relevant snippets of the blog.\n", + "retriever = vectorstore.as_retriever()\n", + "prompt = hub.pull(\"rlm/rag-prompt\")\n", + "\n", + "\n", + "def format_docs(docs):\n", + " return \"\\n\\n\".join(doc.page_content for doc in docs)\n", + "\n", + "\n", + "rag_chain = (\n", + " {\"context\": retriever | format_docs, \"question\": RunnablePassthrough()}\n", + " | prompt\n", + " | llm\n", + " | StrOutputParser()\n", + ")\n", + "\n", + "qa_langchain_query1 = (\n", + " \" what is the net loss value attributable to Uber compared to last year?\"\n", + ")\n", + "rag_chain.invoke(qa_langchain_query1)" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/", + "height": 36 + }, + "id": "G4qR6GxTSMWO", + "outputId": "38fff0fe-377f-4b10-aa4f-362712e539d9" + }, + "outputs": [ + { + "data": { + "application/vnd.google.colaboratory.intrinsic+json": { + "type": "string" + }, + "text/plain": [ + "\"I don't know.\"" + ] + }, + "execution_count": 10, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "qa_langchain_query2 = \"how is the Cash paid for Income taxes, net of refunds from Supplemental disclosures of cash flow information?\"\n", + "rag_chain.invoke(qa_langchain_query2)" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/", + "height": 36 + }, + "id": "b3DM-lCnSMZG", + "outputId": "4c97e6f3-d610-447f-ba4f-62715ae274a1" + }, + "outputs": [ + { + "data": { + "application/vnd.google.colaboratory.intrinsic+json": { + "type": "string" + }, + "text/plain": [ + "\"I don't have detailed charts of intangible assets, net as of December 31, 2021 and March 31, 2022.\"" + ] + }, + "execution_count": 11, + "metadata": {}, + "output_type": "execute_result" + } + ], + "source": [ + "qa_langchain_query3 = \"give me detailed charts of intangible assets, net as of December 31, 2021 and March 31, 2022\"\n", + "rag_chain.invoke(qa_langchain_query3)" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "pwZzEShGwwvT" + }, + "source": [ + "FOR QUERY 2 & QUERY 3 we are not getting the answer\n", + "\n", + "**LETS TRY LLAMAINDEX**" + ] + }, + { + "cell_type": "markdown", + "metadata": { + "id": "GVD2sPBEcRE3" + }, + "source": [ + "# 2 . Llamaindex with Q&A on PDF" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "H__qJIWYdmgY" + }, + "outputs": [], + "source": [ + "import textwrap\n", + "from llama_index.vector_stores.lancedb import LanceDBVectorStore\n", + "from llama_index.core import SimpleDirectoryReader, Document, StorageContext\n", + "from llama_index.core import VectorStoreIndex\n", + "from llama_index.llms.openai import OpenAI\n", + "from llama_index.embeddings.openai import OpenAIEmbedding\n", + "from llama_index.core import VectorStoreIndex\n", + "from llama_index.core import SimpleDirectoryReader\n", + "from llama_index.postprocessor.flag_embedding_reranker import (\n", + " FlagEmbeddingReranker,\n", + ")" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "BCNNrAw9Dklk" + }, + "outputs": [], + "source": [ + "from llama_index.vector_stores.lancedb import LanceDBVectorStore\n", + "\n", + "vector_store_pdf = LanceDBVectorStore(uri=\"/tmp/lancedb_lamaindex\")" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/", + "height": 393, + "referenced_widgets": [ + "0ab0af38e5c54405a48fd40de7bfe606", + "17300f8bea7644f09907376fab719f92", + "f2bbc601684446b5be521833592a319f", + "326bb35286ba4d23a5e3b8b364c7a44f", + "d19be0dd23344eb2887605d88f80c142", + "2e3f44b38e6842a9bd9db4c5e6cc6bcf", + "738a133fb9204208913431c2a35fef5c", + "d407aa9ec48347d9b5536a69fe9f5dc7", + "d779e07ce42c4f088dea05f8fd22922f", + "1700dc0afbb14e4cb85da9eedfca8605", + "498b64a4f33046c9a65b5a5486376359", + "f7de0a6aa31c41de90f45e862fe4ce15", + "a034635cd3d94f8b89b56e6b972daa63", + "5bec1c5f11434bf481862e67f73403f0", + "bd254eea803c4e44b60783f05c5c1ca1", + "1bbdd18be2e74c7f8916c95a1e5cf511", + "0f2a76e74d74459182402f92fbb946c9", + "d3d91529e30a4d67881a782d7eb54060", + "b69deb8741ad47229438548949e2da42", + "f894d72c0e19494abb732b2db46ced98", + "6fe509c6157e4ddbb1cfcf319cf04fac", + "de859b9a94684d39ac8a7aef81058a3b", + "09f4df92faa844b28fef90c3620b09e1", + "d0a4ce2181c84af8b277058689a35641", + "faaff37bf60e4870a9cbebd20d3afc6e", + "f55e2cf0eb34482da1902adb652fca5a", + "55ee9cd902c147ae97c4deea101f25e8", + "f3f8c3dc8a404bd4af55b105f98267d7", + "e13aaa64b9c848f9a67f61295fb546fe", + "2a6b2dacce79419b9a2469b6362abca8", + "3c63c0a7eac54b0fb72775e3fb6ba891", + "61b79e900bfb4b849cfa740236cf6e44", + "57ae83a64b224117a6d01573d3437540", + "e9394568881e456c99758e52813428f6", + "8779121983404d48ab560c1e0d2300f9", + "48550e72c11c4da7985434d6df76df28", + "62c949dbc1714614b00254b3b3ee80c1", + "1c7cdfc1b5ec488dbb08e75b5c89dc0e", + "12eb26078d544f3abe72e91835cc14fc", + "77e3dff7c22343848b9ad097e47b16df", + "cb5a5f4ddfe14e37a6807a74005b01f3", + "9492f5081dd841099c26140b03ef791a", + "86d744b64d604e388d3aadfbbf84eb1c", + "5eda98429678436daa918640f2193bd9", + "2ff3c80449634437b0d88f9a87932b14", + "69a388720e4346e6a98e77c06acb1089", + "22407e9553af4c00a93aa9085ca10b67", + "edd1f2ad4cca4cb093f3d2ab09ebbe85", + "bfb0b78895c34af1abf9a8d669c24aeb", + "356acddfa87f4c1b808245a7880a2ff4", + "f8502cc7258f4efeb43d9e2b0b5e91b2", + "1d645f1f896140158cc30061c0e67080", + "14467a4613e74d819502803825959e5e", + "cdc69d3551bd4bb183d30361b164d0ac", + "17173ad4ef464e9fb4611c2c9c611736", + "edb9212d6a3540b19de36e895b6179e7", + "edada2634ba54b9290fe904edc9905f0", + "1447b1135b994f14866cd17a34124ff6", + "9b8c980b418146e992ec2e5cfde8bf6b", + "45793d371304439db504cb04d4913911", + "2b190420fb764d06bd468c2914773fa8", + "c4829c7eed1541129e4944ad7784b43d", + "74e2c433490840a0a4caa296bd3521f0", + "7d690fb215c64c7a81cdce7156456ca2", + "cd48bd077b9c4a7b822a79fa66b65031", + "b246fdb13dc84c02b266565bb75e621f" + ] + }, + "id": "1BLK8QPhcyMh", + "outputId": "fd0df920-77f7-453f-8b06-3cb35e054467" + }, + "outputs": [ + { + "name": "stderr", + "output_type": "stream", + "text": [ + "/usr/local/lib/python3.10/dist-packages/huggingface_hub/file_download.py:1132: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`.\n", + " warnings.warn(\n", + "/usr/local/lib/python3.10/dist-packages/huggingface_hub/utils/_token.py:89: UserWarning: \n", + "The secret `HF_TOKEN` does not exist in your Colab secrets.\n", + "To authenticate with the Hugging Face Hub, create a token in your settings tab (https://huggingface.co/settings/tokens), set it as secret in your Google Colab and restart your session.\n", + "You will be able to reuse this secret in all of your notebooks.\n", + "Please note that authentication is recommended but still optional to access public models or datasets.\n", + " warnings.warn(\n" + ] + }, + { + "data": { + "application/vnd.jupyter.widget-view+json": { + "model_id": "0ab0af38e5c54405a48fd40de7bfe606", + "version_major": 2, + "version_minor": 0 + }, + "text/plain": [ + "tokenizer_config.json: 0%| | 0.00/443 [00:00" + ], + "text/plain": [ + "" + ] + }, + "metadata": {}, + "output_type": "display_data" + }, + { + "name": "stdout", + "output_type": "stream", + "text": [ + "Debug output: {'agent': {'messages': [AIMessage(content='The PM Gati Shakti National Master Plan (NMP) is an ambitious initiative launched by the Government of India aimed at improving infrastructure development across the country. Launched by Prime Minister Narendra Modi, the plan seeks to integrate the planning and coordination of various infrastructure projects across different sectors and ministries.\\n\\nThe core objective of the Gati Shakti NMP is to enhance multi-modal connectivity and reduce logistics costs by bringing together rail, road, air, and waterways projects under a single, unified framework. This holistic approach is intended to boost economic growth, create jobs, and promote regional connectivity.\\n\\nThe plan utilizes digital technology to map and synchronize projects, ensuring that all related departments and stakeholders are aligned, which helps in eliminating bottlenecks, improving project execution speed, and enhancing overall efficiency. The Gati Shakti NMP is seen as a transformative step towards making India a global manufacturing hub and improving the ease of doing business.', response_metadata={'finish_reason': 'stop'}, id='run-8bc90fb4-007b-4384-9320-35c3621eb9b8-0')]}}\n", + "Extracted content: The PM Gati Shakti National Master Plan (NMP) is an ambitious initiative launched by the Government of India aimed at improving infrastructure development across the country. Launched by Prime Minister Narendra Modi, the plan seeks to integrate the planning and coordination of various infrastructure projects across different sectors and ministries.\n", + "\n", + "The core objective of the Gati Shakti NMP is to enhance multi-modal connectivity and reduce logistics costs by bringing together rail, road, air, and waterways projects under a single, unified framework. This holistic approach is intended to boost economic growth, create jobs, and promote regional connectivity.\n", + "\n", + "The plan utilizes digital technology to map and synchronize projects, ensuring that all related departments and stakeholders are aligned, which helps in eliminating bottlenecks, improving project execution speed, and enhancing overall efficiency. The Gati Shakti NMP is seen as a transformative step towards making India a global manufacturing hub and improving the ease of doing business.\n", + "Debug output: {'agent': {'messages': [AIMessage(content='', additional_kwargs={'tool_calls': [{'index': 0, 'id': 'call_AE3GmC26FUSF63Nb4NhJCSMb', 'function': {'arguments': '{\"query\": \"steps for export import procedure\"}', 'name': 'retrieve_blog_posts'}, 'type': 'function'}, {'index': 1, 'id': 'call_B56lmGd2JIHRDCdxf9Um9Ml7', 'function': {'arguments': '{\"query\": \"customs import export procedure\"}', 'name': 'retrieve_blog_posts'}, 'type': 'function'}]}, response_metadata={'finish_reason': 'tool_calls'}, id='run-2c42f15d-390b-4509-8b23-fbc6d1fb203d-0', tool_calls=[{'name': 'retrieve_blog_posts', 'args': {'query': 'steps for export import procedure'}, 'id': 'call_AE3GmC26FUSF63Nb4NhJCSMb'}, {'name': 'retrieve_blog_posts', 'args': {'query': 'customs import export procedure'}, 'id': 'call_B56lmGd2JIHRDCdxf9Um9Ml7'}])]}}\n", + "Extracted content: \n", + "Debug output: {'retrieve': {'messages': [ToolMessage(content='<Microsoft Word - CUSTOMS IMPORT EXPORT PROCEDURES _final_Admin\\n\\nendobj\\r\\n118 0 obj\\r\\n<>/F 4/A<>/StructParent 32>>\\r\\nendobj\\r\\n119 0 obj\\n\\nEP\\x12t1Ԗ�m�l1�PI�����ٲW$��`�B[C��\\x1e�2�R�ν7ȗ���C�3�a���\\x1fZ��U\\x7f��\\x10�5y<:', name='retrieve_blog_posts', id='8714e8e7-ed08-4452-8d1f-621f4f25af81', tool_call_id='call_AE3GmC26FUSF63Nb4NhJCSMb'), ToolMessage(content='<Microsoft Word - CUSTOMS IMPORT EXPORT PROCEDURES _final_Admin\\n\\nCdTa�x\\t&��A\\x0eb�1����\\x0errMM�q�\\x7f1Q2cҩ0de�˂���>�\\x1b\\x12�\\x19?�\\x0fa�L���Q\\n\\nt\\'�P��h\\x13��u\\u05eex����|^���(I�$�%\\x1f�Q%�.\\x19U�B��jY��\\x16�?\\x19\\x19�\\x0e�$��W�BԸ���\\x1ck��\\x19��l!K�\\x05�!z�\\x1dq\\x12�\\x1d�v��]�P\"\\x11ROC��\\x14', name='retrieve_blog_posts', id='8a56ed96-cd2a-4846-a60d-258e6430aa79', tool_call_id='call_B56lmGd2JIHRDCdxf9Um9Ml7')]}}\n", + "Debug output: {'generate': {'messages': [\"I don't know.\"]}}\n", + "Debug output: {'agent': {'messages': [AIMessage(content='The term \"RCMC\" stands for Registration Cum Membership Certificate. It is a certificate that is provided by the Export Promotion Councils (EPCs) or commodity boards in India. An RCMC is issued to exporters dealing in products registered with these agencies. Holding an RCMC is mandatory for exporters to avail benefits under the Foreign Trade Policy like duty drawback, concessions, and other support.\\n\\nHere are some key points about RCMC:\\n\\n1. **Purpose**: The RCMC is used to certify that an exporter is registered with the respective EPC and is eligible for various benefits under the export-import policy.\\n\\n2. **Validity**: Typically, an RCMC is valid for five years.\\n\\n3. **Application**: Exporters must apply for an RCMC with the relevant EPC that pertains to their main line of business. If the exporter wishes to export items that are not covered by any EPC, they can obtain an RCMC from the Federation of Indian Export Organisations (FIEO).\\n\\n4. **Benefits**: With an RCMC, exporters can participate in international trade fairs, get sponsorship for trade delegations, and access market development assistance among other benefits.\\n\\n5. **Renewal and Cancellation**: The certificate needs to be renewed upon expiry. It can also be cancelled or suspended if the holder fails to abide by the regulatory requirements.\\n\\nIf you need detailed information or specific guidance related to obtaining an RCMC, please let me know!', response_metadata={'finish_reason': 'stop'}, id='run-4fa5e544-510a-4140-984c-89dedd855e71-0')]}}\n", + "Extracted content: The term \"RCMC\" stands for Registration Cum Membership Certificate. It is a certificate that is provided by the Export Promotion Councils (EPCs) or commodity boards in India. An RCMC is issued to exporters dealing in products registered with these agencies. Holding an RCMC is mandatory for exporters to avail benefits under the Foreign Trade Policy like duty drawback, concessions, and other support.\n", + "\n", + "Here are some key points about RCMC:\n", + "\n", + "1. **Purpose**: The RCMC is used to certify that an exporter is registered with the respective EPC and is eligible for various benefits under the export-import policy.\n", + "\n", + "2. **Validity**: Typically, an RCMC is valid for five years.\n", + "\n", + "3. **Application**: Exporters must apply for an RCMC with the relevant EPC that pertains to their main line of business. If the exporter wishes to export items that are not covered by any EPC, they can obtain an RCMC from the Federation of Indian Export Organisations (FIEO).\n", + "\n", + "4. **Benefits**: With an RCMC, exporters can participate in international trade fairs, get sponsorship for trade delegations, and access market development assistance among other benefits.\n", + "\n", + "5. **Renewal and Cancellation**: The certificate needs to be renewed upon expiry. It can also be cancelled or suspended if the holder fails to abide by the regulatory requirements.\n", + "\n", + "If you need detailed information or specific guidance related to obtaining an RCMC, please let me know!\n" + ] + } + ], + "source": [ + "\n", + "# Function to set environment variables securely\n", + "def _set_env(key: str):\n", + " if key not in os.environ:\n", + " os.environ[key] = getpass.getpass(f\"{key}:\")\n", + "\n", + "_set_env(\"OPENAI_API_KEY\")\n", + "\n", + "# (Optional) For tracing\n", + "os.environ[\"LANGCHAIN_TRACING_V2\"] = \"False\"\n", + "_set_env(\"LANGCHAIN_API_KEY\")\n", + "\n", + "\n", + "# upload the data based on your usecase\n", + "\n", + "urls = [\n", + " 'https://content.dgft.gov.in/Website/CIEP.pdf',\n", + " 'https://content.dgft.gov.in/Website/GAE.pdf',\n", + " 'https://content.dgft.gov.in/Website/HTE.pdf',\n", + "]\n", + "\n", + "\n", + "docs = [WebBaseLoader(url).load() for url in urls]\n", + "docs_list = [item for sublist in docs for item in sublist]\n", + "\n", + "text_splitter = RecursiveCharacterTextSplitter.from_tiktoken_encoder(\n", + " chunk_size=100, chunk_overlap=50\n", + ")\n", + "doc_splits = text_splitter.split_documents(docs_list)\n", + "\n", + "# Add to lancedb as vectordb\n", + "\n", + "vectorstore = LanceDB.from_documents(\n", + " documents=doc_splits,\n", + " embedding=OpenAIEmbeddings(),\n", + ")\n", + "retriever = vectorstore.as_retriever()\n", + "\n", + "\n", + "# create the tools\n", + "retriever_tool = create_retriever_tool(\n", + " retriever,\n", + " \"retrieve_blog_posts\",\n", + " \"Search and return information about customs import export procedure,GST & EXPORTS , How to export\",\n", + ")\n", + "\n", + "tools = [retriever_tool]\n", + "tool_executor = ToolExecutor(tools)\n", + "\n", + "\n", + "\n", + "class AgentState(TypedDict):\n", + " messages: Annotated[Sequence[BaseMessage], add_messages]\n", + "\n", + "def grade_documents(state) -> Literal[\"generate\", \"rewrite\"]:\n", + " class grade(BaseModel):\n", + " binary_score: str = Field(description=\"Relevance score 'yes' or 'no'\")\n", + "\n", + " model = ChatOpenAI(temperature=0, model=\"gpt-4-0125-preview\", streaming=True)\n", + " llm_with_tool = model.with_structured_output(grade)\n", + " prompt = PromptTemplate(\n", + " template=\"\"\"You are a grader assessing relevance of a retrieved document to a user question. \\n\n", + " Here is the retrieved document: \\n\\n {context} \\n\\n\n", + " Here is the user question: {question} \\n\n", + " If the document contains keyword(s) or semantic meaning related to the user question, grade it as relevant. \\n\n", + " Give a binary score 'yes' or 'no' score to indicate whether the document is relevant to the question.\"\"\",\n", + " input_variables=[\"context\", \"question\"],\n", + " )\n", + " chain = prompt | llm_with_tool\n", + "\n", + " messages = state[\"messages\"]\n", + " last_message = messages[-1]\n", + " question = messages[0].content\n", + " docs = last_message.content\n", + "\n", + " scored_result = chain.invoke({\"question\": question, \"context\": docs})\n", + " score = scored_result.binary_score\n", + "\n", + " return \"generate\" if score == \"yes\" else \"rewrite\"\n", + "\n", + "def agent(state):\n", + " messages = state[\"messages\"]\n", + " model = ChatOpenAI(temperature=0, streaming=True, model=\"gpt-4-turbo\")\n", + " model = model.bind_tools(tools)\n", + " response = model.invoke(messages)\n", + " return {\"messages\": [response]}\n", + "\n", + "def rewrite(state):\n", + " messages = state[\"messages\"]\n", + " question = messages[0].content\n", + " msg = [\n", + " HumanMessage(\n", + " content=f\"\"\" \\n\n", + " Look at the input and try to reason about the underlying semantic intent / meaning. \\n\n", + " Here is the initial question:\n", + " \\n ------- \\n\n", + " {question}\n", + " \\n ------- \\n\n", + " Formulate an improved question: \"\"\",\n", + " )\n", + " ]\n", + " model = ChatOpenAI(temperature=0, model=\"gpt-4-0125-preview\", streaming=True)\n", + " response = model.invoke(msg)\n", + " return {\"messages\": [response]}\n", + "\n", + "def generate(state):\n", + " messages = state[\"messages\"]\n", + " question = messages[0].content\n", + " last_message = messages[-1]\n", + " docs = last_message.content\n", + "\n", + " prompt = hub.pull(\"rlm/rag-prompt\")\n", + " llm = ChatOpenAI(model_name=\"gpt-3.5-turbo\", temperature=0, streaming=True)\n", + "\n", + " def format_docs(docs):\n", + " return \"\\n\\n\".join(doc.page_content for doc in docs)\n", + "\n", + " rag_chain = prompt | llm | StrOutputParser()\n", + " response = rag_chain.invoke({\"context\": docs, \"question\": question})\n", + " return {\"messages\": [response]}\n", + "\n", + "workflow = StateGraph(AgentState)\n", + "workflow.add_node(\"agent\", agent)\n", + "retrieve = ToolNode([retriever_tool])\n", + "workflow.add_node(\"retrieve\", retrieve)\n", + "workflow.add_node(\"rewrite\", rewrite)\n", + "workflow.add_node(\"generate\", generate)\n", + "workflow.set_entry_point(\"agent\")\n", + "workflow.add_conditional_edges(\"agent\", tools_condition, {\"tools\": \"retrieve\", END: END})\n", + "workflow.add_conditional_edges(\"retrieve\", grade_documents)\n", + "workflow.add_edge(\"generate\", END)\n", + "workflow.add_edge(\"rewrite\", \"agent\")\n", + "graph = workflow.compile()\n", + "\n", + "\n", + "def process_message(user_message):\n", + " inputs = {\n", + " \"messages\": [(\"user\", user_message)]\n", + " }\n", + " content_output = None\n", + " for output in graph.stream(inputs):\n", + " print(f\"Debug output: {output}\") # Debugging line to print the output\n", + " if 'agent' in output and 'messages' in output['agent']:\n", + " messages = output['agent']['messages']\n", + " if messages and hasattr(messages[0], 'content'):\n", + " content_output = messages[0].content # Accessing attribute directly\n", + " print(f\"Extracted content: {content_output}\") # Print extracted content\n", + " return content_output if content_output else \"No relevant output found.\"\n", + "\n", + "\n", + "# Define example questions to guide the user\n", + "example_questions = [\n", + "\"explain me in short what is PM Gati Shakti National Master Plan (NMP)?\"\n", + "\n", + "]\n", + "\n", + "# Create a Gradio interface\n", + "iface = gr.Interface(\n", + " fn=process_message,\n", + " inputs=\"text\",\n", + " outputs=\"text\",\n", + " title=\"Agentic RAG \",\n", + " description=\"Enter a message to query related to export import .\",\n", + " examples=example_questions,\n", + ")\n", + "\n", + "# Launch the Gradio app\n", + "iface.launch(debug=True)\n", + "\n", + "\n", + "\n" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "JjmUVVn1TdH0" + }, + "outputs": [], + "source": [] + }, + { + "cell_type": "markdown", + "source": [ + "#some quetions for testing\n", + "explain me in short what is PM Gati Shakti National Master Plan (NMP)?\n", + "\n", + "what is Zero Rating of Exports?\n", + "\n", + "what is Export Inspection Council of India?\n", + "\n", + "please give us some Details of some of the major initiatives /schemes please ?" + ], + "metadata": { + "id": "c921cw61mPdh" + } + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "k6Tq9E1Lqwtj" + }, + "outputs": [], + "source": [] + } + ], + "metadata": { + "colab": { + "provenance": [] + }, + "kernelspec": { + "display_name": "Python 3", + "name": "python3" + }, + "language_info": { + "name": "python" + } + }, + "nbformat": 4, + "nbformat_minor": 0 +} \ No newline at end of file diff --git a/tutorials/cohere-reranker/README.md b/tutorials/cohere-reranker/README.md index 6f5802bf..c2c8fe75 100644 --- a/tutorials/cohere-reranker/README.md +++ b/tutorials/cohere-reranker/README.md @@ -1,8 +1,8 @@ -Code for "Benchmarking Cohere Rerankers with LanceDB" +# Benchmarking Cohere Rerankers with LanceDB Screenshot-2024-05-06-at-6 06 30-PM -### [Read the blog](blog.lancedb.com) +### [Read the blog](https://blog.lancedb.com/benchmarking-cohere-reranker-with-lancedb/) ## Setup ```