Retrieval-Augmented Generation (RAG)

Retrieval-augmented generation (RAG) models are a smart way to add a context to the LLM (pre-trained large language models). They can help improve the quality of generated text by providing LLMs with context that comes from your own custom data. Using RAG leads to higher accuracy and better robustness in your text generation system.

In this notebook, I am using Covid-19 Clinical Trial data to enhance the relevance and quality of the generated text.

How do RAGs work

Typically RAG consist of two core modules: the retriever and the generator. The retriever searches for relevant information from the provided data. The generator produces the required content.

More specifically the process;

take the input query
using a sentence transformer (small language model) search for relevant information. Transformer models are one of the best performing in NLP.
Take the top-k (top 3 in this notebook) results
Pass the output of the query, together with the original query to the generator.

Installation

python3 -m venv .venv source .venv/bin/activate .venv/bin/pip install -r requirements.txt

Libraries

Pandas: data manipulation
qdrant-client: vector similariy seach engine and vector database
Sentence Transformers: framework to computer dense erctor represetnation for sentences, paragraphs and images. The models are based on transformers like BERT / RoBERTa / XLM-RoBERTa.
Llamafile: Run LLM with a single file
OpenAI: connect to LLM from openAI

Data

I used the publicly available Coronavirus Clinical Trials Dataset from Kaggle. The dataset includes exclusion and inclusion criteria for covid-19 clinical trials that were downloaded from ct.gov.

Large and Small Language Models

The Retriever is using the multi-qa-MiniLM-L6-cos-v1 small language model. This model is great for diverse use-cases and is trained on a dataset of over 1 billion training pairs.

The Generator is using Phi-2 model from HuggingFace. There is a great blog in Microsoft Research about The pusrising power of small language models

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
data		data
images		images
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
Retrieval-Generation-Augmentation.code-workspace		Retrieval-Generation-Augmentation.code-workspace
data-load.ipynb		data-load.ipynb
embeddings.ipynb		embeddings.ipynb
model-download.py		model-download.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Retrieval-Augmented Generation (RAG)

How do RAGs work

Installation

Libraries

Data

Large and Small Language Models

About

Releases

Packages

Languages

License

necibeahat/Retrieval-Augmented-Generation

Folders and files

Latest commit

History

Repository files navigation

Retrieval-Augmented Generation (RAG)

How do RAGs work

Installation

Libraries

Data

Large and Small Language Models

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages