feat(rag): Auto-RAG #2301

gaocegege · 2024-04-10T08:01:29Z

/kind feature

Ref https://arxiv.org/pdf/2404.01037.pdf

Auto-RAG: The idea of automatically optimizing RAG systems, akin to Auto-ML’s approach
in traditional machine learning, presents a significant opportunity for future exploration. Currently, selecting the optimal configuration of RAG components — e.g., chunking strategies,
window sizes, and parameters within rerankers — relies on manual experimentation and intuition. An automated system could systematically explore a vast space of RAG configurations
and select the very best model (Markr.AI, 2024).

RAG requires some hyperparameters e.g. chunking strategies, and window sizes for sentence window retrieval. It should be done automatically.

Love this feature? Give it a 👍 We prioritize the features with the most 👍

gaocegege · 2024-04-11T02:25:49Z

Maybe we could add an example to showcase how to use Katib and LlamaIndex to AutoRAG.

Not sure if there is any new feature to be implemented.

gaocegege · 2024-04-11T02:27:05Z

Related: https://github.com/Marker-Inc-Korea/AutoRAG

tariq-hasan · 2024-04-11T05:34:01Z

Are you thinking of adding an example that uses the proposed tuning API for LLMs to demonstrate Auto-RAG?

gaocegege · 2024-04-11T05:48:21Z

@tariq-hasan It should work. But I do not have the bandwidth for it. I'm simply presenting the idea for consideration at this point.

andreyvelich · 2024-04-11T21:15:53Z

Thanks for creating this @gaocegege.
Are there any differences to optimize these HPs for RAG (e.g. chunking strategies and window sizes) compare to our current optimization flow with Experiment -> Suggestion -> Trials?
I guess, Trials can consume prompt and produce the metrics.

gaocegege · 2024-04-12T02:06:57Z

The workflow should be similar. I think. We could make a demo based on llama index to see if there is anything we miss.

vkehfdl1 · 2024-07-09T04:22:16Z

Hi!
I'm the developer of AutoRAG.
Do you still interested in implement AutoRAG or use it? Make demo for this?
We are open for any kind of collaboration.

andreyvelich · 2024-07-09T18:35:00Z

Nice to meet you @vkehfdl1!
Sure, that would be great, maybe you can attend one of our upcoming AutoML and Training WG community calls to give a demo and we can discuss how we can collaborate.
cc @kubeflow/wg-training-leads

vkehfdl1 · 2024-07-10T00:47:40Z

Hi @andreyvelich Nice to meet you.

First, It will be hard to attend the community call today because the timezone. It is 2:00 a.m. here so hard to attend.
Maybe other community call two weeks later in 2:00 UTC can be fine, or we can book another call.

Thanks!

andreyvelich · 2024-07-10T10:57:29Z

Sure, that sounds great! I added you to the meeting agenda on July 24th.

andreyvelich · 2024-07-24T13:52:35Z

Hi @vkehfdl1, just a reminder that our community call starts in 10 minutes, if you want to give AutoRAG demo.

andreyvelich · 2024-08-21T14:16:26Z

/area llm

google-oss-prow bot added the kind/feature label Apr 10, 2024

google-oss-prow bot added the area/llm LLMs related content label Aug 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(rag): Auto-RAG #2301

feat(rag): Auto-RAG #2301

gaocegege commented Apr 10, 2024

gaocegege commented Apr 11, 2024

gaocegege commented Apr 11, 2024

tariq-hasan commented Apr 11, 2024

gaocegege commented Apr 11, 2024

andreyvelich commented Apr 11, 2024

gaocegege commented Apr 12, 2024

vkehfdl1 commented Jul 9, 2024

andreyvelich commented Jul 9, 2024

vkehfdl1 commented Jul 10, 2024

andreyvelich commented Jul 10, 2024 •

edited

Loading

andreyvelich commented Jul 24, 2024

andreyvelich commented Aug 21, 2024

feat(rag): Auto-RAG #2301

feat(rag): Auto-RAG #2301

Comments

gaocegege commented Apr 10, 2024

gaocegege commented Apr 11, 2024

gaocegege commented Apr 11, 2024

tariq-hasan commented Apr 11, 2024

gaocegege commented Apr 11, 2024

andreyvelich commented Apr 11, 2024

gaocegege commented Apr 12, 2024

vkehfdl1 commented Jul 9, 2024

andreyvelich commented Jul 9, 2024

vkehfdl1 commented Jul 10, 2024

andreyvelich commented Jul 10, 2024 • edited Loading

andreyvelich commented Jul 24, 2024

andreyvelich commented Aug 21, 2024

andreyvelich commented Jul 10, 2024 •

edited

Loading