[FEATURE] Extend support for Bedrock beyond Anthropic to all other models #1495

austintlee · 2023-10-11T23:05:58Z

Is your feature request related to a problem?
Although it's quite possible that the current implementation of RAG works with all other models supported by Bedrock, we need to ensure it works with more than just Anthropic Claude.

What solution would you like?
A clear and concise description of what you want to happen.

What alternatives have you considered?
A clear and concise description of any alternative solutions or features you've considered.

Do you have any additional context?
Add any other context or screenshots about the feature request here.

ishaan-jaff · 2023-10-23T23:08:51Z

Hi @austintlee @ylwu-amzn - I believe we can make this easier
I’m the maintainer of LiteLLM - we allow you to deploy an LLM proxy to call 100+ LLMs in 1 format - Bedrock, OpenAI, Anthropic etc https://github.com/BerriAI/litellm/tree/main/openai-proxy.

If this looks useful (we're used in production)- please let me know how we can help.

Usage

Bedrock request

curl http://0.0.0.0:8000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
     "model": "bedrock/anthropic.claude-instant-v1",
     "messages": [{"role": "user", "content": "Say this is a test!"}],
     "temperature": 0.7
   }'

gpt-3.5-turbo request

curl http://0.0.0.0:8000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
     "model": "gpt-3.5-turbo",
     "messages": [{"role": "user", "content": "Say this is a test!"}],
     "temperature": 0.7
   }'

claude-2 request

curl http://0.0.0.0:8000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
     "model": "claude-2",
     "messages": [{"role": "user", "content": "Say this is a test!"}],
     "temperature": 0.7
   }'

austintlee · 2023-10-24T01:32:32Z

@ishaan-jaff Yes, conceptually, something like this would be ideal. However, I am not sure about running a proxy. What I had in mind is to have this conversion layer in ml-commons itself. Are you familiar with OpenSearch? This might be a good use case for the OpenSearch Python extension (@dblock @dbwiddis). @ylwu-amzn I think of this as an all-in-one HttpConnector (but implemented in Python). It is certainly an interesting idea and I think it might be easier for people to use it if it's an extension (as in an extension of OpenSearch) as opposed to another thing people have to download and install and manage, although the quickest way to put this in the hands of the users is to have the HttpConnector talk to this proxy as-is.

ylwu-amzn · 2023-10-24T05:45:05Z

@ishaan-jaff Thanks for sharing this. The consistent input/output feature definitely helpful.

the quickest way to put this in the hands of the users is to have the HttpConnector talk to this proxy as-is.

Agree with @austintlee , seems this proxy could be some layer between OpenSearch HttpConnector and LLM, which adapts the HttpConnector and LLM.

The other option is we can use similar way to support consistent input/output in ml-commons with some adaption logic, but no proxy process

ishaan-jaff · 2023-10-24T19:10:34Z

@austintlee @ylwu-amzn how can we be most helpful here?

sounds like there are two options:

use the litellm package for consistent LLM I/O
use the litellm proxy server for consistent LLM I/O

austintlee · 2023-10-27T03:24:03Z

The first option won't work because we don't have a language binding for Python or to any language other than Java.

The second option works out of the box (in theory). Maybe you can try out a few examples and one way to contribute or let people know about it is to write a blog post?

austintlee · 2024-09-29T01:16:59Z

#2826 covers this.

austintlee added enhancement New feature or request untriaged labels Oct 11, 2023

austintlee mentioned this issue Oct 11, 2023

Fix prompt passing for Bedrock by passing a single string prompt for … #1490

Merged

2 tasks

ylwu-amzn added this to ml-commons projects Oct 20, 2023

ylwu-amzn moved this to Untriaged in ml-commons projects Oct 20, 2023

ylwu-amzn removed the status in ml-commons projects Oct 20, 2023

ylwu-amzn moved this to On-deck in ml-commons projects Oct 20, 2023

austintlee mentioned this issue Oct 24, 2023

[FEATURE] Support for HttpConnector request and response body transformations through scripts #1475

Open

Zhangxunmt removed the untriaged label Oct 30, 2023

austintlee closed this as completed Sep 29, 2024

github-project-automation bot moved this from On-deck to Done in ml-commons projects Sep 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Extend support for Bedrock beyond Anthropic to all other models #1495

[FEATURE] Extend support for Bedrock beyond Anthropic to all other models #1495

austintlee commented Oct 11, 2023

ishaan-jaff commented Oct 23, 2023

austintlee commented Oct 24, 2023

ylwu-amzn commented Oct 24, 2023 •

edited

Loading

ishaan-jaff commented Oct 24, 2023

austintlee commented Oct 27, 2023

austintlee commented Sep 29, 2024

[FEATURE] Extend support for Bedrock beyond Anthropic to all other models #1495

[FEATURE] Extend support for Bedrock beyond Anthropic to all other models #1495

Comments

austintlee commented Oct 11, 2023

ishaan-jaff commented Oct 23, 2023

Usage

austintlee commented Oct 24, 2023

ylwu-amzn commented Oct 24, 2023 • edited Loading

ishaan-jaff commented Oct 24, 2023

austintlee commented Oct 27, 2023

austintlee commented Sep 29, 2024

ylwu-amzn commented Oct 24, 2023 •

edited

Loading