feat(plugins): ai-proxy-plugin #12323

tysoekong · 2024-01-09T22:32:01Z

Summary

This commit transforms and proxies requests to a number of AI / LLM providers and models.

It adds a plugin that takes requests in one of a few defined and standardised formats, translates them to the configured target format, and then transforms the response back into a standard format.

The request/response formats are based on OpenAI, and conform to the JSON schema in the fixtures

There is a sample API specification to describe the suported formats also

The plugin controls:

The backend URL (unless it's a self-hosted model, in which case set config.model.options.upstream_url)
The API key insertion
The request/response body transformation
Capturing and storing metrics from the upstream responses, into normalised kong.log entries, which will then output via any configured logging platform e.g. file-log, kafka-log...

The reasoning for trying to flatten all provider formats, is that it allows standardised manipulation of the data before and after transmission.

It currently supports v1/chat and v1/completion style requests for all available providers. Each provider implementation class is in its own module under kong.plugins.ai-proxy.drivers.

You can also set mode preserve to pass through, and the plugin will perform a best-effort to conform to the chosen config.model.provider, based on the configured header patterns and values. This may need re-thinking, please provide input on this.

This implementation only supports REST-based full text responses, but streaming support could be added based upon adoption, as with Kong >3.0 we can now process WebSocket frames.

Ideal documentation (docs.konghq.com) is being finalised internally, will be ready soon.

Checklist

The Pull Request has tests
A changelog file has been created under changelog/unreleased/kong or skip-changelog label added on PR if changelog is unnecessary. README.md
There is a user-facing docs PR against https://github.com/Kong/docs.konghq.com - PUT DOCS PR HERE

tysoekong · 2024-01-10T22:54:56Z

@fffonion Anything else for me to do here?

locao

This is great! 🚀

None of my comments are real blockers to merge this PR, just some suggestions.

kong/llm/drivers/anthropic.lua

kong/llm/drivers/azure.lua

locao · 2024-01-17T20:22:16Z

kong/llm/drivers/cohere.lua

+
+  if type(body) == "table" then
+    body_string, err = cjson.encode(body)
+    if err then return nil, nil, "failed to parse body to json: " .. err end


Suggested change

if err then return nil, nil, "failed to parse body to json: " .. err end

if err then

return nil, nil, "failed to parse body to json: " .. err

end

kong/llm/drivers/anthropic.lua

kong/llm/drivers/openai.lua

kong/llm/init.lua

locao · 2024-01-17T21:03:04Z

kong/llm/init.lua

+
+local _M = {}
+
+local auth_schema = {


The schemas could go to their own file to keep this file cleaner.

kong/llm/init.lua

team-gateway-bot · 2024-01-19T19:16:25Z

Successfully created cherry-pick PR for master:

https://github.com/kong/kong-ee/pull/7872

tysoekong requested review from fffonion and flrgh January 9, 2024 22:32

pull-request-size bot added the size/XXL label Jan 9, 2024

github-actions bot assigned tysoekong Jan 9, 2024

github-actions bot added chore Not part of the core functionality of kong, but still needed schema-change-noteworthy labels Jan 9, 2024

tysoekong force-pushed the feat/ai_proxy_plugin branch 2 times, most recently from a074d1c to d9b303e Compare January 9, 2024 22:50

flrgh mentioned this pull request Jan 9, 2024

feat(plugin): ai-proxy plugin #12207

Closed

3 tasks

flrgh approved these changes Jan 9, 2024

View reviewed changes

tysoekong force-pushed the feat/ai_proxy_plugin branch from 2fc22cd to 3b1216c Compare January 10, 2024 22:54

tysoekong force-pushed the feat/ai_proxy_plugin branch 3 times, most recently from c634c5c to 7d6b50a Compare January 16, 2024 21:24

RobSerafini requested a review from a team January 17, 2024 16:28

tysoekong force-pushed the feat/ai_proxy_plugin branch 2 times, most recently from 4b3f822 to 949a161 Compare January 17, 2024 17:36

locao approved these changes Jan 17, 2024

View reviewed changes

tysoekong force-pushed the feat/ai_proxy_plugin branch from 77b23dd to f5be8e6 Compare January 18, 2024 13:41

tysoekong removed the request for review from fffonion January 18, 2024 15:55

feat(plugins): ai-proxy plugin

a25535c

tysoekong force-pushed the feat/ai_proxy_plugin branch from f83b2e9 to a25535c Compare January 18, 2024 20:03

ttyS0e added 2 commits January 19, 2024 18:20

fix(ai-proxy): working azure provider

e612058

fix(ai-proxy): working azure provider

6df1ee2

flrgh added the cherry-pick kong-ee schedule this PR for cherry-picking to kong/kong-ee label Jan 19, 2024

Merge branch 'master' into feat/ai_proxy_plugin

ca51b80

locao merged commit 58fe2dd into master Jan 19, 2024
23 checks passed

locao deleted the feat/ai_proxy_plugin branch January 19, 2024 19:16

chobits mentioned this pull request Mar 4, 2024

ai-proxy buffers streamed responses #12680

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(plugins): ai-proxy-plugin #12323

feat(plugins): ai-proxy-plugin #12323

tysoekong commented Jan 9, 2024

tysoekong commented Jan 10, 2024

locao left a comment

locao Jan 17, 2024

locao Jan 17, 2024 •

edited

Loading

team-gateway-bot commented Jan 19, 2024


		local _M = {}

		local auth_schema = {

feat(plugins): ai-proxy-plugin #12323

feat(plugins): ai-proxy-plugin #12323

Conversation

tysoekong commented Jan 9, 2024

Summary

Checklist

tysoekong commented Jan 10, 2024

locao left a comment

Choose a reason for hiding this comment

locao Jan 17, 2024

Choose a reason for hiding this comment

locao Jan 17, 2024 • edited Loading

Choose a reason for hiding this comment

team-gateway-bot commented Jan 19, 2024

locao Jan 17, 2024 •

edited

Loading