feat(prometheus): add AI metrics and fix #9320 #13148

AntoineJac · 2024-06-03T15:50:27Z

Summary

This PR is:

adding AI metrics enable for Prometheus
fixing a name in the metrics with s missing in tokenS
simplify the logics

Logs format will be:

"ai": {
    "ai-proxy": {
      "cache": {
        "cache_status": "",
        "vector_db": "",
        "embeddings_provider": "",
        "embeddings_model": "",
      },
      "usage": {
        "total_tokens": 40,
        "prompt_tokens": 28,
        "completion_tokens": 12,
        "cost": 0.00104
      },
      "meta": {
        "azure_api_version": "2023-05-15",
        "request_model": "gpt-35-turbo",
        "provider_name": "azure",
        "azure_instance_id": "ai-proxy-regression",
        "response_model": "gpt-35-turbo",
        "plugin_id": "1fdf6955-6243-4298-8754-5142596c9a00",
        "azure_deployment_id": "kong-gpt-3-5"
      }
    }
  },

Promotheus metrics:

kong_ai_requests_total{ai_provider="azure",ai_model="gpt-35-turbo",cache_status="",vector_db="",embeddings_provider="",embeddings_model="",workspace="default"} 1

kong_ai_cost_total{ai_provider="azure",ai_model="gpt-35-turbo",cache_status="",vector_db="",embeddings_provider="",embeddings_model="",workspace="default"} 0.00104

kong_ai_tokens_total{ai_provider="azure",ai_model="gpt-35-turbo",cache_status="",vector_db="",embeddings_provider="",embeddings_model="",token_type="prompt_tokens",workspace="default"} 12
kong_ai_tokens_total{ai_provider="azure",ai_model="gpt-35-turbo",cache_status="",vector_db="",embeddings_provider="",embeddings_model="",token_type="completion_tokens",workspace="default"} 25
kong_ai_tokens_total{ai_provider="azure",ai_model="gpt-35-turbo",cache_status="",vector_db="",embeddings_provider="",embeddings_model="",token_type="total_tokens",workspace="default"} 37

Here is the Grafana AI template for these metrics:
https://grafana.com/grafana/dashboards/21162-kong-cx-ai/

Checklist

The Pull Request has tests
A changelog file has been created under changelog/unreleased/kong or skip-changelog label added on PR if changelog is unnecessary. README.md
There is a user-facing docs PR against https://github.com/Kong/docs.konghq.com - PUT DOCS PR HERE

Issue reference

Fix AG-56

fffonion · 2024-06-05T08:20:53Z

should we use llm to better describe the usage (in case we went into image/video/audio area in the future), something like
kong_ai_llm_tokens_total etc, or maybe use a type="llm" in label.

AntoineJac · 2024-06-05T09:30:15Z

@fffonion agree I will use llm in the metrics name so "kong_ai_llm_tokens_total" to avoid increasing the cardinality of the metrics as it impacts performance for every new addition. Thanks

AntoineJac · 2024-06-06T13:07:35Z

@fffonion , I spend more time on this. Actually the tokens/cost logic is currently the same even with multi modal api.

Let's keep kong_ai_tokens_total and maybe we will add a new type in the future if cost logic is evolving

fffonion · 2024-06-07T08:26:27Z

@AntoineJac Multi-modal or text only can all be called "LLMs", I was looking at something like image or audio only, say I send some texts to let some service to generate a TTS audio, and there might be not a standard API format (like currently those LLM models have) yet. Both LLMs and others can be categorized as "AI". So I would like keep the naming constraints
limited to our current scope. Although "AI" in Kong currently just means LLM, but it may change in the future.

AntoineJac · 2024-06-07T09:12:00Z

@fffonion noted thanks. I have made the changes and fixed the conflicts with the new init.lua file

changelog/unreleased/kong/add-ai-data-prometheus.yml

kong/plugins/prometheus/exporter.lua

kong/llm/drivers/shared.lua

kong/plugins/prometheus/exporter.lua

spec/03-plugins/26-prometheus/02-access_spec.lua

kong/plugins/prometheus/exporter.lua

kong/plugins/prometheus/handler.lua

kong/plugins/prometheus/schema.lua

Co-authored-by: Wangchong Zhou <[email protected]>

changelog/unreleased/kong/add-ai-data-prometheus.yml

Co-authored-by: Zachary Hu <[email protected]>

team-gateway-bot · 2024-06-26T06:42:19Z

Cherry-pick failed for master, because it was unable to cherry-pick the commit(s).

Please cherry-pick the changes locally.

git remote add upstream https://github.com/kong/kong-ee
git fetch upstream master
git worktree add -d .worktree/cherry-pick-13148-to-master-to-upstream upstream/master
cd .worktree/cherry-pick-13148-to-master-to-upstream
git checkout -b cherry-pick-13148-to-master-to-upstream
ancref=$(git merge-base 99cb0608050cbd14611e7795960aade9fc6360cc 486f6ebeb03811300129d5ca10b65108dfee7633)
git cherry-pick -x $ancref..486f6ebeb03811300129d5ca10b65108dfee7633

AntoineJac · 2024-07-02T09:31:12Z

Here is the cherry pick:
https://github.com/Kong/kong-ee/pull/9592

Also fix a regression from #13148 AG-41 (cherry picked from commit 68925dd)

prometheus + fix

25610e9

pull-request-size bot added the size/XXL label Jun 3, 2024

prometheus + fix

556f74c

pull-request-size bot added size/L and removed size/XXL labels Jun 3, 2024

github-actions bot added plugins/prometheus schema-change-noteworthy cherry-pick kong-ee schedule this PR for cherry-picking to kong/kong-ee plugins/ai-proxy plugins/ai-request-transformer plugins/ai-response-transformer labels Jun 3, 2024

github-actions bot assigned AntoineJac Jun 3, 2024

AntoineJac added 4 commits June 3, 2024 17:58

fix

b08ee83

add test details

92ae81c

add test

30ed8cb

add changelogs

d284742

AntoineJac requested review from hanshuebner and jschmid1 and removed request for hanshuebner and jschmid1 June 4, 2024 07:36

AntoineJac added 4 commits June 7, 2024 11:04

rename with llm prefix

008c3bb

Merge branch 'master' into feat/FTI-5861-AI-Metrics-Prometheus

c1a1960

Update init.lua

81d6814

Update init.lua

a7f3f78

fffonion reviewed Jun 14, 2024

View reviewed changes

fffonion reviewed Jun 18, 2024

View reviewed changes

kong/plugins/prometheus/exporter.lua Outdated Show resolved Hide resolved

kong/plugins/prometheus/handler.lua Outdated Show resolved Hide resolved

kong/plugins/prometheus/schema.lua Outdated Show resolved Hide resolved

AntoineJac and others added 6 commits June 18, 2024 15:27

Update kong/plugins/prometheus/exporter.lua

3872f5e

Co-authored-by: Wangchong Zhou <[email protected]>

Update kong/plugins/prometheus/handler.lua

17ebe7a

Co-authored-by: Wangchong Zhou <[email protected]>

Update kong/plugins/prometheus/schema.lua

7c79cf9

Co-authored-by: Wangchong Zhou <[email protected]>

d

3e7a964

rename containers

8785af5

fix order

2124a2b

pull-request-size bot added size/XL and removed size/L labels Jun 18, 2024

remove unneeded info

f71ef89

pull-request-size bot added size/L and removed size/XL labels Jun 18, 2024

AntoineJac added 3 commits June 18, 2024 16:38

remove unneeded info

763cb6a

prepare to merge

ff63000

Merge branch 'master' into feat/FTI-5861-AI-Metrics-Prometheus

e6f6d60

fffonion approved these changes Jun 24, 2024

View reviewed changes

AndyZhang0707 requested review from samugi and nowNick June 25, 2024 09:55

outsinre reviewed Jun 25, 2024

View reviewed changes

changelog/unreleased/kong/add-ai-data-prometheus.yml Outdated Show resolved Hide resolved

Update changelog/unreleased/kong/add-ai-data-prometheus.yml

486f6eb

Co-authored-by: Zachary Hu <[email protected]>

jschmid1 approved these changes Jun 25, 2024

View reviewed changes

fffonion changed the title ~~prometheus + fix #9320~~ feat(prometheus): add AI metrics #9320 Jun 26, 2024

fffonion changed the title ~~feat(prometheus): add AI metrics #9320~~ feat(prometheus): add AI metrics and fix #9320 Jun 26, 2024

fffonion merged commit 68925dd into master Jun 26, 2024
31 checks passed

fffonion deleted the feat/FTI-5861-AI-Metrics-Prometheus branch June 26, 2024 06:42

github-actions bot added the incomplete-cherry-pick A cherry-pick was incomplete and needs manual intervention label Jun 26, 2024

kikito removed the incomplete-cherry-pick A cherry-pick was incomplete and needs manual intervention label Jul 2, 2024

oowl pushed a commit that referenced this pull request Jul 15, 2024

feat(prometheus): add AI metrics (#13148)

15a6fec

Also fix a regression from #13148 AG-41 (cherry picked from commit 68925dd)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(prometheus): add AI metrics and fix #9320 #13148

feat(prometheus): add AI metrics and fix #9320 #13148

AntoineJac commented Jun 3, 2024 •

edited by jira bot

Loading

fffonion commented Jun 5, 2024 •

edited

Loading

AntoineJac commented Jun 5, 2024

AntoineJac commented Jun 6, 2024

fffonion commented Jun 7, 2024

AntoineJac commented Jun 7, 2024 •

edited

Loading

team-gateway-bot commented Jun 26, 2024

AntoineJac commented Jul 2, 2024

feat(prometheus): add AI metrics and fix #9320 #13148

feat(prometheus): add AI metrics and fix #9320 #13148

Conversation

AntoineJac commented Jun 3, 2024 • edited by jira bot Loading

Summary

Checklist

Issue reference

fffonion commented Jun 5, 2024 • edited Loading

AntoineJac commented Jun 5, 2024

AntoineJac commented Jun 6, 2024

fffonion commented Jun 7, 2024

AntoineJac commented Jun 7, 2024 • edited Loading

team-gateway-bot commented Jun 26, 2024

AntoineJac commented Jul 2, 2024

AntoineJac commented Jun 3, 2024 •

edited by jira bot

Loading

fffonion commented Jun 5, 2024 •

edited

Loading

AntoineJac commented Jun 7, 2024 •

edited

Loading