Skip to content

Commit

Permalink
Adds LLM performance matrix doc to ESS & serverless (#5286) (#5296)
Browse files Browse the repository at this point in the history
* Adds LLM performance matrix doc to ESS & serverless

* experimenting with matrix formatting

* uses alternative matrix formatting

* updates matrix format, adds alternative version

* uses updated table format

* updates ESS version of table

* fixes table

* removes outdated table formatting

* Update docs/assistant/llm-performance-matrix.asciidoc

Co-authored-by: Joe Peeples <[email protected]>

* Update docs/serverless/assistant/llm-performance-matrix.mdx

Co-authored-by: Joe Peeples <[email protected]>

* Update docs/serverless/assistant/llm-performance-matrix.mdx

Co-authored-by: Joe Peeples <[email protected]>

* Update docs/assistant/llm-performance-matrix.asciidoc

Co-authored-by: Joe Peeples <[email protected]>

* Update docs/assistant/llm-performance-matrix.asciidoc

Co-authored-by: natasha-moore-elastic <[email protected]>

* Update docs/serverless/assistant/llm-performance-matrix.mdx

Co-authored-by: natasha-moore-elastic <[email protected]>

* Update docs/assistant/llm-performance-matrix.asciidoc

---------

Co-authored-by: Joe Peeples <[email protected]>
Co-authored-by: natasha-moore-elastic <[email protected]>
(cherry picked from commit adb07fa)

# Conflicts:
#	docs/serverless/serverless-security.docnav.json

Co-authored-by: Benjamin Ironside Goldstein <[email protected]>
  • Loading branch information
mergify[bot] and benironside authored Jun 3, 2024
1 parent 26e024a commit 9f2e32e
Show file tree
Hide file tree
Showing 4 changed files with 693 additions and 0 deletions.
15 changes: 15 additions & 0 deletions docs/assistant/llm-performance-matrix.asciidoc
Original file line number Diff line number Diff line change
@@ -0,0 +1,15 @@
[[llm-performance-matrix]]
= Large language model performance matrix

This table describes the performance of various large language models (LLMs) for different use cases in {elastic-sec}, based on our internal testing. To learn more about these use cases, refer to <<attack-discovery, Attack discovery>> or <<security-assistant, AI Assistant>>.

[cols="1,1,1,1,1,1,1", options="header"]
|===
| *Feature* | *Model* | | | | |
| | *Claude 3: Opus* | *Claude 3: Sonnet* | *Claude 3: Haiku* | *GPT-4o* | *GPT-4 Turbo*| *GPT-4 32K*

| *Assistant - General* | Excellent | Excellent | Excellent | Excellent | Excellent | Excellent
| *Assistant - {esql} Generation*| Great | Great | Poor | Excellent | Poor | Excellent
| *Assistant - Alert Questions* | Excellent | Excellent | Excellent | Excellent | Poor | Good (limited context)
| *Attack discovery* | Excellent | Great | Poor | Poor | Good | Good (limited context)
|===
1 change: 1 addition & 0 deletions docs/assistant/security-assistant.asciidoc
Original file line number Diff line number Diff line change
Expand Up @@ -223,6 +223,7 @@ In addition to practical advice, AI Assistant can offer conceptual advice, tips,


include::ai-alert-triage.asciidoc[leveloffset=+1]
include::llm-performance-matrix.asciidoc[leveloffset=+1]
include::azure-openai-setup.asciidoc[leveloffset=+1]
include::connect-to-openai.asciidoc[leveloffset=+1]
include::connect-to-bedrock.asciidoc[leveloffset=+1]
19 changes: 19 additions & 0 deletions docs/serverless/assistant/llm-performance-matrix.mdx
Original file line number Diff line number Diff line change
@@ -0,0 +1,19 @@
---
id: llm-performance-matrix
slug: /serverless/security/llm-performance-matrix
title: Large language model performance matrix
description: Learn how different models perform on different tasks in ((elastic-sec)).
tags: ["security", "overview", "get-started"]
status: in review
---

This table describes the performance of various large language models (LLMs) for different use cases in ((elastic-sec)), based on our internal testing. To learn more about these use cases, refer to <DocLink id="attackDiscovery" text="Attack discovery"/> or <DocLink id="serverlessSecurityAIAssistant" text="AI Assistant"/>.

| **Feature:** | **Model** | | | | | |
|-------------------------------|-----------------------|--------------------|--------------------|------------|-----------------|----------------|
| | **Claude 3: Opus** | **Claude 3: Sonnet** | **Claude 3: Haiku** | **GPT-4o** | **GPT-4 Turbo** | **GPT-4 32K** |
| **Assistant: general** | Excellent | Excellent | Excellent | Excellent | Excellent | Excellent |
| **Assistant: ((esql)) generation** | Great | Great | Poor | Excellent | Poor | Excellent |
| **Assistant: alert questions** | Excellent | Excellent | Excellent | Excellent | Poor | Good (limited context) |
| **Attack discovery** | Excellent | Great | Poor | Poor | Good | Good (limited context) |

Loading

0 comments on commit 9f2e32e

Please sign in to comment.