Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Obs AI Assistant] Evaluation framework #173010

Merged

Conversation

dgieselaar
Copy link
Member

@dgieselaar dgieselaar commented Dec 10, 2023

Adds an evaluation framework for the Observability AI Assistant. For more details, see the README.

@apmmachine
Copy link
Contributor

🤖 GitHub comments

Expand to view the GitHub comments

Just comment with:

  • /oblt-deploy : Deploy a Kibana instance using the Observability test environments.
  • /oblt-deploy-serverless : Deploy a serverless Kibana instance using the Observability test environments.
  • run elasticsearch-ci/docs : Re-trigger the docs validation. (use unformatted text in the comment!)

@dgieselaar dgieselaar added release_note:skip Skip the PR/issue when compiling release notes v8.12.0 v8.12.1 labels Dec 10, 2023
@dgieselaar dgieselaar marked this pull request as ready for review December 10, 2023 10:14
@dgieselaar dgieselaar requested a review from a team as a code owner December 10, 2023 10:14
@dgieselaar
Copy link
Member Author

@elasticmachine merge upstream

@kibana-ci
Copy link
Collaborator

💛 Build succeeded, but was flaky

Failed CI Steps

Test Failures

  • [job] [logs] FTR Configs #10 / discover Discover Saved Queries Manage saved queries updates a saved query

Metrics [docs]

✅ unchanged

History

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

],
};

const sortedEvaluationFunctions = sortBy(evaluationFunctions, 'fileName', 'name');
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm a little bit hesitant to sort the test within the file, why not keep them in their order of declaration?

I haven't seen any other test framework sort test cases

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We use exports so no order is guaranteed. Ideally we have something similar to describe and it where the order of the statements decides order of execution, but need to figure out how we can use something like Mocha for this.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Right, then it makes a lot of sense to sort them until then


async function getKibanaUrl({ kibana, log }: { kibana: string; log: ToolingLog }) {
try {
const isCI = process.env.CI?.toLowerCase() === 'true';
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thought this wouldn't run on CI?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy pasta 😁

@dgieselaar dgieselaar merged commit c36410f into elastic:main Dec 13, 2023
39 checks passed
@dgieselaar dgieselaar deleted the obs-ai-assistant-evaluation-framework branch December 13, 2023 14:44
kibanamachine pushed a commit to kibanamachine/kibana that referenced this pull request Dec 13, 2023
Adds an evaluation framework for the Observability AI Assistant. For
more details, [see the
README](https://github.com/dgieselaar/kibana/blob/obs-ai-assistant-evaluation-framework/x-pack/plugins/observability_ai_assistant/scripts/evaluation/README.md).

---------

Co-authored-by: kibanamachine <[email protected]>
(cherry picked from commit c36410f)
@kibanamachine
Copy link
Contributor

💚 All backports created successfully

Status Branch Result
8.12

Note: Successful backport PRs will be merged automatically after passing CI.

Questions ?

Please refer to the Backport tool documentation

kibanamachine added a commit that referenced this pull request Dec 13, 2023
# Backport

This will backport the following commits from `main` to `8.12`:
- [[Obs AI Assistant] Evaluation framework
(#173010)](#173010)

<!--- Backport version: 8.9.7 -->

### Questions ?
Please refer to the [Backport tool
documentation](https://github.com/sqren/backport)

<!--BACKPORT [{"author":{"name":"Dario
Gieselaar","email":"[email protected]"},"sourceCommit":{"committedDate":"2023-12-13T14:43:59Z","message":"[Obs
AI Assistant] Evaluation framework (#173010)\n\nAdds an evaluation
framework for the Observability AI Assistant. For\r\nmore details, [see
the\r\nREADME](https://github.com/dgieselaar/kibana/blob/obs-ai-assistant-evaluation-framework/x-pack/plugins/observability_ai_assistant/scripts/evaluation/README.md).\r\n\r\n---------\r\n\r\nCo-authored-by:
kibanamachine
<[email protected]>","sha":"c36410f78ee0103cb9c6dd056b1921c1eeec1e58","branchLabelMapping":{"^v8.13.0$":"main","^v(\\d+).(\\d+).\\d+$":"$1.$2"}},"sourcePullRequest":{"labels":["release_note:skip","v8.12.0","v8.12.1","v8.13.0"],"number":173010,"url":"https://github.com/elastic/kibana/pull/173010","mergeCommit":{"message":"[Obs
AI Assistant] Evaluation framework (#173010)\n\nAdds an evaluation
framework for the Observability AI Assistant. For\r\nmore details, [see
the\r\nREADME](https://github.com/dgieselaar/kibana/blob/obs-ai-assistant-evaluation-framework/x-pack/plugins/observability_ai_assistant/scripts/evaluation/README.md).\r\n\r\n---------\r\n\r\nCo-authored-by:
kibanamachine
<[email protected]>","sha":"c36410f78ee0103cb9c6dd056b1921c1eeec1e58"}},"sourceBranch":"main","suggestedTargetBranches":["8.12"],"targetPullRequestStates":[{"branch":"8.12","label":"v8.12.0","labelRegex":"^v(\\d+).(\\d+).\\d+$","isSourceBranch":false,"state":"NOT_CREATED"},{"branch":"main","label":"v8.13.0","labelRegex":"^v8.13.0$","isSourceBranch":true,"state":"MERGED","url":"https://github.com/elastic/kibana/pull/173010","number":173010,"mergeCommit":{"message":"[Obs
AI Assistant] Evaluation framework (#173010)\n\nAdds an evaluation
framework for the Observability AI Assistant. For\r\nmore details, [see
the\r\nREADME](https://github.com/dgieselaar/kibana/blob/obs-ai-assistant-evaluation-framework/x-pack/plugins/observability_ai_assistant/scripts/evaluation/README.md).\r\n\r\n---------\r\n\r\nCo-authored-by:
kibanamachine
<[email protected]>","sha":"c36410f78ee0103cb9c6dd056b1921c1eeec1e58"}}]}]
BACKPORT-->

Co-authored-by: Dario Gieselaar <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
release_note:skip Skip the PR/issue when compiling release notes v8.12.0 v8.12.1 v8.13.0
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants