-
Notifications
You must be signed in to change notification settings - Fork 61
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add text embedding processor #304
Add text embedding processor #304
Conversation
Signed-off-by: miguel-vila <[email protected]>
Thank you! We just merged a test framework in #299. Want to try to add a test for this API? Check out https://github.com/opensearch-project/opensearch-api-specification/blob/main/tests/index_lifecycle.yaml for an example. You can run it with |
Changes AnalysisCommit SHA: 32f992a API ChangesSummary
ReportThe full API changes report is available at: https://github.com/opensearch-project/opensearch-api-specification/actions/runs/9356073256/artifacts/1564049761 API Coverage
|
Signed-off-by: miguel-vila <[email protected]>
So likely a bug! That's why we need tests ;) |
@dblock @miguel-vila Small discrepancies between the spec and the actual implementation of OS are to be expected right now. Much of spec of the core features that we have is inherited from ElasticSearch. Changes to OS since have not been reflected in the spec. We will need help from OS core team to review the spec of the core features. For now, @miguel-vila, you can remove the |
Signed-off-by: miguel-vila <[email protected]>
3f3c584
to
32f992a
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we're going to have many ingest pipeline tests. How about we organize things in folders that match the schema?
So this test should probably go into something like tests/ingest/ingest_with_text_embedding_processor.yaml?
It would be great if the story included a GET of the pipeline, maybe even used the processor.
skip: false | ||
description: | | ||
This test story checks that we can create an ingest pipeline with a text | ||
embedding processor |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's sweat some small stuff since this is new. Add a period. Maybe shorten, "Create and use an ingest pipeline with a text embedding processor."
method: DELETE | ||
status: [200, 404] | ||
chapters: | ||
- synopsis: Create ingest pipeline for text embedding |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Create an ingest pipeline ... + add a period.
I think this can be merge now. We address those minor wording issues later in the test. |
I could address some of the bigger changes in a separate PR. In particular, I wanted to test a whole flow of creating a model and using search against it (I think the search was failing due to the model not existing, which makes sense because I set some random
I have some work in progress in that direction but these might be big/controversial changes so would be good to know your thoughts, might push my WIP PR later. |
Created #315 to show you the possible changes, it's still in a very rough shape. |
Signed-off-by: miguel-vila [email protected]
Description
Adds a definition for the text embedding processor: https://opensearch.org/docs/latest/ingest-pipelines/processors/text-embedding/
Please confirm whether the
field_map
definition makes senseBy submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.