Use endpoint instead of model in the inference API #2528

davidkyle · 2024-04-25T11:49:50Z

Update the inference APIs to refer to Inference endpoints rather than models. This involves renaming classes and updating comments but also includes the potentially breaking changes:

inference.delete_model renamed to inference.delete
inference.get_model renamed to inference.get
inference.put_model renamed to inference.put

The format of the get response has also changed, the models field is renamed to endpoints:

#previously
{
    "models": [...]  
}

#with this change
{
    "endpoints": [...]  
}

In GetResponse.ts I've used an @alias tag to retain the models option

Corresponding Elasticsearch change elastic/elasticsearch#107704

pquentin

Thanks! Since this is for 8.15, I suggest waiting for elastic/elasticsearch#107704 so that we make sure 1/ the JSON spec is exactly the same and 2/ allow clients-flight-recorder to keep up with the change and validate this properly.

I left a few comments there too.

davidkyle · 2024-04-29T13:47:47Z

Thanks! Since this is for 8.15, I suggest waiting for elastic/elasticsearch#107704

That PR is green now. I hesitated before pressing the merge button as after merging we must have a compatible client ready for the next serverless release and when it comes to the breaking change I am hoping there is someway to soften the blow.

Taking Python as an example, existing code will use client.inference.put_model(...) and after this change the Python function is client.inference.put(...). Is there anyway of having inference.put_model resolve to inference.put so that we don't break existing code samples? Perhaps by having spec for both inference.put and inference.put_model but the latter is deprecated and not documented?

pquentin · 2024-04-30T07:28:49Z

Taking Python as an example, existing code will use client.inference.put_model(...) and after this change the Python function is client.inference.put(...). Is there anyway of having inference.put_model resolve to inference.put so that we don't break existing code samples? Perhaps by having spec for both inference.put and inference.put_model but the latter is deprecated and not documented?

I'm not sure we can deprecate an endpoint today if the endpoint itself is gone from Elasticsearch. (The only existing example is _knn_search.) Also, the clients are one thing, but on Serverless right now the usage of Kibana is quite high. And it's not like deprecating a function has any practical effect: things will break eventually. I suppose that's why tech preview is for.

That PR is green now. I hesitated before pressing the merge button as after merging we must have a compatible client ready for the next serverless release

We don't have great automation here, most serverless clients don't get frequent updates yet. We can coordinate to get the serverless Python client out when you want, if that helps.

pquentin

Thanks! Requested a few cosmetic fixes and asked about the models alias.

specification/_json_spec/inference.put.json

pquentin · 2024-05-02T07:11:52Z

specification/inference/get/GetResponse.ts


 export class Response {
  body: {
-    models: Array<ModelConfigContainer>
+    /** @aliases models */


Since we decided to break compatibility, should we keep models as an alias? For what it's worth, the current Python client does the wrong thing and would send models as is to the server. That could be fixed in time for 8.15 though.

I agree and have removed the alias.

The change breaks compatibility, we can do that because the API is neither GA or heavily used therefore it should be a clean break.

specification/inference/inference/InferenceRequest.ts

Co-authored-by: Quentin Pradet <[email protected]>

specification/inference/_types/Services.ts

Co-authored-by: Miguel Grinberg <[email protected]>

pquentin

Thanks! LGTM.

github-actions · 2024-05-02T12:02:54Z

Following you can find the validation results for the APIs you have changed.

API	Status	Request	Response
`inference.delete`	⚪	Missing test	Missing test
`inference.get`	🟠	Missing recording	Missing recording
`inference.inference`	⚪	Missing test	Missing test
`inference.put`	⚪	Missing test	Missing test

You can validate these APIs yourself by using the make validate target.

davidkyle · 2024-05-02T12:26:03Z

Thanks for the reviews

Use endpoint instead of model in the inference API

2c4718d

davidkyle added the specification label Apr 25, 2024

use alias for response

b5189cc

pquentin reviewed Apr 26, 2024

View reviewed changes

davidkyle marked this pull request as ready for review April 29, 2024 13:35

davidkyle requested a review from a team as a code owner April 29, 2024 13:35

Merge branch 'main' into endpoints-not-models

4afa7b4

pquentin and others added 3 commits April 30, 2024 12:47

Merge branch 'main' into endpoints-not-models

1bd351c

Merge branch 'main' into endpoints-not-models

a71b155

Merge branch 'main' into endpoints-not-models

2eadedf

pquentin requested changes May 2, 2024

View reviewed changes

davidkyle and others added 3 commits May 2, 2024 10:21

Update specification/inference/inference/InferenceRequest.ts

1aa7a9e

Co-authored-by: Quentin Pradet <[email protected]>

Update

780abe3

Remove response alias

48a1bec

miguelgrinberg reviewed May 2, 2024

View reviewed changes

specification/inference/_types/Services.ts Outdated Show resolved Hide resolved

Fix typo in Services.ts

69ff916

Co-authored-by: Miguel Grinberg <[email protected]>

pquentin approved these changes May 2, 2024

View reviewed changes

davidkyle merged commit e01c305 into main May 2, 2024
6 checks passed

davidkyle deleted the endpoints-not-models branch May 2, 2024 12:26

pquentin mentioned this pull request May 6, 2024

Auto-generated API code elastic/elasticsearch-serverless-python#38

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use endpoint instead of model in the inference API #2528

Use endpoint instead of model in the inference API #2528

davidkyle commented Apr 25, 2024 •

edited by pquentin

Loading

pquentin left a comment

davidkyle commented Apr 29, 2024

pquentin commented Apr 30, 2024 •

edited

Loading

pquentin left a comment

pquentin May 2, 2024

davidkyle May 2, 2024

pquentin left a comment

github-actions bot commented May 2, 2024

davidkyle commented May 2, 2024

Use endpoint instead of model in the inference API #2528

Use endpoint instead of model in the inference API #2528

Conversation

davidkyle commented Apr 25, 2024 • edited by pquentin Loading

pquentin left a comment

Choose a reason for hiding this comment

davidkyle commented Apr 29, 2024

pquentin commented Apr 30, 2024 • edited Loading

pquentin left a comment

Choose a reason for hiding this comment

pquentin May 2, 2024

Choose a reason for hiding this comment

davidkyle May 2, 2024

Choose a reason for hiding this comment

pquentin left a comment

Choose a reason for hiding this comment

github-actions bot commented May 2, 2024

davidkyle commented May 2, 2024

davidkyle commented Apr 25, 2024 •

edited by pquentin

Loading

pquentin commented Apr 30, 2024 •

edited

Loading