Skip to content

Commit

Permalink
Update start-trained-model-deployment.asciidoc
Browse files Browse the repository at this point in the history
Updating with changes in #115041
  • Loading branch information
ppf2 authored Dec 17, 2024
1 parent bde485a commit 77c3f04
Showing 1 changed file with 4 additions and 4 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -138,8 +138,8 @@ normal priority deployments.
Controls how many inference requests are allowed in the queue at a time.
Every machine learning node in the cluster where the model can be allocated
has a queue of this size; when the number of requests exceeds the total value,
new requests are rejected with a 429 error. Defaults to 1024. Max allowed value
is 1000000.
new requests are rejected with a 429 error. Defaults to 10000. Max allowed value
is 100000.

`threads_per_allocation`::
(Optional, integer)
Expand Down Expand Up @@ -173,7 +173,7 @@ The API returns the following results:
"model_bytes": 265632637,
"threads_per_allocation" : 1,
"number_of_allocations" : 1,
"queue_capacity" : 1024,
"queue_capacity" : 10000,
"priority": "normal"
},
"routing_table": {
Expand Down Expand Up @@ -229,4 +229,4 @@ POST _ml/trained_models/my_model/deployment/_start?deployment_id=my_model_for_se
}
}
--------------------------------------------------
// TEST[skip:TBD]
// TEST[skip:TBD]

0 comments on commit 77c3f04

Please sign in to comment.