Skip to content

Commit

Permalink
Changes paragraph placement
Browse files Browse the repository at this point in the history
  • Loading branch information
kosabogi committed Oct 25, 2024
1 parent 27b445c commit 8f7a9d2
Showing 1 changed file with 6 additions and 6 deletions.
12 changes: 6 additions & 6 deletions serverless/pages/ml-nlp-auto-scale.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -20,11 +20,6 @@ Trained model autoscaling is available for Search, Observability, and Security p

Security and Observability projects are only charged for data ingestion and retention. They are not charged for processing power (vCU usage), which is used for more complex operations, like running advanced search models. For example, in Search projects, models such as ELSER require significant processing power to provide more accurate search results.

Search projects are given access to more processing resources, while Security and Observability projects have lower limits. This difference is reflected in the UI configuration: Search projects have higher resource limits compared to Security and Observability projects to accommodate their more complex operations.

On Serverless, adaptive allocations are automatically enabled for all project types.
However, the "Adaptive resources" control is not displayed in Kibana for Observability and Security projects.

## Enabling autoscaling through APIs - adaptive allocations

Model allocations are independent units of work for NLP tasks.
Expand Down Expand Up @@ -78,7 +73,12 @@ You can choose from three levels of resource usage for your trained model deploy

Refer to the tables in the auto-scaling-matrix section to find out the setings for the level you selected.

<DocImage size="xxl" url="../images/ml-nlp-deployment.png" alt="ML model deployment with adaptive resources enabled." />
<DocImage size="xxl" url="../images/ml-nlp-deployment.png" alt="ML model deployment with adaptive resources enabled." />

Search projects are given access to more processing resources, while Security and Observability projects have lower limits. This difference is reflected in the UI configuration: Search projects have higher resource limits compared to Security and Observability projects to accommodate their more complex operations.

On Serverless, adaptive allocations are automatically enabled for all project types.
However, the "Adaptive resources" control is not displayed in Kibana for Observability and Security projects.

## Model deployment resource matrix

Expand Down

0 comments on commit 8f7a9d2

Please sign in to comment.