From 682c999fbd35b514cb7ae5f52b1ecb842651365e Mon Sep 17 00:00:00 2001 From: Pavithra Vijayakrishnan <160681768+pvijayakrish@users.noreply.github.com> Date: Mon, 28 Oct 2024 09:40:44 -0700 Subject: [PATCH] Update README.md Co-authored-by: Misha Chornyi <99709299+mc-nv@users.noreply.github.com> --- README.md | 3 +++ 1 file changed, 3 insertions(+) diff --git a/README.md b/README.md index d445ecc8..9ca11b36 100644 --- a/README.md +++ b/README.md @@ -18,6 +18,9 @@ limitations under the License. # Triton Model Analyzer +> [!WARNING] +> The current release is version [1.45.0](https://github.com/triton-inference-server/model_analyzer/releases/latest) and corresponds to the 24.10 container release on NVIDIA GPU Cloud (NGC). + Triton Model Analyzer is a CLI tool which can help you find a more optimal configuration, on a given piece of hardware, for single, multiple, ensemble, or BLS models running on a [Triton Inference Server](https://github.com/triton-inference-server/server/). Model Analyzer will also generate reports to help you better understand the trade-offs of the different configurations along with their compute and memory requirements.