Fix sample output

triton-inference-server · nv-hwoo · Oct 28, 2023 · Oct 19, 2023 · Oct 19, 2023 · Oct 20, 2023
commit 0d42a15e3fb5e86cb81f97f57a3ecd1ce9817b0f
diff --git a/src/c++/perf_analyzer/docs/llm.md b/src/c++/perf_analyzer/docs/llm.md
@@ -124,7 +124,7 @@ prompts.
 python profile.py -m vllm --prompt-size-range 100 500 200 --max-tokens 256 --ignore-eos
 
 # Sample output
-# [ Benchmark Summary ]
+# [ BENCHMARK SUMMARY ]
 #   Prompt size: 100, Average first-token latency: 0.0388 sec, Average total token-to-token latency: 0.0066 sec
 #   Prompt size: 300, Average first-token latency: 0.0431 sec, Average total token-to-token latency: 0.0071 sec
 #   Prompt size: 500, Average first-token latency: 0.0400 sec, Average total token-to-token latency: 0.0070 sec