Skip to content

Commit

Permalink
Fix sample output
Browse files Browse the repository at this point in the history
  • Loading branch information
nv-hwoo committed Oct 26, 2023
1 parent 723dbad commit c727af4
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/c++/perf_analyzer/docs/llm.md
Original file line number Diff line number Diff line change
Expand Up @@ -124,7 +124,7 @@ prompts.
python profile.py -m vllm --prompt-size-range 100 500 200 --max-tokens 256 --ignore-eos
# Sample output
# [ Benchmark Summary ]
# [ BENCHMARK SUMMARY ]
# Prompt size: 100, Average first-token latency: 0.0388 sec, Average total token-to-token latency: 0.0066 sec
# Prompt size: 300, Average first-token latency: 0.0431 sec, Average total token-to-token latency: 0.0071 sec
# Prompt size: 500, Average first-token latency: 0.0400 sec, Average total token-to-token latency: 0.0070 sec
Expand Down

0 comments on commit c727af4

Please sign in to comment.