Skip to content

Commit

Permalink
Modified doc
Browse files Browse the repository at this point in the history
  • Loading branch information
AndyDai-nv committed Aug 22, 2024
1 parent f3fcdf0 commit dbdf8f6
Showing 1 changed file with 13 additions and 13 deletions.
26 changes: 13 additions & 13 deletions genai-perf/docs/goodput.md
Original file line number Diff line number Diff line change
Expand Up @@ -72,19 +72,19 @@ genai-perf profile \
Example output:

```
LLM Metrics
┏━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━━┳━━━━━━━━┳━━━━━━━━┳━━━━━━━━┳━━━━━━━━┓
┃ Statistic ┃ avg ┃ min ┃ max ┃ p99 ┃ p90 ┃ p75 ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━━╇━━━━━━━━╇━━━━━━━━╇━━━━━━━━╇━━━━━━━━┩
│ Time to first token (ms) │ 4.443.6323.8513.30 5.154.20
│ Inter token latency (ms) │ 1.03 │ 0.76 │ 1.92 │ 1.63 │ 1.18 │ 1.08
│ Request latency (ms) │ 22.31 7.67 45.83 41.11 25.18 21.97
│ Output sequence length │ 18.54 5.00 │ 24.00 │ 22.41 21.00 │ 20.00 │
│ Input sequence length │ 550.06 │ 550.00 │ 553.00 │ 551.82 │ 550.00 │ 550.00 │
└──────────────────────────┴────────┴────────┴────────┴────────┴────────┴────────┘
Output token throughput (per sec): 827.09
Request throughput (per sec): 44.62
Request goodput (per sec): 30.95
NVIDIA GenAI-Perf | LLM Metrics
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━┳━━━━━━━━┳━━━━━━━━┳━━━━━━━━┳━━━━━━━━┳━━━━━━━━┓
Statistic ┃ avg ┃ min ┃ max ┃ p99 ┃ p90 ┃ p75 ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━╇━━━━━━━━╇━━━━━━━━╇━━━━━━━━╇━━━━━━━━╇━━━━━━━━┩
Time to first token (ms) │ 7.464.1518.4017.8913.265.56
Inter token latency (ms) │ 1.05 │ 0.98 │ 1.16 │ 1.16 │ 1.12 │ 1.06
Request latency (ms) │ 632.78620.90648.19647.81644.38638.67
Output sequence length │ 595.60539.00 │ 638.00 │ 636.80626.00 │ 608.00 │
Input sequence length │ 550.00 │ 550.00 │ 550.00 │ 550.00 │ 550.00 │ 550.00 │
│ Output token throughput (per sec) │ 940.96 │ N/A │ N/A │ N/A │ N/A │ N/A │
│ Request throughput (per sec) │ 1.58 │ N/A │ N/A │ N/A │ N/A │ N/A │
Request goodput (per sec) │ 0.32 │ N/A │ N/A │ N/A │ N/A │ N/A │
└───────────────────────────────────┴────────┴────────┴────────┴────────┴────────┴────────┘
```

### Profile Embeddings Models Goodput<a id="embeddings"></a>
Expand Down

0 comments on commit dbdf8f6

Please sign in to comment.