Skip to content

Commit

Permalink
Add link to LLM docs from PA Readme
Browse files Browse the repository at this point in the history
  • Loading branch information
nealvaidya authored and matthewkotila committed Nov 27, 2023
1 parent 2a125ee commit 6f4b27e
Showing 1 changed file with 3 additions and 0 deletions.
3 changes: 3 additions & 0 deletions src/c++/perf_analyzer/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -73,6 +73,9 @@ changes in performance as you experiment with different optimization strategies.
[TorchServe](docs/benchmarking.md#benchmarking-torchserve) can be used as the
inference server in addition to the default Triton server

- [LLMs](docs/llm.md) can also be measured and charcterized with specific metrics
like token-to-token latency

<br>

# Quick Start
Expand Down

0 comments on commit 6f4b27e

Please sign in to comment.