-
Notifications
You must be signed in to change notification settings - Fork 9
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Update readme with new pip instructions and reorganize (#230)
* Update readme with new pip instructions and reorganize * Fix spacing and link * Migrate to use the pypi.org release * Updated headers and wording around installation * Update templates for docs
- Loading branch information
1 parent
bf54dfb
commit ea2024d
Showing
9 changed files
with
51 additions
and
87 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -73,45 +73,34 @@ INSTALLATION | |
|
||
## Installation | ||
|
||
The easiest way to install GenAI-Perf is through | ||
[Triton Server SDK container](https://ngc.nvidia.com/catalog/containers/nvidia:tritonserver). | ||
Install the latest release using the following command: | ||
The easiest way to install GenAI-Perf is through pip. | ||
### Install GenAI-Perf (Ubuntu 24.04, Python 3.10+) | ||
|
||
```bash | ||
export RELEASE="24.10" | ||
|
||
docker run -it --net=host --gpus=all nvcr.io/nvidia/tritonserver:${RELEASE}-py3-sdk | ||
|
||
# Check out genai_perf command inside the container: | ||
genai-perf --help | ||
pip install genai-perf | ||
``` | ||
**NOTE**: you must already have CUDA 12 installed | ||
|
||
<details> | ||
|
||
<summary>Alternatively, to install from source:</summary> | ||
<details> | ||
|
||
Since GenAI-Perf depends on Perf Analyzer, | ||
you'll need to install the Perf Analyzer binary: | ||
<summary>Alternatively, to install the container:</summary> | ||
|
||
### Install Perf Analyzer (Ubuntu, Python 3.10+) | ||
[Triton Server SDK container](https://ngc.nvidia.com/catalog/containers/nvidia:tritonserver) | ||
|
||
**NOTE**: you must already have CUDA 12 installed | ||
(checkout the [CUDA installation guide](https://docs.nvidia.com/cuda/cuda-installation-guide-linux/index.html)). | ||
Pull the latest release using the following command: | ||
|
||
```bash | ||
pip install tritonclient | ||
export RELEASE="24.12" | ||
|
||
sudo apt update && sudo apt install -y --no-install-recommends libb64-0d | ||
``` | ||
|
||
You can also build Perf Analyzer [from source](../docs/install.md#build-from-source) as well. | ||
|
||
### Install GenAI-Perf from source | ||
docker run -it --net=host --gpus=all nvcr.io/nvidia/tritonserver:${RELEASE}-py3-sdk | ||
|
||
```bash | ||
pip install git+https://github.com/triton-inference-server/perf_analyzer.git#subdirectory=genai-perf | ||
# Validate the genai-perf command works inside the container: | ||
genai-perf --help | ||
``` | ||
|
||
You can also build Perf Analyzer [from source](../docs/install.md#build-from-source) to use alongside GenAI-Perf as well. | ||
|
||
</details> | ||
|
||
</br> | ||
|
@@ -142,7 +131,7 @@ docker run -ti \ | |
--shm-size=1g --ulimit memlock=-1 \ | ||
-v /tmp:/tmp \ | ||
-v ${HOME}/.cache/huggingface:/root/.cache/huggingface \ | ||
nvcr.io/nvidia/tritonserver:24.10-trtllm-python-py3 | ||
nvcr.io/nvidia/tritonserver:24.12-trtllm-python-py3 | ||
|
||
# Install the Triton CLI | ||
pip install git+https://github.com/triton-inference-server/[email protected] | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters