Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clarify new arguments in documentation #746

Merged
merged 2 commits into from
Jul 10, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
7 changes: 7 additions & 0 deletions src/c++/perf_analyzer/docs/cli.md
Original file line number Diff line number Diff line change
Expand Up @@ -157,6 +157,13 @@ will also be reported in the results.
Default is `-1` indicating that the average latency is used to determine
stability.

#### `--request-count=<n>`

Specifies a total number of requests to use for measurement.

Default is `0`, which means that there is no request count and the measurement
will proceed using windows until stabilization is detected.

#### `-r <n>`
#### `--max-trials=<n>`

Expand Down
4 changes: 2 additions & 2 deletions src/c++/perf_analyzer/genai-perf/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -301,8 +301,8 @@ options:

When the dataset is coming from a file, you can specify the following
options:
* `--input-file <path>`: The input file containing the single prompt to
use for benchmarking.
* `--input-file <path>`: The input file containing the prompts to
use for benchmarking as JSON objects.

For any dataset, you can specify the following options:
* `--output-tokens-mean <int>`: The mean number of tokens in each output. Ensure
Expand Down
Loading