docs: Update README.md #63

yinggeh · 2024-09-06T23:40:23Z

List metrics in vllm:* instead of the variable name.

rmccorm4 · 2024-09-06T23:43:51Z

README.md

-counter_prompt_tokens
-# Number of generation tokens processed.
-counter_generation_tokens
+# Counter of prefill tokens processed.


Do we need both this section and the one right below it? Why not just use the below section instead?

VLLM stats are reported by the metrics endpoint in fields that are prefixed with
vllm:. For example, the metrics reported by Triton will look similar to the following:

# HELP vllm:prompt_tokens_total Number of prefill tokens processed. # TYPE vllm:prompt_tokens_total counter vllm:prompt_tokens_total{model="vllm_model",version="1"} 10 # HELP vllm:generation_tokens_total Number of generation tokens processed. # TYPE vllm:generation_tokens_total counter vllm:generation_tokens_total{model="vllm_model",version="1"} 16 ...

I'd rather have another section listing all supported metrics. The sample metrics output is just hard to follow IMO.

And for example, changing counter_prompt_tokens to vllm:prompt_tokens_total allows reader to easily locate the corresponding output from below section.

@oandreeva-nv do you mind driving this PR if you have time?

How about we create a dedicated doc for metrics? So that our front facing README is nice and non-cluttered ?

yinggeh · 2024-09-24T18:17:29Z

There will be changes to the vLLM backend metrics in the upcoming releases. Converting this PR to draft. @oandreeva-nv @rmccorm4

Update README.md

7770bda

yinggeh added the documentation Improvements or additions to documentation label Sep 6, 2024

yinggeh requested review from rmccorm4 and oandreeva-nv September 6, 2024 23:40

yinggeh self-assigned this Sep 6, 2024

rmccorm4 reviewed Sep 6, 2024

View reviewed changes

yinggeh requested a review from rmccorm4 September 6, 2024 23:51

yinggeh changed the title ~~Update README.md~~ docs: Update README.md Sep 18, 2024

rmccorm4 assigned oandreeva-nv and unassigned oandreeva-nv Sep 20, 2024

yinggeh marked this pull request as draft September 24, 2024 18:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: Update README.md #63

docs: Update README.md #63

yinggeh commented Sep 6, 2024

rmccorm4 Sep 6, 2024 •

edited

Loading

yinggeh Sep 6, 2024

yinggeh Sep 6, 2024

rmccorm4 Sep 20, 2024

oandreeva-nv Sep 20, 2024

yinggeh commented Sep 24, 2024

docs: Update README.md #63

Are you sure you want to change the base?

docs: Update README.md #63

Conversation

yinggeh commented Sep 6, 2024

rmccorm4 Sep 6, 2024 • edited Loading

Choose a reason for hiding this comment

yinggeh Sep 6, 2024

Choose a reason for hiding this comment

yinggeh Sep 6, 2024

Choose a reason for hiding this comment

rmccorm4 Sep 20, 2024

Choose a reason for hiding this comment

oandreeva-nv Sep 20, 2024

Choose a reason for hiding this comment

yinggeh commented Sep 24, 2024

rmccorm4 Sep 6, 2024 •

edited

Loading