Use ignore_eos when benchmarking Triton + vLLM deterministically #204

dyastremsky · 2024-12-03T20:50:04Z

When using the --output-tokens-mean-deterministic flag, use the arg "ignore_eos" for Triton + vLLM, since it supports it and it could result in more accurate benchmarking.

dyastremsky · 2024-12-03T20:55:36Z

Closing, since it already works without ignore-eos after user testing. No need to send extra data if the current approach accomplishes the expected behavior.

Use ignore_eos when Triton vLLM deterministic

4a47914

dyastremsky requested review from rmccorm4 and nv-hwoo December 3, 2024 20:50

dyastremsky self-assigned this Dec 3, 2024

dyastremsky temporarily deployed to GITLAB December 3, 2024 20:50 — with GitHub Actions Inactive

debermudez approved these changes Dec 3, 2024

View reviewed changes

dyastremsky closed this Dec 3, 2024

dyastremsky deleted the dyas-vllm-deterministic branch December 17, 2024 19:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use ignore_eos when benchmarking Triton + vLLM deterministically #204

Use ignore_eos when benchmarking Triton + vLLM deterministically #204

dyastremsky commented Dec 3, 2024

dyastremsky commented Dec 3, 2024

Use ignore_eos when benchmarking Triton + vLLM deterministically #204

Use ignore_eos when benchmarking Triton + vLLM deterministically #204

Conversation

dyastremsky commented Dec 3, 2024

dyastremsky commented Dec 3, 2024