Add continus batch size benchmark to LLM guide #404

matthewkotila · 2023-09-28T01:02:28Z

Add third part to guide for testing continuous batch size on token-to-token latency.

matthewkotila · 2023-10-03T23:23:57Z

Do not merge until infinite loop bug has been resolved.

src/c++/perf_analyzer/docs/llm.md

matthewkotila · 2023-10-06T20:41:35Z

Do not merge until infinite loop bug has been resolved.

Infinite loop bug resolved by #410

* Add continus batch size benchmark to LLM guide * Update llm.md * Update llm.md

Add continus batch size benchmark to LLM guide

4bbcf05

matthewkotila requested review from debermudez, Tabrizian and nv-hwoo September 28, 2023 01:02

Update llm.md

b7faf9b

matthewkotila marked this pull request as ready for review October 3, 2023 23:23

nv-hwoo reviewed Oct 4, 2023

View reviewed changes

src/c++/perf_analyzer/docs/llm.md Outdated Show resolved Hide resolved

src/c++/perf_analyzer/docs/llm.md Outdated Show resolved Hide resolved

src/c++/perf_analyzer/docs/llm.md Outdated Show resolved Hide resolved

Update llm.md

bb6c38d

nv-hwoo approved these changes Oct 6, 2023

View reviewed changes

matthewkotila merged commit 1b304c5 into periodic-concurrency-mode Oct 6, 2023

matthewkotila deleted the matthewkotila-llm-guide branch October 6, 2023 23:51

matthewkotila added a commit that referenced this pull request Oct 7, 2023

Add continus batch size benchmark to LLM guide (#404)

6ebc42b

* Add continus batch size benchmark to LLM guide * Update llm.md * Update llm.md

matthewkotila added a commit that referenced this pull request Oct 7, 2023

Add continus batch size benchmark to LLM guide (#404)

9748ba1

* Add continus batch size benchmark to LLM guide * Update llm.md * Update llm.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add continus batch size benchmark to LLM guide #404

Add continus batch size benchmark to LLM guide #404

matthewkotila commented Sep 28, 2023 •

edited

Loading

matthewkotila commented Oct 3, 2023

matthewkotila commented Oct 6, 2023

Add continus batch size benchmark to LLM guide #404

Add continus batch size benchmark to LLM guide #404

Conversation

matthewkotila commented Sep 28, 2023 • edited Loading

matthewkotila commented Oct 3, 2023

matthewkotila commented Oct 6, 2023

matthewkotila commented Sep 28, 2023 •

edited

Loading