[WIP] LLaVA support #720

mwawrzos · 2024-06-27T07:41:27Z

The goal of this MR is to enable measuring VLM throughput and latency where input includes images.

src/c++/perf_analyzer/genai-perf/tests/test_llm_metrics.py

src/c++/perf_analyzer/genai-perf/genai_perf/llm_inputs/llm_inputs.py

milesial · 2024-07-01T19:11:38Z

src/c++/perf_analyzer/genai-perf/genai_perf/llm_inputs/llm_inputs.py

@@ -41,6 +87,7 @@ class PromptSource(Enum):
 class OutputFormat(Enum):
    OPENAI_CHAT_COMPLETIONS = auto()
    OPENAI_COMPLETIONS = auto()
+    OPENAI_VISION = auto()


The response format for chat VLMs is the same as the regular chat completion since we just have text out, why have a separate entry?

The name of the enum is a bit misleading 😅 The OutputFormat enum is actually not about the format of the response but it's about the format of the resulting input json file by LlmInputs.

src/c++/perf_analyzer/genai-perf/genai_perf/llm_inputs/dataset_decorators.py

src/c++/perf_analyzer/genai-perf/genai_perf/llm_inputs/llm_inputs.py

src/c++/perf_analyzer/genai-perf/tests/test_llm_inputs.py

dyastremsky

Fantastic work! Did you mean to delete test_end_to_end.py as part of this PR?

nv-hwoo · 2024-07-10T20:37:17Z

@dyastremsky yes, the script was originally created by Tim in the beginning of genai-perf when we didn't have CI but we never used it afterwards (it's not even part of our unit test). Since we now have CI in place, I don't think we need this script any more.

dyastremsky · 2024-07-10T21:01:47Z

@dyastremsky yes, the script was originally created by Tim in the beginning of genai-perf when we didn't have CI but we never used it afterwards (it's not even part of our unit test). Since we now have CI in place, I don't think we need this script any more.

Great job cleaning this up!

* POC for LLaVA support * non-streaming request in VLM tests * image component sent in "image_url" field instead of HTML tag * generate sample image instead of loading from docs * add vision to endpoint mapping * fixes for handling OutputFormat * refactor - extract image preparation to a separate module * fixes to the refactor * replace match-case syntax with if-elseif-else * Update image payload format and fix tests * Few clean ups and tickets added for follow up tasks * Fix and add tests for vision format * Remove output format from profile data parser * Revert irrelevant code change * Revert changes * Remove unused dependency * Comment test_extra_inputs --------- Co-authored-by: Hyunjae Woo <[email protected]>

* POC LLaVA VLM support (#720) * POC for LLaVA support * non-streaming request in VLM tests * image component sent in "image_url" field instead of HTML tag * generate sample image instead of loading from docs * add vision to endpoint mapping * fixes for handling OutputFormat * refactor - extract image preparation to a separate module * fixes to the refactor * replace match-case syntax with if-elseif-else * Update image payload format and fix tests * Few clean ups and tickets added for follow up tasks * Fix and add tests for vision format * Remove output format from profile data parser * Revert irrelevant code change * Revert changes * Remove unused dependency * Comment test_extra_inputs --------- Co-authored-by: Hyunjae Woo <[email protected]> * Support multi-modal input from file for OpenAI Chat Completions (#749) * add synthetic image generator (#751) * synthetic image generator * format randomization * images should be base64-encoded arbitrarly * randomized image format * randomized image shape * prepare SyntheticImageGenerator to support different image sources * read from files * python 3.10 support fixes * remove unused imports * skip sampled image sizes with negative values * formats type fix * remove unused variable * synthetic image generator encodes images to base64 * image format not randomized * sample each dimension independently Co-authored-by: Hyunjae Woo <[email protected]> * apply code-review suggestsions * update class name * deterministic synthetic image generator * add typing to SyntheticImageGenerator * SyntheticImageGenerator doesn't load files * SyntheticImageGenerator always encodes images to base64 * remove unused imports * generate gaussian noise instead of blank images --------- Co-authored-by: Hyunjae Woo <[email protected]> * Add command line arguments for synthetic image generation (#753) * Add CLI options for synthetic image generation * read image format from file when --input-file is used * move encode_image method to utils * Lazy import some modules * Support synthetic image generation in GenAI-Perf (#754) * support synthetic image generation for VLM model * add test * integrate sythetic image generator into LlmInputs * add source images for synthetic image data * use abs to get positive int --------- Co-authored-by: Marek Wawrzos <[email protected]>

mwawrzos self-assigned this Jun 27, 2024

mwawrzos force-pushed the mwawrzos/openai-vision branch from e1bfcb4 to 1326edb Compare June 27, 2024 07:52

POC for LLaVA support

06df643

mwawrzos force-pushed the mwawrzos/openai-vision branch from 1326edb to 06df643 Compare June 27, 2024 08:30

non-streaming request in VLM tests

20fe487

github-advanced-security bot found potential problems Jun 27, 2024

View reviewed changes

src/c++/perf_analyzer/genai-perf/tests/test_llm_metrics.py Fixed Show fixed Hide fixed

image component sent in "image_url" field instead of HTML tag

dfb6b1d

mwawrzos force-pushed the mwawrzos/openai-vision branch from a400418 to dfb6b1d Compare June 27, 2024 15:31

generate sample image instead of loading from docs

819cd90

github-advanced-security bot found potential problems Jun 28, 2024

View reviewed changes

src/c++/perf_analyzer/genai-perf/genai_perf/llm_inputs/llm_inputs.py Fixed Show fixed Hide fixed

add vision to endpoint mapping

e805a92

milesial reviewed Jul 1, 2024

View reviewed changes

Merge branch 'main' into mwawrzos/openai-vision

f0a87f0

nv-hwoo changed the base branch from main to vision-language July 3, 2024 18:28

fixes for handling OutputFormat

8bf2710

mwawrzos force-pushed the mwawrzos/openai-vision branch from 60b658a to 8bf2710 Compare July 4, 2024 11:32

refactor - extract image preparation to a separate module

0ced8b3

github-advanced-security bot found potential problems Jul 8, 2024

View reviewed changes

src/c++/perf_analyzer/genai-perf/genai_perf/llm_inputs/dataset_decorators.py Fixed Show fixed Hide fixed

src/c++/perf_analyzer/genai-perf/genai_perf/llm_inputs/dataset_decorators.py Fixed Show fixed Hide fixed

mwawrzos and others added 4 commits July 8, 2024 19:45

fixes to the refactor

eea4c26

replace match-case syntax with if-elseif-else

b03a6a6

Update image payload format and fix tests

d46d8fb

Few clean ups and tickets added for follow up tasks

5d1e7d2

github-advanced-security bot found potential problems Jul 9, 2024

View reviewed changes

src/c++/perf_analyzer/genai-perf/genai_perf/llm_inputs/llm_inputs.py Fixed Show fixed Hide fixed

src/c++/perf_analyzer/genai-perf/tests/test_llm_inputs.py Fixed Show fixed Hide fixed

nv-hwoo added 2 commits July 9, 2024 18:05

Fix and add tests for vision format

9f20592

Remove output format from profile data parser

b82bf40

nv-hwoo force-pushed the mwawrzos/openai-vision branch from bb5511d to b82bf40 Compare July 10, 2024 04:56

nv-hwoo added 4 commits July 9, 2024 22:07

Revert irrelevant code change

2346a40

Revert changes

7c0de7f

Remove unused dependency

773a140

Comment test_extra_inputs

fa61742

nv-hwoo requested a review from dyastremsky July 10, 2024 05:38

Merge branch 'vision-language' into mwawrzos/openai-vision

8c9c078

dyastremsky reviewed Jul 10, 2024

View reviewed changes

dyastremsky approved these changes Jul 10, 2024

View reviewed changes

nv-hwoo merged commit 6259b96 into vision-language Jul 10, 2024
5 checks passed

nv-hwoo deleted the mwawrzos/openai-vision branch July 10, 2024 21:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] LLaVA support #720

[WIP] LLaVA support #720

mwawrzos commented Jun 27, 2024

milesial Jul 1, 2024

nv-hwoo Jul 1, 2024

dyastremsky left a comment •

edited

Loading

nv-hwoo commented Jul 10, 2024

dyastremsky commented Jul 10, 2024

[WIP] LLaVA support #720

[WIP] LLaVA support #720

Conversation

mwawrzos commented Jun 27, 2024

milesial Jul 1, 2024

Choose a reason for hiding this comment

nv-hwoo Jul 1, 2024

Choose a reason for hiding this comment

dyastremsky left a comment • edited Loading

Choose a reason for hiding this comment

nv-hwoo commented Jul 10, 2024

dyastremsky commented Jul 10, 2024

dyastremsky left a comment •

edited

Loading