Support synthetic image generation in GenAI-Perf #754

nv-hwoo · 2024-07-17T18:33:04Z

Integrated SyntheticImageGenerator into LlmInputs
Enables users to run genai-perf with synthetic images
Use real source images for synthetic image generation, rather than random noise
Add/fix unit tests

src/c++/perf_analyzer/genai-perf/tests/test_llm_inputs.py

src/c++/perf_analyzer/genai-perf/genai_perf/llm_inputs/synthetic_image_generator.py

src/c++/perf_analyzer/genai-perf/genai_perf/llm_inputs/llm_inputs.py

debermudez · 2024-07-17T23:42:47Z

src/c++/perf_analyzer/genai-perf/genai_perf/llm_inputs/synthetic_image_generator.py

        while True:
-            n = int(self.rng.normal(mean, stddev))
+            n = int(random.gauss(mean, stddev))


Can we do this using an offset or abs() instead of a loop?

Good point. Changed to using abs.

The abs corrupts the gaussian distribution. The while loop truncates it at zero. The same solution is used in SyntheticPromptGenerator:

client/src/c++/perf_analyzer/genai-perf/genai_perf/llm_inputs/synthetic_prompt_generator.py

Lines 120 to 123 in 3e1dbb1

def _sample_random_positive_int(cls, mean: int, stddev: int) -> int:

random_pos_int = -1

while random_pos_int <= 0:

random_pos_int = int(random.gauss(mean, stddev))

But, I'm fine with abs+offset.

i didnt think of that when i suggested it.
We can revert if you prefer.

@mwawrzos I think this is still a valid normal distribution, but a folded one (instead of truncated). It may affect the statistics, but I'm not too sure about the consequences of the slight shift in the statistics. But at the same time, I don't think this should be a huge concern since sampling image resolutions near zero seems like an unlikely case.

@nv-braf do you have any thoughts?

src/c++/perf_analyzer/genai-perf/genai_perf/utils.py

src/c++/perf_analyzer/genai-perf/genai_perf/llm_inputs/synthetic_image_generator.py

* POC LLaVA VLM support (#720) * POC for LLaVA support * non-streaming request in VLM tests * image component sent in "image_url" field instead of HTML tag * generate sample image instead of loading from docs * add vision to endpoint mapping * fixes for handling OutputFormat * refactor - extract image preparation to a separate module * fixes to the refactor * replace match-case syntax with if-elseif-else * Update image payload format and fix tests * Few clean ups and tickets added for follow up tasks * Fix and add tests for vision format * Remove output format from profile data parser * Revert irrelevant code change * Revert changes * Remove unused dependency * Comment test_extra_inputs --------- Co-authored-by: Hyunjae Woo <[email protected]> * Support multi-modal input from file for OpenAI Chat Completions (#749) * add synthetic image generator (#751) * synthetic image generator * format randomization * images should be base64-encoded arbitrarly * randomized image format * randomized image shape * prepare SyntheticImageGenerator to support different image sources * read from files * python 3.10 support fixes * remove unused imports * skip sampled image sizes with negative values * formats type fix * remove unused variable * synthetic image generator encodes images to base64 * image format not randomized * sample each dimension independently Co-authored-by: Hyunjae Woo <[email protected]> * apply code-review suggestsions * update class name * deterministic synthetic image generator * add typing to SyntheticImageGenerator * SyntheticImageGenerator doesn't load files * SyntheticImageGenerator always encodes images to base64 * remove unused imports * generate gaussian noise instead of blank images --------- Co-authored-by: Hyunjae Woo <[email protected]> * Add command line arguments for synthetic image generation (#753) * Add CLI options for synthetic image generation * read image format from file when --input-file is used * move encode_image method to utils * Lazy import some modules * Support synthetic image generation in GenAI-Perf (#754) * support synthetic image generation for VLM model * add test * integrate sythetic image generator into LlmInputs * add source images for synthetic image data * use abs to get positive int --------- Co-authored-by: Marek Wawrzos <[email protected]>

nv-hwoo added 4 commits July 16, 2024 15:14

support synthetic image generation for VLM model

5138dc9

add test

0d18d25

integrate sythetic image generator into LlmInputs

0a60daa

add source images for synthetic image data

3747472

nv-hwoo requested review from mwawrzos, debermudez and ganeshku1 July 17, 2024 18:33

github-advanced-security bot found potential problems Jul 17, 2024

View reviewed changes

src/c++/perf_analyzer/genai-perf/tests/test_llm_inputs.py Outdated Show resolved Hide resolved

nv-hwoo commented Jul 17, 2024

View reviewed changes

src/c++/perf_analyzer/genai-perf/genai_perf/llm_inputs/synthetic_image_generator.py Show resolved Hide resolved

debermudez reviewed Jul 17, 2024

View reviewed changes

use abs to get positive int

393e924

debermudez approved these changes Jul 18, 2024

View reviewed changes

nv-hwoo merged commit e7925c8 into vision-language Jul 18, 2024
5 checks passed

nv-hwoo deleted the hwoo-vlm-synthetic branch July 18, 2024 18:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support synthetic image generation in GenAI-Perf #754

Support synthetic image generation in GenAI-Perf #754

nv-hwoo commented Jul 17, 2024

debermudez Jul 17, 2024

nv-hwoo Jul 18, 2024

mwawrzos Jul 22, 2024

debermudez Jul 22, 2024

nv-hwoo Jul 23, 2024

	def _sample_random_positive_int(cls, mean: int, stddev: int) -> int:
	random_pos_int = -1
	while random_pos_int <= 0:
	random_pos_int = int(random.gauss(mean, stddev))

Support synthetic image generation in GenAI-Perf #754

Support synthetic image generation in GenAI-Perf #754

Conversation

nv-hwoo commented Jul 17, 2024

debermudez Jul 17, 2024

Choose a reason for hiding this comment

nv-hwoo Jul 18, 2024

Choose a reason for hiding this comment

mwawrzos Jul 22, 2024

Choose a reason for hiding this comment

debermudez Jul 22, 2024

Choose a reason for hiding this comment

nv-hwoo Jul 23, 2024

Choose a reason for hiding this comment