Add pre-generated prompts option for benchmark #1091

omer-demir · 2024-11-22T16:42:28Z

During benchmarking, we wanted to have pre-generated prompts that have been prepared for better benchmark result. Hence, It can be handy during benchmarking. In our test, we wanted to focus only token generation and sampling on SLM.

benchmark/python/benchmark_e2e.py

benchmark/python/prompts.json

kunal-vaishnavi · 2024-12-20T19:24:47Z

benchmark/python/benchmark_e2e.py

@@ -232,6 +240,9 @@ def run_benchmark(args, batch_size, prompt_length, generation_length, max_length
        # use random tokens instead of generating a prompt using the model and then tokenizing it
        tokens = np.random.randint(100, size=(batch_size, prompt_length))
        prompt = [tokenizer.decode(tokens[0])] * batch_size
+    elif args.use_prompt_set:
+        prompt = get_prompt_by_length(prompt_length)
+        tokens = tokenizer.encode_batch(prompt)


Different tokenizers can encode prompts into different prompt lengths. Some additional work is needed to get the desired prompt length. You can see an example of how to do this here.

so basically, we will check the tokens length against requested prompt_length and we will add/trim if it is needed. Is that correct?

kunal-vaishnavi · 2024-12-20T19:26:41Z

benchmark/python/benchmark_e2e.py

@@ -83,6 +83,14 @@ def generate_prompt(model, tokenizer, prompt_length, use_graph_capture) -> str:
        generator.generate_next_token()
    return tokenizer.decode(generator.get_sequence(0))

+# Use prompt length to get pre-defined prompt
+def get_prompt_by_length(prompt_length):
+    json_path = "prompts.json"


Instead of uploading another copy of prompts.json, can we download it from here and save it to disk using requests or urllib instead? That way, only one location has to be updated when adding other prompts.

would it make sense to rely on external file? If that changes, we may have problem with this. Besides, in the benchmark environment, it seems logical to use local files instead of relying on internet connectivity. What do you think?

github-advanced-security bot found potential problems Nov 22, 2024

View reviewed changes

benchmark/python/benchmark_e2e.py Fixed Show fixed Hide fixed

omer-demir force-pushed the omerdemir/pre_generated_prompts branch from f8d1bdc to 0b19f52 Compare November 25, 2024 08:59

RyanUnderhill reviewed Dec 3, 2024

View reviewed changes

benchmark/python/prompts.json Outdated Show resolved Hide resolved

Add pre-generated prompts for benchmark

243e7ea

omer-demir force-pushed the omerdemir/pre_generated_prompts branch from 0b19f52 to 243e7ea Compare December 20, 2024 19:00

kunal-vaishnavi reviewed Dec 20, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add pre-generated prompts option for benchmark #1091

Add pre-generated prompts option for benchmark #1091

omer-demir commented Nov 22, 2024

kunal-vaishnavi Dec 20, 2024

omer-demir Dec 23, 2024

kunal-vaishnavi Dec 20, 2024

omer-demir Dec 23, 2024

Add pre-generated prompts option for benchmark #1091

Are you sure you want to change the base?

Add pre-generated prompts option for benchmark #1091

Conversation

omer-demir commented Nov 22, 2024

kunal-vaishnavi Dec 20, 2024

Choose a reason for hiding this comment

omer-demir Dec 23, 2024

Choose a reason for hiding this comment

kunal-vaishnavi Dec 20, 2024

Choose a reason for hiding this comment

omer-demir Dec 23, 2024

Choose a reason for hiding this comment