Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LlmInputs - Add ability to synthetically generate inputs #506

Merged
merged 5 commits into from
Mar 11, 2024

Conversation

nv-braf
Copy link
Contributor

@nv-braf nv-braf commented Mar 8, 2024

New class to synthetically generate an input json. Have unit tests covering the new library as well as end-to-end testing to cover synthetic -> vlmm, trtllm, and both openai endpoints.

@nv-braf nv-braf requested review from debermudez and tgerdesnv March 8, 2024 21:22
@nv-braf nv-braf requested a review from tgerdesnv March 8, 2024 22:35
@@ -0,0 +1,104 @@
The period for a new election of a citizen to
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

High level, this all looks great, thanks @nv-braf.

import random
from typing import List, Tuple

from transformers import LlamaTokenizerFast
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Eventually, we will add the ability to use a tokenizer from the cli but for now, we are using gpt2 in the metrics.
Can we update this one to use gpt2 as well?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am also okay with this being a TODO for now

@debermudez debermudez merged commit a615c6e into feature-genai-pa Mar 11, 2024
3 checks passed
@debermudez debermudez deleted the llm-inputs-synthetic branch March 11, 2024 18:21
debermudez pushed a commit that referenced this pull request Mar 12, 2024
* First cut of synthetic prompt generation class

* Enabled synthetic prompt generation w/ unit testing to all endpoints.

* Creating a new text input for synthetic generation

* Fixes based on Tim's comments

* Adding GPT2 todo
debermudez pushed a commit that referenced this pull request Mar 13, 2024
* First cut of synthetic prompt generation class

* Enabled synthetic prompt generation w/ unit testing to all endpoints.

* Creating a new text input for synthetic generation

* Fixes based on Tim's comments

* Adding GPT2 todo
mc-nv pushed a commit that referenced this pull request Mar 13, 2024
* First cut of synthetic prompt generation class

* Enabled synthetic prompt generation w/ unit testing to all endpoints.

* Creating a new text input for synthetic generation

* Fixes based on Tim's comments

* Adding GPT2 todo
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

4 participants