Update on the development branch #2364
kaiyux
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
The TensorRT-LLM team is pleased to announce that we have pushed an update to the development branch (and the Triton backend) this Oct 22, 2024.
This update includes:
Qwen2ForSequenceClassification
model architecture.BuildConfig
class so that they are aligned with thetrtllm-build
command.gpt_variant
argument to the model conversion when provided by the user ofexamples/gpt/convert_checkpoint.py
, thanks to the contribution from @tonylek in Passing gpt_variant to model conversion #2352.Thanks,
The TensorRT-LLM Engineering Team
Beta Was this translation helpful? Give feedback.
All reactions