Update on the development branch #2274
DanBlanaru
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
The TensorRT-LLM team is pleased to announce that we have pushed an update to the development branch (and the Triton backend) this Sep 30, 2024.
This update includes:
examples/nemotron_nas/README.md
.examples/phi/README.md
.finish_reason
andstop_reason
for theLLM
API.kv_cache_type
issue in the Python benchmark, thanks to the contribution from @qingquansong in Fix kv_cache_type issue #2219.Thanks,
The TensorRT-LLM Engineering Team
Beta Was this translation helpful? Give feedback.
All reactions