Skip to content

Issues: NVIDIA/TensorRT-LLM

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

TRT-LLM fails on GH200 node bug Something isn't working
#2571 opened Dec 12, 2024 by ttim
4 tasks
Support for LLaMa3.3
#2567 opened Dec 11, 2024 by FernandoDorado
InternVL deploy
#2565 opened Dec 11, 2024 by ChenJian7578
What does "weights_scaling_factor_2" mean in safetensor results of awq_w4a8 Investigating Low Precision Issue about lower bit quantization, including int8, int4, fp8 triaged Issue has been triaged by maintainers
#2561 opened Dec 11, 2024 by gujiewen
nccl hung
#2560 opened Dec 11, 2024 by akhoroshev
int8 slower than bf16 on A100 bug Something isn't working Investigating Low Precision Issue about lower bit quantization, including int8, int4, fp8 triaged Issue has been triaged by maintainers
#2553 opened Dec 9, 2024 by ShuaiShao93
4 tasks
Performance issue with long context bug Something isn't working
#2548 opened Dec 6, 2024 by ShuaiShao93
4 tasks
Qwen2-VL FP8/INT8 Quantization
#2546 opened Dec 6, 2024 by MrD005
trtllm-bench faild bug Something isn't working
#2545 opened Dec 6, 2024 by dingjingzhen
2 of 4 tasks
Encoding error in stream response from Triton server bug Something isn't working
#2544 opened Dec 6, 2024 by Wonder-donbury
3 of 4 tasks
lora doesn't work when kv_cache is disabled Investigating Lora/P-tuning triaged Issue has been triaged by maintainers
#2543 opened Dec 5, 2024 by ShuaiShao93
4 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.