Decrease accuracy when running llava model #2541

nghoaithuong · 2024-12-05T11:30:57Z

I successfully converted the model to TensorRT-LLM by following the tutorials. However, when I evaluated the accuracy of this model on both the A100 device and Jetson Orin AGX, the accuracy decreased significantly (from 97% to 93%) when using FP16/FP8 precision and (from 97% to 89%) when using INT4.
Do you have any suggestions for resolving this issue?

TriDefender · 2024-12-05T19:17:23Z

Use higher precison

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decrease accuracy when running llava model #2541

Decrease accuracy when running llava model #2541

nghoaithuong commented Dec 5, 2024

TriDefender commented Dec 5, 2024

Decrease accuracy when running llava model #2541

Decrease accuracy when running llava model #2541

Comments

nghoaithuong commented Dec 5, 2024

TriDefender commented Dec 5, 2024