High inference lantency #320

Altimis · 2024-11-11T12:38:26Z

I fine-tuned Donut on custom data, but inference takes 15 seconds on CPU (8 cores). I realized the generate() function is the one that takes too long. Are there any improvements that could be made to reduce this latency please ? Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

High inference lantency #320

High inference lantency #320

Altimis commented Nov 11, 2024

High inference lantency #320

High inference lantency #320

Comments

Altimis commented Nov 11, 2024