Significant Speed Recression on P40 compared to United #60

wereretot · 2023-08-09T22:15:06Z

LatestGPTQ Branch:

United Branch:

Test done with same model, same token context.
The generation speed seems unaffected but the united implementation seems to take a lot longer to "process" the tokens.
Until this issue is fixed I will stay on latestGPTQ.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Significant Speed Recression on P40 compared to United #60

Significant Speed Recression on P40 compared to United #60

wereretot commented Aug 9, 2023 •

edited

Loading

Significant Speed Recression on P40 compared to United #60

Significant Speed Recression on P40 compared to United #60

Comments

wereretot commented Aug 9, 2023 • edited Loading

wereretot commented Aug 9, 2023 •

edited

Loading