You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
United Branch:
Test done with same model, same token context.
The generation speed seems unaffected but the united implementation seems to take a lot longer to "process" the tokens.
Until this issue is fixed I will stay on latestGPTQ.
The text was updated successfully, but these errors were encountered:
LatestGPTQ Branch:
United Branch:
Test done with same model, same token context.
The generation speed seems unaffected but the united implementation seems to take a lot longer to "process" the tokens.
Until this issue is fixed I will stay on latestGPTQ.
The text was updated successfully, but these errors were encountered: