You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I measure time to generate each token for microsoft/Phi-3-mini-4k-instruct model.
I use prompt size=512 and generation size=128, running on XPU. model precision: fp16.
Ipex v2.3.1: average time per token ~94 ms
Ipex v2.5.1: average time per token ~90 ms
Should I expect higher improvement in token time generation in ipex v2.5.1?
The text was updated successfully, but these errors were encountered:
Describe the issue
Hi,
I measure time to generate each token for
microsoft/Phi-3-mini-4k-instruct
model.I use prompt size=512 and generation size=128, running on XPU. model precision: fp16.
Should I expect higher improvement in token time generation in ipex v2.5.1?
The text was updated successfully, but these errors were encountered: