Do optimum-benchmark support deepspeed_mii #288

neozhang307 · 2024-10-10T05:42:53Z

Thank you very much for your efforts in this work.

I found that you support deepspeed. But I think for inference workload, deepspeed-mii has better performance, as claimed in this repo: https://github.com/microsoft/DeepSpeed-MII.

I am curious about its performance. I wonder how hard it is to support this backend?

IlyasMoutawwakil · 2024-10-10T08:56:55Z

nope we don't but you can submit a PR following the example of in #250

neozhang307 · 2024-11-02T07:47:49Z

Let me see if I can implement it myself. But I don't know where to start.
The basic API for mii is:

import mii
pipe = mii.pipeline("mistralai/Mistral-7B-v0.1")
response = pipe(["DeepSpeed is", "Seattle is"], max_new_tokens=128)
print(response)

Can you point out some backend code that I can start from?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do optimum-benchmark support deepspeed_mii #288

Do optimum-benchmark support deepspeed_mii #288

neozhang307 commented Oct 10, 2024

IlyasMoutawwakil commented Oct 10, 2024

neozhang307 commented Nov 2, 2024

Do optimum-benchmark support deepspeed_mii #288

Do optimum-benchmark support deepspeed_mii #288

Comments

neozhang307 commented Oct 10, 2024

IlyasMoutawwakil commented Oct 10, 2024

neozhang307 commented Nov 2, 2024