Using Mirage for different models #27

ramyaprabhu-alt · 2024-06-25T11:22:02Z

Hi, I just found this repository and I really like the idea. I wanted to try it out for a different model that what's on the readme for this repo, like say LLama 3 8B TP2.
But I'm a novice and am struggling to understand how the inputs in the given example must be modified

1.) From all the looking around I assume this is a prefill kernel for cl 4096 with input size of 256. am I correct?
2.) for each of the Q K and V tensors, why is the first of the input dims tuple 2 * batch size?
3.) what is 64 in the input dims tuple supposed to be? because the readme says this is a kernel for Llama 70B tp4...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using Mirage for different models #27

Using Mirage for different models #27

ramyaprabhu-alt commented Jun 25, 2024

Using Mirage for different models #27

Using Mirage for different models #27

Comments

ramyaprabhu-alt commented Jun 25, 2024