How much RAM needed to run model? #2

texturejc · 2022-04-06T19:09:32Z

Thanks for creating this implementation. I've tried running it in a Google Colab Pro notebook, but the session keeps crashing due to maxing out the RAM. Do you have any sense of how much RAM is needed to run the model? Thanks!

lopho · 2022-04-06T19:24:12Z

~40gb vram to load ~45gb to infer full 2048 token length. About the twice the amount in cpu RAM (~81gb) as it currently is intantiated on the cpu and then converted to fp16 and uploaded to VRAM. Dunno if that's intended behaviour, seems like it's supposed to create meta tensors, but thats not actually working.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How much RAM needed to run model? #2

How much RAM needed to run model? #2

texturejc commented Apr 6, 2022

lopho commented Apr 6, 2022 •

edited

Loading

How much RAM needed to run model? #2

How much RAM needed to run model? #2

Comments

texturejc commented Apr 6, 2022

lopho commented Apr 6, 2022 • edited Loading

lopho commented Apr 6, 2022 •

edited

Loading