Curious about GPU utilization #20

spidercatfly · 2024-12-04T13:00:58Z

Excellent work! Macro-o1 really supports long-term self-thinking!

I found a magical phenomenon that when Macro-o1 is inferencing, it will continue to occupy the memory of the last GPU until it is about to OOM, and then all the calculation tensors will be removed, leaving only the initial model. Could you tell me how this is done?

Thanks in advance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Curious about GPU utilization #20

Curious about GPU utilization #20

spidercatfly commented Dec 4, 2024

Curious about GPU utilization #20

Curious about GPU utilization #20

Comments

spidercatfly commented Dec 4, 2024