You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When cuDF spilling is enabled we may eventually get allocation errors when 100% of the memory is unspillable. The information about cuDF spilling memory usage is very useful, but failures are perhaps delayed to a point where that information doesn't help us understanding why we got there.
With the above said, a dashboard that allows us to look at both cuDF's device memory consumption and spilled (host) memory usage in real-time would help us in understanding where we have pressure, allowing to see the history of cuDF memory usage is definitely a bonus as it would permit us seeing what happened even if workers die abruptly.
The text was updated successfully, but these errors were encountered:
When cuDF spilling is enabled we may eventually get allocation errors when 100% of the memory is unspillable. The information about cuDF spilling memory usage is very useful, but failures are perhaps delayed to a point where that information doesn't help us understanding why we got there.
With the above said, a dashboard that allows us to look at both cuDF's device memory consumption and spilled (host) memory usage in real-time would help us in understanding where we have pressure, allowing to see the history of cuDF memory usage is definitely a bonus as it would permit us seeing what happened even if workers die abruptly.
The text was updated successfully, but these errors were encountered: