-
Notifications
You must be signed in to change notification settings - Fork 139
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Nexus segfault when there are offline CPUs #31
Comments
BTW, the code in |
Hi, Yilong. The approach suggested in this issue would be nice to have in eRPC. Machines with offline CPUs are uncommon IMO, so this is a low-priority task for us. We would welcome a patch. As a temporary workaround, you might hard-code the core for the session management thread. Or, you might delete the core pinning for this thread altogether. The session management thread has near-zero CPU use when sessions aren't being actively created or destroyed, so my hope is that disabling core pinning won't affect performance. |
I had hyperthreading turned off so half of the CPUs were offline. I agree that machines with offline CPUs are rare in production but it's quite convenient for doing experiment. Anyway, I might submit a patch if this becomes more problematic for me. Thanks. |
Ah - I didn't think of the HT-disabled case. That's a scenario that we would like to support. |
I got a segfault when running the
create_session_test
at the following line:eRPC/src/nexus_impl/nexus.cc
Line 63 in dff4589
The problem is that
sm_thread_lcore_index
is assigned to be the last lcore at line 61 without considering its status whileget_lcores_for_numa_node
returns only online lcores.The text was updated successfully, but these errors were encountered: