-
Notifications
You must be signed in to change notification settings - Fork 160
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Running fence_scsi_check_hardreboot consumes CPU. #313
Comments
You could try setting |
Hi Oyvind, Our environment is RHEL8.0, and fence_scsi seems to support verbose=yes. I tried to set verbose=yes in the fence_scsi parameter, but it seems that information is not output especially to pacemaker.log. Best Regards, |
It might also be in corosync.log or /var/log/messages. If you try to run it manually though it should be shown on your screen immediately. |
Hi Oyvind, Thanks for your comment. Best Regards, |
Hi Oyvind, Since the specification of the verbose option cannot be performed well, I forcibly changed the code of fence_scsi and enabled and executed the verbose, but it did not seem to get much useful information.
Also, it seems that the same high CPU load occurs when using the watchdog service with fence_mpath. I will investigate the cause a little more. Best Regards, |
Maybe there's some watchdog setting for tuning priority of the process? |
Hi Oyvind,
Yes. In the environment in question, the default settings in /etc/watchdog.conf are as follows:
Many thanks, |
I would try changing the priority to see if that helps. |
Hi Oyvind,
I'll give it a try.... But... I changed the priority to 50 or 99 and restarted the watchdog service, but it seems that the CPU usage of fence_scsi_check_hardreboot does not change. It seems that you can confirm that the CPU usage rises simply by the following command line.
I think this improvement seems to be difficult for python import processing. Best Regards, |
Yeah. I dont know how we can improve that. |
Hi Oyvind, I think a little more about improvement. It may be the right conclusion that this improvement is difficult in Python. Best Regards, |
Hi All,
Configure a cluster using fence_scsi in a virtual environment to which only one CPU core is allocated.
When fence_scsi_check_hardreboot is used together with the watchdog service to configure the pacemaker cluster, fence_scsi_check_hardreboot uses 20% of the CPU every second.
When this happens, pacemaker frequently outputs the following log.
Some improvement can be achieved by increasing the number of CPU cores or increasing the monitoring interval of the watchdog service.
However, some users may not be able to change core assignments. Increasing the monitoring interval also affects the failover time when a failure occurs.
Is there any way to improve the fence_scsi_check_hardreboot script to solve the problem?
(Can make the processing of fence_scsi_check_hardreboot a little lighter?)
Best Regards,
Hideo Yamauchi.
The text was updated successfully, but these errors were encountered: