-
Notifications
You must be signed in to change notification settings - Fork 413
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Block RHCOS gcp-routes service on both masters and workers #3619
Block RHCOS gcp-routes service on both masters and workers #3619
Conversation
WDYT about moving gcp-routes into something managed by https://github.com/openshift/cluster-network-operator ? |
Oh wow, I hadn't realized we had duplicated this. I think we can just nuke the script from RHCOS right? |
From what I can tell, it was left behind on purpose because we believed that it was necessary to run it on the bootstrap host. But yeah, probably inheriting this from RHCOS and then disabling it everywhere except the bootstrap host is not the best idea... How could we test this? There is no simple way to build an OCP image which will use a patched version of RHCOS at bootstrap time is there?
OK, so a few parts to that:
|
It's all very doable with https://github.com/coreos/coreos-assembler then use the override env var for the installer. That said, I think if we want to just test this being disabled on the bootstrap, it's a lot easier to patch the Ignition generated by openshift-install for the bootstrap node. |
OK, done. It fails. So it is needed on bootstrap. So... should we leave it in RHCOS and merge this PR to disable it in OCP, or would it be better to remove it from RHCOS and get it onto the bootstrap via some other means (something in the installer I guess?) |
Thanks for testing that. Here's my thoughts:
So from my PoV, |
/retest-required |
/retest-required (I've confirmed with cluster-bot that the PR works. Tests are just being flaky...) |
/retest |
/lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: cgwalters, danwinship The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/retest-required |
@danwinship: The following tests failed, say
Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
/skip |
/retest-required |
(After digging into this some more...) It turns out that So, if they aren't needed on the bootstrap host, then it must be the case that there are no clients running on the bootstrap host that use the apiserver LB? |
RHCOS includes a
gcp-routes.service
that is (allegedly?) needed on the bootstrap host, but which we replace with a better version on the masters. However, we were still leaving it running on workers, which should be a no-op, but is actually theoretically bad because of bugs / unwanted design features in the script. (https://issues.redhat.com/browse/OCPBUGSM-45189)