-
Notifications
You must be signed in to change notification settings - Fork 46
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Removing a machine that failed to add properly leaves internal tokens behind #256
Comments
Looking specifically in microovn and microceph, the machine was also present in the |
Removing nodes is not fully supported at the moment. The For each app, you can use sql like you did for microcloud, except with LXD it's got the command |
Thank you, this seems to have done the trick. I am still unable to successfully add the node, and with each try I have to go through the procedure you provided, remove and purge the snaps on the new node, reboot, and try again. The error I am currently getting is a context deadline exceeded, which is not super useful. It would be great that if the |
At what point are you receiving that error? What's the prior output? Did the node fail to join any cluster? Does it appear in |
I do not have the output anymore, but it is currently showing in the cluster list for everything except lxd. |
OK then it looks like LXD is not cleaning up properly after a failed join. I bet it's connected to canonical/lxd#12624. It's probably salvageable if you manually add some dummy entries into the LXD database corresponding to the failed node with the sql command and then call There's a couple tables that are important here, notably |
I tried to add a new machine to my microcloud cluster, and it failed because of the networking configuration. However, microcloud still saw the new machine as a cluster member, so I could not try to add it back again.
I ran
sudo microcloud cluster remove <name> --force
and tried to add the machine again, and got the error:I poked around a bit and tried to manually remove the token:
sudo microcloud sql "DELETE FROM internal_token_records WHERE name='<name>'"
Running a
SELECT
afterwards shows that the token is gone. However, trying to add the machine again fails with the same error, and a new token is present in the table.Running the add command with debug and verbose does not give more information. I can also see the machine's IP in
/var/snap/microcloud/common/state/database/cluster.yaml
.The state that is left behind prevents me from trying again to add that machine.
The text was updated successfully, but these errors were encountered: