-
Notifications
You must be signed in to change notification settings - Fork 554
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Partitions (backed by network storage) disappear if network is unavailable for more than 5 seconds #9706
Comments
Please provide some logs to understand how does the partition disappear in your case. |
So this is quite expected, it has nothing to do with mounting (at least until there's enough logs to prove the opposite). The partition is mounted, but as it's a network disk, any operation would be broken if the network is unreliable. Talos works without issues e.g. on AWS/EBS volumes, so the network volume should be made reliable enough first. |
but why did the partition not show up again after the network connectivity was re-established? |
I don't know. There are zero logs on partitions being unmounted (it shouldn't be). You can grab kernel logs with |
I'll try to see if I can recreate it in a lab environment and then get the logs. |
Bug Report
Given Talos OS disk is provided from network storage
when network storage is unavailable for more than 5 seconds
then partitions disappear.
For example the /var partition disappeared at the node. The partition was available again after reboot.
Description
It could be related to the hardcoded timeout of 5 seconds in the mount.go :
It is not clear if other timeouts can cause the partition to disappear as well.
If the mount function runs in a reconciliation loop then it is probably the right place to fix the issue.
Alternative could be looking into the general configuration the XFS filesystem to handle errors using the
max_retries
andretry_timeout_seconds
andaction
XFS mount options.Logs
Disk I/O timeouts are seen in logs.
Environment
The text was updated successfully, but these errors were encountered: