You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We just observed this behavior and in the logs i discovered this error: error running SLAVE OF command: dial tcp 10.138.59.180:9999: i/o timeout, so i'll assume that either of this happened:
network issue
dragonfly main/networking thread blocked
dragonfly crashed without killing the process
Due to this i would like to suggest the following changes:
Check via redis client that the operator can talk to the new master before promoting it
Check via redis client that the operator can talk to the (now) replicas before setting it to slave of new master
kill the pod if it can't talk to it after X tries (configurable? 0 meaning, do not kill it?)
The text was updated successfully, but these errors were encountered:
Regarding this:
dragonfly-operator/internal/controller/dragonfly_instance.go
Line 116 in 64cfcba
and this:
dragonfly-operator/internal/controller/dragonfly_instance.go
Line 117 in 64cfcba
We just observed this behavior and in the logs i discovered this error:
error running SLAVE OF command: dial tcp 10.138.59.180:9999: i/o timeout
, so i'll assume that either of this happened:Due to this i would like to suggest the following changes:
0
meaning, do not kill it?)The text was updated successfully, but these errors were encountered: