-
Notifications
You must be signed in to change notification settings - Fork 4.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dataplane not getting information from the ControlPlane in Hybrid mode #13076
Comments
@lays147 Thanks for your report. Seems the file of LMDB is corrupted. Now Kong is using LMDB to store entities on the data_plane. Do you have some deployment that might share/change the |
And do you have steps to help us to reproduce this issue locally? |
No, there's no configuration changes between deployments.
So, the scenario is like I told on the bug report. I have one control plane, and two data planes in my production environment running on ECS. The autoscaling configuration(based on cpu and memory consumption) is very sensitive, so it scales out very drastically(3-to-5). And when this happens, some or all of the new containers have one or more errors like the ones that I added, and does not load the configuration from the control plane. The containers appear to be in a deadlock. I don't know if this can be reproduced locally. I might throw a hand, and maybe you can try using some Kubernetes stuff like k6, adding very little resources to the dataplane node, and forcing it to scale in the cluster and see if these issues happen. The problem happens on trying to add new data plane nodes to an existent environment. One thing that might stabilize my services, is changing the health check from the |
The first 2 errors looks like a network glitch and if it does not persists it should be fine to ignore them; |
@StarlightIbuki I don't have access to the underlying infrastructure to check that file. And apparently, after changing the health check, it fixed this issue for me on the events of autoscaling. |
Good to know it works for you. |
Is there an existing issue for this?
Kong version (
$ kong version
)3.6.1
Current Behavior
I have a Kong Cluster running in AWS ECS in Hybrid Mode.
The connection between the DP and CP is made with AWS Service Discovery/Cloudmap.
When the cluster scales, the new dataplane node appears to be unable to retrieve the information from the Control Plane.
I have a handful of different type of errors, which I don't know where to investigate further, since Kong is not my expertise.
Within these cases, the Dataplane appears to not be able to retry the connections with de Control Plane, so I need to force the restart of the CP, so the new nodes can update themselves.
I require your guidance to help to investigate this problem, since this is causing disruptions in our production environment.
Expected Behavior
The new nodes of Data Plane should get the info from the Control Plane without issues.
The text was updated successfully, but these errors were encountered: