-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Connection reset by peer when reading slugbuilder logs #225
Comments
@gabrtv any more debug info available? . events or |
@gabrtv, you need to increase the idle timeout on the ELB that k8s created for you. I wish we could configure that in the router svc, but it doesn't seem we can. 1200 should be enough. |
@krancour i have the ELB timeout set to 600, which usually works fine. |
@gabrtv ah ok then... bad guess on my part. The default is 60, and I know that's bitten a few people before. |
I'm almost positive this is
If you can grab the |
@slack full kubelet logs for that node here: https://gist.github.com/gabrtv/70ae044394f3491ea6cb No smoking gun that I can see. However there are a few unexplained reboots with a decent amount of time where the kubelet was restarting. That could easily explain it. Unless someone else finds something relevant in the logs, I'm inclined to close the issue and chalk it up to flakiness of the underlying cluster.. in which case the error is exactly what I'd expect. |
Adding this to RC1 |
There are a series of node reboots in the logs which would explain "connection reset by peer" if those overlapped with the log fetches. |
I dont think any one has reported this since beta4 |
First
git push
on a freshkube-aws
cluster gave me this:Looks like an internal network issue talking to the Kube API server, but hard to say. Subsequent pushes seem to work fine. Filing an issue for posterity.
The text was updated successfully, but these errors were encountered: