[EKS] [Managed Workers]: Send kubelet logs to CloudWatch #903

aaron-trout · 2020-05-19T12:36:51Z

Community Note

Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
Please do not leave "+1" or "me too" comments, they generate extra noise for issue followers and do not help prioritize the request
If you are interested in working on this issue or have submitted a pull request, please leave a comment

Tell us about your request
Would be great for kubelet / other managed worker node logs to be sent to CloudWatch.

Which service(s) is this request for?
EKS (Managed worker nodes)

Tell us about the problem you're trying to solve. What are you trying to do, and why is it hard?
Developers are already using the control plane logs (specifically the audit log) to assist in debugging, but occasionally the platform team has to step in and SSH into worker nodes to pull kubelet logs.

It would be super helpful if these were sent to CloudWatch like the control plane logs.

Are you currently working around this issue?
Workaround is to just SSH to the worker node, but obviously this has some limitations, for example when using cluster-autoscaler the node might not live for very long.

tpsk-hub · 2021-01-26T20:01:37Z

Hi Folks,

We now have the ability to solve the above ask. Check out this blog to learn more - https://aws.amazon.com/blogs/containers/fluent-bit-integration-in-cloudwatch-container-insights-for-eks/

roberto-civitas · 2022-10-24T22:13:14Z

I dont see how that addresses the above

deepankarm · 2024-01-08T06:06:50Z

Would be great for kubelet / other managed worker node logs to be sent to CloudWatch.

Is this already doable?

joeynaor · 2024-01-29T15:19:41Z

When had an EKS node unexpectedly changing its status to notReady. As far as I understand, the only way to find out the reason behind (beside running kubectl describe <node> at the time of failure) this is by checking kubelet logs (journalctl -u kubelet), which in our case rotated out. Being able to log these conveniently is a must for RCA purposes.

michaelswierszcz · 2024-01-31T18:36:11Z

same situation happened with us recently

When had an EKS node unexpectedly changing its status to notReady. As far as I understand, the only way to find out the reason behind (beside running kubectl describe <node> at the time of failure) this is by checking kubelet logs (journalctl -u kubelet), which in our case rotated out. Being able to log these conveniently is a must for RCA purposes.

tooptoop4 · 2024-09-07T00:09:40Z

did u solve @joeynaor @michaelswierszcz ? i'm thinking related to aws/amazon-vpc-cni-k8s#2808

michaelswierszcz · 2024-09-07T03:36:31Z

ended up scraping kubelet logs with our exists logging infrastructure (fluent-bit -> loki)

joeynaor · 2024-09-11T08:22:45Z

@tooptoop4 Our workaround was to disable "delete on terminate" for the EKS nodes disks. After an incident, we mounted the disk of the faulty node to a regular EC2 instance and inspected the logs. In our case, the only incident ever since was caused by a hardware failure on AWS side, confirmed by AWS support.

aaron-trout added the Proposed Community submitted issue label May 19, 2020

mikestef9 added EKS Amazon Elastic Kubernetes Service EKS Managed Nodes EKS Managed Nodes labels May 19, 2020

mikestef9 closed this as completed Jan 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[EKS] [Managed Workers]: Send kubelet logs to CloudWatch #903

[EKS] [Managed Workers]: Send kubelet logs to CloudWatch #903

aaron-trout commented May 19, 2020

tpsk-hub commented Jan 26, 2021 •

edited by mikestef9

Loading

roberto-civitas commented Oct 24, 2022

deepankarm commented Jan 8, 2024

joeynaor commented Jan 29, 2024

michaelswierszcz commented Jan 31, 2024

tooptoop4 commented Sep 7, 2024

michaelswierszcz commented Sep 7, 2024

joeynaor commented Sep 11, 2024

[EKS] [Managed Workers]: Send kubelet logs to CloudWatch #903

[EKS] [Managed Workers]: Send kubelet logs to CloudWatch #903

Comments

aaron-trout commented May 19, 2020

Community Note

tpsk-hub commented Jan 26, 2021 • edited by mikestef9 Loading

roberto-civitas commented Oct 24, 2022

deepankarm commented Jan 8, 2024

joeynaor commented Jan 29, 2024

michaelswierszcz commented Jan 31, 2024

tooptoop4 commented Sep 7, 2024

michaelswierszcz commented Sep 7, 2024

joeynaor commented Sep 11, 2024

tpsk-hub commented Jan 26, 2021 •

edited by mikestef9

Loading