Auto cache when new node added to cluster #213

maacarbo · 2023-04-19T14:23:58Z

In AWS EKS, we intensively use auto scaling clusters. It would be handy if the controller knows when a new node is spin up and directly starts to cache the images.

leonidkhelemes · 2023-04-21T15:48:06Z

+1

elocke · 2023-05-03T18:50:17Z

+1 I kindof expected this already happened.

jaihwan104 · 2023-05-08T06:55:13Z

+1

djmcgreal-cc · 2023-05-11T09:32:16Z

My exact question, the top issue in the list!

This is likely a major use case in Machine Learning where a) GPUs are more expensive so typically scale often and b) images are large.

In this auto-scale-up case, Pods are waiting to be scheduled immediately so will probably not be able to take advantage of the kube-fledged cache refresh to load images into the new node (which I assume at least works?). Perhaps kube-fledged could be configured to manage a taint on newly provisioned nodes that's removed when images have been loaded from the cache. In cluster-autoscaler, taints can be prefixed with ignore-taint.cluster-autoscaler.kubernetes.io/ so they do not effect auto scaling groups selection.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Auto cache when new node added to cluster #213

Auto cache when new node added to cluster #213

maacarbo commented Apr 19, 2023 •

edited

Loading

leonidkhelemes commented Apr 21, 2023

elocke commented May 3, 2023

jaihwan104 commented May 8, 2023

djmcgreal-cc commented May 11, 2023

Auto cache when new node added to cluster #213

Auto cache when new node added to cluster #213

Comments

maacarbo commented Apr 19, 2023 • edited Loading

leonidkhelemes commented Apr 21, 2023

elocke commented May 3, 2023

jaihwan104 commented May 8, 2023

djmcgreal-cc commented May 11, 2023

maacarbo commented Apr 19, 2023 •

edited

Loading