-
Notifications
You must be signed in to change notification settings - Fork 121
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature: React to node changes #182
Comments
Jotting down some of my thoughts around implementing this feature:-
|
This feature is hugely important to our project. Our new nodes need to be able to start caching images immediately on launch. |
Thinking about my use case, any delay to caching images would be undesirable, so I'd be keen to see a job created as a one off on the immediate detection of a new node. Once this job has completed, either success or failure, maybe add a label to the node which would allow the controller to use it regularly going forward. I wouldn't worry about the node being unschedulable as a special case here, as I that could happen to any node at any time, so should be something you cater for at all times anyway. As for the status message, a quick one off message should be fine. If the users are doing any logging then hopefully they are tracking and logging all messages on the resource, and not just the latest. |
The other approach, which would be a big change in strategy, would be for each node to have a separate timer, rather than all thr jobs be created at once. Arguably good from a network bandwith perspective, but you'd need something like a daemonset to have the independence to time each node. It wouldn't answer your question of status message on the cache resource though unless you deliberately listed each node's status independently as a different line in the message. |
@ChevronTango We implemented a simple prototype for this: |
Ideally I do not need to periodically refresh images at all since we tag our images, however what I would need is the images to be cached to new nodes as they join the cluster. This does not happen if the image refresh time is set to 0s to disable periodic refreshing.
The text was updated successfully, but these errors were encountered: