Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[proposal] cpu burst should also burst some pods when the node util nears threshold #2164

Open
zwzhang0107 opened this issue Aug 8, 2024 · 0 comments
Labels
area/koordlet help wanted Extra attention is needed kind/proposal Create a report to help us improve
Milestone

Comments

@zwzhang0107
Copy link
Contributor

zwzhang0107 commented Aug 8, 2024

What is your proposal:
when the utilization around cpuBurst.sharePoolThresholdPercent, koordlet should not reset all but try to select some pods and burst them.

Why is this needed:

cpu burst strategy is proposed to solve problems of cpu resource throttling for pods like java app, which needs much resource than its resource.limit during start time.

When there are multi pods start at the same time, the strategy may not handle wll. Since now if node utilization > cpuBurst.sharePoolThresholdPercent, all bursted cfs_quota will be reset, which may make the node utilizaiton << cpuBurst.sharePoolThresholdPercent. koordlet will try burst again util the cooling duration finished.

This leads to none pods could get burst in long time and stuck at starting stage.

here are some related issues: kubernetes/kubernetes#3312

Is there a suggested solution, if so, please add it:
Only reset PART of over-utilized pods when the node.util > cpuBurst.sharePoolThresholdPercent to avoid none pod could get burst.
Only burst SOME throttled pods when node.util < cpuBurst.sharePoolThresholdPercent to avoid node util too high.

@zwzhang0107 zwzhang0107 added area/koordlet kind/proposal Create a report to help us improve labels Aug 8, 2024
@zwzhang0107 zwzhang0107 added this to the v1.6 milestone Aug 8, 2024
@zwzhang0107 zwzhang0107 added the help wanted Extra attention is needed label Aug 8, 2024
@zwzhang0107 zwzhang0107 changed the title [proposal] cpu burst should also burst some pods when the node util nears cpuBurst.sharePoolThresholdPercent [proposal] cpu burst should also burst some pods when the node util nears threshold Aug 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/koordlet help wanted Extra attention is needed kind/proposal Create a report to help us improve
Projects
None yet
Development

No branches or pull requests

1 participant