Make throttling nmagent fetches for nodesubnet more dynamic #3023

santhoshmprabhu · 2024-09-19T21:28:04Z

Reason for Change:
We previously added fetch of secondary IPs to the NMAgent client in CNS. This PR makes the fetch asynchronous, with the ability to adjust the frequency dynamically. Specifically,

Min and max interval between fetches can be configured
Frequency is cut in half (interval doubled) if no diff is seen, up to the max interval.
Min interval is used as soon as a diff is seen

This is implemented using a time Ticker, but we wrap it with a wrapper so as to mock it for tests. This PR also includes some additional checks for empty responses from the nmagent (this was raised in this comment). Also have UTs.

Issue Fixed:
NA

Requirements:

uses conventional commit messages
includes documentation
adds unit tests
relevant PR labels added

Notes:

…ubnet

… backoff

…de-subnet-intelligent-refresh

santhoshmprabhu · 2024-09-19T23:16:40Z

/azp run Azure Container Networking PR

azure-pipelines · 2024-09-19T23:16:50Z

Azure Pipelines successfully started running 1 pipeline(s).

…de-subnet-intelligent-refresh

santhoshmprabhu · 2024-09-20T23:16:31Z

/azp run Azure Container Networking PR

azure-pipelines · 2024-09-20T23:16:42Z

Azure Pipelines successfully started running 1 pipeline(s).

timraymond

I think I'm going to recant some of what I said in the last PR now that I understand why we're doing this debouncing. As I mentioned during the design review, rate limits and wireserver are a fact of life, so it now feels weird to have arbitrary logic treating the nmagent client carefully because it might blow our rate limit. The design intention of the NMAgent client was always that users can do nmagent.DoSomethingForMe() and it will block while it deals with all of the weirdness that happens with NMAgent and Wireserver. This includes async response handling... and I'm thinking it should also include respecting rate limits. If the request is going to blow the rate limit, the WireserverTransport in the NMAgent client needs to block that request until the request will succeed. Otherwise, we're just setting the user up for failure.

Doing this with tickers is definitely the right way (in particular the whole ticker resetting thing is how I've done respecting rate limits in the past). However, it should be done in https://github.com/Azure/azure-container-networking/blob/master/nmagent/internal/wireserver.go so that it's shared across the client.

cns/nodesubnet/ip_fetcher_test.go

cns/nodesubnet/ip_fetcher.go

cns/nodesubnet/refreshticker.go

cns/nodesubnet/ip_fetcher.go

…de-subnet-intelligent-refresh

…efresh

santhoshmprabhu · 2024-09-24T18:59:58Z

/azp run Azure Container Networking PR

santhoshmprabhu · 2024-10-08T19:54:27Z

/azp run Azure Container Networking PR

azure-pipelines · 2024-10-08T19:54:39Z

Azure Pipelines successfully started running 1 pipeline(s).

santhoshmprabhu · 2024-10-08T19:57:18Z

/azp run Azure Container Networking PR

azure-pipelines · 2024-10-08T19:57:29Z

Azure Pipelines successfully started running 1 pipeline(s).

…efresh

santhoshmprabhu · 2024-10-08T21:22:46Z

/azp run Azure Container Networking PR

azure-pipelines · 2024-10-08T21:22:57Z

Azure Pipelines successfully started running 1 pipeline(s).

…efresh

santhoshmprabhu · 2024-10-09T15:37:18Z

/azp run Azure Container Networking PR

azure-pipelines · 2024-10-09T15:37:29Z

Azure Pipelines successfully started running 1 pipeline(s).

…efresh

…thub.com:Azure/azure-container-networking into sanprabhu/cilium-node-subnet-intelligent-refresh

santhoshmprabhu · 2024-10-14T17:08:58Z

/azp run Azure Container Networking PR

azure-pipelines · 2024-10-14T17:09:10Z

Azure Pipelines successfully started running 1 pipeline(s).

santhoshmprabhu added 4 commits September 19, 2024 09:25

feat(CNS): Early work on better throttling in NMAgent fetch for nodes…

3a4625e

…ubnet

feat(CNS): Update NMAgent fetches to be async with binary exponential…

8f00354

… backoff

chore: check for empty nmagent response

187cb66

test: update test for empty response

e21a29f

santhoshmprabhu added cns Related to CNS. go Pull requests that update Go code labels Sep 19, 2024

santhoshmprabhu self-assigned this Sep 19, 2024

santhoshmprabhu requested a review from a team as a code owner September 19, 2024 21:28

santhoshmprabhu requested review from ramiro-gamarra and tamilmani1989 September 19, 2024 21:28

style: make linter happy

c7e88fb

santhoshmprabhu changed the title ~~Sanprabhu/cilium node subnet intelligent refresh~~ Make throttling nmagent fetches for nodesubnet more dynamic Sep 19, 2024

Merge remote-tracking branch 'origin/master' into sanprabhu/cilium-no…

ab97370

…de-subnet-intelligent-refresh

santhoshmprabhu requested a review from rbtr September 19, 2024 21:42

chore: fix some comments

8011972

Merge remote-tracking branch 'origin/master' into sanprabhu/cilium-no…

b6bc7ad

…de-subnet-intelligent-refresh

rbtr requested a review from timraymond September 20, 2024 20:48

fix: Fix bug in refresh

b20588e

timraymond reviewed Sep 23, 2024

View reviewed changes

cns/nodesubnet/ip_fetcher_test.go Outdated Show resolved Hide resolved

tamilmani1989 reviewed Sep 24, 2024

View reviewed changes

cns/nodesubnet/ip_fetcher.go Outdated Show resolved Hide resolved

cns/nodesubnet/ip_fetcher.go Outdated Show resolved Hide resolved

cns/nodesubnet/refreshticker.go Outdated Show resolved Hide resolved

tamilmani1989 reviewed Sep 24, 2024

View reviewed changes

cns/nodesubnet/ip_fetcher.go Outdated Show resolved Hide resolved

refactor: Address comments

4c7394d

santhoshmprabhu requested a review from a team as a code owner September 24, 2024 18:50

santhoshmprabhu and others added 2 commits September 24, 2024 11:50

Merge remote-tracking branch 'origin/master' into sanprabhu/cilium-no…

7708cc9

…de-subnet-intelligent-refresh

Merge branch 'master' into sanprabhu/cilium-node-subnet-intelligent-r…

f0e3f3d

…efresh

santhoshmprabhu added 2 commits October 8, 2024 12:52

refactor: address Tim's comment

f8b1c64

fix: undo accidental commit

8541631

Merge branch 'master' into sanprabhu/cilium-node-subnet-intelligent-r…

cc0b7d7

…efresh

Merge branch 'master' into sanprabhu/cilium-node-subnet-intelligent-r…

0abad88

…efresh

santhoshmprabhu requested a review from timraymond October 9, 2024 15:39

timraymond previously approved these changes Oct 9, 2024

View reviewed changes

santhoshmprabhu added this pull request to the merge queue Oct 9, 2024

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Oct 9, 2024

santhoshmprabhu added this pull request to the merge queue Oct 14, 2024

fix: make linter happy

df3d900

santhoshmprabhu removed this pull request from the merge queue due to a manual request Oct 14, 2024

santhoshmprabhu dismissed timraymond’s stale review via df3d900 October 14, 2024 16:26

santhoshmprabhu and others added 3 commits October 14, 2024 09:26

Merge branch 'master' into sanprabhu/cilium-node-subnet-intelligent-r…

d1b1461

…efresh

fix: make linter happy

0407ef0

Merge branch 'sanprabhu/cilium-node-subnet-intelligent-refresh' of gi…

b235877

…thub.com:Azure/azure-container-networking into sanprabhu/cilium-node-subnet-intelligent-refresh

tamilmani1989 approved these changes Oct 14, 2024

View reviewed changes

santhoshmprabhu added this pull request to the merge queue Oct 14, 2024

Merged via the queue into master with commit b5046a0 Oct 14, 2024
14 checks passed

santhoshmprabhu deleted the sanprabhu/cilium-node-subnet-intelligent-refresh branch October 14, 2024 23:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make throttling nmagent fetches for nodesubnet more dynamic #3023

Make throttling nmagent fetches for nodesubnet more dynamic #3023

santhoshmprabhu commented Sep 19, 2024

santhoshmprabhu commented Sep 19, 2024

azure-pipelines bot commented Sep 19, 2024

santhoshmprabhu commented Sep 20, 2024

azure-pipelines bot commented Sep 20, 2024

timraymond left a comment

santhoshmprabhu commented Sep 24, 2024

santhoshmprabhu commented Oct 8, 2024

azure-pipelines bot commented Oct 8, 2024

santhoshmprabhu commented Oct 8, 2024

azure-pipelines bot commented Oct 8, 2024

santhoshmprabhu commented Oct 8, 2024

azure-pipelines bot commented Oct 8, 2024

santhoshmprabhu commented Oct 9, 2024

azure-pipelines bot commented Oct 9, 2024

santhoshmprabhu commented Oct 14, 2024

azure-pipelines bot commented Oct 14, 2024

Make throttling nmagent fetches for nodesubnet more dynamic #3023

Make throttling nmagent fetches for nodesubnet more dynamic #3023

Conversation

santhoshmprabhu commented Sep 19, 2024

santhoshmprabhu commented Sep 19, 2024

azure-pipelines bot commented Sep 19, 2024

santhoshmprabhu commented Sep 20, 2024

azure-pipelines bot commented Sep 20, 2024

timraymond left a comment

Choose a reason for hiding this comment

santhoshmprabhu commented Sep 24, 2024

santhoshmprabhu commented Oct 8, 2024

azure-pipelines bot commented Oct 8, 2024

santhoshmprabhu commented Oct 8, 2024

azure-pipelines bot commented Oct 8, 2024

santhoshmprabhu commented Oct 8, 2024

azure-pipelines bot commented Oct 8, 2024

santhoshmprabhu commented Oct 9, 2024

azure-pipelines bot commented Oct 9, 2024

santhoshmprabhu commented Oct 14, 2024

azure-pipelines bot commented Oct 14, 2024