[wip] #2411

kyrtapz · 2025-01-08T13:24:12Z

No description provided.

Handle host-network pods as default network. Don't return per-pod errors on startup. Remove nadController from UDNHostIsolationManager as we don't use it anymore to find pod's UDN based on NADs that exist in the namespace. Signed-off-by: Nadia Pinaeva <[email protected]>

Signed-off-by: Jaime Caamaño Ruiz <[email protected]>

udn host isolation: fix initialSync.

…face Signed-off-by: Martin Kennelly <[email protected]>

This code isnt being used anymore. We dont expect users to upgrade directly from code which contained the legacy LRPs, therefore its safe to remove. Signed-off-by: Martin Kennelly <[email protected]>

Signed-off-by: Martin Kennelly <[email protected]>

In an unlikely scenario where the service doesn't exist and there was an issue getting the current active network the code should not use the service object for the returned error. Signed-off-by: Patryk Diak <[email protected]>

Previously, if the services controller failed to start it would not be retried. Signed-off-by: Patryk Diak <[email protected]>

L2 UDN: EgressIP hosted by primary interface (`breth0`)

If EncapIP is configured, it means it is different from the node's primary address. Do not update EncapIP when node's primary address changes. Signed-off-by: Yun Zhou <[email protected]>

Assign network ID from network manager running in cluster manager. The network ID is included in NetInfo and annotated on the NAD along with the network name. Network managers running in zone & node controllers will read the network ID from the annotation to set it on NetInfo. On startup, network manager running in cluster manager will read the network IDs annotated on the nodes to cover for the upgrade scenario. Network IDs will still be annotated on the nodes because this PR does not transition all the code to use the network ID from the NetInfo instead of the node annotation. That will have to be done progressively. This have several benefits, among them: - NetworkID is available sooner overall since we dont have to wait for all the nodes to be annotated - No need to unmarshall the node annotation to get the network IDs, they are available in NetInfo - No need to unmashall the NAD to get the network name, can be accessed directly from the annotation. If a network is replaced with a different one with the same name, the network ID is reused as the respective network controller will not start as the previous one is stopped and cleaned up so it shouldn't be a problem. Signed-off-by: Jaime Caamaño Ruiz <[email protected]>

Instead of considering managed VRFs those that follow the mp<id>-udn-vrf naming template, use the table number: those vrfs associated to a table within our reserved block of table numbers are managed by us. The block right now is anything higher than RoutingTableIDStart (1000). This allows to manage VRFs with any name which is desirable if the name is going to be exposed through BGP. Signed-off-by: Jaime Caamaño Ruiz <[email protected]>

Anticipating that these VRF names are going to be exposed through BGP, we should to use friendlier names for our VRFs. The most natural name to use is the network name. Thus giving a cluster UDN a name below 15 characters that matches an already existing VRF not managed by ovn-k will fail. This is considered an admin problem and not an ovn-k problem for now. Signed-off-by: Jaime Caamaño Ruiz <[email protected]>

Signed-off-by: Jaime Caamaño Ruiz <[email protected]>

Was causing deadlocks in unit tests Signed-off-by: Jaime Caamaño Ruiz <[email protected]>

…heir subcontrollers Signed-off-by: Jaime Caamaño Ruiz <[email protected]>

Signed-off-by: Jaime Caamaño Ruiz <[email protected]>

Assuming that there is three types of controllers, being: network agnostic, network aware and network specific; we were already notifying network specific controllers of network changes. But network aware controllers, controllers for which we have a single instance capable of managing multiple networks, had no code path to be informed of netwokr changes. This commit adds a code path for that and makes the RouteAdvertisments controller aware of network changes. Changed ClusterManager to be the controller manager for cluster manager instead of secondaryNetworkClusterManager. It just makes more sense that way sice ClusterManager is the top level manager. Signed-off-by: Jaime Caamaño Ruiz <[email protected]>

Signed-off-by: Jaime Caamaño Ruiz <[email protected]>

…twork exist test Signed-off-by: Or Mergi <[email protected]>

On CUDN cleanup is inconsistent as we see some flaky tests due to CUDN "already exist" errors, implying object are not actually deleted. Wait for CUDN object be gone when deleted Signed-off-by: Or Mergi <[email protected]>

CUDN is cluster-scoped object, in case tests running in parallel, having random names avoids conflicting with other tests. Use random metadata.name for CUDN objects. The "isolates overlapping CIDRs" tests create objects based on the 'red' and 'blue' variables, including CUDN objects. Change the tests CUDN creation use random names and update the given 'networkAttachmentConfigParams' with the new generated name. Update 'red' & 'blue' vaiables with the generated name, carried by 'networkAttachmentConfigParams' (netConfig.name). The pod2Egress tests asserts on the CUDN object name given by 'userDefinedNetworkName'. In practice the tests netConfigParam.name is userDefinedNetworkName. Change the assertion to check the given netConfigParam. Signed-off-by: Or Mergi <[email protected]>

Signed-off-by: nithyar <[email protected]>

e2e, CUDN: Improve stability

Reconcile RouteAdvertisements in cluster manager

Add missing enum validation for RouteAdvertisements

Compare annotations directly if possible. For network specific map entries only compare raw json entries without parsing the map in full. Co-authored-by: Tim Rozet <[email protected]> Signed-off-by: Patryk Diak <[email protected]>

Instead of always parsing all node/join subnets parse the raw json map and only compute the results for the affected network. Signed-off-by: Patryk Diak <[email protected]>

Increases async performance of informer cache being able to always queue events and not blocking while performing ADD/UPDATE/DELETE operation. Signed-off-by: Tim Rozet <[email protected]>

Add a pool of Event handlers instead of a single (federated) event handler per informer. Ensure a controller always gets registers with the same event handler. Set the pool size to 201 (200 for secondary controllers and one, index 0, reserved for the default network). Always use pool entry with index 0 for the default network controller. Signed-off-by: Dumitru Ceara <[email protected]>

Keep the initial add/sync queue small enough though. This is needed to avoid contention on handler addition initial processing. Signed-off-by: Dumitru Ceara <[email protected]>

k8s.ovn.org/user-defined-network is now required to be labeled on a namespace at namespace creation time in order to use a primary UDN. The following conditions are true: 1. If namespace is missing the label, and a pod is created, it attaches to default network. 2. If the namespace is missing the label, and a primary UDN or CUDN is created that matches that namespace, the UDN/CUDN will report error status and the NAD will not be generated. 3. If the namespace is missing the label, and a primary UDN/CUDN exists, a pod in the namespace will be created and attached to default network. 4. If the namespace has the label, and a primary UDN/CUDN does not exist a pod in the namespace will fail creation until the UDN/CUDN is created. Also includes some fixes to unit tests that were brought to light by this PR. For example, the layer 2 multi-network tests were adding invalid annotations for node-subnets, etc. Signed-off-by: Tim Rozet <[email protected]>

Signed-off-by: Patryk Diak <[email protected]>

Signed-off-by: Tim Rozet <[email protected]>

Was using ipv6 on ipv4 cluster. Signed-off-by: Tim Rozet <[email protected]>

The unit tests run with race detection enabled and on constrained environments (e.g., default GitHub runners) and run out of resources when using such large event queues. This change doesn't affect e2e tests in any way. Those will use default event queue sizes (1K) in order to test what gets deployed on actual clusters. Signed-off-by: Dumitru Ceara <[email protected]>

Signed-off-by: Dumitru Ceara <[email protected]>

…w table. Signed-off-by: Dumitru Ceara <[email protected]>

Secondary network controllers should ingore resources that do not belong to the current network. Signed-off-by: Patryk Diak <[email protected]>

…rovements_ds

openshift-ci · 2025-01-08T13:24:26Z

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

openshift-ci · 2025-01-08T13:25:07Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: kyrtapz
Once this PR has been reviewed and has the lgtm label, please assign knobunc for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

kyrtapz · 2025-01-08T13:25:46Z

/test images

openshift-ci · 2025-01-08T14:34:34Z

@kyrtapz: all tests passed!

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

When multinetwork policies support is disabled WatchNamespaces should run to completion for primary UDNs. This was not happening because a primary UDN is also a secondary network. Signed-off-by: Patryk Diak <[email protected]>

Signed-off-by: Patryk Diak <[email protected]>

Signed-off-by: Dumitru Ceara <[email protected]>

Otherwise there's a data race because subnets can be allocated while we read the current usage/count. Signed-off-by: Dumitru Ceara <[email protected]>

openshift-merge-robot · 2025-01-26T22:30:16Z

PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

npinaeva and others added 30 commits December 18, 2024 20:23

Add missing enum validation for RouteAdvertisements

06c26bc

Signed-off-by: Jaime Caamaño Ruiz <[email protected]>

Merge pull request #4930 from npinaeva/udn-isolation-hostnet

d456afd

udn host isolation: fix initialSync.

Add UDN Layer2 support for an Egress IP assigned to the primary inter…

6c4e021

…face Signed-off-by: Martin Kennelly <[email protected]>

EIP/ESVC: remove code to remove legacy no reroutes for nodes

1aedb9a

This code isnt being used anymore. We dont expect users to upgrade directly from code which contained the legacy LRPs, therefore its safe to remove. Signed-off-by: Martin Kennelly <[email protected]>

EIP E2Es: create const for httpd image name and add image tag

33f957d

Signed-off-by: Martin Kennelly <[email protected]>

Fix a potential nil dereference

52e47a7

In an unlikely scenario where the service doesn't exist and there was an issue getting the current active network the code should not use the service object for the returned error. Signed-off-by: Patryk Diak <[email protected]>

Services controller: Handle startup failures

33b5099

Previously, if the services controller failed to start it would not be retried. Signed-off-by: Patryk Diak <[email protected]>

Merge pull request #4833 from martinkennelly/eip-l2

d8d42f1

L2 UDN: EgressIP hosted by primary interface (`breth0`)

Do not update EncapIP if it is configured

6c366bb

If EncapIP is configured, it means it is different from the node's primary address. Do not update EncapIP when node's primary address changes. Signed-off-by: Yun Zhou <[email protected]>

Add FRRConfiguration to factory

90136b5

Signed-off-by: Jaime Caamaño Ruiz <[email protected]>

Reconcile RouteAdvertisements from cluster manager

3d1ec7a

Signed-off-by: Jaime Caamaño Ruiz <[email protected]>

Fix receiver names of nad controller

4c0eb71

Signed-off-by: Jaime Caamaño Ruiz <[email protected]>

Fix deadlock when comparing mutableNetInfo with self

7c03f3f

Was causing deadlocks in unit tests Signed-off-by: Jaime Caamaño Ruiz <[email protected]>

Fix network controllers constructors not using the same NetInfo for t…

a13297e

…heir subcontrollers Signed-off-by: Jaime Caamaño Ruiz <[email protected]>

Fix egress IP tests

81449dd

Signed-off-by: Jaime Caamaño Ruiz <[email protected]>

do not assign an ID to network still used by a controller being stopped

e2b9f0a

Signed-off-by: Jaime Caamaño Ruiz <[email protected]>

test,networksegmentation: Wait for CUDN status populate in primary ne…

f401ff5

…twork exist test Signed-off-by: Or Mergi <[email protected]>

test,networksegmentation: Wait for CUDN object be gone

30f1e6c

On CUDN cleanup is inconsistent as we see some flaky tests due to CUDN "already exist" errors, implying object are not actually deleted. Wait for CUDN object be gone when deleted Signed-off-by: Or Mergi <[email protected]>

Add nftables binaries to ubuntu arm image

4e344ec

Signed-off-by: nithyar <[email protected]>

Fix issues in DPU host initialization

97d0504

Signed-off-by: nithyar <[email protected]>

Merge pull request #4842 from ormergi/e2e-cudn-fix

e4b585c

e2e, CUDN: Improve stability

Merge pull request #4691 from jcaamano/cm-routeadvertisements

ff34493

Reconcile RouteAdvertisements in cluster manager

Merge pull request #4934 from jcaamano/advertisements-enum-validation

326f9db

Add missing enum validation for RouteAdvertisements

Improve annotation changes detection

166621b

Compare annotations directly if possible. For network specific map entries only compare raw json entries without parsing the map in full. Co-authored-by: Tim Rozet <[email protected]> Signed-off-by: Patryk Diak <[email protected]>

kyrtapz and others added 18 commits January 7, 2025 10:00

Use partial parsing for network sepcific CIDRs

5f4ba73

Instead of always parsing all node/join subnets parse the raw json map and only compute the results for the affected network. Signed-off-by: Patryk Diak <[email protected]>

Always use queued informer

c31a2b7

Increases async performance of informer cache being able to always queue events and not blocking while performing ADD/UPDATE/DELETE operation. Signed-off-by: Tim Rozet <[email protected]>

factory: Bump the event queue size to 1K.

c23b286

Keep the initial add/sync queue small enough though. This is needed to avoid contention on handler addition initial processing. Signed-off-by: Dumitru Ceara <[email protected]>

Disable adding/removing the UDN namespace label

8028e81

Signed-off-by: Patryk Diak <[email protected]>

Update E2Es to for required UDN label

983dad6

Signed-off-by: Tim Rozet <[email protected]>

Fix multicast net seg tests ip family

e0e7e09

Signed-off-by: Tim Rozet <[email protected]>

Fix egress IP tests

bfab275

Was using ipv6 on ipv4 cluster. Signed-off-by: Tim Rozet <[email protected]>

ovn_test: Properly shutdown the watch factory.

8791a60

Signed-off-by: Dumitru Ceara <[email protected]>

gateway_localnet_linux_test: Serialize access to openflow manager flo…

e52b7de

…w table. Signed-off-by: Dumitru Ceara <[email protected]>

UserDefinedNetworks: Ignore pods and namespaces from different networks

4a11dc8

Secondary network controllers should ingore resources that do not belong to the current network. Signed-off-by: Patryk Diak <[email protected]>

Merge remote-tracking branch 'upstream/master' into perf_improvements_ds

667551e

Merge remote-tracking branch 'origin/perf_improvements' into perf_imp…

e72504a

…rovements_ds

Merge branch 'pr/4912' into perf_improvements_ds

9728674

Merge branch 'pr/4939' into perf_improvements_ds

bc36acb

Merge branch 'svc_udn_fix' into perf_improvements_ds

0b6d9ce

openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jan 8, 2025

kyrtapz and others added 4 commits January 8, 2025 16:56

Fix WatchNamespaces for UDNs

c2bf969

When multinetwork policies support is disabled WatchNamespaces should run to completion for primary UDNs. This was not happening because a primary UDN is also a secondary network. Signed-off-by: Patryk Diak <[email protected]>

wip

ea0c924

Signed-off-by: Patryk Diak <[email protected]>

HACK: Bump event queue initial queue size

a8558be

Signed-off-by: Dumitru Ceara <[email protected]>

subnet_allocator: Take lock when reading subnetAllocatorRanges.

e3b7547

Otherwise there's a data race because subnets can be allocated while we read the current usage/count. Signed-off-by: Dumitru Ceara <[email protected]>

openshift-merge-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jan 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[wip] #2411

[wip] #2411

kyrtapz commented Jan 8, 2025

openshift-ci bot commented Jan 8, 2025

openshift-ci bot commented Jan 8, 2025

kyrtapz commented Jan 8, 2025

openshift-ci bot commented Jan 8, 2025

openshift-merge-robot commented Jan 26, 2025

[wip] #2411

Are you sure you want to change the base?

[wip] #2411

Conversation

kyrtapz commented Jan 8, 2025

openshift-ci bot commented Jan 8, 2025

openshift-ci bot commented Jan 8, 2025

kyrtapz commented Jan 8, 2025

openshift-ci bot commented Jan 8, 2025

openshift-merge-robot commented Jan 26, 2025