Multiple active gateways #2891

cangyin · 2024-01-30T11:31:04Z

What would you like to be added:

Multiple active gateways for higher inter-cluster data transfer performance.

Why is this needed:

Currently there is only one gateway per cluster. As per the benchmark result in #2890, there is significant performance drop (about 56%) for PODs running on non-gateway nodes. Suppose the gateway node has a 10Gbit/s NIC. For DBMS servers running on non-gateway nodes, they only share 560 MByte/s. This means the whole cluster is only able to transfer 46TB of data per day at maximum, theoretically, which is unacceptable for productional clusters (JFYI, a small to median sized productional ClickHouse cluster can receive more than 40TB of data per day).

skitt · 2024-01-30T12:12:19Z

With the default IPsec tunnels, this happens because the IPsec protocol doesn’t support splitting tunnels, and the encryption and packet ordering must happen on a single core.

We have considered in the past enabling multiple parallel tunnels, on one gateway or across multiple gateways (in the latter case, with added HA benefits); but that requires deciding how to split traffic across the available tunnels.

In parallel, protocol extensions are being discussed to allow IPsec tunnels to be split to avoid these bottlenecks; see the current draft for details. It seems preferable for Submariner to support that, once it becomes available, instead of coming up with its own solution.

For performance-critical scenarios, especially in cases where a dedicated (private) network is available between gateways, Submariner supports VXLAN tunnels instead of IPsec.

cangyin · 2024-01-30T15:31:15Z

For performance-critical scenarios, especially in cases where a dedicated (private) network is available between gateways, Submariner supports VXLAN tunnels instead of IPsec.

The 56% performance drop in question is exactly that of inter-cluster VXLAN tunnel (vxlan-tunnel VTEP). Which gives 44% of underlying network capability of only one NIC, while the IPsec tunnel gives 26%.

It seems multiple active VXLAN gateways is easier to implement, intuitively.

dfarrell07 · 2024-02-13T14:08:29Z

protocol extensions are being discussed to allow IPsec tunnels to be split to avoid these bottlenecks

If we go this path, we should likely raise a more specific issue.

maayanf24 · 2024-02-20T14:08:30Z

Need more investigation before prioritizing.
First we want to implement load-balancer mode.

maayanf24 · 2024-08-07T13:09:03Z

Decided to push to following releases

cangyin added the enhancement New feature or request label Jan 30, 2024

dfarrell07 added the need-info label Jan 30, 2024

dfarrell07 added this to Submariner 0.18 Feb 13, 2024

dfarrell07 removed this from Submariner 0.18 Feb 13, 2024

dfarrell07 removed the need-info label Feb 27, 2024

maayanf24 assigned yboaron Jun 24, 2024

maayanf24 added this to Submariner 0.19 Jun 24, 2024

maayanf24 moved this to Schedule and Epics in Submariner 0.19 Jun 24, 2024

maayanf24 added this to Backlog Aug 7, 2024

maayanf24 removed this from Submariner 0.19 Aug 7, 2024

github-project-automation bot moved this to Backlog in Backlog Aug 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multiple active gateways #2891

Multiple active gateways #2891

cangyin commented Jan 30, 2024

skitt commented Jan 30, 2024

cangyin commented Jan 30, 2024

dfarrell07 commented Feb 13, 2024

maayanf24 commented Feb 20, 2024

maayanf24 commented Aug 7, 2024

Multiple active gateways #2891

Multiple active gateways #2891

Comments

cangyin commented Jan 30, 2024

skitt commented Jan 30, 2024

cangyin commented Jan 30, 2024

dfarrell07 commented Feb 13, 2024

maayanf24 commented Feb 20, 2024

maayanf24 commented Aug 7, 2024