-
Notifications
You must be signed in to change notification settings - Fork 103
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Container to container networking performance degradation #293
Comments
I'm having a hard time replicatingMy throughput is fine; Just to check: you're using the CF default Silk backend, right? Not the NSX-T Container Plug-in, right? If you don't know the answer, you're using the Silk backend. Technical Detailsfor app in dora-{0,1}; do
echo -n "$app: "
cf ssh $app -c 'echo $CF_INSTANCE_ADDR / $CF_INSTANCE_INTERNAL_IP'
done Gives me:
cf add-network-policy dora-0 dora-1 --protocol tcp --port 8080
cf ssh dora-0
curl http://10.255.249.140:8080 # Hi, I'm Dora!
time for i in $(seq 1 1000); do curl http://10.255.249.140:8080 > /dev/null 2>&1 ; done
|
Correct, we are using the Silk backend not the NSX-T plug-in. |
We noticed this as well but didn't get to the solution, just kept an isolation segment of bionic cells for the one team using container-to-container networking. They deploy an nginx server and use the internal app route to talk to their application (essentially to add authentication). |
Moving this issue out of cloudfoundry/cf-networking-release#213 as we have indications that it is related to the Stemcell.
Issue
We currently observe timeouts in C2C when moving CF from Bionic to Jammy.
Please note that this issue can only be observed when the Diego cells are migrated from Bionic to Jammy and cannot be reproduced on Bionic stemcells.
As additional information, we've also tested in different environments with and without dynamic ASGs.
Steps to Reproduce - See additional information below
cf add-network-policy app-a app-b --protocol tcp --port 8080
Expected result
Successful connections from app-a to app-b.
Current result
Sporadic timeouts and slow connections from app-a to app-b.
Workaround
In every CloudFoundry diego cell you should disable a configuration parameter in the networking interface:
ethtool -K eth0 tx-udp_tnl-segmentation off && ethtool -K eth0 tx-udp_tnl-csum-segmentation off
This is currently disabled (
off
) by default on Bionic compared to Jammy.Further information
Infrastructure: ESXI prepared with NSX-T / NSX-V (tested on both) - not sure if it can be reproduced in other cloud environments.
The text was updated successfully, but these errors were encountered: