Don't unwrap when removing invalid TCP state from fastpath. #621

FelixMcFelix · 2024-11-28T11:04:00Z

Now that the fastpath temporarily drops (and reacquires) the port lock when TCP state needs to be invalidated, we opened ourselves up to the possibility that another packet could have removed this state ahead of us. Equally, a packet could insert new TCP state which we might accidentally remove.

This PR removes the unwrap on removal to account for the race, and only removes TCP flows if they are pointer-equal.

Closes #618, closes #624.

Now that the fastpath temporarily drops (and reacquires) the port lock when TCP state needs to be invalidated, we opened ourselves up to the possibility that another packet could have removed this state ahead of us. Equally, a packet could insert new TCP state which we might accidentally remove. This PR removes the unwrap on removal to account for the race, and only removes TCP flows if they are pointer-equal. Closes #618.

FelixMcFelix · 2024-11-28T12:09:24Z

I sadly haven't been able to locally reproduce the crash itself, using either zone-to-zone traffic or between local VMs via standalone omicron. Both this PR and master stand up against while true; do iperf -c 10.0.0.1 -P128 -t 2; iperf -c 10.0.0.1 -P128 -t 2 -R; done for around 10 minutes. This should slam the TCP flow state table pretty hard, and exposes us to a lot of leftover entries when data segments arrive later than expected -- I would have thought this would get us a trigger, but alas.

rcgoodfellow · 2024-11-28T17:54:55Z

I sadly haven't been able to locally reproduce the crash itself, using either zone-to-zone traffic or between local VMs via standalone omicron. Both this PR and master stand up against while true; do iperf -c 10.0.0.1 -P128 -t 2; iperf -c 10.0.0.1 -P128 -t 2 -R; done for around 10 minutes.

Maybe try this for many flows simultaneously with distinct address pairs? This would get a bit closer to rack traffic conditions.

FelixMcFelix · 2024-11-29T11:53:24Z

Maybe try this for many flows simultaneously with distinct address pairs? This would get a bit closer to rack traffic conditions.

I've bumped up the topology a bit (all instances running omicron's builtin alpine linux):

┌──────────────┐    ┌──────────────────────────────────────────────────────┐
│   Macbook    │    │Farme                                                 │
│(10.0.23.168) │    │(10.0.147.187)                                        │
└──────────────┘    │  ┌ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─   │
        ▲           │   VPC                                             │  │
        └─────SSH───┼──┼─────────┬──────────────┬──────────────┐           │
                    │            │              │              │        │  │
                    │  │         ▼              ▼              ▼           │
                    │    ┌──────────────┬──────────────┬──────────────┐ │  │
                    │  │ │     cafe     │  restaurant  │     bar      │    │
                    │    │I: 172.30.0.5 │I: 192.168.0.5│I: 172.30.0.7 │ │  │
                    │  │ │E: 10.1.222.12│E: 10.1.222.13│E: 10.1.222.14│    │
                    │    └──────────────┴──────────────┴──────────────┘ │  │
                    │  │         ▲              │              │           │
                    │            │              │              │        │  │
                    │  │         └──────────────┴──────────────┘           │
                    │                  iperf3 client->server            │  │
                    │  └ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─   │
                    └──────────────────────────────────────────────────────┘

No crashes as yet on master under the above -P128 (×4 servers on cafe) rapid-fire iperf gauntlet, but I did come across #624. I've been attempting concurrent API operations (e.g., firewall rule add/delete) in case epoch bumps tell part of the story (as per oxide-dogfood on the most recent coredump). I'll leave it running, but we're missing something to make the original fault dead reproducible.

Closes #624.

luqmana

LGTM. Just left one suggestion for coalescing the flow state lookup+removal.

luqmana · 2024-12-02T20:34:51Z

lib/opte/src/engine/port.rs

+            if let Some(found_entry) = local_lock.tcp_flows.get(ufid_out) {
+                if Arc::ptr_eq(found_entry, &entry) {
+                    self.uft_tcp_closed(&mut local_lock, ufid_out, ufid_in);
+                    _ = local_lock.tcp_flows.remove(ufid_out);
+                }
+            }


If it's ok to swap the order here, you could make use of the Entry API to lookup and remove the tcp flow to save on searching through it twice:

Suggested change

if let Some(found_entry) = local_lock.tcp_flows.get(ufid_out) {

if Arc::ptr_eq(found_entry, &entry) {

self.uft_tcp_closed(&mut local_lock, ufid_out, ufid_in);

_ = local_lock.tcp_flows.remove(ufid_out);

}

}

if let Entry::Occupied(found_entry) = local_lock.tcp_flows.entry(*ufid_out) {

if Arc::ptr_eq(found_entry.get(), &entry) {

_ = found_entry.remove_entry();

self.uft_tcp_closed(&mut local_lock, ufid_out, ufid_in);

}

}

Thanks a lot for the review Luqman.

I think this would be a good idea (the swap is valid as I read it), except local_lock.tcp_flows here is a FlowTable and not a BTreeMap. It'd be nice to forward the Entry API while respecting the capacity constraints in FlowTable::add etc. to enable that. I'll open a ticket.

EDIT: #627.

FelixMcFelix self-assigned this Nov 28, 2024

Revisit comment.

65380ae

FelixMcFelix mentioned this pull request Nov 29, 2024

Large ioctl response resize logic is incorrect. #624

Closed

Correctly set ioctl resp_buf capacity on ENOBUFS

178a178

Closes #624.

FelixMcFelix requested a review from rcgoodfellow December 2, 2024 17:43

luqmana approved these changes Dec 2, 2024

View reviewed changes

FelixMcFelix mentioned this pull request Dec 3, 2024

FlowTable could have an Entry API #627

Open

FelixMcFelix merged commit b56afee into master Dec 3, 2024
10 checks passed

FelixMcFelix deleted the fix-618 branch December 3, 2024 10:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't unwrap when removing invalid TCP state from fastpath. #621

Don't unwrap when removing invalid TCP state from fastpath. #621

FelixMcFelix commented Nov 28, 2024 •

edited

Loading

FelixMcFelix commented Nov 28, 2024 •

edited

Loading

rcgoodfellow commented Nov 28, 2024

FelixMcFelix commented Nov 29, 2024 •

edited

Loading

luqmana left a comment

luqmana Dec 2, 2024

FelixMcFelix Dec 3, 2024 •

edited

Loading

Don't unwrap when removing invalid TCP state from fastpath. #621

Don't unwrap when removing invalid TCP state from fastpath. #621

Conversation

FelixMcFelix commented Nov 28, 2024 • edited Loading

FelixMcFelix commented Nov 28, 2024 • edited Loading

rcgoodfellow commented Nov 28, 2024

FelixMcFelix commented Nov 29, 2024 • edited Loading

luqmana left a comment

Choose a reason for hiding this comment

luqmana Dec 2, 2024

Choose a reason for hiding this comment

FelixMcFelix Dec 3, 2024 • edited Loading

Choose a reason for hiding this comment

FelixMcFelix commented Nov 28, 2024 •

edited

Loading

FelixMcFelix commented Nov 28, 2024 •

edited

Loading

FelixMcFelix commented Nov 29, 2024 •

edited

Loading

FelixMcFelix Dec 3, 2024 •

edited

Loading