Task: Duplicate event prevention/cleanup #234

WPprodigy · 2021-12-17T20:33:30Z

Problem: There are apparently a variety of situations where you can end up w/ duplicate events, sometimes hundreds to thousands of them. This can pretty much perpetually bog down a site until cleaned up manually.

1) Prevent duplicate recurring events

I actually think this is something WP core overlooked, and should probably revisit in core trac. Single-events are prevented from being registered if there is a similar action/args event already scheduled within +/-10mins. Docs actually say the same about recurring events, but it's not true.

Part of this is already implemented in an upcoming release, seen here: bbb53ae#diff-467459221a72437e5497c04a9ac91579c53ef140b9da6bb0b67c043dc1fb0490R41-R61. If an event has the same action/args/schedule, then we're just gonna deviate from core a little and prevent registration. If multiple of the same event on the same schedule is needed for some reason, then you'd just need to add some varying args when registering them.

Perhaps could take this a step further and either re-query with SRTM (send reads to master), or have a stricter unique constraint on the table (though could be tricky w/ varying event statuses).

2) Cleanup existing duplicate recurring events

There are quite a lot of sites in the wild now with duplicate recurring events, some to extreme degrees with hundreds of each. On a large MS, this means cron is going to struggle unnecessarily, and just have an overall heavy load that we can prevent.

So w/ the new contract in place where we prevent duplicate recurring events, we can then go through and cleanup any existing duplicates, as well as ones that may continue to slip through the cracks due to other race conditions. I'm thinking the daily a8c_cron_control_clean_legacy_data internal job would be great for this.

3) Handle DB unique constraint errors.

Mentioned in #147, we do need to handle the cases where events run into the unique constraint. Notably, a rare but possible situation is a single event sharing the same action as a recurring event, if the timestamps collide we need to make sure to handle it gracefully.

We'll need to document this "core deviance" in the readme.

The text was updated successfully, but these errors were encountered:

WPprodigy · 2022-01-26T22:53:40Z

Cleaning up duplicate recurring events now on the daily cronjob, so that's good.

Next steps I'm thinking:

When we determine an event doesn't exist yet (performantly), do another query that locks the table while it does an additional check and then inserts it's event. This way if there are two truly parallel requests, only one will win and get the write: https://dev.mysql.com/doc/refman/8.0/en/innodb-locking-reads.html
The unique constraint needs to drop the status column, as I'm planning to move events to a running status and then back to pending if they are recurring (and we wouldnt want an event being added in between those). It's possible the unique constraint won't be needed anymore if we truly lock down the potential for duplicates (which would simplify moving events to other statuses like complete/failed/cancelled so we don't have to worry about the unique constraint), but if it is needed still then the index should be swapped to action/instance/timestamp in that order so more queries can take advantage of the index.

This was referenced Dec 17, 2021

Events: Detect duplicate events created with timestamps within two or three seconds #49

Closed

Events Store: improve handling of DB errors when race condition creates duplicates #147

Closed

WPprodigy mentioned this issue Jan 20, 2022

Prune duplicate recurring events #251

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Task: Duplicate event prevention/cleanup #234

Task: Duplicate event prevention/cleanup #234

WPprodigy commented Dec 17, 2021 •

edited

Loading

WPprodigy commented Jan 26, 2022 •

edited

Loading

Task: Duplicate event prevention/cleanup #234

Task: Duplicate event prevention/cleanup #234

Comments

WPprodigy commented Dec 17, 2021 • edited Loading

1) Prevent duplicate recurring events

2) Cleanup existing duplicate recurring events

3) Handle DB unique constraint errors.

WPprodigy commented Jan 26, 2022 • edited Loading

WPprodigy commented Dec 17, 2021 •

edited

Loading

WPprodigy commented Jan 26, 2022 •

edited

Loading