Sliding Sync: Lazy-loading room members on incremental sync (remember memberships) #17809

MadLittleMods · 2024-10-09T19:52:36Z

Lazy-loading room members on incremental sync and remember which memberships we've sent down the connection before (up-to 100)

Fix #17804

Follow-up/alternative to #17806

Depends on #17785

Pull Request Checklist

Pull request is based on the develop branch
- Wait for Correctly handle changes to required state config in sliding sync #17785 to merge
Pull request includes a changelog file. The entry should:
- Be a short description of your change which makes sense to users. "Fixed a bug that prevented receiving messages from other servers." instead of "Moved X method from EventStore to EventWorkerStore.".
- Use markdown where necessary, mostly for code blocks.
- End with either a period (.) or an exclamation mark (!).
- Start with a capital letter.
- Feel free to credit yourself, by adding a sentence "Contributed by @github_username." or "Contributed by [Your Name]." to the end of the entry.
Code style is correct
(run the linters)

Fixes #17698

… required state (#17805) Parameterize and add more tests for tracking changes to required state (`_required_state_changes(...)`) These are direct modifications to #17785 and should be merged into that PR.

Co-authored-by: Eric Eastwood <[email protected]>

MadLittleMods · 2024-10-09T20:08:55Z

tests/handlers/test_sliding_sync.py

+ # sent it before and send the new state. (if we were tracking
+ # that we sent any other state, we should still keep track
+ # that).
+ {},


It's possible that we could remember the specific state_keys that we have sent down before but this currently just acts the same as if a whole type was removed (same as simple_remove_type)

This is just a performance optimization though and the result would look like following:

Suggested change

{},

{

EventTypes.Member: {

"@user3:test",

}

},

Perhaps it's good that we "garbage collect" and forget what we've sent before for a given type when the client stops caring about a certain type 🤷.

erikjohnston

I think this makes sense! I am slightly worried about the increase in the size of the tables if we keep adding more and more members in there, but I'll have a little think about how we might make that more efficient.

synapse/handlers/sliding_sync/__init__.py

See #17809 (comment)

…c-lazy-load-members-on-incrental-sync3 Conflicts: synapse/handlers/sliding_sync/__init__.py

…ers-on-incrental-sync3 Conflicts: synapse/handlers/sliding_sync/__init__.py tests/handlers/test_sliding_sync.py

erikjohnston · 2024-10-15T12:26:00Z

changelog.d/17809.bugfix

@@ -0,0 +1 @@
+Fix bug with sliding sync where `$LAZY`-loading room members would not return `required_state` membership in incremental syncs.


I am slightly worried about the increase in the size of the tables if we keep adding more and more members in there, but I'll have a little think about how we might make that more efficient.

More concretely: there's a few concerns I have:

Over the lifetime of a connection the size of the store required state could grow very large (think Matrix HQ).

Currently, we pull out everything when we get a request, so the size of the data matters

Every time we change the state we end up copying all the rows from the previous connection position to the new position, so the amount of data again matters.

On point 3. I think we can change the DB tables up a bit so that we have one table that holds the base values that apply to all live positions in the connection (there will be max two), and then another table that are the delta from the base to that position. This means that we'd only need to copy over the deltas to the base and insert new deltas whenever we persist a new position (which should be a lot less data).

I don't really know what we do about not pulling lots of data from the DB. We could only pull out that data for the rooms we're sending down? Or maybe we we don't pull it out and instead query the DB for "which of these users have we previously sent membership down for?", though that feels like a bigger change.

We could only pull out that data for the rooms we're sending down?

This would probably be good to do in any case.

But it seems we're also concerned with how big an individual (room_id, user_id) could get.

I don't really know what we do about not pulling lots of data from the DB. We could only pull out that data for the rooms we're sending down? Or maybe we we don't pull it out and instead query the DB for "which of these users have we previously sent membership down for?", though that feels like a bigger change.

Are we set on wanting to track it? The dumb simple solution is to not track it at all which means returning a membership event for any timeline events.

Another alternative is that we could throw away a type when the state_key's grow too large. Or just throw the whole required_state_map when it grows too large. When I say throw-away, I just mean reset to whatever the requested required_state_map has. This is a decent middle ground between tracking nothing and everything. Ideally, we could kick out entries by recency but the complexity is probably not needed. It also doesn't require us to change our database schema.

I think we do want to track the sent memberships, otherwise you are often just doubling the number of events you have to pull out and send to the clients.

Another alternative is that we could throw away a type when the state_key's grow too large. Or just throw the whole required_state_map when it grows too large. When I say throw-away, I just mean reset to whatever the requested required_state_map has. This is a decent middle ground between tracking nothing and everything. Ideally, we could kick out entries by recency but the complexity is probably not needed. It also doesn't require us to change our database schema.

This is probably a decent idea that should be fairly easy to do? Even something like "if we have $LAZY and the set of members we've sent down is bigger than 100, reset", or something. It's not pretty but a) means for most rooms we'll never reset, and b) most of the time we won't send down redundant memberships.

Updated to limit the state_keys we remember from previous requests to 100 for any type (also added tests).

changelog.d/17809.bugfix

See #17809 (comment)

…ers-on-incrental-sync3

erikjohnston and others added 14 commits October 3, 2024 16:11

Correctly changes to required state config

af9c72d

Fixes #17698

Add tests

2fd3a5a

Newsfile

5e66e98

Sliding Sync: Parameterize and add more tests for tracking changes to…

596581a

… required state (#17805) Parameterize and add more tests for tracking changes to required state (`_required_state_changes(...)`) These are direct modifications to #17785 and should be merged into that PR.

Apply suggestions from code review

4b631a5

Co-authored-by: Eric Eastwood <[email protected]>

Fixup

e913037

comment __bool__

35532d6

Fix handling of your own user ID

859d4a8

Remove unused var

56565dc

Remove TODOs

0d0207e

Move comment down

fafb2fe

Clarify None comment

c23d326

LAZY only applies to memberships

6391b0f

Lazy-loading room members on incremental sync (remember memberships)

fbe9894

MadLittleMods added the A-Sync label Oct 9, 2024

MadLittleMods mentioned this pull request Oct 9, 2024

Sliding Sync: Remember memberships already sent down connection when lazy-loading room members on incremental sync #17808

Closed

3 tasks

MadLittleMods added 3 commits October 9, 2024 14:54

Add changelog

b9a8780

Re-use invalidated_state_keys

760810f

Better clarify $ME translation

ddc9769

MadLittleMods commented Oct 9, 2024

View reviewed changes

MadLittleMods added 2 commits October 9, 2024 15:12

Fill in TODO

8b906c4

Fix grammar

2080cc5

MadLittleMods mentioned this pull request Oct 9, 2024

Sliding Sync: Lazy-loading room members on incremental sync #17806

Closed

3 tasks

Move prev_room_sync_config up

0087ad2

MadLittleMods mentioned this pull request Oct 9, 2024

Correctly handle changes to required state config in sliding sync #17785

Merged

MadLittleMods marked this pull request as ready for review October 9, 2024 22:07

MadLittleMods requested a review from a team as a code owner October 9, 2024 22:07

MadLittleMods and others added 2 commits October 10, 2024 09:52

Move prev_room_sync_config up

f8feef5

Merge branch 'develop' into erikj/ss_required_state

a566347

erikjohnston reviewed Oct 10, 2024

View reviewed changes

synapse/handlers/sliding_sync/__init__.py Show resolved Hide resolved

MadLittleMods added 2 commits October 10, 2024 10:43

Comment why we're making new sets

4e662da

See #17809 (comment)

Merge branch 'erikj/ss_required_state' into madlittlemods/sliding-syn…

fb5bca4

…c-lazy-load-members-on-incrental-sync3 Conflicts: synapse/handlers/sliding_sync/__init__.py

Base automatically changed from erikj/ss_required_state to develop October 14, 2024 12:31

Merge branch 'develop' into madlittlemods/sliding-sync-lazy-load-memb…

7e667fe

…ers-on-incrental-sync3 Conflicts: synapse/handlers/sliding_sync/__init__.py tests/handlers/test_sliding_sync.py

MadLittleMods requested a review from erikjohnston October 14, 2024 16:34

erikjohnston reviewed Oct 15, 2024

View reviewed changes

MadLittleMods commented Oct 15, 2024

View reviewed changes

changelog.d/17809.bugfix Show resolved Hide resolved

MadLittleMods added 6 commits October 16, 2024 17:03

Limit the number of state_keys that we remember

aec7455

See #17809 (comment)

Update comments

87497c8

Fix bug when requesting more than we can remember

525222b

Merge branch 'develop' into madlittlemods/sliding-sync-lazy-load-memb…

90f0741

…ers-on-incrental-sync3

Fix bug remembering too much when over limit

96c9a91

Merge branch 'develop' into madlittlemods/sliding-sync-lazy-load-memb…

cf13ca0

…ers-on-incrental-sync3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sliding Sync: Lazy-loading room members on incremental sync (remember memberships) #17809

Sliding Sync: Lazy-loading room members on incremental sync (remember memberships) #17809

MadLittleMods commented Oct 9, 2024 •

edited

Loading

MadLittleMods Oct 9, 2024

erikjohnston left a comment

erikjohnston Oct 15, 2024

MadLittleMods Oct 15, 2024

erikjohnston Oct 16, 2024

MadLittleMods Oct 16, 2024

- {},
+ {
+ EventTypes.Member: {
+ "@user3:test",
+ }
+ },

		@@ -0,0 +1 @@
		Fix bug with sliding sync where `$LAZY`-loading room members would not return `required_state` membership in incremental syncs.

Sliding Sync: Lazy-loading room members on incremental sync (remember memberships) #17809

Are you sure you want to change the base?

Sliding Sync: Lazy-loading room members on incremental sync (remember memberships) #17809

Conversation

MadLittleMods commented Oct 9, 2024 • edited Loading

Pull Request Checklist

MadLittleMods Oct 9, 2024

Choose a reason for hiding this comment

erikjohnston left a comment

Choose a reason for hiding this comment

erikjohnston Oct 15, 2024

Choose a reason for hiding this comment

MadLittleMods Oct 15, 2024

Choose a reason for hiding this comment

erikjohnston Oct 16, 2024

Choose a reason for hiding this comment

MadLittleMods Oct 16, 2024

Choose a reason for hiding this comment

MadLittleMods commented Oct 9, 2024 •

edited

Loading