feat: New ingestion consumer #27668

benjackwhite · 2025-01-18T14:14:59Z

Problem

This PR is the first in a few steps to start refactoring the plugin server to be much cleaner and more minimal

Changes

New consumer based of the same class based consumer for cdp
Essentially does the same thing as the old each-batch-ingestion process but cleaner and removing a lot of complex code paths
Changes to be much more config driven - this allows us to run multiple version of exactly the same consumer just with different source / destination topics and consumer group IDs
Adds a new mode for local running (not default) so that we can test all ingestion consumers this way - this will become the default once we release and remove the older options
Tests to cover most of this

👉 Stay up-to-date with PostHog coding conventions for a smoother review.

Does this work well for both Cloud and self-hosted?

How did you test this code?

# Conflicts: # plugin-server/src/main/pluginsServer.ts

pl

This is a really good step forward, the new IngestionConsumer is much cleaner than what I've seen in Mr Blobby.

I haven't done a super-detailed review of the ingestion code, as it would take me a bit more time to properly understand the old code and compare it with the new implementation. Let me know if you want me to do that.

I left some comments in a couple of places, they're not super critical, so I'll approve the PR and leave it to you what to do with them.

plugin-server/src/capabilities.ts

plugin-server/src/types.ts

plugin-server/src/main/pluginsServer.ts

pl · 2025-01-21T13:10:28Z

plugin-server/src/ingestion/ingestion-consumer.test.ts

+
+    beforeEach(async () => {
+        const now = new Date(2025, 1, 1).getTime()
+        jest.spyOn(Date, 'now').mockReturnValue(now)


issue: This looks like it's gonna break when you run it in a different timezone – in the snapshot it's already a different date. I assume we don't want to mock the timers using jest because it's an integration test and it might be running timers somewhere in the middle of the stack. Could we do the same as we do with UUIDs or completely mask out the timestamps in the snapshots?

Yeah I tried a few things and this seemed to be the best option. I'll circle back and see if I can make it nicer

plugin-server/src/ingestion/ingestion-consumer.test.ts

pl · 2025-01-21T13:39:57Z

plugin-server/src/ingestion/ingestion-consumer.test.ts

+    })
+
+    describe('dropping events', () => {
+        describe.each(['headers', 'payload'] as const)('via %s', (kind) => {


note: For me it's quite a bit of effort to understand those tests – there's quite a lot of back-and-forth:

to understand why the payload tests pass, I had to scroll up quite a bit to find that there's a default token in the payload

in the header tests the payloads still have the tokens – the name of the describe block suggest that we're testing events with tokens only in the headers

The tests below are short enough (and very readable on their own) that they could be duplicated and the createEvent function could be explicit on where the token lands, instead of relying on addMessageHeaders filtering by the kind of test.

Cool point taken. I think its too early for this to be reviewed tbh as I'm mostly trying to understand the existing logic myself...

# Conflicts: # plugin-server/package.json

posthog-bot · 2025-01-21T18:20:42Z

📸 UI snapshots have been updated

1 snapshot changes in total. 0 added, 1 modified, 0 deleted:

chromium: 0 added, 1 modified, 0 deleted (diff for shard 1)
webkit: 0 added, 0 modified, 0 deleted

Triggered by this commit.

👉 Review this PR's diff of snapshots.

# Conflicts: # frontend/__snapshots__/replay-player-success--second-recording-in-list--dark.png

benjackwhite · 2025-01-21T18:40:38Z

plugin-server/src/utils/token-bucket.ts

Found some bugs here - will pull out to a separate PR

benjackwhite added 18 commits January 18, 2025 12:51

Fixes

e3bab2c

Fixes

5330f51

More changes

c3e8070

More checks

38dfb3d

Fix

9f94ca5

Fixes

b87a8e9

Fixess

410e36d

Fixes

c128b81

Refactor producer

17621a9

Fixes

46837c1

Fix

a4327cb

Fixes

73f623c

fix

3cef6f7

Fixes

6f0d9b6

Fixes

02753d8

Fixes

2ac0021

fix

8228d91

Merge branch 'feat/producer-refactor' into feat/new-ingestion-consumer

d5ff970

# Conflicts: # plugin-server/src/main/pluginsServer.ts

benjackwhite changed the base branch from master to feat/producer-refactor January 19, 2025 10:46

Fix for different types

e9a3ce7

benjackwhite changed the title ~~wip: New ingestion consumer~~ feat: New ingestion consumer Jan 19, 2025

benjackwhite added 9 commits January 19, 2025 11:57

Fix

4a6575c

Fixes

e8f6a41

Fixes

704831e

Fixes

a76761f

Fixes

7609504

Fixes

db07b59

Fixes

39f3277

Started adding tests

18bb3dd

Fix

63d909d

benjackwhite added 11 commits January 21, 2025 11:39

Merge branch 'fix/tests' into feat/new-ingestion-consumer

a3d5857

Fix skip logic

0353b86

Added test

d97f65f

fix

00ceb09

fix

1c7439e

Fix

d9fc72f

Merge branch 'fix/tests' into feat/new-ingestion-consumer

0ce4169

Merge branch 'feat/improve-overflow' into feat/new-ingestion-consumer

d3751c1

Merge branch 'fix/tests' into feat/new-ingestion-consumer

f99f7d8

Fix

03b5078

Fixes

7057835

pl approved these changes Jan 21, 2025

View reviewed changes

benjackwhite and others added 8 commits January 21, 2025 15:07

Fixes

4e80a92

Fix

59b0380

Merge branch 'fix/tests' into feat/new-ingestion-consumer

0f68b84

# Conflicts: # plugin-server/package.json

Fix tests

e619928

Fixes

6b21bd5

Fix

1618c17

Added happy path tests

7d1b544

Update UI snapshots for chromium (1)

e9a10b0

Fixes

1002c84

benjackwhite marked this pull request as ready for review January 21, 2025 18:33

benjackwhite requested review from meikelmosby and pl and removed request for pl and meikelmosby January 21, 2025 18:33

Merge branch 'fix/tests' into feat/new-ingestion-consumer

a0ca3ce

# Conflicts: # frontend/__snapshots__/replay-player-success--second-recording-in-list--dark.png

benjackwhite commented Jan 21, 2025

View reviewed changes

plugin-server/src/utils/token-bucket.ts Outdated

Copy link

Contributor Author

benjackwhite Jan 21, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Found some bugs here - will pull out to a separate PR

fix

4ad069e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: New ingestion consumer #27668

feat: New ingestion consumer #27668

benjackwhite commented Jan 18, 2025 •

edited

Loading

pl left a comment

pl Jan 21, 2025

benjackwhite Jan 21, 2025

pl Jan 21, 2025

benjackwhite Jan 21, 2025

posthog-bot commented Jan 21, 2025

benjackwhite Jan 21, 2025

feat: New ingestion consumer #27668

Are you sure you want to change the base?

feat: New ingestion consumer #27668

Conversation

benjackwhite commented Jan 18, 2025 • edited Loading

Problem

Changes

Does this work well for both Cloud and self-hosted?

How did you test this code?

pl left a comment

Choose a reason for hiding this comment

pl Jan 21, 2025

Choose a reason for hiding this comment

benjackwhite Jan 21, 2025

Choose a reason for hiding this comment

pl Jan 21, 2025

Choose a reason for hiding this comment

benjackwhite Jan 21, 2025

Choose a reason for hiding this comment

posthog-bot commented Jan 21, 2025

📸 UI snapshots have been updated

benjackwhite Jan 21, 2025

Choose a reason for hiding this comment

benjackwhite commented Jan 18, 2025 •

edited

Loading