feat(insights): launch funnels as a Clickhouse UDF behind a feature flag #23587

aspicer · 2024-07-09T23:14:50Z

Problem

Funnels are not working with more than about 12 steps. They show exponential time growth with each step.

Changes

This is an experiment to rewrite funnels to run in a UDF. This PR launches that feature behind a feature-flag.

What's a UDF? It's a user-defined function. You can define a function and implement it in any language of your choosing. Clickhouse launches your function and has it sit idle, preloaded and hot, waiting for input on stdin. When you call the function from Clickhouse, it pipes the data to the function and listens for its output on stdout.

Why a UDF? This makes the funnels code a lot simpler and easier to reason about. The core of the functionality is in a small python function.

What are the downsides of this approach? The biggest one is that UDFs are slower than native clickhouse. You shouldn't use an UDF for anything clickhouse does well natively. A lot of this speed difference can be mitigated by optimizing your UDF (writing it in C / C++ for example).

How does it work

We use clickhouse to turn events into mostly the same structure we have now - a set of matching steps and exclusions.

The core of the code is in aggregate_funnel.py. For each aggregation_target, it iterates through all the matching events in time order, keeps track of funnel progress, and returns timings and results.

At the end, we use Clickhouse to do a couple aggregations on breakdowns and to calculate averages.

At least locally, it runs much faster than the existing queries.

Product Questions (about strict and unordered funnels)

While working on this, I dug a bit into usage of strict and unordered funnel. Both have shockingly low usage at 0.3% and 1.2% respectively.
select coalesce(di.filters->>'funnel_order_type', 'ordered') as fot, count(*) from posthog_dashboarditem as di where di.filters->>'insight' = 'FUNNELS' group by fot

select coalesce(di.filters->>'funnel_order_type', 'ordered') as fot, coalesce(di.filters->>'funnel_viz_type', 'steps') as viz, count(*) from posthog_dashboarditem as di where di.filters->>'insight' = 'FUNNELS' group by (fot, viz) order by count(*) desc

Strict mode is supposed to not allow any events between steps in the funnel. In reality, this doesn't make sense - if you start tracking something new, it could break all your funnel data. Strict was launched in July of 2021, so it's not exactly new. The fact the usage is so low isn't a great sign.

Unordered mode was also launched around the same time, and maybe has a plausible use case but I think needs some product focus and changes if we're going to support it. An example of an improvement that would make it more useful would be allowing sets of events to be unordered, vs having unordered be a global toggle for the whole funnel. Allowing sets of unordered events would allow people to track out of order user behavior, potentially gated on things like "checkout" vs just having a somewhat aimless unordered funnel that just tells you how many steps they completed.

I think maybe we could think about moving these to being deprecated. Next steps here would be looking to see if any of these strict or unordered funnels get frequent traffic. Thoughts?

Further Questions

How does this work at scale? It runs quickly (much quicker than the old code) locally, but how does it scale for the largest funnels we have?

Follow up

Assuming that it works at scale, follow up items are

Add support for (or remove) unordered mode
Port actors to the new queries (should we always calculate actors so that we can return it faster?)
Performance improvements. Optimize the tight loop / rewrite it in a faster language (C, rust)
Clean up legacy code

Does this work well for both Cloud and self-hosted?

It might take some work to figure out how to deploy this for Cloud.

How did you test this code?

Unit testing. Local testing comparing the old funnel to the new funnel on dev.

…/udf

posthog-bot · 2024-08-29T22:30:49Z

📸 UI snapshots have been updated

1 snapshot changes in total. 0 added, 1 modified, 0 deleted:

chromium: 0 added, 1 modified, 0 deleted (diff for shard 2)
webkit: 0 added, 0 modified, 0 deleted

Triggered by this commit.

👉 Review this PR's diff of snapshots.

posthog-bot · 2024-08-29T22:50:07Z

📸 UI snapshots have been updated

1 snapshot changes in total. 0 added, 1 modified, 0 deleted:

chromium: 0 added, 1 modified, 0 deleted (diff for shard 2)
webkit: 0 added, 0 modified, 0 deleted

Triggered by this commit.

👉 Review this PR's diff of snapshots.

…/udf

…lag (#23587) Co-authored-by: github-actions <41898282+github-actions[bot]@users.noreply.github.com>

aspicer and others added 30 commits July 4, 2024 07:14

udf

f351508

working

e4a0979

merge

5fd74bf

Merge remote-tracking branch 'origin/master' into aspicer/udf

2322db9

funnels

2cb95db

sorted group array

69674bc

working query

e6a4e6d

3 level funnels

1401ffa

loading conversion window limit

a70ffc2

working with array

abbb018

timestamps

d8b319b

timestamps

fdd66e7

working

9bf5dee

passing breakdown through

2af6b78

passing

e497417

breakdowns passing through

c41cd48

string breakdown

2bd965b

breakdowns and stesting

1c45b7b

time tracking

17ac128

need to make timestamps

6288035

returning values

cb1962f

timings working

f27e4b7

median

4e69b14

nan

89dceb0

working array breakdowns

3e43a8f

hm

dc9a0f4

test case

91d8c4e

step

9e5ce4c

36 tests failing

d13e5c8

order by

65de38a

github-actions bot and others added 16 commits August 28, 2024 05:56

Update query snapshots

38ce36e

add the ability to toggle a udf

fbfd7b1

remove test function

fb5ee7c

Merge branch 'aspicer/udf' of github.com:PostHog/posthog into aspicer…

e65617c

…/udf

mypy

062b660

udf

839274d

fix typo

2e2521d

Update query snapshots

18c9ccf

Merge remote-tracking branch 'origin/master' into aspicer/udf

fbb1407

Merge branch 'aspicer/udf' of github.com:PostHog/posthog into aspicer…

6ec5de8

…/udf

tmate

aad3b15

timeout

a981ea9

tet

62d07b7

bump temporalio

0fd6128

always wait for temporal

ded3751

Update UI snapshots for chromium (2)

209a722

Update UI snapshots for chromium (2)

cd25fa1

aspicer and others added 8 commits September 3, 2024 12:41

Merge remote-tracking branch 'origin/master' into aspicer/udf

5b8313d

Merge branch 'aspicer/udf' of github.com:PostHog/posthog into aspicer…

ebf6c56

…/udf

clean up actions and rename

f245dfc

staged

a6f0c21

Update query snapshots

afacb91

Merge remote-tracking branch 'origin/master' into aspicer/udf

329cd4a

Merge branch 'aspicer/udf' of github.com:PostHog/posthog into aspicer…

f6fa7ab

…/udf

Update query snapshots

32fd423

aspicer merged commit 7fa73a8 into master Sep 5, 2024
93 checks passed

aspicer deleted the aspicer/udf branch September 5, 2024 18:40

timgl pushed a commit that referenced this pull request Sep 10, 2024

feat(insights): launch funnels as a Clickhouse UDF behind a feature f…

380f683

…lag (#23587) Co-authored-by: github-actions <41898282+github-actions[bot]@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(insights): launch funnels as a Clickhouse UDF behind a feature flag #23587

feat(insights): launch funnels as a Clickhouse UDF behind a feature flag #23587

aspicer commented Jul 9, 2024 •

edited

Loading

posthog-bot commented Aug 29, 2024

posthog-bot commented Aug 29, 2024

feat(insights): launch funnels as a Clickhouse UDF behind a feature flag #23587

feat(insights): launch funnels as a Clickhouse UDF behind a feature flag #23587

Conversation

aspicer commented Jul 9, 2024 • edited Loading

Problem

Changes

How does it work

Product Questions (about strict and unordered funnels)

Further Questions

Follow up

Does this work well for both Cloud and self-hosted?

How did you test this code?

posthog-bot commented Aug 29, 2024

📸 UI snapshots have been updated

posthog-bot commented Aug 29, 2024

📸 UI snapshots have been updated

aspicer commented Jul 9, 2024 •

edited

Loading