feat(data-warehouse): integrating data warehouse with trends insight #20320

EDsCODE · 2024-02-13T19:10:13Z

Problem

data warehouse data can't be used outside of SQL right now

Changes

attempt to connect data to insight trends
adds a data warehouse trends query builder
breakdowns and property filters work
unique aggregations and actor queries not supported yet

Design Decision

@Gilbert09 and I discussed this and figured that for the purposes of getting this functionality in trends, splitting at the trends query builder level would be the clearest because:
- Query builders are responsible for converting filter object/json into an AST. AST is passed into the hogql parser/printer which provides the query output
- The current query builders assemble the ast with "events" table everywhere and events table has a specific schema that isn't "generalized". For example, properties is a giant object that's parsed whereas in data warehouse tables, properties are often flattened and not in a giant object
- Drawbacks:
  - will need to do this logic splitting in many places (everywhere where "events" is explicitly parsed)
@timgl suggests we find a place to fit data warehouse table acceptance where we wouldn't need to repeat/split out logic more such as within the hogql parser itself
- I don't think the parser is the appropriate place to handle this because we would be passing in "events" and have unintuitive logic that infers when it actually spit out a data warehouse table which would make the parser abstraction pretty muddy.

Gilbert09

Looking good, I'm guessing DataWarehouseTrendsQueryBuilder was mostly copied over with a few changes so far?

Will the types of aggregations we support be reduced? e.g. does "Weekly Active Users" on a data warehouse series make sense? Hopefully from this, we can reduce some the complexity around aggregations and breakdowns.

posthog/hogql_queries/insights/trends/data_warehouse_trends_query_builder.py

posthog/hogql_queries/insights/trends/test/test_data_warehouse_query_builder.py

Gilbert09 · 2024-02-15T09:26:40Z

posthog/hogql_queries/insights/trends/test/test_data_warehouse_query_builder.py

+            timings=timings,
+        )
+
+    def create_parquet_file(self):


We'll likely need to add some higher-level helper funcs somewhere in our testing framework for doing this kinda stuff. I imagine I'm gonna need the same when doing the table linking too

EDsCODE · 2024-02-15T10:27:49Z

Yep, trying to share as many code paths as possible. I need to figure out if it's realistic to also have an "actor" id mapping which would allow for all the unique user math aggregations

…hog into dw-test-insight-integration

posthog-bot · 2024-02-20T19:32:21Z

📸 UI snapshots have been updated

1 snapshot changes in total. 0 added, 1 modified, 0 deleted:

chromium: 0 added, 1 modified, 0 deleted (diff for shard 2)
webkit: 0 added, 0 modified, 0 deleted

Triggered by this commit.

👉 Review this PR's diff of snapshots.

…hog into dw-test-insight-integration

posthog-bot · 2024-02-20T19:41:36Z

📸 UI snapshots have been updated

1 snapshot changes in total. 0 added, 1 modified, 0 deleted:

chromium: 0 added, 1 modified, 0 deleted (diff for shard 2)
webkit: 0 added, 0 modified, 0 deleted

Triggered by this commit.

👉 Review this PR's diff of snapshots.

posthog-bot · 2024-02-20T19:51:41Z

📸 UI snapshots have been updated

1 snapshot changes in total. 0 added, 1 modified, 0 deleted:

chromium: 0 added, 1 modified, 0 deleted (diff for shard 2)
webkit: 0 added, 0 modified, 0 deleted

Triggered by this commit.

👉 Review this PR's diff of snapshots.

…hog into dw-test-insight-integration

posthog-bot · 2024-02-22T18:15:28Z

📸 UI snapshots have been updated

1 snapshot changes in total. 0 added, 1 modified, 0 deleted:

chromium: 0 added, 1 modified, 0 deleted (diff for shard 2)
webkit: 0 added, 0 modified, 0 deleted

Triggered by this commit.

👉 Review this PR's diff of snapshots.

posthog-bot · 2024-02-22T18:25:22Z

📸 UI snapshots have been updated

1 snapshot changes in total. 0 added, 1 modified, 0 deleted:

chromium: 0 added, 1 modified, 0 deleted (diff for shard 2)
webkit: 0 added, 0 modified, 0 deleted

Triggered by this commit.

👉 Review this PR's diff of snapshots.

…hog into dw-test-insight-integration

Gilbert09

Looking gooood, 🥳

posthog/hogql_queries/insights/trends/trends_query_runner.py

…hog into dw-test-insight-integration

posthog-bot · 2024-02-28T17:51:35Z

📸 UI snapshots have been updated

1 snapshot changes in total. 0 added, 1 modified, 0 deleted:

chromium: 0 added, 1 modified, 0 deleted (diff for shard 2)
webkit: 0 added, 0 modified, 0 deleted

Triggered by this commit.

👉 Review this PR's diff of snapshots.

…hog into dw-test-insight-integration

mariusandra

Had a look through the code. A lot of it is a much needed change. However I think we can have an easier time and get rid of some of the duplicated trends code, if we instead make a virtual events table (events_from_stripe) during query time, and then just query against that. This table would be set up with HogQL's custom fields that'll do a bit of proxying. Mapping person_id, distinct_id and timestamp would already enable matching all person properties. The properties field could be a special object that gives access to all other fields on the table, now with a custom data picker in the frontend. Passing the field mapping down to the HogQL layer should make all existing insights work nicely with minimal modifications and no special runners. At least in theory 😅 Slack thread

EDsCODE added 6 commits February 12, 2024 16:30

wip with new datawarehousenode type in trendsquerybuilder

ec4f4e8

split out query builder

1b3dc17

add test

74b8056

add property support

ed55bd9

add test for entity property and non entity property filter

637d82d

basic breakdown working

4cf27a2

EDsCODE requested a review from Gilbert09 February 15, 2024 04:06

Gilbert09 reviewed Feb 15, 2024

View reviewed changes

EDsCODE and others added 13 commits February 20, 2024 10:10

merge master

ff669d0

typing

388a81f

typing

6c778a4

more typing

4017a43

use default args

7395667

more typing

b28ac36

resolved mypy

df6d297

Merge branch 'master' into dw-test-insight-integration

e118dba

Update query snapshots

99dc8f4

remove config

7b649ae

Merge branch 'dw-test-insight-integration' of github.com:PostHog/post…

620a98a

…hog into dw-test-insight-integration

generate schema properly

3a0ee07

Update UI snapshots for chromium (2)

4c72ca0

EDsCODE and others added 3 commits February 20, 2024 14:35

add breakdown type

b9bb5c9

Merge branch 'dw-test-insight-integration' of github.com:PostHog/post…

28ffe84

…hog into dw-test-insight-integration

Update UI snapshots for chromium (2)

bc1e18f

EDsCODE and others added 2 commits February 20, 2024 14:42

trim data warehouse query builder

cd12eb3

Update UI snapshots for chromium (2)

a60ce92

github-actions bot and others added 3 commits February 22, 2024 18:05

Update query snapshots

71d7c48

Merge branch 'dw-test-insight-integration' of github.com:PostHog/post…

735dd55

…hog into dw-test-insight-integration

Update UI snapshots for chromium (2)

03be49e

Update UI snapshots for chromium (2)

a693548

EDsCODE and others added 5 commits February 22, 2024 14:22

match signature

67b0148

Merge branch 'dw-test-insight-integration' of github.com:PostHog/post…

4b831f0

…hog into dw-test-insight-integration

match signature

6e2165a

update test

e19bc4a

Update query snapshots

9345626

EDsCODE requested a review from Gilbert09 February 23, 2024 05:09

Gilbert09 approved these changes Feb 23, 2024

View reviewed changes

posthog/hogql_queries/insights/trends/trends_query_runner.py Outdated Show resolved Hide resolved

EDsCODE and others added 6 commits February 27, 2024 15:50

add missing fields

f5d20ea

Merge branch 'dw-test-insight-integration' of github.com:PostHog/post…

46b2a0e

…hog into dw-test-insight-integration

add schema

fac83eb

update typing

6277d39

Merge branch 'master' into dw-test-insight-integration

37df9fd

Update UI snapshots for chromium (2)

8154be1

EDsCODE and others added 6 commits February 28, 2024 13:37

merge master

e1d8672

Merge branch 'dw-test-insight-integration' of github.com:PostHog/post…

dfb6933

…hog into dw-test-insight-integration

Update query snapshots

45806c2

Merge branch 'master' into dw-test-insight-integration

66c4cd1

Merge branch 'master' into dw-test-insight-integration

4b35e03

Merge branch 'dw-test-insight-integration' of github.com:PostHog/post…

11f7671

…hog into dw-test-insight-integration

EDsCODE requested a review from mariusandra February 29, 2024 17:59

mariusandra reviewed Feb 29, 2024

View reviewed changes

EDsCODE merged commit a1c21f9 into master Feb 29, 2024
89 checks passed

EDsCODE deleted the dw-test-insight-integration branch February 29, 2024 21:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(data-warehouse): integrating data warehouse with trends insight #20320

feat(data-warehouse): integrating data warehouse with trends insight #20320

EDsCODE commented Feb 13, 2024 •

edited

Loading

Gilbert09 left a comment

Gilbert09 Feb 15, 2024

EDsCODE commented Feb 15, 2024

posthog-bot commented Feb 20, 2024

posthog-bot commented Feb 20, 2024

posthog-bot commented Feb 20, 2024

posthog-bot commented Feb 22, 2024

posthog-bot commented Feb 22, 2024

Gilbert09 left a comment

posthog-bot commented Feb 28, 2024

mariusandra left a comment

feat(data-warehouse): integrating data warehouse with trends insight #20320

feat(data-warehouse): integrating data warehouse with trends insight #20320

Conversation

EDsCODE commented Feb 13, 2024 • edited Loading

Problem

Changes

Design Decision

Gilbert09 left a comment

Choose a reason for hiding this comment

Gilbert09 Feb 15, 2024

Choose a reason for hiding this comment

EDsCODE commented Feb 15, 2024

posthog-bot commented Feb 20, 2024

📸 UI snapshots have been updated

posthog-bot commented Feb 20, 2024

📸 UI snapshots have been updated

posthog-bot commented Feb 20, 2024

📸 UI snapshots have been updated

posthog-bot commented Feb 22, 2024

📸 UI snapshots have been updated

posthog-bot commented Feb 22, 2024

📸 UI snapshots have been updated

Gilbert09 left a comment

Choose a reason for hiding this comment

posthog-bot commented Feb 28, 2024

📸 UI snapshots have been updated

mariusandra left a comment

Choose a reason for hiding this comment

EDsCODE commented Feb 13, 2024 •

edited

Loading