chore: add app_metrics2 table #22859

bretthoerner · 2024-06-10T19:11:39Z

Problem

Apps shouldn't be confined to things like integer plugin_config_ids, and app_metrics is sorted on that column. So we need a new table.

Changes

Add app_metrics2, which is sorted more like [log_entries], but keeps the app_metrics metric aggregations.

Also swap successes_after_retry which was unused for skipped which is a metric we wanted.

Note that we may want to add more metrics columns, like latency ones, but those should be easy to add later.

Also note, this PR doesn't change existing app_metrics queries. We'll have to roll this out and then migrate queries and insertion over for existing plugins.

👉 Stay up-to-date with PostHog coding conventions for a smoother review.

Does this work well for both Cloud and self-hosted?

Yes

How did you test this code?

Existing

posthog/models/app_metrics2/sql.py

posthog/clickhouse/test/__snapshots__/test_schema.ambr

posthog/models/app_metrics2/sql.py

posthog-bot · 2024-06-20T07:31:03Z

This PR hasn't seen activity in a week! Should it be merged, closed, or further worked on? If you want to keep it open, post a comment or remove the stale label – otherwise this will be closed in another week.

fuziontech

This is great! Nice job!

posthog/models/app_metrics2/sql.py

bretthoerner · 2024-06-24T20:44:52Z

@benjackwhite No rush from our end, but I think you were doing some kind of experiment to see if this schema would work for Hog apps? Just wanted to let you know this is ready whenever, or we can tweak the schema depending.

benjackwhite · 2024-06-25T08:23:45Z

posthog/models/app_metrics2/sql.py

+    successes SimpleAggregateFunction(sum, Int64),
+    skipped SimpleAggregateFunction(sum, Int64),
+    failures SimpleAggregateFunction(sum, Int64),
+    error_type String


I'm wondering if there is any strong reason why not to make this more generic?

From the work I'm doing so far there is a bunch of metrics that would be really useful to store here
success | failure | requestSuccess | requestFailure | timeout | filtered | skipped_due_to_disabled etc.

If we aggregate on the metric name it should still condense nicely I would imagine?

Suggested change

successes SimpleAggregateFunction(sum, Int64),

skipped SimpleAggregateFunction(sum, Int64),

failures SimpleAggregateFunction(sum, Int64),

error_type String

metric_name LowCardinality(String),

count SimpleAggregateFunction(sum, Int64)

Interesting, I can't think of a strong reason.

My best guess at the historical reasons for why it was this way in app_metrics (1) is that this table was probably created to support this page: https://us.posthog.com/project/2/pipeline/destinations/457/metrics

Your metric_name / count would be able to support the same page with minor query changes, with the relatively minor tradeoff that we'd be going a little "schema-less." It's hard to know if an arbitrary string is a success or a failure or what. But maybe the future of that page is we just break out each unique type + count and we're done? That gets a little weird for showing the total success / failure at the top, and only showing failures broken out at the bottom -- but there's nothing saying we have to do it the exact way we are today.

Maybe @tiina303 has opinions on whether it would cover existing use cases (maybe I'm missing something other than the page linked above)?

This seems like a fine idea. The only worry I have is about us always needing to update the UI and alerting - specifically if some sort of a new failure category is added, e.g. spark graph for example only has success and failure, we can't display 7 lines there it would be too noisy.

A potential way to get the best of both worlds is to have top level success / failure - which can be used by alerting and sparkgraph and then subcategories, which can be used in metrics page to break down by (or in alerts to ignore 'skip' category for example or have different thresholds there).

something like this:

successes SimpleAggregateFunction(sum, Int64), failures SimpleAggregateFunction(sum, Int64), category LowCardinality(String),

I'm 👍 to @benjackwhite's suggestion here.

metric_name LowCardinality(String), count SimpleAggregateFunction(sum, Int64)

Is ideal and can represent almost everything that you want now and in the future ✔️

I pushed up one like @tiina303 suggested. Rather than a single count column, we force you to at least pick one of successes, skipped or failures -- which gives us some way to aggregate / understand counts without having to know what every metric_name "means."

metric_name LowCardinality(String), successes SimpleAggregateFunction(sum, Int64), skipped SimpleAggregateFunction(sum, Int64), failures SimpleAggregateFunction(sum, Int64),

Does that seem reasonable @benjackwhite ? It seems like your success | failure | requestSuccess | requestFailure | timeout | filtered | skipped_due_to_disabled would all fit into there.

And I dropped error_type since a high level error would go in metric_name -- non-metric logs / debug info should go to the log table, and not in there, as discussed above. (That was an artifact from app_metrics (1) that we should remove.)

Talked with Ben, went with metric_kind and count so this can be totally flexible.

i.e. kind could be error and name could be timeout or sql_error or whatever

Co-authored-by: github-actions <41898282+github-actions[bot]@users.noreply.github.com>

bretthoerner marked this pull request as ready for review June 10, 2024 20:07

bretthoerner requested a review from fuziontech as a code owner June 10, 2024 20:07

bretthoerner requested a review from a team June 10, 2024 20:07

bretthoerner commented Jun 10, 2024

View reviewed changes

posthog/models/app_metrics2/sql.py Show resolved Hide resolved

xvello reviewed Jun 11, 2024

View reviewed changes

posthog/clickhouse/test/__snapshots__/test_schema.ambr Outdated Show resolved Hide resolved

posthog/clickhouse/test/__snapshots__/test_schema.ambr Show resolved Hide resolved

tiina303 approved these changes Jun 11, 2024

View reviewed changes

posthog/models/app_metrics2/sql.py Outdated Show resolved Hide resolved

posthog/models/app_metrics2/sql.py Outdated Show resolved Hide resolved

bretthoerner force-pushed the brett/app-metrics2 branch 4 times, most recently from d6a782a to 86b5398 Compare June 11, 2024 17:33

posthog-bot added the stale label Jun 20, 2024

bretthoerner removed the stale label Jun 20, 2024

fuziontech approved these changes Jun 23, 2024

View reviewed changes

posthog/models/app_metrics2/sql.py Show resolved Hide resolved

posthog/models/app_metrics2/sql.py Outdated Show resolved Hide resolved

benjackwhite reviewed Jun 25, 2024

View reviewed changes

bretthoerner force-pushed the brett/app-metrics2 branch from 76d31e4 to 3d45c7b Compare July 1, 2024 14:20

bretthoerner force-pushed the brett/app-metrics2 branch from 61fedd9 to 5b7b4d1 Compare July 15, 2024 17:09

bretthoerner requested a review from a team as a code owner July 15, 2024 17:09

bretthoerner force-pushed the brett/app-metrics2 branch from 5b7b4d1 to 163133f Compare July 15, 2024 17:11

bretthoerner and others added 5 commits July 16, 2024 10:38

chore: add app_metrics2 table

61d55a0

Update query snapshots

e662285

Update query snapshots

bf3d320

metric_kind

334ecd2

rename migration

04bef53

bretthoerner force-pushed the brett/app-metrics2 branch from e82e5fa to 04bef53 Compare July 16, 2024 16:38

Update query snapshots

90cad22

bretthoerner merged commit f787067 into master Jul 16, 2024
84 checks passed

bretthoerner deleted the brett/app-metrics2 branch July 16, 2024 17:21

silentninja pushed a commit to silentninja/posthog that referenced this pull request Aug 8, 2024

chore: add app_metrics2 table (PostHog#22859)

5197d63

Co-authored-by: github-actions <41898282+github-actions[bot]@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: add app_metrics2 table #22859

chore: add app_metrics2 table #22859

bretthoerner commented Jun 10, 2024

posthog-bot commented Jun 20, 2024

fuziontech left a comment

bretthoerner commented Jun 24, 2024

benjackwhite Jun 25, 2024

bretthoerner Jun 25, 2024 •

edited

Loading

tiina303 Jul 8, 2024 •

edited

Loading

fuziontech Jul 8, 2024

bretthoerner Jul 15, 2024 •

edited

Loading

bretthoerner Jul 16, 2024

chore: add app_metrics2 table #22859

chore: add app_metrics2 table #22859

Conversation

bretthoerner commented Jun 10, 2024

Problem

Changes

Does this work well for both Cloud and self-hosted?

How did you test this code?

posthog-bot commented Jun 20, 2024

fuziontech left a comment

Choose a reason for hiding this comment

bretthoerner commented Jun 24, 2024

benjackwhite Jun 25, 2024

Choose a reason for hiding this comment

bretthoerner Jun 25, 2024 • edited Loading

Choose a reason for hiding this comment

tiina303 Jul 8, 2024 • edited Loading

Choose a reason for hiding this comment

fuziontech Jul 8, 2024

Choose a reason for hiding this comment

bretthoerner Jul 15, 2024 • edited Loading

Choose a reason for hiding this comment

bretthoerner Jul 16, 2024

Choose a reason for hiding this comment

bretthoerner Jun 25, 2024 •

edited

Loading

tiina303 Jul 8, 2024 •

edited

Loading

bretthoerner Jul 15, 2024 •

edited

Loading