feat(experiments): Apply new Trends continuous stats methods #26835

danielbachhuber · 2024-12-11T14:37:03Z

See #26713

Changes

Introduces a new set of methods for calculating Trends continuous values:

calculate_probabilities_v2_continuous
are_results_significant_v2_continuous
calculate_credible_intervals_v2_continuous
calculate_expected_loss_v2_continuous

Uses the Trends continuous methods when stats_version=2.

Also introduces expected loss calculation for Trends count.

How did you test this code?

Tests should pass.

danielbachhuber · 2024-12-12T13:10:43Z

@andehen It would be really helpful if you could produce some sample values with your notebook. I can then implement those in tests and we can see where we're at for the log_variance value.

danielbachhuber · 2024-12-13T14:56:41Z

It would be really helpful if you could produce some sample values with your notebook. I can then implement those in tests and we can see where we're at for the log_variance value.

@andehen log_variance = 2 seems to work well for these test cases. I'd be curious to hear your opinion on that, and what you think the delta value should be for these tests.

This reverts commit 858d7b4.

jurajmajerik

Looks great! Why not already plug it into the query runner though?

posthog/posthog/hogql_queries/experiments/experiment_trends_query_runner.py

Lines 318 to 319 in f34bea5

    
           if self.stats_version == 2: 
        
               probabilities = calculate_probabilities_v2_count(control_variant, test_variants)

danielbachhuber · 2024-12-18T13:26:49Z

Why not already plug it into the query runner though?

My thought on Friday was to ship what I have incrementally.

andehen

Looks good! 👍 Didn't spot any errors in the code it self.

As we have talked about, the key issue to address further is the assumption of a fixed variance. If one run test now, where the actual variance in the data is higher, we will claim that the results are significant when they should not be. We can use a higher value to protect against this, at the cost of saying "not significant" when we likely could (in the event that variance is lower than we assume). Really hard to say what the value should be without doing an analysis on actual data 🤷

jurajmajerik

Looks great, thanks for doing this!

@andehen, could you please create an issue with a detailed description to track the concern mentioned in your comment? It's important to keep track of this, though I consider it non-blocking for now. This calculation is already a big improvement over the current state of things :)

jurajmajerik · 2024-12-20T13:30:23Z

@danielbachhuber there's an issue here I've only spotted now:

https://github.com/PostHog/posthog/blob/master/posthog/hogql_queries/experiments/experiment_trends_query_runner.py#L134-L138

We're still overwriting the math property from AVG to SUM. So the new probability method will never receive AVG. This went unnoticed because we're only unit-testing the probability method, and not including it in a more integrated query runner test. Would be nice to have a small test within test_experiment_trends_query_runner.py to ensure the correct AVG value is received.

danielbachhuber · 2024-12-23T17:17:43Z

there's an issue here I've only spotted now:

Resolved with #27133

danielbachhuber and others added 9 commits December 11, 2024 05:07

Rename in prep for continuous implementation

53000ec

First pass at continuous stats methods

7a5ffc5

Drop assertion context because it's too noisy

2394003

Add a test case for near zero means

f5ec622

Handle zero mean scenario

2501c1c

Update query snapshots

43735d2

Merge branch 'master' into experiments/trends-continuous

af09286

Update query snapshots

8a670dd

Update query snapshots

280cd48

danielbachhuber added 14 commits December 12, 2024 05:13

Adjust test values

883750f

Use absolute_exposure, not relative exposure

4be08a0

Merge branch 'master' into experiments/trends-continuous

f619d41

Introduce a new assertRange() method

c6c58b4

Increase log variance to increase uncertainty for small sample sizes

2a27a31

Use correct exposure values

d3b8c7e

Use correct absolute exposure for these tests

61bfc39

Use correct exposure values

4c14b48

Adjust range values for test_many_variants_not_significant

6c0ab9c

Update values for test_large_sample_two_variants_significant

4aa1fc6

Update values for test_different_relative_and_absolute_exposure

6962e97

Replace some use of assertRange

61f118c

Replace use of assertRange

dc8fceb

This test is no longer necessary

7dfc17b

danielbachhuber added 5 commits December 13, 2024 07:27

Use correct absolute_exposure values

9f43ba7

Merge branch 'master' into experiments/trends-continuous

772208f

Drop duplicative test

3a6d407

Add a test case with real world data

3d2e24c

Use a standard LOG_VARIANCE between methods

2ea703d

danielbachhuber requested a review from a team December 13, 2024 22:01

danielbachhuber marked this pull request as ready for review December 13, 2024 22:01

danielbachhuber added 6 commits December 16, 2024 04:15

Provide a helpful message when the tests fail

858d7b4

Merge branch 'master' into experiments/trends-continuous

dc9b7bb

Revert "Provide a helpful message when the tests fail"

00a4d64

This reverts commit 858d7b4.

Merge branch 'master' into experiments/trends-continuous

838067a

Use consistent constant names

4e81ea4

Use the log variance provided by Anders

f34bea5

jurajmajerik reviewed Dec 18, 2024

View reviewed changes

danielbachhuber added 6 commits December 19, 2024 14:04

Merge branch 'master' into experiments/trends-continuous

bcf3259

Implement expected loss for trend count

ecaed9a

Apply expected loss to trends continuous

17ea023

More informative description

78af92d

Incorporate continuous methods into TrendsQueryRunner

54f989e

Add a test for 54f989e

340ed68

danielbachhuber requested a review from a team December 19, 2024 23:56

andehen approved these changes Dec 20, 2024

View reviewed changes

jurajmajerik approved these changes Dec 20, 2024

View reviewed changes

danielbachhuber merged commit 1979d74 into master Dec 20, 2024
92 checks passed

danielbachhuber deleted the experiments/trends-continuous branch December 20, 2024 12:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(experiments): Apply new Trends continuous stats methods #26835

feat(experiments): Apply new Trends continuous stats methods #26835

danielbachhuber commented Dec 11, 2024 •

edited

Loading

danielbachhuber commented Dec 12, 2024

danielbachhuber commented Dec 13, 2024

jurajmajerik left a comment

danielbachhuber commented Dec 18, 2024

andehen left a comment

jurajmajerik left a comment

jurajmajerik commented Dec 20, 2024

danielbachhuber commented Dec 23, 2024

	if self.stats_version == 2:
	probabilities = calculate_probabilities_v2_count(control_variant, test_variants)

feat(experiments): Apply new Trends continuous stats methods #26835

feat(experiments): Apply new Trends continuous stats methods #26835

Conversation

danielbachhuber commented Dec 11, 2024 • edited Loading

Changes

How did you test this code?

danielbachhuber commented Dec 12, 2024

danielbachhuber commented Dec 13, 2024

jurajmajerik left a comment

Choose a reason for hiding this comment

danielbachhuber commented Dec 18, 2024

andehen left a comment

Choose a reason for hiding this comment

jurajmajerik left a comment

Choose a reason for hiding this comment

jurajmajerik commented Dec 20, 2024

danielbachhuber commented Dec 23, 2024

danielbachhuber commented Dec 11, 2024 •

edited

Loading