chore(experiments): Stats cleanup #27151

danielbachhuber · 2024-12-24T13:59:35Z

See #26713

Changes

Some cleanup while working on PostHog/posthog.com#10217:

Uses places instead of delta for credible intervals.
Improves function descriptions across the board.
Allows a little bit of flakiness in all of the tests

How did you test this code?

Tests should pass.

jurajmajerik · 2024-12-27T13:50:12Z

posthog/hogql_queries/experiments/funnels_statistics_v2.py

-    conversion rate) compared to all other variants, including the control. It uses samples
-    drawn from the posterior Beta distributions of each variant's conversion rate.
+    conversion rate) compared to all other variants, including the control. It uses Beta
+    distributions to model conversion rates, which is the conjugate prior for binomial data.


What's a "conjugate prior for binomial data"? In the code comments I'd aim for the same level of simplicity as in our docs :)

Explained with aeff274

jurajmajerik · 2024-12-27T13:54:55Z

posthog/hogql_queries/experiments/test/test_funnels_statistics.py

                self.assertEqual(significance, ExperimentSignificanceCode.LOW_WIN_PROBABILITY)
                self.assertEqual(p_value, 1)

                # Check credible intervals
-                self.assertAlmostEqual(intervals["control"][0], 0.05, delta=0.05)


Uh oh, the tolerated delta here is definitely too wide 👍

jurajmajerik · 2024-12-27T13:58:03Z

posthog/hogql_queries/experiments/test/test_funnels_statistics.py

@@ -58,16 +60,16 @@ def run_test(stats_version, calculate_probabilities, are_results_significant, ca

            self.assertEqual(len(probabilities), 2)
            if stats_version == 2:
-                self.assertAlmostEqual(probabilities[0], 0.15, delta=0.1)
-                self.assertAlmostEqual(probabilities[1], 0.85, delta=0.1)
+                self.assertAlmostEqual(probabilities[0], 0.149, delta=0.1)


What's the reason for keeping delta=0.1 here? The range seems too wide (0.049 -> 0.249, i.e. 4.9% to 24.9%)

Not sure, must've skipped over it. Updated with e37b745

danielbachhuber added 8 commits December 24, 2024 05:40

Use places instead of delta for credible intervals

4ae3043

Split out assertions for v1 vs. v2

c04785f

Update more values for places

2b4b92d

Improve function comments

c32f2e1

Improve function descriptions

39fd2c9

Fix

055720b

Improve function comments

ed7cb77

Allow a little bit of flakiness in all of the tests

ca1b577

danielbachhuber marked this pull request as ready for review December 24, 2024 22:48

danielbachhuber requested a review from a team December 24, 2024 22:48

danielbachhuber enabled auto-merge (squash) December 24, 2024 23:44

jurajmajerik reviewed Dec 27, 2024

View reviewed changes

danielbachhuber added 3 commits January 2, 2025 05:25

Merge branch 'master' into experiments/stats-cleanup

493b3da

Explain "conjugate prior"

aeff274

Use a tighter delta

e37b745

danielbachhuber requested review from a team and neilkakkar January 2, 2025 14:15

Merge branch 'master' into experiments/stats-cleanup

0aca478

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(experiments): Stats cleanup #27151

chore(experiments): Stats cleanup #27151

danielbachhuber commented Dec 24, 2024 •

edited

Loading

jurajmajerik Dec 27, 2024

danielbachhuber Jan 2, 2025

jurajmajerik Dec 27, 2024

jurajmajerik Dec 27, 2024

danielbachhuber Jan 2, 2025

chore(experiments): Stats cleanup #27151

Are you sure you want to change the base?

chore(experiments): Stats cleanup #27151

Conversation

danielbachhuber commented Dec 24, 2024 • edited Loading

Changes

How did you test this code?

jurajmajerik Dec 27, 2024

Choose a reason for hiding this comment

danielbachhuber Jan 2, 2025

Choose a reason for hiding this comment

jurajmajerik Dec 27, 2024

Choose a reason for hiding this comment

jurajmajerik Dec 27, 2024

Choose a reason for hiding this comment

danielbachhuber Jan 2, 2025

Choose a reason for hiding this comment

danielbachhuber commented Dec 24, 2024 •

edited

Loading