Skip to content

Commit

Permalink
Update stat_specs.Rmd
Browse files Browse the repository at this point in the history
Updated discussion on "Standard errors of the treatment effect".

Signed-off-by: wolbersm <[email protected]>
  • Loading branch information
wolbersm authored Jan 17, 2024
1 parent c66fbca commit d96ea9b
Showing 1 changed file with 26 additions and 7 deletions.
33 changes: 26 additions & 7 deletions vignettes/stat_specs.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -439,13 +439,32 @@ Treatment effects based on conditional mean imputation are deterministic. All ot

### Standard errors of the treatment effect

All approaches provide frequentist consistent estimates of the standard error for imputation under a MAR assumption. For reference-based imputation methods, methods based on conditional mean imputation or bootstrapped MI provide frequentist consistent estimates of the standard error whereas Rubin's rules applied to conventional MI methods provides so-called information anchored inference (@Bartlett2021, @CroEtAl2019, @vonHippelBartlett2021, @Wolbers2021). Frequentist consistent estimates of the standard error lead to confidence intervals and tests which have (asymptotically) correct coverage and type I error control under the assumption that the reference-based assumption reflects the true data-generating mechanism. For finite samples, simulations for a sample size of $n=100$ per group reported in @Wolbers2021 demonstrated that conditional mean imputation combined with the jackknife provided exact protection of the type one error rate whereas the bootstrap was associated with a small type I error inflation (between 5.1\% to 5.3\% for a nominal level of 5\%).

It is well known that Rubin's rules do not provide frequentist consistent estimates of the standard error for reference-based imputation methods (@Seaman2014, @LiuPang2016, @Tang2017, @CroEtAl2019, @Bartlett2021). Standard errors from Rubin's rule are typically larger than frequentist standard error estimates leading to conservative inference and a corresponding loss of statistical power, see e.g. the simulations reported in @Wolbers2021.
Intuitively, this occurs because reference-based imputation methods borrow information from the reference group for imputations in the intervention group leading to a reduction in the frequentist variance of the resulting treatment effect contrast which is not captured by Rubin’s variance estimator. Formally, this occurs because the imputation and analysis models are uncongenial for reference-based imputation methods (@Meng1994, @Bartlett2021).
@CroEtAl2019 argued that Rubin’s rule is nevertheless valid for reference-based imputation methods because it is approximately information-anchored, i.e. that the proportion of information lost due to missing data under MAR is approximately preserved in reference-based analyses. In contrast, frequentist standard errors for reference based imputation are not information anchored for reference-based imputation and standard errors under reference-based assumptions are typically smaller than those for MAR imputation.

Information anchoring is a sensible concept for sensitivity analyses, whereas for a primary analyses, it may be more important to adhere to the principles of frequentist inference. Analyses of data with missing observations generally rely on unverifiable missing data assumptions and the assumptions for reference-based imputation methods are relatively strong. Therefore, these assumptions need to be clinically justified as appropriate or at least conservative for the considered disease area and the anticipated mechanism of action of the intervention.
All approaches provide frequentist consistent estimates of the standard error for imputation under a MAR assumption.

For reference-based imputation methods, the situation is more complicated and two different types of variance estimators have been proposed in the statistical literature (@Bartlett2021).
The first is the frequentist variance which describes the actual repeated sampling variability of the estimator.
If the reference-based missing data assumption is correctly specified, then the resulting inference based on this variance is correct in the frequentist sense, i.e. hypothesis tests have asymptotically correct type I error control and confidence intervals have correct coverage
probabilities under repeated sampling (@Bartlett2021, @Wolbers2021).
However, as a consequence of the strong reference-based missingness assumption which borrows information from the control arm for estimation in the active arm, the size of the frequentist standard error of the treatment effects may decrease with increasing amounts of missing data.
The second proposal is the so-called "information-anchored" variance which was originally proposed in the context of sensitivity analyses (@CroEtAl2019). This variance estimator is based on disentangling point estimation and variance estimation altogether.
The information-anchoring principle described in @CroEtAl2019 states that the relative increase in the variance of the treatment effect estimator under MAR imputation with increasing amounts of missingness should be preserved for reference-based imputation methods.
The resulting information-anchored variance is typically very similar to the variance under MAR imputation and increases with increasing amounts of missing data.
However, the information-anchored variance does not reflect the actual variability of the reference-based estimator under repeated sampling and the resulting inference is highly conservative resulting in a substantial power loss (@Wolbers2021).
Moreover, to date, no Bayesian or frequentist framework has been developed under which the information-anchored variance provides correct inference for reference-based missingness assumptions, nor is it clear whether such a framework can even be developed.

Reference-based conditional mean imputation (`method_condmean()`) and bootstrapped likelihood-based multiple methods (`method = method_bmlmi()`) obtain standard errors via resampling and hence target the frequentist variance (@Wolbers2021, @vonHippelBartlett2021).
For finite samples, simulations for a sample size of $n=100$ per group reported in @Wolbers2021 demonstrated that conditional mean imputation combined with the jackknife (`method_condmean(type = "jackknife")`) provided exact protection of the type one error rate whereas the bootstrap was associated with a small type I error inflation (between 5.1\% to 5.3\% for a nominal level of 5\%).
For reference-based conditional mean imputation, an alternative information-anchored variance can be obtained by following a proposal by @Lu2021.
The basic idea of @Lu2021 is to obtain the information-anchored variance via a MAR imputation combined with a delta-adjustment where delta is selected in a data-driven way to match the reference-based estimator.
For conditional mean imputation, the proposal by @Lu2021 can be implemented by choosing the delta-adjustment as the difference between the conditional mean imputation under the chosen reference-based assumption and MAR on the original dataset.
An illustration of how the different variances can be obtained for conditional mean imputation in `rbmi` is provided in the vignette "Frequentist and information-anchored inference for reference-based conditional mean imputation" (`vignette(topic = "CondMean_InfoAnchored", package = "rbmi")`).

Reference-based Bayesian (or approximate Bayesian) multiple imputation methods combined with Rubin's rules (`method_bayes()` and `method_approxbayes()`) target the information-anchored variance (@CroEtA2019).
A frequentist variance for these methods could in principle be obtained via bootstrap or jackknife re-sampling their point estimators but this would be very computationally intensive and is not directly supported by `rbmi`.

Our view is that for primary analyses, accurate type I error control (which can be obtained by using the frequntist variance) is more important than adherence to the information anchoring principle which, to us, is
not fully compatible with the strong reference-based missing data assumptions. In any case, if reference-based imputation is used for the primary analysis, it is critical that the chosen
reference-based assumption can be clinically justified, and that suitable sensitivity analyses are conducted to stress-test these assumptions.

Conditional mean imputation combined with the jackknife is the only method which leads to deterministic standard error estimates and, consequently, confidence intervals and $p$-values are also deterministic. This is particularly important in a regulatory setting where it is important to ascertain whether a calculated $p$-value which is close to the critical boundary of 5% is truly below or above that threshold rather than being uncertain about this because of Monte Carlo error.

Expand Down

0 comments on commit d96ea9b

Please sign in to comment.