StatQuest |
Khan Academy style videos by Josh Starmer on distributions, parameters, linear models, machine learning, bioinformatics, and R |
Statistical Tests Are Linear Models |
Common statistical tests such as the t-test and ANOVA are just specialized forms of linear models |
Flexible Imputation For Missing Data |
Statistical aspects of missing data. Covers MCAR/MAR/MNAR assumptions, complete-case analysis, and MICE imputation methods |
Modern Statistics For Modern Biology |
Intro statistics for bioinformatics applications. Covers distributions, PCA, hypothesis testing, RNASeq analysis, machine learning, experimental design |
Biostatistics For Biomedical Research |
Intro biostatistics, regression models, randomized controlled trials (RCTs), observational studies, diagnosis, statistical pitfalls, high dimensional modeling |
Statistical Problems to Avoid |
Common statistical issues to think about in the design, analysis, and publication of research |
Common Statistical Misconceptions |
List of statistical misconceptions with sources to disprove each misconception |
Regression Modeling Strategies |
Comprehensive dive into regression modeling covers linear, logistic, and ordinal regression as well as survival analysis with lots of case studies |
Communicating Frequentist Results |
Proper way to describe treatment effects according to frequentist theory |
Bayesian Re-analysis of Toss Up Clinical Trial |
How to interpret clinical trials that slightly miss p < .05 cutoff and bayesian methods to re-analyze trials |
Statistics Glossary |
Glossary of Statistical Terms |
Observed Power should be avoided |
Issues with observed power overestimating the study's true power, misleading researchers |
Observed Power Simulation Study |
Simulation study showing problems with observed power |
Criticisms of ROC curves for medical decision-making |
Problems with using ROC curves to evaluate the quality of medical decision-making models |
Table 1 P-values in RCTs |
Baseline differences should not be tested in RCTs as differences are already due to chance from randomization |
Pseudo R^2 Explanation |
Intuitive explanation behind OLS R^2 and generalizes this intuition to pseduo R^2 measures for binary outcomes |
GLMs Explained |
Introduction to Generalized Linear Models (GLMs) |
Multicollinearity and Omitted Variable Bias |
Distinction between collinearity and omitted variable bias |
Cox PH Diagnostics |
Guide to checking Cox PH regression assumptions |
Dealing with Unbalanced Datasets |
Why resampling techniques should be avoided when analyzing datasets with unbalanced outcomes |
Statistical Rethinking with brms |
Uses brms and tidyverse code instead of base R |
Interpretting P-Value Histogram |
How to interpret results from running several statistical tests before adjusting for multiple comparisons |
Correct Confidence Intervals For GLMs |
Gavin Simpson's blog shows to how compute confidence intervals for GLM models that obey the constraints of the response variable |
Beyond Multiple Linear Regression |
Statistics textbook explaining GLMs and multi-level models |
Mixed Models with R |
Explanation of mixed effects models with R code |
Using Mixture Models for Clustering |
Tutorial using mixtools R package |
Applied Statistics for Experimental Biology |
Textbook with an in depth look at linear modeling for biological data |