From 9c4937bae037e8db4aa46c0c96dc99adf55b55b1 Mon Sep 17 00:00:00 2001
From: Stephanie Zimmer <stephaniezimmer@gmail.com>
Date: Sun, 3 Mar 2024 14:54:40 -0500
Subject: [PATCH] Incorporate reviewers feedback on testing chapter (#101)

* Fix table references

* incorporate testing feedback 1

* incorporate testing feedback 2

* Apply suggestions from code review

---------

Co-authored-by: Rebecca Powell <rjp6996@gmail.com>
---
 06-statistical-testing.Rmd | 97 +++++++++++++++++++++++++-------------
 1 file changed, 64 insertions(+), 33 deletions(-)

diff --git a/06-statistical-testing.Rmd b/06-statistical-testing.Rmd
index 164c1228..3e2162fb 100644
--- a/06-statistical-testing.Rmd
+++ b/06-statistical-testing.Rmd
@@ -1,5 +1,12 @@
 # Statistical testing {#c06-statistical-testing}
 
+```{r}
+#| label: stattest-styler
+#| include: false
+knitr::opts_chunk$set(tidy = 'styler')
+```
+
+
 ::: {.prereqbox-header}
 `r if (knitr:::is_html_output()) '### Prerequisites {- #prereq6}'`
 :::
@@ -63,7 +70,16 @@ When analyzing results from a survey, the point estimates described in Chapter \
 
 The general idea of statistical testing is the same for data obtained through surveys and data obtained through other methods, where we compare the point estimates and variance estimates of each statistic to see if statistically significant differences exist. However, statistical testing for complex surveys involves additional considerations due to the need to account for the sampling design in order to obtain accurate variance estimates.
 
-The functions in the {survey} packages allow for the correct estimation of the variances. This chapter will cover the following statistical tests with survey data and functions:
+Statistical testing, also called hypothesis testing, involves declaring a null and alternative hypothesis. A null hypothesis is denoted as $H_0$ and the alternative hypothesis is denoted as $H_A$. The null hypothesis is the default assumption in that there are no differences in the data, or that the data is operating under "standard" behaviors.  On the other hand, the alternative hypothesis is the break from the "standard" and what we are trying to determine if the data supports. 
+
+Let's review an example outside of survey data. If we are flipping a coin, a null hypothesis would be that the coin is fair and that each side has an equal chance of being flipped.  In other words, the probability of the coin landing on each side is 1/2.  Whereas an alternative hypothesis could be that the coin is unfair and that one side has a higher probability of being flipped (e.g., a probability of 1/4 to get heads, but a probability of 3/4 to get tails). We write this set of hypotheses as:
+
+- $H_0: \rho_{heads} = \rho_{tails}$, where $\rho_{x}$ is the probability of flipping the coin and having it land on heads ($\rho_{heads}$) or tails ($\rho_{tails}$)
+- $H_A: \rho_{heads} \neq \rho_{tails}$
+
+When we conduct hypothesis testing, the statistical models calculate a p-value, which shows how likely we are to observe the data if the null hypothesis is true. If the p-value (a probability between 0 and 1) is small, we have strong evidence to reject the null hypothesis as it is unlikely to see the data we are observing if the null hypothesis is true. However, if the p-value is large, we say we do not have evidence to reject the null hypothesis. The size of the p-value for this cut off is determined by type 1 error known as $\alpha$.  A common type 1 error value for statistical testing is to use $\alpha = 0.05$.^[For more information on statistical testing, we recommend reviewing introduction to statistics textbooks.] It is common for explanations of statistical testing to refer to confidence level.  The confidence level is the inverse of the type 1 error.  Thus, if $\alpha = 0.05$, the confidence level would be 95%.
+
+The functions in the {survey} package allow for the correct estimation of the variances. This chapter will cover the following statistical tests with survey data and functions:
 
 * Comparison of proportions `svyttest()`
 * Comparison of means `svyttest()`
@@ -71,9 +87,9 @@ The functions in the {survey} packages allow for the correct estimation of the v
 * Tests of independence `svychisq()`
 * Tests of homogeneity `svychisq()`
 
-## Dot Notation
+## Dot Notation {#dot-notation}
 
-Up to this point, we have shown functions that use wrappers from the {srvyr} package. This means that the functions work with tidyverse syntax. However, the functions in this chapter do not have wrappers in the {srvyr} package and are instead used directly from the {survey} package. Therefore, the design object is *not* the first argument, and to use these functions with the magrittr pipe `%>%` and tidyverse syntax, we will need to use dot (`.`) notation.
+Up to this point, we have shown functions that use wrappers from the {srvyr} package. This means that the functions work with tidyverse syntax. However, the functions in this chapter do not have wrappers in the {srvyr} package and are instead used directly from the {survey} package. Therefore, the design object is *not* the first argument, and to use these functions with the magrittr pipe (`%>%`) and tidyverse syntax, we will need to use dot (`.`) notation^[This could change in the future if another package is built or {srvyr} is expanded to work with **tidymodels** but no such plans are known at this time.]
 
 Functions that work with the magrittr pipe (`%>%`) have the data as the first argument. When we run a function with the pipe, it automatically places anything to the left of the pipe into the first argument of the function to the right of the pipe. For example, if we wanted to take the `mtcars` data and filter to cars with six cylinders, we can write the code in at least four different ways:
 
@@ -93,13 +109,20 @@ svydata_des %>%
 
 By default, the pipe places the left-hand object in the first argument spot. Placing the dot (`.`) in the second argument spot indicates that the survey design object `svydata_des` should be used in the second argument and not the first.
 
-Alternatively, named arguments could be used to place the dot first, as in the following:
+Alternatively, named arguments could be used to place the dot first as named arguments can appear at any location, as in the following:
 
 ```r
 svydata_des %>%
  svyttest(design = ., x ~ y)
 ```
 
+However, the following code will not work as the `svyttest()` function expects the formula as the first argument when arguments are not named:
+
+```r
+svydata_des %>%
+ svyttest(., x ~ y)
+```
+
 ## Comparison of Proportions and Means {#stattest-ttest}
 
 We use t-tests to compare two proportions or means. T-tests allow us to determine if one proportion or mean is statistically different from another. They are commonly used to determine if a single estimate differs from a known value (e.g., 0 or 50%) or to compare two group means (e.g., North versus South). Comparing a single estimate to a known value is called a *one sample t-test*, and we can set up the hypothesis test as follows:  
@@ -122,7 +145,7 @@ When we do not have survey data, we can use the `t.test()` function from the {st
 
   - We need to use the survey design object instead of the original data frame
   - We can only use a formula and not separate x and y data
-  - The confidence level cannot be specified and will always be set to 95%. However, we will show examples of how the confidence level can be changed after running using the `confint()` function.
+  - The confidence level cannot be specified and will always be set to 95%. However, we will show examples of how the confidence level can be changed after running the `svyttest()` function by using the `confint()` function.
 
 Here is the syntax for the `svyttest()` function:
 
@@ -138,18 +161,18 @@ The arguments are:
 * `design`: survey design object
 * `...`: This passes options on for one-sided tests only, and thus, we can specify `na.rm=TRUE`
 
-Notice that the first argument here is the `formula` and not the `design`. This means we must use the dot `(.)` if we pipe in the survey design object (as described at the beginning of this chapter).
+Notice that the first argument here is the `formula` and not the `design`. This means we must use the dot `(.)` if we pipe in the survey design object (as described in Section \@ref(dot-notation)).
 
 The `formula` argument can take several different forms depending on what we are measuring. Here are a few common scenarios:
 
 1. **One-sample t-test:** 
-    a. **Comparison to 0:** `var ~ 0`, where `var` is the measure of interest, and we compare it to the value `0`. For example, we could test if the population mean of household debt is different from `0`.
-    b. **Comparison to a different value:** `var - value ~ 0`, where `var` is the measure of interest and `value` is what we are comparing to. For example, we could test if the proportion of the population that has blue eyes is different from `25%` by using `var - 0.25 ~ 0`. Note that specifying the formula as `var ~ 0.25` is not equivalent.
+    a. **Comparison to 0:** `var ~ 0`, where `var` is the measure of interest, and we compare it to the value `0`. For example, we could test if the population mean of household debt is different from `0` given the sample data collected.
+    b. **Comparison to a different value:** `var - value ~ 0`, where `var` is the measure of interest and `value` is what we are comparing to. For example, we could test if the proportion of the population that has blue eyes is different from `25%` by using `var - 0.25 ~ 0`. Note that specifying the formula as `var ~ 0.25` is not equivalent and will result in a syntax error.
 2. **Two-sample t-test:**
     a. **Unpaired:**
         - **2 level grouping variable:** `var ~ groupVar`, where `var` is the measure of interest and `groupVar` is a variable with two categories. For example, we could test if the average age of the population who voted for president in 2020 differed from the age of people who did not vote. In this case, age would be used for `var`, and a binary variable indicating voting activity would be the `groupVar`.
-        - **3+ level grouping variable:** `var ~ groupVar == level`, where `var` is the measure of interest, `groupVar` is the categorical variable, and `level` is the category level to isolate. For example, we could test if the test scores in one classroom differed from all other classrooms.
-    b. **Paired:** `var_1 - var_2 ~ 0`, where `var_1` is the first variable of interest and `var_2` is the second variable of interest. For example, we could test if test scores on a subject differed between the start and the end of a course.
+        - **3+ level grouping variable:** `var ~ groupVar == level`, where `var` is the measure of interest, `groupVar` is the categorical variable, and `level` is the category level to isolate. For example, we could test if the test scores in one classroom differed from all other classrooms where `groupVar` would be the variable holding the values for classroom IDs and `level` is the classroom ID we want to compare to the others.
+    b. **Paired:** `var_1 - var_2 ~ 0`, where `var_1` is the first variable of interest and `var_2` is the second variable of interest. For example, we could test if test scores on a subject differed between the start and the end of a course so `var_1` would be the test score at the beginning of the course and `var_2` would be the score at the end of the course.
     
 The `na.rm` argument defaults to `FALSE`, which means if any data is missing, the t-test will not compute. Throughout this chapter, we will always set `na.rm = TRUE`, but before analyzing the survey data, review the notes provided in Chapter \@ref(c03-understanding-survey-data-documentation) to better understand how to handle missing data.
 
@@ -159,12 +182,12 @@ Let's walk through a few examples using the ANES and RECS data.
 
 #### Example 1: One-sample t-test for Mean {.unnumbered #stattest-ttest-ex1}  
 
-RECS asks respondents to indicate what temperature they set their house to during the summer at night.^[During the summer, what is your home’s typical indoor temperature inside your home at night?] In our data, we have called this variable `SummerTempNight`. If we want to see if the average U.S. household sets its temperature at a value different from 68$^\circ$F, we could set up the hypothesis as follows: 
+RECS asks respondents to indicate what temperature they set their house to during the summer at night.^[During the summer, what is your home’s typical indoor temperature inside your home at night?] In our data, we have called this variable `SummerTempNight`. If we want to see if the average U.S. household sets its temperature at a value different from 68$^\circ$F^[This is the temperature that Stephanie prefers at night during the summer, and she wanted to see if she was different from the population.], we could set up the hypothesis as follows: 
 
 - $H_0: \mu = 68$ where $\mu$ is the average temperature U.S. households set their thermostat to in the summer at night
 - $H_A: \mu \neq 68$
 
-To conduct this in R, we use `svyttest()` with and subtract the temperature on the right-hand side of the formula:
+To conduct this in R, we use `svyttest()` and subtract the temperature on the left-hand side of the formula:
 
 ```{r}
 #| label: stattest-ttest-ex1
@@ -205,6 +228,7 @@ confint(ttest_ex1, level = 0.95)
 confint(ttest_ex1, level = 0.8)
 ```
 
+In this case, neither confidence interval contains 0, and we draw the same conclusion from either that the average temperature households set their thermostat in the summer at night is significantly higher than 68$^\circ$F.
 
 #### Example 2: One-sample t-test for Proportion {.unnumbered #stattest-ttest-ex2}
 
@@ -306,7 +330,7 @@ U.S. households set their thermostat on average `r signif(ttest_ex4$estimate,2)`
 
 Chi-square tests ($\chi^2$) allow us to examine multiple proportions using a goodness-of-fit test, a test of independence, or a test of homogeneity. These three tests have the same $\chi^2$ distributions but with slightly different underlying assumptions.
 
-First, **goodness-of-fit** tests are used when comparing *observed* data to *expected* data. For example, this could be used to determine if respondent demographics (the observed data) match known population information (the expected data). In this case, we can set up the hypothesis test as follows:
+First, **goodness-of-fit** tests are used when comparing *observed* data to *expected* data. For example, this could be used to determine if respondent demographics (the observed data in the sample) match known population information (the expected data). In this case, we can set up the hypothesis test as follows:
 
   - $H_0: p_1 = \pi_1, ~ p_2 = \pi_2, ~ ..., ~ p_k = \pi_k$ where $p_i$ is the observed proportion for category $i$, $\pi_i$ is expected proportion for category $i$, and $k$ is the number of categories
   - $H_A:$ at least one level of $p_i$ does not match $\pi_i$
@@ -349,7 +373,7 @@ The arguments are:
 * `design`: Survey design object
 * ...: Other arguments to pass on, such as `na.rm`
 
-Based on the order of the arguments, we again must use the dot `(.)` notation if we pipe in the survey design object or explicitly name the arguments (as described at the beginning of this chapter). For the goodness of fit tests, the formula will be a single variable `formula = ~var` as we compare the observed data from this variable to the expected data. The expected probabilities are then entered in the `p` argument and need to be a vector of the same length as the number of categories in the variable. For example, if we want to know if the proportion of males and females matches a distribution of 30/70, then the sex variable (with two categories) would be used `formula = ~SEX`, and the proportions would be included as `p = c(.3, .7)`. It is important to note that the variable entered into the formula should be formatted as either a factor or a character. The examples below provide more detail and tips on how to make sure the levels match up correctly.
+Based on the order of the arguments, we again must use the dot `(.)` notation if we pipe in the survey design object or explicitly name the arguments as described in Section \@ref(dot-notation). For the goodness of fit tests, the formula will be a single variable `formula = ~var` as we compare the observed data from this variable to the expected data. The expected probabilities are then entered in the `p` argument and need to be a vector of the same length as the number of categories in the variable. For example, if we want to know if the proportion of males and females matches a distribution of 30/70, then the sex variable (with two categories) would be used `formula = ~SEX`, and the proportions would be included as `p = c(.3, .7)`. It is important to note that the variable entered into the formula should be formatted as either a factor or a character. The examples below provide more detail and tips on how to make sure the levels match up correctly.
 
 For tests of homogeneity and independence, the `svychisq()` function should be used. The syntax is as follows:
 
@@ -384,7 +408,7 @@ Let's walk through a few examples using the ANES data.
 
 #### Example 1: Goodness of Fit Test {.unnumbered #stattest-chi-ex1}
 
-ANES asked respondents about their highest education level.^[What is the highest level of school you have completed or the highest degree you have received?] Based on the data from the 2020 American Community Survey (ACS) 5-year estimates^[Data was pulled from data.census.gov using the S1501 Education Attainment 2020: ACS 5-Year Estimates Subject Tables], the education distribution of those 18+ in the U.S. is as follows:
+ANES asked respondents about their highest education level.^[What is the highest level of school you have completed or the highest degree you have received?] Based on the data from the 2020 American Community Survey (ACS) 5-year estimates^[Data was pulled from data.census.gov using the S1501 Education Attainment 2020: ACS 5-Year Estimates Subject Tables], the education distribution of those aged 18+ in the United States (among the 50 states and District of Columbia) is as follows:
 
   - 11% had less than High School degree
   - 27% had a High School degree
@@ -406,7 +430,7 @@ anes_des %>%
   summarize(p = survey_mean())
 ```
 
-Based on this output, we can see that we have different levels than the ACS data provides. Specifically, the education data from ANES has two levels for Bachelor's Degree or Higher (Bachelor's and Graduate), so these two categories need to be collapsed into a single category to match the ACS data. For this, we can use the {forcats} package from the tidyverse. The package's `fct_collapse()` function helps us create a new variable by collapsing categories into a single one. Then, we will use the `svygofchisq()` function to compare the ANES data to the ACS data:
+Based on this output, we can see that we have different levels than the ACS data provides. Specifically, the education data from ANES has two levels for Bachelor's Degree or Higher (Bachelor's and Graduate), so these two categories need to be collapsed into a single category to match the ACS data. For this, among other methods, we can use the {forcats} package from the tidyverse. The package's `fct_collapse()` function helps us create a new variable by collapsing categories into a single one. Then, we will use the `svygofchisq()` function to compare the ANES data to the ACS data where we specify the updated design object, the formula using the collapsed education variable, the ACS estimates for education levels as p, and removing NA values.
 
 ```{r}
 #| label: stattest-chi-ex1
@@ -432,7 +456,7 @@ chi_ex1 <- anes_des_educ %>%
 chi_ex1
 ```
 
-The output from the `svygofchisq()` indicates that at least one proportion from ANES does not match the ACS data ($\chi^2 =$ `r chi_ex1$statistic`; $p-value =$ `r pretty_p_value(chi_ex1[["p.value"]])`). To get a better idea of the differences, we can use the `expected` output along with `survey_mean()` to create a comparison table:  
+The output from the `svygofchisq()` indicates that at least one proportion from ANES does not match the ACS data ($\chi^2 =$ `r  prettyNum(chi_ex1$statistic, big.mark=",")`; p-value `r pretty_p_value(chi_ex1[["p.value"]])`). To get a better idea of the differences, we can use the `expected` output along with `survey_mean()` to create a comparison table:  
 
 ```{r}
 #| label: stattest-chi-ex1-table
@@ -447,12 +471,13 @@ ex1_table <- anes_des_educ %>%
 ex1_table
 ```
 
-This output includes our expected proportions from the ACS that we provided the `svygofchisq()` function along with the output of the observed proportions and their confidence intervals. This table shows that the "High school" and "Post HS" categories have nearly identical proportions but that the other two categories are slightly different. Looking at the confidence intervals, we can see that the ANES data skews to include fewer people in the "Less than HS" category and more people in the "Bachelor or Higher" category. This may be easier to see if we plot this (see Figure \@ref(fig:stattest-chi-ex1-graph)).
+This output includes our expected proportions from the ACS that we provided the `svygofchisq()` function along with the output of the observed proportions and their confidence intervals. This table shows that the "High school" and "Post HS" categories have nearly identical proportions but that the other two categories are slightly different. Looking at the confidence intervals, we can see that the ANES data skews to include fewer people in the "Less than HS" category and more people in the "Bachelor or Higher" category. This may be easier to see if we plot this. The code below uses the tabular output to create Figure \@ref(fig:stattest-chi-ex1-graph).
 <!--Will want to remove the legend title...need to look this up. -->
 ```{r}
 #| label: stattest-chi-ex1-graph
 #| fig.cap: Expected and observed proportions of education, showing the confidence intervals for the expected proportions and whether the observed proportions lie within them.
 #| fig.alt: Expected and observed proportions of education, showing the confidence intervals for the expected proportions and whether the observed proportions lie within them. The x-axis has labels 'Less than HS', 'High school', 'Post HS', and 'Bachelor or Higher'. The only ones where expected proportion is outside of the intervals is 'Less than HS' and 'Bachelor or Higher'.
+
 ex1_table %>%
   pivot_longer(
     cols = c("Expected", "Observed"),
@@ -461,14 +486,15 @@ ex1_table %>%
   ) %>%
   mutate(
     Observed_low = if_else(Names == "Observed", Observed_low, NA_real_),
-    Observed_upp = if_else(Names == "Observed", Observed_upp, NA_real_)
+    Observed_upp = if_else(Names == "Observed", Observed_upp, NA_real_),
+    Names = if_else(Names == "Observed", "ANES (observed)", "ACS (expected)")
   ) %>%
   ggplot(aes(x = Education, y = Proportion, color = Names)) +
   geom_point(alpha = 0.75, size = 2) +
   geom_errorbar(aes(ymin = Observed_low, ymax = Observed_upp), width = 0.25) +
   theme_bw() +
   scale_color_manual(name = "Type", values = book_colors[c(4, 1)]) +
-  theme(legend.position = "none")
+  theme(legend.position = "bottom", legend.title=element_blank())
 ```
 
 #### Example 2: Test of Independence {.unnumbered #stattest-chi-ex2}
@@ -505,7 +531,7 @@ The output from `svychisq()` indicates that the distribution of people's trust i
 chi_ex2$observed
 ```
 
-However, as researchers, we often want to know about the proportions and not just the respondent counts from the survey. There are a couple of different ways that we can do this. The first is using the counts from `chi_ex2$observed` to calculate the proportion. We can then pivot the table to create a cross-tabulation similar to the counts table above. Adding `group_by()` to the code means that we are obtaining the proportions within each level of that variable. In this case, we are looking at the distribution of `TrustGovernment` for each level of `TrustPeople`. The resulting table is shown in Table \@ref(tab:stattest-chi-ex2-prop1-tab).
+However, as researchers, we often want to know about the proportions and not just the respondent counts from the survey. There are a couple of different ways that we can do this. The first is using the counts from `chi_ex2$observed` to calculate the proportion. We can then pivot the table to create a cross-tabulation similar to the counts table above. Adding `group_by()` to the code means that we are obtaining the proportions within each level of that variable. In this case, we are looking at the distribution of `TrustGovernment` for each level of `TrustPeople`. The resulting table is shown in Table \@ref(tab:stattest-chi-ex2-prop1-tab) and in Chapter \@ref(c08-communicating-results), we will discuss more on how to make publication-quality tables like this.
 
 ```{r}
 #| label: stattest-chi-ex2-prop1
@@ -531,7 +557,7 @@ chi_ex2_table<-chi_ex2$observed %>%
 chi_ex2_table
 ```
 
-(ref:stattest-chi-ex2-prop1-tab) Estimates of proportion of people by levels of trust in people and government, ANES 2020
+(ref:stattest-chi-ex2-prop1-tab) Proportion of adults in the U.S. by levels of trust in people and government, ANES 2020
 
 ```{r}
 #| label: stattest-chi-ex2-prop1-tab
@@ -542,6 +568,8 @@ chi_ex2_table %>%
     print_gt_book(knitr::opts_current$get()[["label"]])
 ```
 
+In Table \@ref(tab:stattest-chi-ex2-prop1-tab), each column sums to 1. For example, we can say that it is estimated that of people who always trust in people, `r round(chi_ex2$observed[1,1]/sum(chi_ex2$observed[,1])*100, 1)`% also always trust in government based on the top-left cell but `r round(chi_ex2$observed[5,1]/sum(chi_ex2$observed[,1])*100, 1)`% never trust in government.
+
 
 The second option is to use `group_by()` and `survey_mean()` functions to calculate the proportions from the ANES design object. A reminder that with more than one variable listed in the `group_by()` statement, the proportions are within the first variable listed. As mentioned above, we are looking at the distribution of `TrustGovernment` for each level of `TrustPeople`. 
 
@@ -571,7 +599,7 @@ chi_ex2_obs_table<-chi_ex2_obs %>%
 chi_ex2_obs_table
 ```
 
-(ref:stattest-chi-ex2-prop2-tab) Estimates of proportion of people by levels of trust in people and government with confidence intervals, ANES 2020
+(ref:stattest-chi-ex2-prop2-tab) Proportion of adults in the U.S. by levels of trust in people and government with confidence intervals, ANES 2020
 
 ```{r}
 #| label: stattest-chi-ex2-prop2-tab
@@ -582,17 +610,20 @@ chi_ex2_obs_table %>%
     print_gt_book(knitr::opts_current$get()[["label"]])
 ```
 
-Both methods produce the same output as the `svychisq()` function does account for the survey design. However, calculating the proportions directly from the design object means we can also obtain the variance information. In this case, the table output displays the survey estimate followed by the confidence intervals. Based on the output, we can see that of those who never trust people, 50.3% also never trust the government, while the proportions of never trusting the government are much lower for each of the other levels of trusting people. 
+Both methods produce the same output as the `svychisq()` function does account for the survey design. However, calculating the proportions directly from the design object means we can also obtain the variance information. In this case, the table output displays the survey estimate followed by the confidence intervals. Based on the output, we can see that of those who never trust people, `r round(chi_ex2$observed[5,5]/sum(chi_ex2$observed[,5])*100, 1)`% also never trust the government, while the proportions of never trusting the government are much lower for each of the other levels of trusting people. 
 
-We may find it easier to look at these proportions graphically. We can use `ggplot()` and facets to provide an overview:
+We may find it easier to look at these proportions graphically. We can use `ggplot()` and facets to provide an overview as shown below to create Figure \@ref(fig:stattest-chi-ex2-graph):
 
 <!--Perhaps a mosaic plot instead?-->
 ```{r}
 #| label: stattest-chi-ex2-graph
-#| fig.cap: Estimates of proportion of people by levels of trust in people and government with confidence intervals, ANES 2020
-#| fig.alt: Estimates of proportion of people by levels of trust in people and government with confidence intervals, ANES 2020. This presents the same information as previous table in graphical form.
+#| fig.cap: Proportion of adults in the U.S. by levels of trust in people and government with confidence intervals, ANES 2020
+#| fig.alt: Proportion of adults in the U.S. by levels of trust in people and government with confidence intervals, ANES 2020. This presents the same information as the previous table in graphical form.
+
 chi_ex2_obs %>%
-  mutate(TrustPeople=str_c("Trust in People:\n", TrustPeople)) %>%
+  mutate(TrustPeople=
+           fct_reorder(str_c("Trust in People:\n", TrustPeople), 
+                       order(TrustPeople))) %>%
   ggplot(aes(x = TrustGovernment, y = Observed, color = TrustGovernment)) +
   facet_wrap( ~ TrustPeople, ncol = 5) +
   geom_point() +
@@ -609,7 +640,7 @@ chi_ex2_obs %>%
 
 #### Example 3: Test of Homogeneity {.unnumbered #stattest-chi-ex3}
 
-Researchers and politicians often look at specific demographics each election cycle to understand how each group is leaning or voting toward candidates. The ANES data is post-election, but we can still see if there are differences in how specific demographic groups voted.
+Researchers and politicians often look at specific demographics each election cycle to understand how each group is leaning or voting toward candidates. The ANES data are collected post-election, but we can still see if there are differences in how specific demographic groups voted.
 
 If we want to see if there is a difference in how each age group voted for the 2020 candidates, this would be a test of homogeneity, and we can set up the hypothesis as follows:
 
@@ -621,7 +652,7 @@ H_0: p_{1_{Biden}} &= p_{1_{Trump}} = p_{1_{Other}},\\
     p_{5_{Biden}} &= p_{5_{Trump}} = p_{5_{Other}},\\
     p_{6_{Biden}} &= p_{6_{Trump}} = p_{6_{Other}}
   \end{align*}
-    where $p_{i_{Biden}}$ is the observed proportion of each age group ($i$) that voted for Biden, $p_{i_{Trump}}$ is the observed proportion of each age group ($i$) that voted for Trump, and $p_{i_{Other}}$ is the observed proportion of each age group ($i$) that voted for another candidate
+    where $p_{i_{Biden}}$ is the observed proportion of each age group ($i$) that voted for Joseph Biden, $p_{i_{Trump}}$ is the observed proportion of each age group ($i$) that voted for Donald Trump, and $p_{i_{Other}}$ is the observed proportion of each age group ($i$) that voted for another candidate
     
 - $H_A:$ at least one category of $p_{i_{Biden}}$ does not match $p_{i_{Trump}}$ or $p_{i_{Other}}$
 
@@ -678,11 +709,11 @@ chi_ex3_obs_table %>%
     print_gt_book(knitr::opts_current$get()[["label"]])
 ```
 
-We can see that the age group distribution was younger for Biden and other candidates and older for Trump. For example, of those who voted for Biden, 20.4% were in the 18-29 age group, compared to only 11.4% of those who voted for Trump were in that age group. On the other side, 23.4% of those who voted for Trump were in the 50-59 age group compared to only 15.4% of those who voted for Biden.
+We can see that the age group distribution that voted for Biden and other candidates was younger than those that voted for Trump. For example, of those who voted for Biden, 20.4% were in the 18-29 age group, compared to only 11.4% of those who voted for Trump were in that age group. On the other side, 23.4% of those who voted for Trump were in the 50-59 age group compared to only 15.4% of those who voted for Biden.
 
 ## Exercises {#stattest-exercises}
 
-The exercises use the design objects `anes_des` and `recs_des` as provided in the Prerequisites box in the beginning of the chapter. Here are some exercises for practicing conducting t-tests using `svyttest()`:
+The exercises use the design objects `anes_des` and `recs_des` as provided in the Prerequisites box in the [beginning of the chapter](#c06-statistical-testing). Here are some exercises for practicing conducting t-tests using `svyttest()`:
 
 1. Using the RECS data, do more than 50% of U.S. households use AC (`ACUsed`)?
 
@@ -710,7 +741,7 @@ ttest_solution2 <- recs_des %>%
 ttest_solution2
 ```
 
-3. Using the ANES data, does the average age (`Age`) of those who voted for Biden in 2020 (`VotedPres2020_selection`) differ from those who voted for another candidate?
+3. Using the ANES data, does the average age (`Age`) of those who voted for Joseph Biden in 2020 (`VotedPres2020_selection`) differ from those who voted for another candidate?
 
 ```{r}
 #| label: stattest-ttest-solution3