to go with Introduction to Modern Statistics by Çentinkaya-Rundel & Hardin. Math 58B - Introduction to Biostatistics.
p-value = probability of the observed data or more extreme given the null hypothesis is true.
See Canvas front page for anonymous survey / feedback for the class. Also, if you are looking for people to work with, you could contact me directly (non-anonymously!) so that I can connect you to people.
From StatKey applet: https://www.lock5stat.com/StatKey/
95% confident that the interval includes the sample proportion who believe that the global poverty rate has doubled.
If researchers were to select a new sample of 1005 adult Americans, then we’re 95% confident that between 56% and 62% of those people would answer “doubled” to the question.
What does “of the time” mean???
It means in repeated samples. That is, in 3% of all datasets we’d take from that exact same population, we would mistakenly reject the actually true hypothesis that p=0.47.
What does “of the time” mean???
It means in repeated samples. That is, in 97% of all datasets we’d take from that exact same population, we would capture the true population proportion of 0.47.
From the NYT, March 21, 2023, https://www.nytimes.com/2023/03/21/sports/basketball/tall-basketball-march-madness.html
The average W.N.B.A. player, at a shade taller than 6 feet, towers over the average American woman (5 feet 3.5 inches). American men who are between 6 feet and 6-2 — significantly taller than the 5-9 average — have about a five in a million chance of making the N.B.A., according to “The Sports Gene,” a 2013 book by David Epstein about the science of athletic performance. But if you hit the genetic lottery and happen to be 7 feet tall, your chances of landing in the N.B.A. are roughly one in six. (There are 38 players on active rosters who are 7 feet or taller, according to N.B.A. Advanced Stats; the average height of an N.B.A. player is 6 feet 6.5 inches.)
https://davidepstein.com/david-epstein-the-sports-gene/
Q: what is the most confusing part of understanding the difference between the variability of the weights and the variability of the average of the weights?
# A tibble: 3 × 5
term estimate std.error statistic p.value
<chr> <dbl> <dbl> <dbl> <dbl>
1 (Intercept) 198. 59.2 3.34 0.00584
2 volume 0.718 0.0615 11.7 0.0000000660
3 coverpb -184. 40.5 -4.55 0.000672
\(X_1\) = mortgage rate in %
\(X_2\) = 1 if SF, 0 if LA
\(Y\) = demand in $100 per capita
p-value = probability of observed data (\(b_1\)) or more extreme if \(H_0\) is true (\(\beta_1 = 0).\)
You must connect the variable to both the explanatory and response variable. For me, that is easiest to do with c. academic ability of the student.
we usually use b. \(\sum_{i=1}^n(Y_i - \overline{Y})^2\) (for calculus and historical reasons), but c. and e. are also totally reasonably answers.
we usually use b. \(\sum_{i=1}^n(Y_i - \hat{Y}_i)^2\) (for calculus and historical reasons), but c. and e. are also totally reasonably answers.
The worksheet solutions and clicker questions are on the main course website. The HW & Lab solutions are on Canvas under Files.
random samples
Describing random samples (of size n) from the population, the sampling distribution of the sample mean is normal if the sample size (n) is large enough.
I don’t know. 10 or 20 both seem like reasonable values. I don’t think 1, 50, or 100 are reasonable.
FALSE, we never think that the statistic will be the same as the parameter, regardless of the value of the parameter.
FALSE, we never think that the statistic will be the same as the parameter, regardless of the value of the parameter.
No. Because statistics vary from sample to sample, always.