STAT 120
If \(n \geq 30^*\), then
\[\bar{X} \approx N\left(\mu, \frac{\sigma}{\sqrt{n}}\right)\]
*Smaller sample sizes may be sufficient for symmetric distributions, and 30 may not be sufficient for very skewed distributions or distributions with high outliers
The standard deviation of the population is
\(\sigma\)
\(s\)
\(\frac{\sigma}{\sqrt{n}}\)
The standard deviation of the sample is
The standard deviation of the sample mean is
Check: Check the one sample size conditions for the CLT
Tests: Use t-ratios of the form \[t=\frac{\text { stat }-\text { null value }}{S E}\] P-values computed from a t-distribution with appropriate \(\mathrm{df}\) - pt(t, df= )
gives the area to the left of \(t\)
Confidence intervals: \(\mathrm{CI}\) of the form \[\text { stat } \pm t^{*} S E\] The \(t^{*}\) multiplier comes from a t-distribution with appropriate \(d f\) - qt(0.975, df= )
gives \(t^{*}\) for \(95 \%\) confidence
53 lakes were sampled, pH recorded
Can we use a t-test?
\[H_0: \mu = 7 \qquad H_A: \mu \neq 7\] Data: The average pH was \(\bar{x}=6.591\) with a standard deviation of \(s=1.288\).
\[H_0: \mu = 7 \qquad\qquad H_A: \mu \neq 7\] - p-value \(2 \times P(t < -2.31)\), or double left tail area below -2.31 - use t-distribution with \(df = 53-1= 52\)
Interpret: The p-value is 0.025. If the mean pH of all lakes is 7, then we would see a sample mean that is at least 2.31 SEs away from 7 about 2.5% of the time in samples of 53 lakes.
Conclusion: There is a statistically discernible difference between the observed mean pH of 6.591 and the hypothesized mean of 7 (t=-2.31, df=52, p=0.025).
t.test
in RHow different is the population mean from 7?
\[6.591 \pm 2.0066 \dfrac{1.288}{\sqrt{53}} = 6.591 \pm 0.355 = (6.236, 6.946)\] where \(t^*\) corresponds to 95% confidence (97.5th percentile):
We are 95% confident that the mean pH of all lakes is between 6.236 and 6.946 (slightly acidic)
Gribbles are small marine worms that bore through wood, and the enzyme they secrete may allow us to turn inedible wood and plant waste into biofuel
A sample of 50 gribbles finds an average length of \(3.1 \mathrm{~mm}\) with a standard deviation of \(0.72 \mathrm{~mm}\).
Give a \(90 \%\) confidence interval for the average length of gribbles.
A sample of 50 gribbles finds an average length of \(3.1 \mathrm{~mm}\) with a standard deviation of \(0.72 \mathrm{~mm}\). For a \(90 \%\) confidence interval for the average length of gribbles, what is \(t^*\) ?
a). 1.645 b). 1.677 c). 1.960 d). 1.690
A sample of 50 gribbles finds an average length of \(3.1 \mathrm{~mm}\) with a standard deviation of \(0.72 \mathrm{~mm}\). For a \(90 \%\) confidence interval for the average length of gribbles, what is the standard error?
a). 0.171 b). 0.720 c). 1.960 d). 0.102
A sample of 50 gribbles finds an average length of \(3.1 \mathrm{~mm}\) with a standard deviation of \(0.72 \mathrm{~mm}\). For a \(90 \%\) confidence interval for the average length of gribbles, what is the margin of error?
a). 0.171 b). 0.720 c). 1.960 d). 0.102
\(\begin{gathered}\text { statistic } \pm t^* \cdot S E \\ \bar{x} \pm t^* \cdot \frac{s}{\sqrt{n}} \\ 3.1 \pm 1.677 \cdot \frac{0.72}{\sqrt{50}} \\ 3.1 \pm 0.17 \\ (2.93,3.27)\end{gathered}\)
We are \(90 \%\) confident that the average length of gribbles is between 2.93 and \(3.27 \mathrm{~mm}\).
\[\mathrm{CI}: \bar{x} \pm t^* \cdot \frac{s}{\sqrt{n}}\]
For a single mean, what is the margin of error?