STA258 Lecture 10

Completed Notes Status

Central objective: Build and interpret Confidence Intervals for differences (two means, paired means, two proportions, and two variances) while tracking assumptions and practical meaning.
Key concepts:
- Two-Sample t Confidence Interval (Equal Variances):
  - Use pooled variance $S_{p}^{2}$ when assuming $σ_{1}^{2} = σ_{2}^{2}$ and independent normal samples.
  - Degrees of freedom: $ν = n_{1} + n_{2} - 2$ ; interpret sign of CI for direction of mean difference.
- Welch's t-test:
  - Use unpooled standard error when $σ_{1}^{2} \neq σ_{2}^{2}$ (or equality is doubtful).
  - Use Welch–Satterthwaite degrees of freedom; conservative alternative is $ν = min (n_{1} - 1, n_{2} - 1)$ .
- Two-Proportion Z Confidence Interval:
  - Estimate $p_{1} - p_{2}$ with ${\hat{p}}_{1} - {\hat{p}}_{2}$ and use a $z^{⋆}$ critical value when large-sample conditions hold.
  - Always align the algebraic sign with the verbal claim (which group is "1" vs "2").
Connections:
- "CI excludes 0" is a statistical signal of non-zero difference, while distance from 0 (and units) drives Practical vs Statistical Significance.
- Assumption checks (e.g., Normal Q-Q Plot) protect inference validity for small-sample mean procedures.

#tk: "How sure can we be that direct is better than broker?" (and "95/100 times we will have a better return directly.")
- Answer:
  - A $95 %$ CI does not mean there is a $95 %$ chance that $μ_{d} > μ_{b}$ ; the CI is a procedure that would cover the fixed parameter $μ_{d} - μ_{b}$ in $95 %$ of repeated samples.
  - To quantify evidence for "direct is better," use a one-sided framing: test $H_{0} : μ_{d} - μ_{b} \leq 0$ vs $H_{A} : μ_{d} - μ_{b} > 0$ (or report a one-sided CI / p-value), and also assess whether the lower CI bound is meaningfully above 0 in domain units.
  - The phrase "95/100 investors" is about individual outcomes (a probability for a random investor beating another), which is not what a CI for mean difference answers; that would need a distributional model for individual returns, not only mean parameters.

Remember/Understand:
- What assumptions justify using the pooled-variance two-sample t CI (and what is the df)?
- In a CI for $μ_{1} - μ_{2}$ , what does it mean if 0 is inside the interval vs outside?
- Why do difference-of-proportions CIs typically use $z^{⋆}$ instead of $t^{⋆}$ ?
Apply/Analyze:
- Compute a $95 %$ CI for $μ_{1} - μ_{2}$ using pooled variance, given $n_{1}, n_{2}, {\bar{x}}_{1}, {\bar{x}}_{2}, s_{1}^{2}, s_{2}^{2}$ .
- For a two-proportion dataset, build a CI for $p_{1} - p_{2}$ and write a one-sentence interpretation that correctly matches the sign.
- For paired data, compute differences $d_{i}$ and build a CI for $μ_{d}$ ; explain why pairing changes the analysis.
Evaluate/Create:
- Given two groups with unequal variances and unequal sample sizes, decide between pooled t, Welch, or a conservative df approach, and justify your choice using assumptions and consequences.

Confidence Intervals:
- Why it's challenging: A CI is often misread as a probability statement about the parameter, or as a statement about individual outcomes.
- Study strategy: Write the "repeated sampling" interpretation verbatim, then practise translating CI endpoints into domain-specific meaning and units.
Welch's t-test:
- Why it's challenging: The Welch df formula is algebra-heavy and easy to misapply (variance vs standard deviation, $n$ placement).
- Study strategy: Memorize the structure (variance-over- $n$ terms), then verify with R output on a small synthetic example.

Immediate review actions:
- Review all [COMPLETED] insertions and verify with course materials
- Address all #tk items (if any)
- Re-read Lecture Summary and confirm key links
Practice and application:
- Answer Remember/Understand questions
- Answer Apply/Analyze questions
- Attempt Evaluate/Create questions
Deep dive study:
- Focus on Challenging Concepts (above)
- Extend atomic notes only where the new lecture adds content
Verification and integration:
- Cross-reference notes with textbook/slides
- Identify remaining gaps
- Link this lecture to prior/future lecture notes (lecture notes only; no MOCs)