Finish last time’s slides
Two sample inference
Two sample setting
Setting: two independent samples
i.i.d from population with c.d.f , and
i.i.d from population with c.d.f
Parameter: now focus on some comparison between the two populations and
Alternative view
Setting: two independent samples
where is a binary grouping variable which indicates which population the observation came from:
Two views are equivalent
Depending on sampling scheme one view may seem more natural:
I sample 40 OSU graduate students and 20 OSU undergraduate students:
- = graduate student time to complete 1 mile run,
- = undergraduate student time to complete 1 mile run,
I sample 60 OSU students and record:
- = time to complete 1 mile run,
- = student’s level (0 = graduate, 1 = undergraduate),
In second view, if we condition on the counts in each group, inference is the same as first view.
Two sample inference for difference in population means
To compare population means: , , we might look at their difference:
(In alternative view: equivalent to )
- Estimate for
- Test for
- Confidence interval for
Difference in sample means
It seems reasonable to use:
as a good starting point for inference on .
Complete worksheet (Charlotte will provide)
Leads to two sample Z-test and intervals
Assume known population variances: .
Reference Distribution: If null hypothesis is true, then
Rejection Regions:
- , reject for
- , reject for
- , reject for
Leads to two sample Z-test and intervals
% Confidence interval for
Next time…
What if population variances aren’t known?