Two-Sample Z-Test: This inferential procedure evaluates whether the difference between two observed sample proportions () provides enough evidence to reject the claim that the underlying population proportions ( and ) are equal.
Null Hypothesis (): The baseline assumption is that there is no difference between the populations, expressed as or .
Alternative Hypothesis (): This represents the research claim, which can be one-tailed ( or ) or two-tailed ().
Sampling Distribution: When samples are independent and large enough, the distribution of is approximately normal with a mean of and a standard deviation derived from the population parameters.
The Pooling Logic: Because the null hypothesis assumes , we combine the data from both samples to create a single, more precise estimate of the common population proportion, known as the pooled proportion ().
Standard Error (Pooled): The standard error for the test uses this pooled estimate to reflect the variability of the difference under the assumption that both samples come from populations with the same proportion.
| Feature | Hypothesis Test (Difference) | Confidence Interval (Difference) |
|---|---|---|
| Assumption | Assumes | Makes no assumption about equality |
| Standard Error | Uses Pooled Proportion () | Uses Individual Proportions () |
| Purpose | To decide if a difference exists | To estimate the size of the difference |
| Normality Check | Uses and | Uses and |
Define Parameters: Always start by explicitly defining and in the context of the problem (e.g., 'where is the true proportion of adults who...').
Check the Tail: For a two-tailed test (), remember to double the area found in one tail of the normal distribution to get the final p-value.
Pooled Proportion Check: Ensure you use the pooled proportion for both the standard error calculation and the Large Counts condition check.
Conclusion Phrasing: If , 'reject ' and state there is 'sufficient evidence' for . If , 'fail to reject ' and state there is 'insufficient evidence' for .
Mixing up X and n: A common error is confusing the number of successes () with the sample size () when calculating .
Incorrect Standard Error: Students often use the standard error formula for confidence intervals (using and separately) instead of the pooled formula required for the test.
Independence Violation: Forgetting to check the 10% rule when sampling without replacement can lead to an overestimation of precision.