A Confidence Interval (CI) for the difference in proportions is a symmetric range centered at the observed difference between two sample proportions ().
The primary goal is to capture the true population difference () with a specified level of confidence, such as 95% or 99%.
The interval is constructed by taking the point estimate and adding or subtracting a Margin of Error (ME), which represents the maximum expected distance between the estimate and the true parameter.
The method relies on the Sampling Distribution of the Difference in Proportions, which describes how the difference between two sample proportions behaves across many repeated samples.
According to the Central Limit Theorem, if sample sizes are sufficiently large, this sampling distribution will be approximately normal with a mean equal to the true difference ().
The spread of this distribution is measured by the Standard Error (SE), which estimates the standard deviation of the sampling distribution using sample data: .
Step 1: Identify Parameters: Define and as the true proportions of the two populations being compared.
Step 2: Verify Conditions: Ensure the data meets the requirements for Randomness, Independence (10% rule), and Normality (Large Counts).
Step 3: Calculate the Point Estimate: Find the difference between the two sample proportions: .
Step 4: Determine the Critical Value (): Select the z-score corresponding to the desired confidence level (e.g., for 95% confidence).
Step 5: Construct the Interval: Apply the formula: .
| Feature | Confidence Interval | Hypothesis Test |
|---|---|---|
| Proportion Used | Individual sample proportions () | Combined/Pooled proportion () |
| Assumption | No prior assumption about the difference | Assumes the null hypothesis () is true |
| Purpose | To estimate the size of the difference | To determine if a difference exists at all |
Check for Zero: If the interval contains zero (e.g., to ), you cannot conclude there is a statistically significant difference between the populations at that confidence level.
Directionality: Always specify the direction of the subtraction (e.g., Group A minus Group B) to ensure the interpretation of positive or negative values is clear.
Condition Verification: When checking the Large Counts condition, you must verify and for both samples individually; failing to check all four values is a common way to lose marks.
Width Relationships: Remember that increasing the confidence level makes the interval wider, while increasing the sample sizes () makes the interval narrower.