The mean of the sampling distribution of is exactly equal to the population mean . This indicates that the sample mean is an unbiased estimator of the population parameter.
The variability of the sampling distribution is measured by the standard error, calculated as . This formula demonstrates that as the sample size increases, the spread of the sample means decreases, leading to more precise estimates.
The CLT explains why the normal distribution is so prevalent in nature and social sciences. Since many observed variables are the sum or average of many independent factors, they naturally tend toward a normal distribution.
To calculate probabilities for a sample mean using the CLT, you must first verify the normality condition. If the population is not normal, you must ensure ; if the population is already normal, the sampling distribution is normal regardless of .
Standardize the sample mean by calculating the z-score using the formula: . This z-score represents how many standard errors the sample mean is away from the population mean.
Once the z-score is obtained, use standard normal distribution tables or technology to find the area under the curve. This area represents the probability of obtaining a sample mean as extreme as, or more extreme than, the observed value.
It is critical to distinguish between the population distribution and the sampling distribution. The population distribution describes individual data points, while the sampling distribution describes the behavior of a statistic (like the mean) across many samples.
| Feature | Population Distribution | Sampling Distribution of |
|---|---|---|
| Shape | Any (Skewed, Uniform, etc.) | Approximately Normal (if ) |
| Center | ||
| Spread |
The CLT is only required when the population distribution is unknown or non-normal. If the population is normally distributed, the sampling distribution of the mean is perfectly normal for any sample size, even .
Always explicitly state and check the condition before performing calculations. Examiners often award marks for identifying that the Central Limit Theorem allows the use of normal approximation methods.
Be careful not to confuse the population standard deviation with the standard error . A common mistake is forgetting to divide by the square root of the sample size when calculating z-scores for means.
When a problem asks for the probability of a single individual value, use the population standard deviation . When it asks for the probability of a sample mean, you must use the standard error .