How does the Central Limit Theorem differ from the Law of Large Numbers?

The Law of Large Numbers states that the sample mean will converge to the population mean as $n$ grows. The Central Limit Theorem describes the *shape* of the distribution of those sample means, specifically that it becomes normal.

When should you use the standard deviation $\sigma$ versus the standard error $\sigma/\sqrt{n}$?

Use $\sigma$ when calculating probabilities for a single individual observation from a population. Use the standard error $\sigma/\sqrt{n}$ when calculating probabilities for the average (mean) of a sample of size $n$.

What is the difference in requirements for the CLT if the population is already normally distributed?

If the population is normally distributed, the sampling distribution of the mean is normal for *any* sample size. The CLT is not needed to 'create' normality in this case, as the normality is inherited directly from the population.

What is the most common error when calculating a z-score for a sample mean?

The most common error is forgetting to divide the population standard deviation by $\sqrt{n}$. This leads to an overestimation of the spread and incorrect probability results.

Can the CLT be applied to a sample size of $n=15$ from a heavily skewed population?

No, the CLT generally requires $n \ge 30$ for the normal approximation to be valid for non-normal populations. With $n=15$, the sampling distribution may still reflect the skewness of the original population.

What happens if the independence assumption is violated in a CLT application?

If observations are dependent, the standard error formula $\sigma/\sqrt{n}$ becomes inaccurate. This usually results in a sampling distribution that is either more or less spread out than the theorem predicts.

Define the Central Limit Theorem.

It is a theorem stating that the sampling distribution of the sample mean approximates a normal distribution as the sample size becomes large ($n \ge 30$), regardless of the population's distribution shape.

What is the formula for the Standard Error of the Mean?

The formula is $\sigma_{\bar{x}} = \frac{\sigma}{\sqrt{n}}$, where $\sigma$ is the population standard deviation and $n$ is the sample size. It represents the standard deviation of the sampling distribution.

What is the z-score formula used for sample means under the CLT?

The formula is $z = \frac{\bar{x} - \mu}{\sigma / \sqrt{n}}$. It measures how many standard errors the observed sample mean $\bar{x}$ is from the population mean $\mu$.

Why does the standard error decrease as the sample size $n$ increases?

As $n$ increases, individual variations and outliers have less impact on the overall average. This causes the sample means to cluster more closely around the true population mean, reducing the spread.

Library Podcasts

Courses

Referral & Rewards

Revision Notes

College Board

Statistics

Unit 5: Sampling Distributions

The Central Limit Theorem

Summary

The Central Limit Theorem (CLT) is a fundamental principle in statistics that describes the behavior of sample means. It establishes that, regardless of the shape of the underlying population distribution, the sampling distribution of the sample mean will approach a normal distribution as the sample size increases, provided the samples are independent and the sample size is sufficiently large.

1. Definition & Core Concepts

The Central Limit Theorem states that for a population with a mean $\mu$ and a standard deviation $\sigma$ , the sampling distribution of the sample mean $\bar{x}$ will be approximately normal if the sample size $n$ is large enough. This approximation holds true even if the original population distribution is skewed, uniform, or multi-peaked.

A sample size is generally considered large enough when $n \ge 30$ . As $n$ increases, the shape of the sampling distribution becomes increasingly symmetric and bell-shaped, centering more tightly around the population mean.

The theorem relies on the assumption that the individual observations in the sample are independent of one another. This is typically achieved through random sampling from a large population.

Diagram showing a skewed population distribution transforming into a normal sampling distribution for the mean as sample size increases.

2. Underlying Principles

3. Methods & Techniques

4. Key Distinctions

5. Exam Strategy & Tips

The Central Limit Theorem

Summary

1. Definition & Core Concepts

The theorem relies on the assumption that the individual observations in the sample are independent of one another. This is typically achieved through random sampling from a large population.

Diagram showing a skewed population distribution transforming into a normal sampling distribution for the mean as sample size increases.

2. Underlying Principles

The mean of the sampling distribution of $\bar{x}$ is exactly equal to the population mean $\mu$ . This indicates that the sample mean is an unbiased estimator of the population parameter.

The variability of the sampling distribution is measured by the standard error, calculated as $\sigma_{\bar{x}} = \frac{\sigma}{\sqrt{n}}$ . This formula demonstrates that as the sample size increases, the spread of the sample means decreases, leading to more precise estimates.

The CLT explains why the normal distribution is so prevalent in nature and social sciences. Since many observed variables are the sum or average of many independent factors, they naturally tend toward a normal distribution.

3. Methods & Techniques

To calculate probabilities for a sample mean using the CLT, you must first verify the normality condition. If the population is not normal, you must ensure $n \ge 30$ ; if the population is already normal, the sampling distribution is normal regardless of $n$ .

Standardize the sample mean by calculating the z-score using the formula: $z = \frac{\bar{x} - \mu}{\sigma / \sqrt{n}}$ . This z-score represents how many standard errors the sample mean is away from the population mean.

Once the z-score is obtained, use standard normal distribution tables or technology to find the area under the curve. This area represents the probability of obtaining a sample mean as extreme as, or more extreme than, the observed value.

4. Key Distinctions

It is critical to distinguish between the population distribution and the sampling distribution. The population distribution describes individual data points, while the sampling distribution describes the behavior of a statistic (like the mean) across many samples.

Feature	Population Distribution	Sampling Distribution of $\bar{x}$
Shape	Any (Skewed, Uniform, etc.)	Approximately Normal (if $n \ge 30$ )
Center	$\mu$	$\mu$
Spread	$\sigma$	$\frac{\sigma}{\sqrt{n}}$

The CLT is only required when the population distribution is unknown or non-normal. If the population is normally distributed, the sampling distribution of the mean is perfectly normal for any sample size, even $n=2$ .

5. Exam Strategy & Tips

Always explicitly state and check the $n \ge 30$ condition before performing calculations. Examiners often award marks for identifying that the Central Limit Theorem allows the use of normal approximation methods.

Be careful not to confuse the population standard deviation $\sigma$ with the standard error $\sigma/\sqrt{n}$ . A common mistake is forgetting to divide by the square root of the sample size when calculating z-scores for means.

When a problem asks for the probability of a single individual value, use the population standard deviation $\sigma$ . When it asks for the probability of a sample mean, you must use the standard error $\sigma/\sqrt{n}$ .

The mean of the sampling distribution of $\bar{x}$ is exactly equal to the population mean $\mu$ . This indicates that the sample mean is an unbiased estimator of the population parameter.

Feature	Population Distribution	Sampling Distribution of $\bar{x}$
Shape	Any (Skewed, Uniform, etc.)	Approximately Normal (if $n \ge 30$ )
Center	$\mu$	$\mu$
Spread	$\sigma$	$\frac{\sigma}{\sqrt{n}}$