The Range is the simplest measure of spread, calculated as the difference between the maximum and minimum values in a dataset: . While easy to compute, it is highly sensitive to outliers and does not describe the distribution of the middle values.
The Interquartile Range (IQR) measures the spread of the middle 50% of the data. It is calculated by subtracting the first quartile () from the third quartile (): .
Unlike the range, the IQR is a resistant measure, meaning it is not influenced by extreme outliers. This makes it the preferred measure of spread for skewed distributions or datasets with known extreme values.
Standard Deviation measures the average distance of each data point from the mean. A small standard deviation indicates that the data points are generally close to the mean, while a large one indicates they are far from it.
The Variance is the average of the squared differences from the mean. It is the square of the standard deviation ( or ). While variance is useful in higher-level statistical proofs, standard deviation is more commonly used for description because it is expressed in the same units as the original data.
For a sample, the standard deviation () uses in the denominator to provide an unbiased estimate: For a population, the standard deviation () uses :
| Measure | Calculation | Resistance to Outliers | Best Use Case |
|---|---|---|---|
| Range | Low (Non-resistant) | Quick, rough estimate | |
| IQR | High (Resistant) | Skewed data / Outliers | |
| Std Dev | Avg distance from mean | Low (Non-resistant) | Symmetric data |
| Variance | Low (Non-resistant) | Mathematical modeling |
Identify the Distribution Shape: If a distribution is skewed or contains outliers, always prioritize the IQR over the standard deviation. Standard deviation is heavily pulled by extreme values, which can provide a misleading sense of the 'typical' spread.
Check the Units: Ensure that your final answer for standard deviation includes the correct units. If the question asks for variance, remember to square the standard deviation and adjust the units accordingly.
Outlier Detection: Use the rule to mathematically justify outliers. A value is an outlier if it is less than or greater than .
Sample vs. Population: Pay close attention to whether the data represents a whole population or just a sample. Using instead of for a sample standard deviation is a common error that results in lost marks.