Measures of Dispersion: Unlike averages (mean, median) which describe the 'typical' value, range measures describe the 'spread' or 'variability' of the data. They indicate how much the data values deviate from the central tendency.
Sensitivity to Extremes: The standard range is highly sensitive to outliers because it only considers the two most extreme points. If a single data point is significantly larger or smaller than the rest, the range will expand drastically, potentially misrepresenting the overall data behavior.
Robustness of Internal Ranges: By focusing on internal boundaries like quartiles or percentiles, measures like the IQR provide a more stable view of the data's core. These measures effectively 'trim' the influence of outliers by excluding the top and bottom percentages of the data set.
Calculating the Range: Identify the maximum and minimum values in the data set and subtract the minimum from the maximum. The formula is simply .
Calculating the IQR: First, determine the lower quartile () and upper quartile (). The IQR is then found using the formula . This represents the width of the middle 50% of the data.
Calculating Interpercentile Ranges: To find the range between the and percentiles, calculate the values at those specific positions and subtract the smaller from the larger. For example, the 10th to 90th interpercentile range is .
Step-by-Step Process: Always begin by ordering the data from smallest to largest. Identify the required boundary values (extremes, quartiles, or percentiles) and perform the subtraction to find the specific range.
| Feature | Range | Interquartile Range (IQR) | Interpercentile Range |
|---|---|---|---|
| Data Used | All data (via extremes) | Middle 50% | Custom middle portion |
| Outlier Impact | High | Low (Robust) | Low (Robust) |
| Primary Use | Quick spread check | Box plots, identifying outliers | Detailed distribution analysis |
| Calculation |
Check the Units: Range measures always carry the same units as the original data. If the data is in 'seconds', the range must be reported in 'seconds'. Forgetting units is a common way to lose marks.
Identify Outliers: If a question asks why the IQR is better than the Range, look for extreme values in the data set. Explain that the Range is 'distorted' by these extremes, while the IQR remains 'representative' of the central data.
Verify Data Order: Before calculating any range, ensure the data is sorted. Calculating quartiles or percentiles on unsorted data is a fundamental error that leads to incorrect results.
Sanity Check: The IQR must always be less than or equal to the total Range. If your calculated IQR is larger than the Range, you have likely swapped your boundary values or made a calculation error.
Confusing Deciles and Percentiles: Remember that the decile corresponds to the percentile. For example, the 3rd decile () is exactly the same as the 30th percentile ().
Subtracting Positions instead of Values: A common mistake is subtracting the index or position of the data point (e.g., 'the 10th value minus the 2nd value') rather than the actual numerical values at those positions.
Ignoring the Median: While the median is not used in the range formulas themselves, it is the central point around which these ranges describe the spread. Understanding the median helps contextualize whether the spread is symmetrical or skewed.