Comparing data sets means judging both a typical value and the amount of variation in each set, then linking those numerical comparisons to the real-world context. A good comparison uses an appropriate measure of average, an appropriate measure of spread, and a cautious conclusion that recognizes limits such as outliers, small samples, or bias. The key skill is not just calculating statistics, but choosing the right ones and explaining what they imply.
Range: but it depends only on the two extreme values. Because of that, it can change dramatically due to a single outlier and may exaggerate the true variability of most of the data.
IQR: where is the lower quartile and is the upper quartile. Since it ignores the most extreme 25 percent at each end, it is usually a more robust measure of spread when outliers are present.
Average statement: "Data set A has a higher than data set B, so A has a higher typical value." Spread statement: "Data set A has a smaller than data set B, so A is more consistent." This structure is effective because it separates the two statistical ideas clearly and shows that you understand what each one means.
The most important distinction is between center and spread. Center tells you what is typical, while spread tells you how variable the data is. A complete comparison usually needs one measure from each category.
Another key distinction is between sensitive and resistant statistics. The mean and range are sensitive to extreme values, whereas the median and IQR are more resistant. This matters because outliers can make a data set seem more extreme or more variable than most of its values actually are.
| Feature | Mean | Median | Mode | | --- | --- | --- | --- | | What it describes | Arithmetic average of all values | Middle value in order | Most frequent value | | Effect of outliers | Strongly affected | Usually not affected much | Often unaffected unless frequency changes | | Best use | Symmetric data without extreme values | Skewed data or data with outliers | Most common category or repeated value | | Limitation | Can be distorted by unusual values | Ignores exact size of most values | May be unclear or not unique |
| Feature | Range | IQR | | --- | --- | --- | | Formula | | | | Uses extremes? | Yes | No, focuses on middle 50 percent | | Outlier resistance | Low | High | | Best use | Quick overall spread | Typical spread when outliers may exist |
A smaller spread does not mean the values are smaller overall; it means they are less spread out. Likewise, a larger average does not automatically mean a better data set unless the context defines “better” that way. Good statistical reasoning depends on using language that matches what the numbers actually measure.