How does the mean differ from the median when interpreting a data set?

The mean uses every value, so it represents a balance point of the data, while the median depends on the middle position after ordering. Because of that, the mean is more affected by extreme values, whereas the median is usually more reliable when outliers are present.

How does the interquartile range differ from the range as a measure of spread?

The range uses only the smallest and largest values, so it is very sensitive to extreme observations. The interquartile range uses the middle 50 percent of the data, which makes it a more stable measure when the data contains outliers.

When should you compare data sets using mean and standard deviation rather than median and interquartile range?

Use mean and standard deviation when the data is roughly symmetrical and does not contain strong outliers. They work well together because both are based on the full data set and describe center and spread around the same underlying model.

What error occurs if you compare two data sets using only their means or medians?

You ignore how spread out the data is, so you may miss whether one set is much less consistent than the other. Two data sets can have similar centers but very different variability, which can change the overall interpretation.

Why is it a mistake to assume the mean always gives the best description of the center?

The mean can be pulled upward or downward by extreme values because every observation contributes directly to it. If the data contains outliers, the mean may no longer represent a typical value well, so the median may be more appropriate.

Why is it risky to guess how the median changes after adding a new value?

The median depends on the ordered positions in the data, not just on whether the new value is large or small. Adding one observation can leave the middle unchanged in some cases or shift it in others, so the ordered list must be checked.

What does a measure of location tell you about a data set?

A measure of location tells you where the data is centered or what a typical value looks like. It helps summarize the general level of the observations, but it does not show how variable those observations are.

What does the interquartile range measure?

The interquartile range measures the spread of the middle half of the data, calculated as $Q_3 - Q_1$. It is useful because it reduces the influence of extreme values and gives a robust sense of variability.

What is the formula for the mean, and what do its symbols represent?

The mean is given by $\bar{x} = \frac{\sum x}{n}$. Here $\sum x$ is the total of all observations and $n$ is the number of observations, so the formula divides the total evenly across the data set.

How are variance and standard deviation related?

Variance measures average squared spread around the mean, and standard deviation is its square root. Because standard deviation is in the same units as the original data, it is usually easier to interpret in practice.

Library Podcasts

Courses

Referral & Rewards

Revision Notes

AS-Level

Cambridge International Examinations

Maths

Probability And Statistics 1

Data Presentation & Interpretation

Interpreting Data

Summary

Interpreting data means using summary statistics to make justified statements about where data is centered, how variable it is, how reliable those summaries are, and how two data sets compare. Good interpretation is not just calculation: it requires choosing suitable measures, understanding how outliers and added or removed values affect them, and expressing conclusions in context.

1. Definition & Core Concepts

Interpreting data is the process of turning numerical summaries into meaningful conclusions about a distribution. Instead of stopping after calculating values such as the mean, median, range, or standard deviation, you explain what those values reveal about the typical size of observations and the spread or consistency of the data.
A measure of location describes where the data is centered. Common measures are the mean and median, and they answer questions such as what a typical value looks like or where the middle of the data lies.
A measure of spread describes how much the data varies around its center. Measures such as the range, interquartile range, variance, and standard deviation help distinguish between a tightly clustered data set and one with large variability.
Interpretation is strongest when you discuss both location and spread together. A data set with a larger average is not automatically preferable, because it may also be much less consistent than another set.
The meaning of "larger" or "smaller" depends on the real-world context. For example, a smaller central value may be better when measuring completion time, while a larger central value may be better when measuring scores or output.

Diagram showing that interpreting data requires combining a measure of location with a measure of spread.

2. Underlying Principles

3. Methods & Techniques

Flowchart showing how to choose mean and standard deviation for symmetrical data, or median and interquartile range when outliers are present, then write the conclusion in context.

4. Key Distinctions

5. Exam Strategy & Tips

6. Common Pitfalls & Misconceptions

7. Connections & Extensions