What is the primary difference between the y-axis of a bar chart and a histogram?

A bar chart uses Frequency on the y-axis, whereas a histogram uses Frequency Density. This is because in a histogram, the area represents the frequency, not the height.

How does the representation of frequency differ between a bar chart and a histogram?

In a bar chart, frequency is represented by the height of the bar. In a histogram, frequency is represented by the total area of the bar (width multiplied by frequency density).

When should you use a histogram instead of a bar chart?

A histogram should be used for continuous grouped data, especially when class intervals are of unequal widths. Bar charts are reserved for discrete or qualitative (categorical) data.

What is the consequence of plotting frequency instead of frequency density on a histogram's y-axis?

The graph will be visually misleading because wider class intervals will appear to have a higher frequency than they actually do. The area-frequency relationship will be broken, leading to incorrect data interpretation.

What error occurs if gaps are left between the bars of a histogram?

Leaving gaps implies that the data is discrete or that there are values for which no data exists. Since histograms represent continuous data, the bars must touch to show the continuous range of possible values.

Why is it incorrect to use the class labels (like 10-19) directly as boundaries if the next class starts at 20?

This creates a gap between 19 and 20. To represent continuous data, boundaries must be adjusted (usually to 19.5) so that the upper boundary of one class matches the lower boundary of the next.

Define 'Frequency Density'.

Frequency Density is the frequency of a class divided by its class width. It represents the number of data points per unit of the measurement scale on the x-axis.

What is the formula for calculating the frequency of a class from a histogram?

The frequency is calculated as $Frequency = Frequency Density \times Class Width$. This corresponds to the area of the bar representing that class.

How do you calculate the 'Class Width' for a histogram bar?

Class width is calculated by subtracting the lower boundary of the class from its upper boundary ($Upper Boundary - Lower Boundary$).

What does the total area of all bars in a histogram represent?

The total area represents the total frequency (the sum of all observations) of the entire data set.

Histograms | Cambridge International Examinations AS-Level Maths

Revision Notes

AS-Level

Cambridge International Examinations

Maths

Probability And Statistics 1

Data Presentation & Interpretation

Histograms

Summary

Histograms are a specialized graphical tool used to represent the distribution of continuous grouped data. Unlike standard bar charts, the area of each bar in a histogram is proportional to the frequency of the class, which allows for the accurate representation of data even when class intervals are of unequal widths.

1. Definition & Core Concepts

A histogram is a graphical representation of the distribution of numerical data, specifically designed for continuous grouped data. It consists of adjacent rectangles (bars) where the horizontal axis represents the data values and the vertical axis represents the frequency density.

The most critical feature of a histogram is that the area of each bar, rather than just its height, represents the frequency of that specific class interval. This distinction is vital because it allows the diagram to maintain mathematical integrity when class widths vary.

Unlike bar charts used for discrete or qualitative data, there are no gaps between the bars of a histogram. This lack of spacing reflects the continuous nature of the underlying data, where one class ends exactly where the next begins.

A histogram showing bars of unequal widths where height represents frequency density and area represents frequency.

2. Underlying Principles

3. Methods & Techniques

4. Key Distinctions

It is essential to distinguish histograms from bar charts to avoid fundamental errors in data representation.

Feature	Bar Chart	Histogram
Data Type	Discrete or Qualitative	Continuous Grouped
Y-Axis	Frequency	Frequency Density
Bar Spacing	Gaps between bars	No gaps between bars
Significance	Height represents frequency	Area represents frequency

While a bar chart is used to compare individual categories, a histogram is used to visualize the distribution, spread, and skewness of a continuous variable.

5. Exam Strategy & Tips

Check the Y-Axis: Always verify if the vertical axis is labeled 'Frequency' or 'Frequency Density'. If it is a histogram, it MUST be frequency density; if you see frequency, it is likely a bar chart or a specific case of equal class widths.
Area Calculations: In exam questions, you are often asked to find the frequency of a 'part' of a bar. To do this, multiply the width of the specific section by the height (frequency density) of that bar.
Scale Awareness: Examiners often use non-standard scales (e.g., 2 small squares = 5 units). Always calculate the value of one small square on both axes before drawing or reading values.
Total Frequency Check: You can verify your histogram by calculating the area of every bar and summing them; the total should equal the total number of data points given in the frequency table.

6. Common Pitfalls & Misconceptions

Histograms

Summary

1. Definition & Core Concepts

A histogram showing bars of unequal widths where height represents frequency density and area represents frequency.

2. Underlying Principles

The fundamental principle of a histogram is the relationship between frequency, class width, and frequency density. This is expressed by the formula: $Frequency Density = \frac{Frequency}{Class Width}$

By using frequency density on the vertical axis, the histogram ensures that the visual 'weight' (the area) of the bar correctly corresponds to the number of observations. If frequency were used as the height for a very wide class, it would visually over-represent that group compared to narrower classes.

The total area of all the bars in a histogram is proportional to the total number of observations in the data set. This property makes histograms useful for estimating the number of items within specific sub-ranges of the data.

3. Methods & Techniques

Step 1: Boundary Adjustment: Ensure there are no gaps between classes. If data is given in discrete-looking groups (e.g., 10-19, 20-29), adjust the boundaries to the midpoints (e.g., 9.5-19.5, 19.5-29.5) to create a continuous scale.
Step 2: Calculate Class Width: For each group, subtract the lower boundary from the upper boundary ( $CW = Upper - Lower$ ).
Step 3: Calculate Frequency Density: Divide the frequency of each class by its calculated class width ( $FD = \frac{f}{CW}$ ).
Step 4: Plotting: Draw the bars on a grid where the x-axis is a continuous scale of the data and the y-axis is the frequency density. Ensure the scale on the y-axis is linear and clearly labeled.

4. Key Distinctions

It is essential to distinguish histograms from bar charts to avoid fundamental errors in data representation.

Feature	Bar Chart	Histogram
Data Type	Discrete or Qualitative	Continuous Grouped
Y-Axis	Frequency	Frequency Density
Bar Spacing	Gaps between bars	No gaps between bars
Significance	Height represents frequency	Area represents frequency

While a bar chart is used to compare individual categories, a histogram is used to visualize the distribution, spread, and skewness of a continuous variable.

5. Exam Strategy & Tips

Check the Y-Axis: Always verify if the vertical axis is labeled 'Frequency' or 'Frequency Density'. If it is a histogram, it MUST be frequency density; if you see frequency, it is likely a bar chart or a specific case of equal class widths.
Area Calculations: In exam questions, you are often asked to find the frequency of a 'part' of a bar. To do this, multiply the width of the specific section by the height (frequency density) of that bar.
Scale Awareness: Examiners often use non-standard scales (e.g., 2 small squares = 5 units). Always calculate the value of one small square on both axes before drawing or reading values.
Total Frequency Check: You can verify your histogram by calculating the area of every bar and summing them; the total should equal the total number of data points given in the frequency table.

6. Common Pitfalls & Misconceptions

The most common mistake is plotting frequency directly on the y-axis. This results in a distorted graph where wider classes appear to have more data than they actually do relative to their density.

Another frequent error is failing to adjust class boundaries. If a table shows ages as '10-14' and '15-19', there is a gap of 1 unit. These must be adjusted to '9.5-14.5' and '14.5-19.5' so the bars touch.

Students often confuse class width with the class interval labels. The width is the actual distance between the boundaries, which is crucial for the frequency density calculation.