The fundamental principle of a histogram is the Area-Frequency Relationship, expressed as . In most standard applications, the constant is equal to 1.
Because , and the width corresponds to the class width, the height must represent frequency density.
The formula for frequency density is:
This approach ensures that wider classes do not appear artificially 'larger' just because they cover more ground on the x-axis; their height is adjusted downward to maintain the correct area.
It is vital to distinguish histograms from bar charts to avoid fundamental errors in data interpretation.
| Feature | Bar Chart | Histogram |
|---|---|---|
| Data Type | Discrete or Qualitative | Grouped Continuous |
| Gaps | Gaps between bars | No gaps between bars |
| Y-Axis | Frequency | Frequency Density |
| Bar Width | Equal (usually) | Can be unequal |
| Meaning of Area | No specific meaning | Represents Frequency |
A frequency polygon is a line graph used to visualize the shape of a distribution, often overlaid on a histogram or used to compare multiple datasets.
To construct one, plot points at the midpoint of the top of each histogram bar. The midpoint is calculated as .
Connect these points with straight lines. Note that the lines should not be joined to the x-axis unless the frequency of the adjacent 'imaginary' class is zero; otherwise, it is an 'open' polygon.
Check the Y-Axis: Always verify if the vertical axis is labeled 'Frequency' or 'Frequency Density'. If it is a histogram with unequal widths, it MUST be density.
The Constant: In some exam problems, the area is proportional but not equal to frequency. Use a known bar to find in before calculating others.
Partial Bars: If asked for the frequency of a specific range within a bar (e.g., values between 12 and 15 in a 10-20 bar), calculate the area of that specific sub-section using the bar's height.
Boundary Adjustments: Look for 'hidden' gaps in table data. If ages are given as '10-14' and '15-19', the boundary is 14.5.