Calculation Logic: The mean is found by summing the products of each value and its frequency, then dividing by the total number of observations. This accounts for the fact that some values appear more often than others.
Discrete Formula: The mean is calculated using the formula:
Grouped Data (Estimated Mean): For grouped data, the exact values are unknown, so the midpoint of each class interval is used as the representative value (). The resulting mean is an estimate because it assumes data is evenly distributed within each group.
The Mode: In a discrete table, the mode is the data value with the highest frequency. In grouped data, we identify the Modal Class, which is the interval with the highest frequency.
The Median Position: To find the median, calculate the position using , where is the total frequency. If is large, is often used.
Locating the Median: Use cumulative frequency to track the running total of frequencies row by row. The median is the value (or class) where the cumulative frequency first reaches or exceeds the median position.
| Feature | Discrete Table | Grouped Table |
|---|---|---|
| Data Value | Exact values are known | Only ranges (classes) are known |
| Mean Type | Exact Mean | Estimated Mean |
| Representative | The value itself () | The Midpoint of the class |
| Mode | Modal Value | Modal Class |
Frequency as the Mode: Students often mistake the highest frequency number for the mode. The mode is the data value associated with that frequency, not the frequency itself.
Range Errors: The range is the difference between the largest and smallest data values, not the difference between the largest and smallest frequencies.
Midpoint Accuracy: When calculating the midpoint for grouped data, ensure you use the lower and upper boundaries of the class interval correctly, especially if the classes have gaps.