When comparing two datasets, you must comment on both a measure of location (to compare averages) and a measure of spread (to compare consistency).
Comparisons must always be made in context; for example, instead of saying 'Group A has a higher mean,' say 'Group A was faster on average than Group B.'
If comparing data represented by box plots, use the median and IQR for the comparison as these are the values explicitly shown by the box and whisker structure.
Ensure that the scales of the datasets are comparable before drawing conclusions about their relative spread.
Positive Skew: The distribution has a long tail to the right. Numerically, this is often indicated when or when .
Negative Skew: The distribution has a long tail to the left. This occurs when or when .
Symmetrical Distribution: The data is balanced around the center, where and the mean, median, and mode are approximately equal.
Context is Mandatory: Marks are frequently lost for providing purely numerical comparisons without relating them back to the real-world scenario described in the question.
Check for Outliers: Before choosing between mean and median, check if the question defines outliers. If outliers exist and are not cleaned, the median is almost always the better choice for interpretation.
Units: Always include units in your interpretation (e.g., seconds, kg, dollars) to ensure the comparison is physically meaningful.
Avoid Vague Terms: Use precise statistical language like 'more consistent' or 'greater variability' rather than 'better' or 'worse' unless the context defines what 'better' means.