In a Positively Skewed distribution: . The mean is the largest value because it is pulled to the right by high-value outliers.
In a Negatively Skewed distribution: . The mean is the smallest value because it is pulled to the left by low-value outliers.
Skewness can be determined by comparing the distances between the quartiles ().
Positive Skew: The median () is closer to the lower quartile (). Mathematically: .
Negative Skew: The median () is closer to the upper quartile (). Mathematically: .
| Feature | Positive Skew | Negative Skew |
|---|---|---|
| Tail Direction | Right (towards positive infinity) | Left (towards negative infinity) |
| Mean vs Median | ||
| Box Plot Visual | Median is on the left side of the box | Median is on the right side of the box |
| Data Concentration | Most data is at the lower end | Most data is at the higher end |
Visual Check: Always look for the 'tail' of the distribution. Students often mistake the 'hump' for the skew direction; remember that skewness is where the data is sparse, not where it is clustered.
Quartile Calculation: If provided with numerical values for and , calculate the differences and explicitly to justify your conclusion about skewness.
Contextual Analysis: When comparing two datasets, use skewness to describe the 'nature' of the data. For example, a positive skew in exam scores suggests the test was difficult, as most students scored low.
Consistency Check: Ensure your findings from the mean/median relationship match your findings from the box plot; they should always point to the same type of skew.