Admin Panel AccessUser ManagementSystem SettingsExport DatabaseDownload BackupUser Credentials ListAPI Keys ManagementAccess TokensEnvironment ConfigConfiguration FileWordPress AdminWordPress LoginphpMyAdminJoomla AdminEnvironment FileGit ConfigDatabase BackupDebug InterfaceTest EndpointInternal API
LearnlyAILibraryPodcasts
DashboardMy ShelfAll NotesAI ChatCreate AI NoteEssay AssistantAI PresentationTo-DoCalendar
Courses

Log in to view your courses

Referral & Rewards
Revision Notes
AS-Level
Cambridge International Examinations
Maths
Probability And Statistics 1
Data Presentation & Interpretation
Skewness
AI Assistant

Skewness

Summary

Skewness is a statistical measure that describes the degree of asymmetry in a data distribution. While measures of central tendency locate the middle of the data, skewness identifies whether the data is 'leaning' or stretched toward one side, creating a characteristic 'tail' that significantly influences the relationship between the mean, median, and mode.

1. Definition & Core Concepts

  • Skewness refers to the lack of symmetry in a probability distribution or frequency distribution of a dataset.

  • A symmetrical distribution is one where the left and right sides are mirror images of each other, typically resulting in the mean, median, and mode being equal.

  • Asymmetry occurs when data points are not evenly distributed around the center, causing the distribution to stretch further in one direction than the other.

  • The direction of the skew is defined by the direction of the tail (the long, thin part of the curve), not the direction of the peak.

Positive Skew (Tail Right)Negative Skew (Tail Left)

Comparison of positive and negative skew curves showing the direction of the tails.

2. Underlying Principles

  • The Mean is highly sensitive to extreme values (outliers) and is pulled toward the tail of the distribution.

  • The Mode represents the highest frequency and remains at the peak of the distribution, regardless of the tail's length.

  • The Median is a positional measure and typically falls between the mode and the mean in a skewed distribution, as it is less affected by extreme values than the mean but more so than the mode.

3. Methods & Techniques

4. Key Distinctions

5. Exam Strategy & Tips

6. Common Pitfalls & Misconceptions

  • Peak vs. Tail: A common error is labeling a distribution based on where the peak is located. If the peak is on the left, the tail is on the right, making it a positive skew.

  • Outlier Influence: Students often forget that a single extreme outlier can create skewness in an otherwise symmetrical dataset, significantly shifting the mean while leaving the median unchanged.

  • Symmetry Assumption: Do not assume a distribution is symmetrical just because it looks 'bell-shaped' at first glance; always verify by checking the mean and median.

Identification via Central Tendency

  • In a Positively Skewed distribution: Mode<Median<MeanMode < Median < MeanMode<Median<Mean. The mean is the largest value because it is pulled to the right by high-value outliers.

  • In a Negatively Skewed distribution: Mean<Median<ModeMean < Median < ModeMean<Median<Mode. The mean is the smallest value because it is pulled to the left by low-value outliers.

Identification via Box Plots

  • Skewness can be determined by comparing the distances between the quartiles (Q1,Q2,Q3Q_1, Q_2, Q_3Q1​,Q2​,Q3​).

  • Positive Skew: The median (Q2Q_2Q2​) is closer to the lower quartile (Q1Q_1Q1​). Mathematically: Q3−Q2>Q2−Q1Q_3 - Q_2 > Q_2 - Q_1Q3​−Q2​>Q2​−Q1​.

  • Negative Skew: The median (Q2Q_2Q2​) is closer to the upper quartile (Q3Q_3Q3​). Mathematically: Q3−Q2<Q2−Q1Q_3 - Q_2 < Q_2 - Q_1Q3​−Q2​<Q2​−Q1​.

Feature Positive Skew Negative Skew
Tail Direction Right (towards positive infinity) Left (towards negative infinity)
Mean vs Median Mean>MedianMean > MedianMean>Median Mean<MedianMean < MedianMean<Median
Box Plot Visual Median is on the left side of the box Median is on the right side of the box
Data Concentration Most data is at the lower end Most data is at the higher end
  • Visual Check: Always look for the 'tail' of the distribution. Students often mistake the 'hump' for the skew direction; remember that skewness is where the data is sparse, not where it is clustered.

  • Quartile Calculation: If provided with numerical values for Q1,Q2,Q_1, Q_2,Q1​,Q2​, and Q3Q_3Q3​, calculate the differences (Q3−Q2)(Q_3 - Q_2)(Q3​−Q2​) and (Q2−Q1)(Q_2 - Q_1)(Q2​−Q1​) explicitly to justify your conclusion about skewness.

  • Contextual Analysis: When comparing two datasets, use skewness to describe the 'nature' of the data. For example, a positive skew in exam scores suggests the test was difficult, as most students scored low.

  • Consistency Check: Ensure your findings from the mean/median relationship match your findings from the box plot; they should always point to the same type of skew.