Random Sampling is the foundational method used to ensure that every individual in a population has an equal chance of being measured, thereby reducing selection bias.
Sample Size Selection is critical; larger samples provide more reliable data and reduce the impact of anomalous individuals on the final statistical results.
Standard Deviation (SD) is a measure of the spread of data around the mean, calculated using the formula: where is an individual value, is the mean, and is the sample size.
Data Interpretation: A small standard deviation indicates that the data points are clustered closely around the mean, suggesting low variation, while a large SD indicates high variation.
| Feature | Continuous Variation | Discontinuous Variation |
|---|---|---|
| Data Type | Quantitative (measurable) | Qualitative (categorical) |
| Genetic Basis | Polygenic (many genes) | Monogenic (one or few genes) |
| Environment | Significant influence | Little to no influence |
| Examples | Height, leaf length, mass | Blood group, seed shape |
| Graph Shape | Normal distribution curve | Bar chart (discrete bars) |
Check for Overlap: When comparing two means, always look at the standard deviation bars; if they overlap significantly, the difference between the means may not be statistically significant.
Precision in Units: Ensure that all measurements are taken using the same units and to the same level of decimal precision to avoid errors in calculating the mean.
Sample Size Justification: In exam questions, always justify the use of a large sample size by stating it 'increases reliability' and 'allows for a statistical test to be performed'.
Sanity Check: After calculating a mean, verify that it falls within the range of your raw data; if the mean is higher than your highest value or lower than your lowest, a calculation error has occurred.
Confusing SD with Range: Students often mistake the range (highest minus lowest) for standard deviation. While range only considers two points, SD considers every data point in the set.
Assuming Genetic Only: A common error is ignoring the role of the environment in continuous variation; remember that identical genotypes can produce different phenotypes in different environments.
Biased Sampling: Using a 'convenience sample' (e.g., only measuring plants near a path) leads to results that do not accurately represent the whole population.