A Binomial Distribution is a discrete probability distribution used to model the number of 'successes' in a fixed number of trials. It is denoted as , where is the discrete random variable representing the count of successes.
The parameter represents the fixed number of trials, while represents the constant probability of success in any single trial. The probability of failure is denoted as , where .
The random variable can take any integer value from to . It is impossible for to be negative or to exceed the total number of trials .
The probability of achieving exactly successes is calculated using the Probability Mass Function (PMF):
The term is the binomial coefficient, calculated as . it represents the number of different ways (paths) to arrange successes and failures.
The Expected Value (Mean) of the distribution is given by , representing the average number of successes expected over many repetitions.
The Variance is calculated as , which measures the spread of the distribution. The standard deviation is simply the square root of the variance: .
The shape of a binomial distribution is determined primarily by the value of . When , the distribution is perfectly symmetrical around the mean.
If , the distribution is positively skewed (skewed to the right), meaning the 'tail' of the graph extends toward the higher values of .
If , the distribution is negatively skewed (skewed to the left), with the 'tail' extending toward the lower values of .
As the number of trials increases, the distribution tends to become more bell-shaped and symmetrical, regardless of the initial value of .
| Feature | Binomial Distribution | Simple Probability (Bernoulli) |
|---|---|---|
| Number of Trials | Multiple fixed trials () | Single trial () |
| Outcome Variable | Count of successes | Occurrence of success/failure |
| Complexity | Requires binomial coefficients | Simple or |
Check the Inequality: Always identify if the question asks for 'at most' (), 'fewer than' (), 'at least' (), or 'more than' (). For discrete distributions, is .
The Complement Rule: If asked to find , it is much faster to calculate than to sum all probabilities from to .
Parameter Identification: Before calculating, explicitly write down , , and . This prevents simple substitution errors in the formula.
Sanity Check: Ensure your calculated mean sits near the peak of your distribution and that all individual probabilities are between and .