The Binomial Distribution is a discrete probability distribution that describes the number of successes, , in a sequence of independent experiments. It is denoted by the notation , where represents the total number of trials and represents the constant probability of success in a single trial.
A discrete random variable is used because the outcomes are countable integers (e.g., 0, 1, 2 successes), rather than continuous values. The distribution allows us to calculate the exact probability of achieving a specific number of successes within the defined set of trials.
The complement of the success probability is the probability of failure, often denoted as , where . Together, and account for all possible outcomes of a single trial, ensuring the total probability space sums to 1.
The probability of obtaining exactly successes in trials is calculated using the Probability Mass Function (PMF). The formula is given by:
The term , known as the binomial coefficient, represents the number of different ways to arrange successes and failures. It is calculated as , which accounts for the fact that the order of successes within the sequence does not matter.
The term represents the probability of successes occurring, while represents the probability of the remaining trials being failures. Multiplying these components together yields the total probability for a specific number of successes.
It is vital to distinguish between Probability Density (PD) and Cumulative Distribution (CD). PD calculates the probability of an exact value, , whereas CD calculates the probability of a range of values, such as , by summing the individual probabilities from 0 up to .
The shape of the distribution is determined entirely by the value of . When , the distribution is perfectly symmetric; when , the distribution is positively skewed (tail to the right); and when , it is negatively skewed (tail to the left).
| Feature | |||
|---|---|---|---|
| Symmetry | Right-skewed | Perfectly Symmetric | Left-skewed |
| Peak (Mode) | Near the start (low ) | In the center | Near the end (high ) |
| Mean vs Median | Mean > Median | Mean = Median | Mean < Median |
Check the BINS: Before performing any calculations, explicitly state how the scenario meets the four binomial conditions. This is often a required step in multi-part exam questions and ensures you haven't misidentified the distribution type.
Identify n and p correctly: Always define what constitutes a 'success' in the context of the problem. Sometimes a question might give you the probability of failure, and you must subtract it from 1 to find before using the formula.
Use Cumulative Probabilities for Inequalities: For questions asking for 'at least' or 'more than' a certain number of successes, use the complement rule. For example, . This is much faster than calculating individual probabilities for every value from 3 to .
Sanity Check the Mean: Calculate the expected value . If your calculated probability for a value far from the mean is extremely high, you may have swapped and or used the wrong .