Sample Space (): The set of all possible outcomes of a random experiment. For example, in a coin toss, the sample space is .
Event (): A subset of the sample space representing a specific outcome or group of outcomes. The probability of an event is calculated as , where is the number of favorable outcomes and is the total number of outcomes in the sample space.
Probability Scale: All probabilities exist on a closed interval . A probability of indicates an impossible event, while a probability of indicates a certain event.
Complementary Events: The probability that an event does NOT occur is denoted as . It is calculated using the identity .
Mean (): The arithmetic average of a dataset, calculated by summing all values and dividing by the count (). It is highly sensitive to outliers, which can pull the mean toward extreme values.
Median: The middle value of a dataset when ordered from least to greatest. It is a robust measure of center because it remains unaffected by extreme outliers.
Interquartile Range (IQR): A measure of statistical dispersion representing the range of the middle of the data. It is calculated as , where is the 75th percentile and is the 25th percentile.
Standard Deviation (): A measure of how much the data values deviate from the mean. A low standard deviation indicates that the data points tend to be close to the mean, while a high standard deviation indicates that the data points are spread out over a wider range.
Addition Rule (OR): For any two events and , the probability that at least one occurs is . If the events are mutually exclusive, .
Multiplication Rule (AND): For independent events, the probability of both occurring is . Independence means the occurrence of one event does not affect the probability of the other.
Conditional Probability: The probability of event occurring given that event has already occurred is denoted . This is essential for analyzing dependent events.
| Feature | Theoretical Probability | Experimental Probability |
|---|---|---|
| Basis | Mathematical logic and sample space | Observed results from trials |
| Formula | ||
| Accuracy | Exact value | Approaches theoretical value as trials increase (Law of Large Numbers) |
The 'Sum to One' Check: Always verify that the sum of probabilities for all mutually exclusive outcomes in a sample space equals exactly . If it doesn't, an outcome has been missed or a calculation is wrong.
Identifying Outliers: Use the rule. A value is typically considered an outlier if it is less than or greater than .
Contextual Reasoning: When asked to choose between mean and median, look at the data distribution. If the data is skewed or contains outliers, the median is usually the better representation of the 'typical' value.
Rounding Precision: In probability, avoid rounding intermediate steps. Keep values as fractions where possible to maintain exactness until the final answer.