IB Mathematics AA – Topic 4: Statistics & Probability
Comprehensive Guide to Probability Distributions
Introduction to Probability Distributions
Probability distributions provide mathematical models for random phenomena, describing all possible values a random variable can take and the probabilities associated with each value. From manufacturing quality control and medical testing to financial modeling and weather forecasting, probability distributions enable quantitative predictions about uncertain outcomes.
Key concepts: A random variable assigns numerical values to outcomes of random experiments. Discrete random variables take specific separated values (like number of heads in coin flips), while continuous random variables can take any value in an interval (like heights or temperatures). The binomial distribution models fixed-trial success/failure experiments, while the normal distribution describes many naturally occurring phenomena with its characteristic bell-shaped curve.
Why distributions matter: Rather than treating every problem uniquely, probability distributions provide standard models that apply across countless situations. Understanding that exam scores often follow normal distributions, manufacturing defects follow binomial patterns, and customer arrivals follow Poisson distributions enables professionals to make accurate predictions, set appropriate standards, and quantify risks systematically.
In this guide: We'll master random variables and their probability distributions, understand expected value and variance as measures of central tendency and spread, develop complete proficiency with the binomial distribution for fixed-trial experiments, explore the normal distribution with standardization and z-scores, use tables and GDC to find probabilities, and solve real-world applications—all essential skills for IB exam success.
1. Random Variables and Probability Distributions
What is a Random Variable?
Random Variable Definition:
A random variable (usually denoted by capital letters like \(X\), \(Y\), \(Z\)) is a variable whose value depends on the outcome of a random experiment.
- Discrete random variable: Takes specific, separate values (countable)
- Continuous random variable: Can take any value in an interval (measurable)
Examples: \(X\) = number of heads in 5 coin flips (discrete), \(Y\) = height of randomly selected person (continuous)
Probability Distribution
Discrete Probability Distribution
A probability distribution shows all possible values and their probabilities
Requirements:
- \(0 \leq P(X = x) \leq 1\) for all values
- \(\sum P(X = x) = 1\) (sum of all probabilities equals 1)
Expected Value and Variance
Expected Value and Variance
Expected Value (Mean): \(E(X)\) or \(\mu\)
\(E(X) = \sum x \cdot P(X = x)\)
Weighted average of all possible values
Variance: \(\text{Var}(X)\) or \(\sigma^2\)
\(\text{Var}(X) = E(X^2) - [E(X)]^2\)
or \(\text{Var}(X) = \sum (x - \mu)^2 \cdot P(X = x)\)
Measures spread of distribution
Standard Deviation: \(\sigma\)
\(\sigma = \sqrt{\text{Var}(X)}\)
Square root of variance; in same units as \(X\)
⚠ Common Pitfalls:
- Sum check: Always verify probabilities sum to 1
- E(X) vs E(X²): These are different! \(E(X^2) \neq [E(X)]^2\)
- Variance formula: Use \(E(X^2) - [E(X)]^2\) to avoid calculation errors
- Units: Variance has squared units; standard deviation has original units
2. Binomial Distribution
When to Use Binomial Distribution
Binomial Conditions (BINS):
- Binary outcomes: Each trial has only two outcomes (success/failure)
- Independent trials: Trials don't affect each other
- Number of trials fixed: Know exactly how many trials (\(n\))
- Same probability: Probability of success (\(p\)) constant for each trial
Examples: Coin flips, manufacturing defects (with replacement), multiple choice guessing
Binomial Distribution Notation and Formulas
Binomial Distribution \(X \sim B(n, p)\)
Notation: \(X \sim B(n, p)\) means \(X\) follows binomial distribution
- \(n\): number of trials
- \(p\): probability of success on each trial
- \(X\): number of successes in \(n\) trials
Probability Formula:
\(P(X = r) = \binom{n}{r} p^r (1-p)^{n-r}\)
where \(\binom{n}{r} = \frac{n!}{r!(n-r)!}\) is the binomial coefficient
Expected Value and Variance:
\(E(X) = np\)
\(\text{Var}(X) = np(1-p)\)
\(\sigma = \sqrt{np(1-p)}\)
Using GDC for Binomial
Calculator Functions:
- binompdf(\(n\), \(p\), \(r\)): Finds \(P(X = r)\) exactly
- binomcdf(\(n\), \(p\), \(r\)): Finds \(P(X \leq r)\) cumulative
- \(P(X \geq r)\): Calculate as \(1 - P(X \leq r-1)\)
- \(P(a \leq X \leq b)\): Calculate as \(P(X \leq b) - P(X \leq a-1)\)
💡 Binomial Tips:
- Always use GDC for calculations—faster and more accurate
- Check BINS conditions before using binomial distribution
- For "at least" questions: use complement or cumulative probability
- Mean always equals \(np\)—quick way to check reasonableness
Example 1: Binomial Distribution (IB-Style)
Problem: A fair coin is flipped 10 times. Let \(X\) be the number of heads.
(a) Write down the distribution of \(X\)
(b) Find \(P(X = 6)\)
(c) Find \(P(X \geq 8)\)
(d) Find the expected number of heads and standard deviation
Solution:
(a) Distribution:
Check BINS conditions:
✓ Binary: heads or tails
✓ Independent: coin flips don't affect each other
✓ Number fixed: \(n = 10\)
✓ Same probability: \(p = 0.5\) each flip
\(X \sim B(10, 0.5)\)
(b) \(P(X = 6)\):
Using GDC: binompdf(10, 0.5, 6)
\(P(X = 6) = 0.205\)
(c) \(P(X \geq 8)\):
\(P(X \geq 8) = P(X = 8) + P(X = 9) + P(X = 10)\)
Method 1: Direct calculation
Using GDC: binompdf(10, 0.5, 8) + binompdf(10, 0.5, 9) + binompdf(10, 0.5, 10)
\(= 0.0439 + 0.0098 + 0.00098 = 0.0547\)
Method 2: Using complement
\(P(X \geq 8) = 1 - P(X \leq 7)\)
Using GDC: 1 - binomcdf(10, 0.5, 7)
\(= 1 - 0.9453 = 0.0547\)
\(P(X \geq 8) = 0.0547\) or 5.47%
(d) Expected value and standard deviation:
\(E(X) = np = 10 \times 0.5 = 5\)
\(\sigma = \sqrt{np(1-p)} = \sqrt{10 \times 0.5 \times 0.5} = \sqrt{2.5} = 1.58\)
Expected heads: 5, Standard deviation: 1.58
3. Normal Distribution
The Normal Distribution
Normal Distribution \(X \sim N(\mu, \sigma^2)\)
Notation: \(X \sim N(\mu, \sigma^2)\)
- \(\mu\): mean (center of distribution)
- \(\sigma^2\): variance
- \(\sigma\): standard deviation (spread)
Key Properties:
- Bell-shaped, symmetric curve centered at \(\mu\)
- Mean = Median = Mode = \(\mu\)
- Total area under curve = 1
- Approximately 68% of data within 1 standard deviation of mean
- Approximately 95% within 2 standard deviations
- Approximately 99.7% within 3 standard deviations
Standardization and Z-scores
Standard Normal Distribution
Standard Normal: \(Z \sim N(0, 1)\)
Mean = 0, Standard deviation = 1
Z-score Formula:
\(z = \frac{x - \mu}{\sigma}\)
Converts any normal value to standard normal
Interpretation: Z-score tells how many standard deviations \(x\) is from the mean
Finding Normal Probabilities
Using GDC (Recommended):
- normalcdf(lower, upper, \(\mu\), \(\sigma\)): Finds \(P(a \leq X \leq b)\)
- For \(P(X < a)\): use normalcdf(-∞, a, \(\mu\), \(\sigma\))
- For \(P(X > a)\): use normalcdf(a, ∞, \(\mu\), \(\sigma\))
- invNorm(\(p\), \(\mu\), \(\sigma\)): Finds value \(x\) where \(P(X \leq x) = p\)
Note: On calculator, use large negative/positive numbers for -∞ and ∞ (e.g., -9999, 9999)
⚠ Normal Distribution Pitfalls:
- Variance vs standard deviation: Notation uses \(\sigma^2\), but GDC uses \(\sigma\)
- Parameter order: Check if your GDC wants mean first or last
- Sketch first: Draw normal curve to visualize what area you need
- Units matter: Z-score is unitless, but original \(x\) has units
Example 2: Normal Distribution (IB-Style)
Problem: Heights of adult males are normally distributed with mean 175 cm and standard deviation 8 cm. Let \(X\) represent height.
(a) Write the distribution of \(X\)
(b) Find the probability a randomly selected male is taller than 180 cm
(c) Find the probability a male's height is between 170 cm and 185 cm
(d) Find the height that 90% of males are shorter than
Solution:
(a) Distribution:
Mean \(\mu = 175\) cm, Standard deviation \(\sigma = 8\) cm
\(X \sim N(175, 8^2)\) or \(X \sim N(175, 64)\)
(b) \(P(X > 180)\):
Using GDC: normalcdf(180, 9999, 175, 8)
\(P(X > 180) = 0.266\) or 26.6%
Interpretation: About 26.6% of adult males are taller than 180 cm
(c) \(P(170 \leq X \leq 185)\):
Using GDC: normalcdf(170, 185, 175, 8)
\(P(170 \leq X \leq 185) = 0.630\) or 63.0%
(d) Height where 90% are shorter (90th percentile):
We want to find \(x\) where \(P(X \leq x) = 0.90\)
Using GDC: invNorm(0.90, 175, 8)
\(x = 185.25\) cm
Height = 185 cm (or 185.3 cm to 1 d.p.)
Interpretation: 90% of males are shorter than 185.3 cm; only 10% are taller
📋 Probability Distributions Quick Reference
| Distribution | Notation | Mean | Variance |
|---|---|---|---|
| General Discrete | \(X\) | \(E(X) = \sum x \cdot P(X=x)\) | \(E(X^2) - [E(X)]^2\) |
| Binomial | \(X \sim B(n, p)\) | \(np\) | \(np(1-p)\) |
| Normal | \(X \sim N(\mu, \sigma^2)\) | \(\mu\) | \(\sigma^2\) |
🎯 IB Exam Strategy
Common Question Types:
- "Find E(X)": Use \(\sum x \cdot P(X=x)\) or formulas for specific distributions
- "X follows binomial...": Check BINS, use GDC binompdf/binomcdf
- "Normally distributed...": Use normalcdf for probabilities, invNorm for values
- "At least/at most": Use complement or cumulative probabilities
- "Find k where P(X ≤ k) = ...": Use invNorm for normal, trial and error for binomial
Key Reminders:
- Always use GDC for binomial and normal calculations
- Check all probabilities sum to 1 for discrete distributions
- Binomial: check BINS conditions first
- Normal: sketch curve to visualize required area
- Expected value formulas: \(np\) for binomial, \(\mu\) for normal
- Variance: \(np(1-p)\) for binomial, \(\sigma^2\) for normal
🎉 Master Probability Distributions!
Probability distributions provide powerful mathematical models for random phenomena. The binomial distribution handles fixed-trial experiments, while the normal distribution models countless natural phenomena from heights to test scores. Master these distributions, their formulas, and GDC techniques to excel in IB exams and prepare for advanced statistics and data science!
Key Success Factors:
- ✓ Discrete: \(E(X) = \sum x \cdot P(X=x)\), Var = \(E(X^2) - [E(X)]^2\)
- ✓ Binomial: Check BINS conditions, \(X \sim B(n,p)\)
- ✓ Binomial formulas: \(E(X) = np\), \(\text{Var}(X) = np(1-p)\)
- ✓ Normal: \(X \sim N(\mu, \sigma^2)\), bell curve, symmetric
- ✓ Z-score: \(z = \frac{x-\mu}{\sigma}\) standardizes any normal value
- ✓ Use GDC: binompdf/cdf for binomial, normalcdf/invNorm for normal
Check Conditions • Use GDC • Interpret Results
Master probability distributions and excel in IB Mathematics! 🚀