🎯 May 2026 AP Statistics FRQ Forecast & Deep Trend Analysis
Data-Driven Predictions Based on 222 Released FRQs (1997–2025)
📊 Executive Summary: What's Changing
📈 Total FRQs Analyzed
222 Questions
28 exam years (1997–2025, no 2020)
🎲 Parse Quality
100% Good
Zero unparsed items; all data verified
📉 Key Finding
Exploring Data
Dominates 40% of recent exams (2019–2024)
🔮 Most Probable May 2026 FRQ Lineup
| Question | Primary Topic (Unit) | Likely Focus | Confidence |
|---|---|---|---|
| Q1 | Exploring Data (Unit 1/2) | Comparing distributions via boxplots/histograms | 75% |
| Q2 | Design & Sampling (Unit 3) | Experimental design, blocking, or bias | 60% |
| Q3 | Probability (Unit 4) | Conditional probability, binomial, expected value | 80% |
| Q4 | Inference-Proportions (Unit 6) | 1-proportion z-test or z-interval | 50% |
| Q5 | Chi-Square (Unit 8) | Test for independence (two-way table) | 55% |
| Q6 | Investigative Task (Multi-Unit) | Sampling distributions or simulation | 35% |
🎯 Coverage: This forecast covers Units 1, 2, 3, 4, 6, 8 with Unit 5 (Sampling Distributions) in Q6 and Unit 7 (Inference for Means) as Q4/Q6 alternative.
📊 Historical Topic Trends: 1997–2025
Unit Frequency Distribution (All 222 FRQs)
| Unit | Description | Count | % of Total |
|---|---|---|---|
| Unit 7 | Inference for Quantitative Data: Means | 38 | 17.1% |
| Unit 4 | Probability, Random Variables, Distributions | 36 | 16.2% |
| Unit 6 | Inference for Categorical Data: Proportions | 36 | 16.2% |
| Unit 3 | Collecting Data: Design & Sampling | 30 | 13.5% |
| Unit 1 | Exploring One-Variable Data | 21 | 9.5% |
| Unit 2 | Exploring Two-Variable Data | 20 | 9.0% |
| Unit 8 | Inference for Categorical Data: Chi-Square | 18 | 8.1% |
| Unit 5 | Sampling Distributions | 12 | 5.4% |
| Unit 9 | Inference for Quantitative Data: Slopes | 10 | 4.5% |
⚠️ Critical Shift: 1997–2018 vs. 2019–2024
Unit 3 (Design)
12% → 23% ↑ Surging
Unit 7 (Inference–Means)
17.8% → 13.3% ↓ Declining
Interpretation: College Board is emphasizing experimental design, sampling methods, and bias identification. Meanwhile, formal t-test procedures have declined significantly. This represents a shift from procedure-heavy to conceptual and exploratory questions.
🔬 Inference Procedure Trends (1997–2025)
Critical Finding: Only 59 of 222 FRQs (26%) involve formal inference procedures. In recent years (2019–2024), this drops to just 5 of 30 (17%). Several historically common procedures have vanished entirely.
| Procedure | All-Time Count | Recent (2019–2024) | Gap |
|---|---|---|---|
| Paired t-test | 11 | 1 (2023) | Current ✓ |
| 2-sample t-test | 9 | 0 | 6+ years |
| 2-proportion z-test | 8 | 1 (2024) | Current ✓ |
| Chi-square independence | 8 | 1 (2024) | Current ✓ |
| 1-proportion z-test | 5 | 0 | 12+ years |
| Slope t-test (Unit 9) | 4 | 1 (2023) | Current ✓ |
💡 Implication: The 2-sample t-test (historically #2 procedure, 9 appearances) has not appeared since 2018. This represents either a deliberate rotation or a permanent shift in CB's emphasis. Conservative students should still know this procedure (defensive prep), but recent trend suggests low probability for May 2026.
🎯 Top 5 Preparation Priorities for May 2026
1️⃣ Master Distribution Comparisons (Unit 1)
Use CSSO framework: Center, Shape, Spread, Outliers. Practice describing parallel boxplots, back-to-back stemplots, and histograms. Understand how transformations affect statistics: adding/subtracting constant shifts center only; multiplying scales both. Q1 appears as exploring data 78% of the time.
2️⃣ Experimental Design Fluency (Unit 3)
Know completely randomized design (CRD), randomized block design, and matched pairs. Understand three sampling methods: SRS, stratified, cluster. Distinguish random assignment (causation) from random sampling (generalization). Unit 3 surged to 23% in 2019–2024; Q2 shows 3 of 5 recent exams.
3️⃣ Probability Mechanics (Unit 4)
Drill conditional probability using two-way tables and tree diagrams: $$P(A|B) = rac{P(A \cap B)}{P(B)}$$. Master binomial probability: $$P(X=k) = inom{n}{k} p^k (1-p)^{n-k}$$ and expected value: $$E(X) = \sum x \cdot P(x)$$. Q3 is probability 80% of the time.
4️⃣ One-Proportion Inference Mastery (Unit 6)
Focus on 1-proportion z-test and z-interval. Memorize conditions: (1) random sample/assignment, (2) large counts ($$np \geq 10, n(1-p) \geq 10$$), (3) independence ($$n < 10\%$$ of population). Practice writing hypotheses, interpreting p-values in context, and linking conclusions to the scenario. Q4 has 50% probability of 1-prop inference.
5️⃣ Chi-Square Independence Testing (Unit 8)
Understand chi-square test for independence (appeared 2024 Q5). Calculate expected counts: $$E = rac{( ext{row total}) imes ( ext{col total})}{ ext{grand total}}$$. Verify condition: all expected counts $$\geq 5$$. Interpret conclusions: "Reject H₀ → convincing evidence of association between variables." Q5 shows 55% probability for chi-square.
✅ Critical Conditions Checklist
Recent scoring guidelines penalize incomplete condition checks more heavily. For every inference procedure, explicitly verify all three:
1. Random Condition
✓ Random sample: "The problem states this is a random sample of 200 students."
✓ Random assignment: "Treatments were randomly assigned to experimental units."
2. Independence Condition
✓ 10% rule: "Sample size 200 < 10% of population."
✓ Pairs independent: "Matched pairs are independent of each other."
3. Normality/Large Counts
✓ Large counts (proportions): "$$np = 200(0.5) = 100 \geq 10$$ ✓"
✓ Large sample (means): "$$n = 40 \geq 30$$, so CLT applies." or "Histogram shows approximately normal."
❌ Common Error: "The conditions are met" (vague, no calculations). ✓ Correct: "Random sample ✓, $$np_0 = 200(0.65) = 130 \geq 10$$ ✓, $$n(1-p_0) = 70 \geq 10$$ ✓, $$n = 200 < 10\%$$ of all users ✓"
📍 Question Slot Patterns (2014–2024)
Historical analysis reveals strong slot tendencies. Each question position gravitates toward specific topics:
Q1: Exploring Data
6 of 11 (55% in 2014–2024). Historical: 29 of 37 (78%). Trend: Consistent dominance. Nearly always boxplots, histograms, or distribution comparison.
Q2: Design or Exploring
Recent split: 4 design, 4 exploring (2014–2024). Design surging. Q2 is where experimental design, blocking, and sampling methods dominate.
Q3: Probability
7 of 11 (64% in 2014–2024). Historical: 14 of 37 (38%). Most predictable non-Q1 slot. Conditional probability, binomial, expected value dominate.
Q4: Most Unpredictable
Recent: 5 exploring, 2 inference-means. Historical: diverse. Q4 is the most volatile slot; 50% forecast confidence is deserved.
Q5: Chi-Square Home
2 of 11 chi-square in recent years; 8 of 37 historically (22%). Q5 is traditional "chi-square slot," though competing with regression/means inference.
Q6: Investigative
Multi-skill synthesis. Recent: sampling distributions (2), exploring data (2), design (1). Q6 requires transfer learning and conceptual breadth.
📐 Key Inference Formulas & Conditions
1-Proportion Z-Test (Unit 6)
Test Statistic: $$z = rac{\hat{p} - p_0}{\sqrt{rac{p_0(1-p_0)}{n}}}$$
Conditions: (1) Random sample, (2) $$np_0 \geq 10$$ AND $$n(1-p_0) \geq 10$$, (3) $$n < 10\%$$ of population
1-Proportion Z-Interval (Unit 6)
Confidence Interval: $$\hat{p} \pm z^* \sqrt{rac{\hat{p}(1-\hat{p})}{n}}$$
Interpretation: "We are [C]% confident the true proportion is between [lower] and [upper]."
Paired t-Test (Unit 7)
Test Statistic: $$t = rac{ar{d}}{s_d/\sqrt{n}}$$ with df = $$n-1$$
When to Use: Two measurements on same individual OR matched pairs. Conditions: Random pairs, Independence, Normality (or $$n \geq 30$$)
Chi-Square Test for Independence (Unit 8)
Test Statistic: $$\chi^2 = \sum rac{( ext{Observed} - ext{Expected})^2}{ ext{Expected}}$$ with df = $$(r-1)(c-1)$$
Expected Count: $$E = rac{( ext{row total}) imes ( ext{column total})}{ ext{grand total}}$$
Conditions: (1) Random sample, (2) All expected counts $$\geq 5$$
Sampling Distribution of Sample Mean (Unit 5)
Center: $$\mu_{ar{x}} = \mu$$ Spread: $$\sigma_{ar{x}} = rac{\sigma}{\sqrt{n}}$$
Shape (Central Limit Theorem): For large $$n$$ (typically $$n \geq 30$$), $$ar{x}$$ is approximately normal even if population is non-normal.
🛡️ Defensive Prep: Low-Frequency But Possible Topics
Allocate 10–15% of prep time to these topics with long gaps. They haven't appeared recently but remain historically significant and could resurface.
🔴 HIGH RISK: Two-Sample t-Test (Difference of Means)
Last Appeared: 2018 Q4 (6-year gap) Historical Frequency: 9 times (2nd-most-common procedure) Risk Level: MEDIUM
Key Concept: Comparing means of two independent groups. Test statistic: $$t = rac{ar{x}_1 - ar{x}_2}{SE(ar{x}_1 - ar{x}_2)}$$. Conditions: Random samples, Independence, Normality (or large samples). Use calculator (don't pool unless specified).
🟡 MEDIUM RISK: One-Sample t-Test/Interval
Last Appeared: 2024 Q6 (t-interval); 2013 Q1 (t-test) Historical Frequency: 4 times Risk Level: LOW-MEDIUM
Key Concept: Single sample, quantitative data, unknown population SD (almost always in AP Stats). Test statistic: $$t = rac{ar{x} - \mu_0}{s/\sqrt{n}}$$. df = $$n-1$$. Conditions: Random sample, Independence, Normality (or large n).
🟡 MEDIUM RISK: Slope Inference (Unit 9)
Last Appeared: 2023 Q5 (recent) Will be Removed: 2027 framework (but valid for 2026) Risk Level: LOW
Key Concept: Test if slope $$
eq 0$$: $$t = rac{b - 0}{SE(b)}$$. Conditions: Linear relationship, Independence, Normality of residuals, Equal SD. May appear as "test for slope ≠ 1" or compare slopes across subgroups.
🟢 VERY LOW RISK: Chi-Square Goodness of Fit
Last Appeared: 2008 Q5 (17-year gap) Historical Frequency: 1 time (essentially extinct) Risk Level: VERY LOW
Key Concept: Test if observed distribution matches expected theoretical proportions. Conditions: Random sample, All expected counts $$\geq 5$$. Formula: $$\chi^2 = \sum rac{( ext{Obs} - ext{Exp})^2}{ ext{Exp}}$$. df = #categories - 1.
📅 Recommended Study Allocation (8-Week Prep)
| Topic Area | % of Time | Key Focus |
|---|---|---|
| Exploring Data (Units 1–2) | 20% | CSSO, boxplots, transformations, Q1 dominance |
| Design & Sampling (Unit 3) | 18% | CRD, blocking, bias, Q2 surge |
| Probability (Unit 4) | 17% | Conditional prob, binomial, expected value |
| Inference-Proportions (Unit 6) | 15% | 1-prop z-test/interval, conditions, p-values |
| Chi-Square (Unit 8) | 12% | Independence test, expected counts, Q5 |
| Sampling Distributions (Unit 5) | 10% | CLT, sampling dist of $$ar{x}$$, Q6 |
| Defensive Prep (Rare Topics) | 8% | 2-sample t, 1-sample t, slope inference |
💡 Pro Tip: Allocate 20% of your prep to "synthesis problems" (full FRQs covering multiple units, like Q6 investigative tasks). This builds conceptual breadth and transfer learning—critical for scoring well on novel problems.
❌ Common Scoring Errors (Avoid These!)
❌ Error: "H₀: There is no difference in proportions"
✓ Correct: "H₀: p₁ = p₂ vs. Hₐ: p₁ ≠ p₂" or "H₀: μ₁ = μ₂ vs. Hₐ: μ₁ > μ₂"
❌ Error: "Conditions are met" (no verification)
✓ Correct: "np₀ = 100(0.6) = 60 ≥ 10 ✓; n(1–p₀) = 40 ≥ 10 ✓; n = 100 < 10% of population ✓"
❌ Error: "p-value is 0.03, so reject H₀"
✓ Correct: "If H₀ is true, P(data this extreme) = 0.03. Since 0.03 < 0.05, reject H₀. Convincing evidence..."
❌ Error: "95% probability the true proportion is in this interval"
✓ Correct: "We are 95% confident the true proportion is between X and Y"
❌ Error: "The data show the groups are different"
✓ Correct: "Median group A (75 hours) is 10 hours higher than median group B (65 hours)"
🎓 Final Recommendations: Key Takeaways
1. Trust the Data, But Prepare Broadly
This forecast is based on 28 years of evidence. However, AP Statistics remains a broad-spectrum assessment. Allocate 80% to high-probability topics, 20% to defensive coverage. Don't neglect any unit entirely.
2. Master Conditions & Context (Recent Emphasis)
Recent scoring guidelines penalize vague condition checks and generic conclusions. Always show explicit calculations, link statistical findings to the original context (e.g., "convincing evidence that the true proportion of students who..."), and avoid mechanical responses.
3. Practice Full 4-Point FRQs, Not Isolated Concepts
Work through complete questions with parts (a)–(d). This mimics exam pacing and helps you integrate multiple skills. Q6 investigative tasks are particularly important: they synthesize multiple units and reward holistic understanding.
4. The Exam is About Statistical Thinking, Not Memorization
Understanding *why* you check conditions, *how* p-values connect to hypotheses, and *when* to use each procedure matters far more than memorizing formulas. This forecast highlights those conceptual shifts—exploring data and design now dominate over rote inference procedures.
📊 Forecast Based On: 222 Released AP Statistics FRQs (1997–2025)
📅 Updated: November 11, 2025
⚠️ Disclaimer: This forecast is probabilistic, not deterministic. May 2026 exam content remains uncertain. Use this guide as a complement to official College Board resources, not a replacement.