🎯 May 2026 AP Statistics FRQ Forecast & Deep Trend Analysis

Data-Driven Predictions Based on 222 Released FRQs (1997–2025)

📊 Executive Summary: What's Changing

📈 Total FRQs Analyzed

222 Questions

28 exam years (1997–2025, no 2020)

🎲 Parse Quality

100% Good

Zero unparsed items; all data verified

📉 Key Finding

Exploring Data

Dominates 40% of recent exams (2019–2024)

AP Statistics FRQs Explore 230+ questions & past papers
230+ Total FRQ Questions
29 Years of Exams
9 Content Units
100% Official College Board
Explore AP Statistics FRQs

🔮 Most Probable May 2026 FRQ Lineup

Question Primary Topic (Unit) Likely Focus Confidence
Q1 Exploring Data (Unit 1/2) Comparing distributions via boxplots/histograms 75%
Q2 Design & Sampling (Unit 3) Experimental design, blocking, or bias 60%
Q3 Probability (Unit 4) Conditional probability, binomial, expected value 80%
Q4 Inference-Proportions (Unit 6) 1-proportion z-test or z-interval 50%
Q5 Chi-Square (Unit 8) Test for independence (two-way table) 55%
Q6 Investigative Task (Multi-Unit) Sampling distributions or simulation 35%

🎯 Coverage: This forecast covers Units 1, 2, 3, 4, 6, 8 with Unit 5 (Sampling Distributions) in Q6 and Unit 7 (Inference for Means) as Q4/Q6 alternative.

📊 Historical Topic Trends: 1997–2025

Unit Frequency Distribution (All 222 FRQs)

Unit Description Count % of Total
Unit 7 Inference for Quantitative Data: Means 38 17.1%
Unit 4 Probability, Random Variables, Distributions 36 16.2%
Unit 6 Inference for Categorical Data: Proportions 36 16.2%
Unit 3 Collecting Data: Design & Sampling 30 13.5%
Unit 1 Exploring One-Variable Data 21 9.5%
Unit 2 Exploring Two-Variable Data 20 9.0%
Unit 8 Inference for Categorical Data: Chi-Square 18 8.1%
Unit 5 Sampling Distributions 12 5.4%
Unit 9 Inference for Quantitative Data: Slopes 10 4.5%

⚠️ Critical Shift: 1997–2018 vs. 2019–2024

Unit 3 (Design)

12% → 23% ↑ Surging

Unit 7 (Inference–Means)

17.8% → 13.3% ↓ Declining

Interpretation: College Board is emphasizing experimental design, sampling methods, and bias identification. Meanwhile, formal t-test procedures have declined significantly. This represents a shift from procedure-heavy to conceptual and exploratory questions.

🔬 Inference Procedure Trends (1997–2025)

Critical Finding: Only 59 of 222 FRQs (26%) involve formal inference procedures. In recent years (2019–2024), this drops to just 5 of 30 (17%). Several historically common procedures have vanished entirely.

Procedure All-Time Count Recent (2019–2024) Gap
Paired t-test 11 1 (2023) Current ✓
2-sample t-test 9 0 6+ years
2-proportion z-test 8 1 (2024) Current ✓
Chi-square independence 8 1 (2024) Current ✓
1-proportion z-test 5 0 12+ years
Slope t-test (Unit 9) 4 1 (2023) Current ✓

💡 Implication: The 2-sample t-test (historically #2 procedure, 9 appearances) has not appeared since 2018. This represents either a deliberate rotation or a permanent shift in CB's emphasis. Conservative students should still know this procedure (defensive prep), but recent trend suggests low probability for May 2026.

🎯 Top 5 Preparation Priorities for May 2026

1️⃣ Master Distribution Comparisons (Unit 1)

Use CSSO framework: Center, Shape, Spread, Outliers. Practice describing parallel boxplots, back-to-back stemplots, and histograms. Understand how transformations affect statistics: adding/subtracting constant shifts center only; multiplying scales both. Q1 appears as exploring data 78% of the time.

2️⃣ Experimental Design Fluency (Unit 3)

Know completely randomized design (CRD), randomized block design, and matched pairs. Understand three sampling methods: SRS, stratified, cluster. Distinguish random assignment (causation) from random sampling (generalization). Unit 3 surged to 23% in 2019–2024; Q2 shows 3 of 5 recent exams.

3️⃣ Probability Mechanics (Unit 4)

Drill conditional probability using two-way tables and tree diagrams: $$P(A|B) = rac{P(A \cap B)}{P(B)}$$. Master binomial probability: $$P(X=k) = inom{n}{k} p^k (1-p)^{n-k}$$ and expected value: $$E(X) = \sum x \cdot P(x)$$. Q3 is probability 80% of the time.

4️⃣ One-Proportion Inference Mastery (Unit 6)

Focus on 1-proportion z-test and z-interval. Memorize conditions: (1) random sample/assignment, (2) large counts ($$np \geq 10, n(1-p) \geq 10$$), (3) independence ($$n < 10\%$$ of population). Practice writing hypotheses, interpreting p-values in context, and linking conclusions to the scenario. Q4 has 50% probability of 1-prop inference.

5️⃣ Chi-Square Independence Testing (Unit 8)

Understand chi-square test for independence (appeared 2024 Q5). Calculate expected counts: $$E = rac{( ext{row total}) imes ( ext{col total})}{ ext{grand total}}$$. Verify condition: all expected counts $$\geq 5$$. Interpret conclusions: "Reject H₀ → convincing evidence of association between variables." Q5 shows 55% probability for chi-square.

✅ Critical Conditions Checklist

Recent scoring guidelines penalize incomplete condition checks more heavily. For every inference procedure, explicitly verify all three:

1. Random Condition

Random sample: "The problem states this is a random sample of 200 students."
Random assignment: "Treatments were randomly assigned to experimental units."

2. Independence Condition

10% rule: "Sample size 200 < 10% of population."
Pairs independent: "Matched pairs are independent of each other."

3. Normality/Large Counts

Large counts (proportions): "$$np = 200(0.5) = 100 \geq 10$$ ✓"
Large sample (means): "$$n = 40 \geq 30$$, so CLT applies." or "Histogram shows approximately normal."

❌ Common Error: "The conditions are met" (vague, no calculations). ✓ Correct: "Random sample ✓, $$np_0 = 200(0.65) = 130 \geq 10$$ ✓, $$n(1-p_0) = 70 \geq 10$$ ✓, $$n = 200 < 10\%$$ of all users ✓"

📍 Question Slot Patterns (2014–2024)

Historical analysis reveals strong slot tendencies. Each question position gravitates toward specific topics:

Q1: Exploring Data

6 of 11 (55% in 2014–2024). Historical: 29 of 37 (78%). Trend: Consistent dominance. Nearly always boxplots, histograms, or distribution comparison.

Q2: Design or Exploring

Recent split: 4 design, 4 exploring (2014–2024). Design surging. Q2 is where experimental design, blocking, and sampling methods dominate.

Q3: Probability

7 of 11 (64% in 2014–2024). Historical: 14 of 37 (38%). Most predictable non-Q1 slot. Conditional probability, binomial, expected value dominate.

Q4: Most Unpredictable

Recent: 5 exploring, 2 inference-means. Historical: diverse. Q4 is the most volatile slot; 50% forecast confidence is deserved.

Q5: Chi-Square Home

2 of 11 chi-square in recent years; 8 of 37 historically (22%). Q5 is traditional "chi-square slot," though competing with regression/means inference.

Q6: Investigative

Multi-skill synthesis. Recent: sampling distributions (2), exploring data (2), design (1). Q6 requires transfer learning and conceptual breadth.

📐 Key Inference Formulas & Conditions

1-Proportion Z-Test (Unit 6)

Test Statistic: $$z = rac{\hat{p} - p_0}{\sqrt{ rac{p_0(1-p_0)}{n}}}$$

Conditions: (1) Random sample, (2) $$np_0 \geq 10$$ AND $$n(1-p_0) \geq 10$$, (3) $$n < 10\%$$ of population

1-Proportion Z-Interval (Unit 6)

Confidence Interval: $$\hat{p} \pm z^* \sqrt{ rac{\hat{p}(1-\hat{p})}{n}}$$

Interpretation: "We are [C]% confident the true proportion is between [lower] and [upper]."

Paired t-Test (Unit 7)

Test Statistic: $$t = rac{ar{d}}{s_d/\sqrt{n}}$$ with df = $$n-1$$

When to Use: Two measurements on same individual OR matched pairs. Conditions: Random pairs, Independence, Normality (or $$n \geq 30$$)

Chi-Square Test for Independence (Unit 8)

Test Statistic: $$\chi^2 = \sum rac{( ext{Observed} - ext{Expected})^2}{ ext{Expected}}$$ with df = $$(r-1)(c-1)$$

Expected Count: $$E = rac{( ext{row total}) imes ( ext{column total})}{ ext{grand total}}$$

Conditions: (1) Random sample, (2) All expected counts $$\geq 5$$

Sampling Distribution of Sample Mean (Unit 5)

Center: $$\mu_{ar{x}} = \mu$$   Spread: $$\sigma_{ar{x}} = rac{\sigma}{\sqrt{n}}$$

Shape (Central Limit Theorem): For large $$n$$ (typically $$n \geq 30$$), $$ar{x}$$ is approximately normal even if population is non-normal.

🛡️ Defensive Prep: Low-Frequency But Possible Topics

Allocate 10–15% of prep time to these topics with long gaps. They haven't appeared recently but remain historically significant and could resurface.

🔴 HIGH RISK: Two-Sample t-Test (Difference of Means)

Last Appeared: 2018 Q4 (6-year gap)   Historical Frequency: 9 times (2nd-most-common procedure)   Risk Level: MEDIUM
Key Concept: Comparing means of two independent groups. Test statistic: $$t = rac{ar{x}_1 - ar{x}_2}{SE(ar{x}_1 - ar{x}_2)}$$. Conditions: Random samples, Independence, Normality (or large samples). Use calculator (don't pool unless specified).

🟡 MEDIUM RISK: One-Sample t-Test/Interval

Last Appeared: 2024 Q6 (t-interval); 2013 Q1 (t-test)   Historical Frequency: 4 times   Risk Level: LOW-MEDIUM
Key Concept: Single sample, quantitative data, unknown population SD (almost always in AP Stats). Test statistic: $$t = rac{ar{x} - \mu_0}{s/\sqrt{n}}$$. df = $$n-1$$. Conditions: Random sample, Independence, Normality (or large n).

🟡 MEDIUM RISK: Slope Inference (Unit 9)

Last Appeared: 2023 Q5 (recent)   Will be Removed: 2027 framework (but valid for 2026)   Risk Level: LOW
Key Concept: Test if slope $$ eq 0$$: $$t = rac{b - 0}{SE(b)}$$. Conditions: Linear relationship, Independence, Normality of residuals, Equal SD. May appear as "test for slope ≠ 1" or compare slopes across subgroups.

🟢 VERY LOW RISK: Chi-Square Goodness of Fit

Last Appeared: 2008 Q5 (17-year gap)   Historical Frequency: 1 time (essentially extinct)   Risk Level: VERY LOW
Key Concept: Test if observed distribution matches expected theoretical proportions. Conditions: Random sample, All expected counts $$\geq 5$$. Formula: $$\chi^2 = \sum rac{( ext{Obs} - ext{Exp})^2}{ ext{Exp}}$$. df = #categories - 1.

📅 Recommended Study Allocation (8-Week Prep)

Topic Area % of Time Key Focus
Exploring Data (Units 1–2) 20% CSSO, boxplots, transformations, Q1 dominance
Design & Sampling (Unit 3) 18% CRD, blocking, bias, Q2 surge
Probability (Unit 4) 17% Conditional prob, binomial, expected value
Inference-Proportions (Unit 6) 15% 1-prop z-test/interval, conditions, p-values
Chi-Square (Unit 8) 12% Independence test, expected counts, Q5
Sampling Distributions (Unit 5) 10% CLT, sampling dist of $$ar{x}$$, Q6
Defensive Prep (Rare Topics) 8% 2-sample t, 1-sample t, slope inference

💡 Pro Tip: Allocate 20% of your prep to "synthesis problems" (full FRQs covering multiple units, like Q6 investigative tasks). This builds conceptual breadth and transfer learning—critical for scoring well on novel problems.

❌ Common Scoring Errors (Avoid These!)

❌ Error: "H₀: There is no difference in proportions"
✓ Correct: "H₀: p₁ = p₂ vs. Hₐ: p₁ ≠ p₂" or "H₀: μ₁ = μ₂ vs. Hₐ: μ₁ > μ₂"

❌ Error: "Conditions are met" (no verification)
✓ Correct: "np₀ = 100(0.6) = 60 ≥ 10 ✓; n(1–p₀) = 40 ≥ 10 ✓; n = 100 < 10% of population ✓"

❌ Error: "p-value is 0.03, so reject H₀"
✓ Correct: "If H₀ is true, P(data this extreme) = 0.03. Since 0.03 < 0.05, reject H₀. Convincing evidence..."

❌ Error: "95% probability the true proportion is in this interval"
✓ Correct: "We are 95% confident the true proportion is between X and Y"

❌ Error: "The data show the groups are different"
✓ Correct: "Median group A (75 hours) is 10 hours higher than median group B (65 hours)"

🎓 Final Recommendations: Key Takeaways

1. Trust the Data, But Prepare Broadly

This forecast is based on 28 years of evidence. However, AP Statistics remains a broad-spectrum assessment. Allocate 80% to high-probability topics, 20% to defensive coverage. Don't neglect any unit entirely.

2. Master Conditions & Context (Recent Emphasis)

Recent scoring guidelines penalize vague condition checks and generic conclusions. Always show explicit calculations, link statistical findings to the original context (e.g., "convincing evidence that the true proportion of students who..."), and avoid mechanical responses.

3. Practice Full 4-Point FRQs, Not Isolated Concepts

Work through complete questions with parts (a)–(d). This mimics exam pacing and helps you integrate multiple skills. Q6 investigative tasks are particularly important: they synthesize multiple units and reward holistic understanding.

4. The Exam is About Statistical Thinking, Not Memorization

Understanding *why* you check conditions, *how* p-values connect to hypotheses, and *when* to use each procedure matters far more than memorizing formulas. This forecast highlights those conceptual shifts—exploring data and design now dominate over rote inference procedures.

📊 Forecast Based On: 222 Released AP Statistics FRQs (1997–2025)
📅 Updated: November 11, 2025
⚠️ Disclaimer: This forecast is probabilistic, not deterministic. May 2026 exam content remains uncertain. Use this guide as a complement to official College Board resources, not a replacement.