Unit 1.9 – Comparing Distributions of a Quantitative Variable

Comparative statistics: How do groups compare? Use statistics and visuals to uncover differences in shape, center, spread, and outliers!

⚖️ When & Why Compare Distributions?

  • When analyzing data for two or more groups (e.g., test scores for Class A vs. Class B)
  • To identify differences/similarities in shape, center, spread, and outliers
  • For context in AP Stats FRQs and real-world data investigations

🔬 How to Compare — The SOCS Approach

  • Shape: Are both symmetric? Skewed? Bimodal? Uniform?
  • Outliers: Any unusual/extreme values in either group?
  • Center: Which group has greater median/mean?
  • Spread: Compare ranges, IQRs, or standard deviations
  • Always add context (e.g., “Class A’s median test score is 82, while Class B’s is 74.”)
Comparison Table Example
Feature Group 1 Group 2
Center (Median) 75 82
Mean 73.1 81.5
Spread (IQR) 19 15
Outliers? 1 high score none
Shape Skewed left Roughly symmetric

📦 Best Graphs for Comparing Distributions

  • Parallel boxplots: Quickly show center, spread, outliers for multiple groups together.
  • Back-to-back stem-and-leaf plots: Great for smaller data sets, all values visible.
  • Multiple dotplots: Put on same axis or stack for fine-grained comparison.
  • Avoid using just one histogram—hard to compare unless overlaid or side-by-side.

🔢 Key Stats & Formulas for Comparison

Mean
\[ \bar{x} = \frac{1}{n} \sum_{i=1}^{n} x_i \]
Median
Order data, select middle value (average two if even count)
IQR (Interquartile Range)
\[ IQR = Q_3 - Q_1 \]
Standard Deviation
\[ s = \sqrt{ \frac{1}{n-1} \sum_{i=1}^n (x_i - \bar{x})^2 } \]
How to Compare (Example Response)
Shape:
The distribution of Group A’s scores is right-skewed, while Group B’s is symmetric.
Center:
Group A’s median is lower than Group B’s median.
Spread:
Group A has a larger IQR and range than Group B.
Outliers:
Group A has one low outlier; Group B has none.

💡 Tips & Tricks for AP Stats

  • Always describe ALL four of SOCS when comparing groups.
  • Give actual statistics (values) in your comparison, not just "higher/lower."
  • Name the context—meanings and units (e.g., “heights in cm”).
  • Use “greater than”, “about the same as”, “more spread out” for clarity and credit.
  • Sketch parallel boxplots for instant visual clarity.
  • If possible, explain real-world reasons for any differences.

❌ Common Mistakes

  • Describing only center—always compare all SOCS aspects.
  • Giving vague statements (“Group A is higher”). Quantify and compare specifically!
  • Omitting context or units.
  • Ignoring outliers in one or both groups.
  • Comparing means when distributions are highly skewed or outlier-heavy (use median/IQR instead).
  • Failing to use or describe appropriate comparative visuals (boxplots etc.).
Summary:
Unit 1.9 is about comparing quantitative distributions using SOCS: clearly state and support all comparisons of shape, outliers, center, and spread, include statistics and context, and use graphs for effective comparisons. This is essential to AP Stats FRQ and real analysis!