What is Simpson's Paradox?

Simpson's Paradox is a mental model used for better thinking and decision-making.

How do you apply Simpson's Paradox?

To apply Simpson's Paradox, identify situations where this framework is relevant, then use it as a lens to evaluate your options and decisions. The model is most useful when combined with other complementary mental models.

What category does Simpson's Paradox fall under?

Simpson's Paradox falls under the Mathematics & Probability category of mental models. Other models in this category can be found on the Mathematics & Probability hub page.

Why is Simpson's Paradox important?

Simpson's Paradox is important because it provides a structured way to think about problems that would otherwise be approached with intuition alone. Understanding this model helps you avoid common reasoning errors and make better decisions.

Where does Simpson's Paradox come from?

Simpson's Paradox is discussed in the tradition of Edward Simpson / Yule.

Simpson's Paradox Mental Model…

Simpson's Paradox Mental Model… | Faster Than Normal

Section 2

How to See It

Simpson's paradox appears when an aggregate trend reverses within subgroups. Look for "overall X is higher for A than B, but within every segment B is higher than A" — or the reverse. The diagnostic: when you slice by a plausible confound (segment, channel, cohort), does the direction of the comparison flip?

Business

You're seeing Simpson's Paradox when overall conversion falls month-over-month even though conversion improved in every segment (e.g. by device, region, source). The mix of traffic shifted toward segments with lower conversion — e.g. more mobile traffic. The aggregate hides the within-segment improvement. Slice by segment to see the real trend.

Technology

You're seeing Simpson's Paradox when a new algorithm shows worse overall engagement than the old one, but better engagement in every user cohort (new users, power users, etc.). The confound is that the new algorithm attracts or retains a different mix of users — e.g. more light users — so the aggregate comparison is misleading. Evaluate within cohort.

Investing

You're seeing Simpson's Paradox when a fund's overall return is lower than the index even though the fund outperforms in every sector it holds. The fund may be overweight sectors that did poorly; the sector mix, not stock selection, drives the aggregate underperformance. Segment by sector to judge selection.

Markets

You're seeing Simpson's Paradox when a policy or treatment appears to reduce an outcome overall but increase it in every demographic or region. The confound is usually composition: the treated group has a different mix. Policy evaluation requires within-group comparison or explicit adjustment for the confound.

Section 3

How to Use It

Decision filter

"When you see a trend or comparison (A vs B), ask: could a confound — a variable that differs across groups and affects the outcome — reverse the result within subgroups? Slice by segment, channel, or cohort. If the direction flips, report by segment and name the confound. Don't act on the aggregate alone."

As a founder

Slice metrics by segment before drawing conclusions. If overall conversion is down, check conversion by channel, cohort, and product. If it's up in every slice, the issue is mix shift — you're adding users or traffic from lower-converting segments. Fix the mix or fix the segment economics; don't optimise the wrong lever. When presenting to the board, show both aggregate and segment view so Simpson's paradox doesn't mislead.

As an investor

When a company reports a metric that moved the wrong way, ask for a segment breakdown. Is the trend consistent within segments, or does mix explain it? Portfolio-level returns can hide Simpson's paradox: the portfolio might underperform while every holding outperforms its peer group if the allocation is tilted toward underperforming segments.

As a decision-maker

Before acting on a comparison (this group vs that, this period vs that), check for confounds. Slice by the obvious candidates: segment, region, product, cohort. If the comparison reverses within slices, the aggregate is misleading. Decide based on the level that matches your question — segment-level fairness vs aggregate outcome — and state the confound explicitly.

Common misapplication: Treating the aggregate as the truth. When Simpson's paradox holds, the aggregate and the within-group results conflict. The right answer depends on the question. "Are we biased by department?" → look within department. "What is overall admission rate by gender?" → aggregate is correct but confounded by application mix. Specify the question, then choose the level of analysis.

Second misapplication: Ignoring mix shift. Many "our metric went down" stories are mix shift: you're adding volume in a segment with lower conversion, LTV, or margin. That is Simpson's paradox in growth form. Segment so you see whether you're improving within segment or just diluting with different mix.

Section 4

The Mechanism

Section 5

Founders & Leaders in Action

Jeff BezosFounder & CEO, Amazon, 1994–2021

Bezos emphasised "disaggregated metrics" — looking at segments rather than only totals. Amazon's culture of slicing by product, geography, and cohort reduces the risk of Simpson's paradox: a drop in aggregate conversion triggers a segment-level check. Mix shift (e.g. international growth with lower conversion) is understood as composition, not universal decline.

Reed HastingsCo-founder & CEO, Netflix

Netflix evaluates content and product by cohort and region. Overall engagement can move because of mix (more users in lower-engagement regions) rather than because every segment changed. Hastings has pushed for segment-level accountability so that teams don't hide behind aggregate numbers that are confounded by mix.

Section 6

Visual Explanation

Simpson's paradox: aggregate trend (e.g. A > B) reverses within subgroups (B > A in every segment). Cause: confound that differs across groups. Slice by segment to see the real relationship.

Section 7

Connected Models

Simpson's paradox sits at the intersection of confounding, correlation vs causation, and segmentation. These models either explain the paradox or help avoid it.

Reinforces

Confounding Factor

A confounding factor is a variable that influences both the explanatory variable and the outcome. Simpson's paradox is the dramatic case: the confound differs across groups and reverses the aggregate comparison. Controlling for the confound (slicing by it) reveals the real relationship. The two are the same idea at different levels of formality.

Reinforces

Segmentation

Segmentation is splitting the data into meaningful groups. Simpson's paradox is detected and resolved by segmenting: when you slice by the right variable, the paradox appears or disappears. Good segmentation is the antidote to aggregate illusion.

Tension

Correlation vs Causation

Correlation can reverse when you control for a confound — that is Simpson's paradox. The tension: the aggregate correlation may not reflect causation in any segment. Establishing causation requires controlling for confounds; Simpson's paradox is a warning that aggregate correlation can be misleading.

Tension

Selection Bias

Selection bias is when the sample is not representative. Simpson's paradox can look like selection: the "groups" (e.g. treated vs control) have different composition. The tension: sometimes the paradox is due to a confound you can measure and slice by; sometimes it's selection into the sample. Both require careful interpretation of aggregate vs within-group results.

Section 8

One Key Quote

"It is possible for a set of data to show a trend in a given direction when separated into groups, and the opposite trend when combined."
— Edward Simpson, 1951

The definition is the paradox. One dataset, two valid readings — and they point opposite ways. The takeaway: always ask whether the trend holds within subgroups. If it doesn't, the aggregate is confounded. Report both and name the confound.

Section 9

Analyst's Take

Faster Than Normal — Editorial View

Slice before you conclude. Any time a key metric moves, slice by segment, channel, and cohort. If the trend is positive in every slice but negative overall, you have mix shift — Simpson's paradox. Fix the mix or fix the segment; don't optimise the wrong thing.

Name the confound. When you present segment-level results that differ from the aggregate, state what's driving the reversal. "Overall conversion is down because mobile share increased and mobile converts lower." That sentence tells the reader you're not hiding behind the aggregate.

Match the level to the question. "Are we biased in admissions?" → look within department. "What's our overall gender admission rate?" → aggregate, but then explain mix. The paradox doesn't tell you which level is "right"; it tells you they can conflict. Choose the level that answers the question you care about.

Watch for it in A/B tests. If the overall treatment effect is zero or negative but positive in every segment, check for a confound (e.g. segment mix differed between arms). Pre-stratify or analyse within segment so the paradox doesn't hide a real effect.

Section 10

Test Yourself

Is this mental model at work here?

Scenario 1

Overall conversion rate falls from 4% to 3.5%. When the team slices by device, conversion is up on both mobile (2% to 2.2%) and desktop (6% to 6.2%). Traffic mix shifted from 50% mobile to 70% mobile.

Scenario 2

A university's overall admission rate is lower for women than men. Within every department, women's admission rate is equal or higher than men's. Women apply more to the most selective departments.

Scenario 3

A company reports that ARPU is down 5% year-over-year. They do not break out ARPU by segment or cohort.

Scenario 4

A team runs an A/B test. Overall treatment has no significant effect. Within each of three user segments, treatment is positive and significant. The team concludes the treatment works and ships it.

Section 11

Simpson's Paradox

Popular Mental Models

Continue exploring

The Core Idea

How to See It

How to Use It

The Mechanism

Founders & Leaders in Action

Visual Explanation

Connected Models

One Key Quote

Analyst's Take

Test Yourself

Is this mental model at work here?

Further Reading

This connects to...

Popular Mental Models

Continue exploring

More like this, in your inbox

The Core Idea

How to See It

How to Use It

The Mechanism

Founders & Leaders in Action

Visual Explanation

Connected Models

One Key Quote

Analyst's Take

Test Yourself

Is this mental model at work here?

Further Reading

This connects to...