Concept

Simpson's Paradox

Definition

Simpson's paradox is the statistical phenomenon in which a trend present in several sub-groups of a population reverses or disappears when the sub-groups are aggregated. A treatment can appear better than a control within every patient sub-group, yet appear worse when the data is pooled — and vice versa.

Named after the statistician Edward Simpson (1951), the paradox arises whenever the relative sizes of the sub-groups are correlated with both the treatment and the outcome — i.e. whenever there is a lurking confounder behind the aggregation.

Why it matters

How it works

Consider two treatments A and B applied across two patient sub-groups. Treatment A might cure 80% of sub-group 1 and 30% of sub-group 2; treatment B 90% and 40%. B wins in both sub-groups. But if A is mostly given to sub-group 1 (easy cases) and B mostly to sub-group 2 (hard cases), the pooled cure rates can show A winning overall.

The resolution depends on what the patient-mix represents. If sub-group membership is a confounder you should adjust for (severity of disease), the within-group comparison is right. If sub-group membership is itself caused by treatment (a mediator), the pooled comparison may be right. Without causal context, the data alone cannot tell you which.

Where it goes next

Probabilitylinked concept
Statistical Inferencelinked concept
Conditional Probabilitylinked concept
Base Rateshares tag: probability
Causationshares tag: causation
Clinical Trialshares tag: probability
Confidence Intervalshares tag: probability
Correlation vs Causationshares tag: causation
Discrete Datashares tag: probability
Distributionsshares tag: probability
Epidemiologyshares tag: probability
Experimental Designshares tag: causation
Failure Rateshares tag: probability
Frequentist Probabilityshares tag: probability
Mutually Exclusiveshares tag: probability
Pollingshares tag: probability
Randomisationshares tag: probability
Regression to the Meanshares tag: probability
Sampling Distributionshares tag: probability
Spurious Correlationshares tag: causation
80/20 Ruleshares tag: statistics
Actuarial Scienceshares tag: probability
Attributionshares tag: causation
Bar Chartshares tag: statistics
Base Rate Fallacyshares tag: probability
Bayes Theoremshares tag: probability
Bayesian-Frequentist Debateshares tag: probability
Bayesian Probabilityshares tag: probability
Bayesian Updateshares tag: probability
Binomial Distributionshares tag: probability
Birthday Paradoxshares tag: probability
Central Limit Theoremshares tag: probability
Central Tendencyshares tag: statistics
Classical Probabilityshares tag: probability
Conditional Value-at-Riskshares tag: statistics
Conjunction Fallacyshares tag: probability
Correlationshares tag: statistics
Correlation Coefficientshares tag: statistics
Cost-Effectivenessshares tag: statistics
Courtroom Probabilityshares tag: probability
Data Literacyshares tag: statistics
Decision Theoryshares tag: probability
Decision Under Uncertaintyshares tag: statistics
Descriptive Statisticsshares tag: statistics
Distribution (Market Phase)shares tag: statistics
Dollar Streetshares tag: statistics
Doubling Lineshares tag: statistics
Expected Utilityshares tag: probability
Expected Valueshares tag: probability
Fat-Tailed Distributionsshares tag: probability
Frightening vs Dangerousshares tag: statistics
Gambler's Fallacyshares tag: probability
Great-Man Theoryshares tag: causation
Histogramshares tag: statistics
History of Probabilityshares tag: probability
House Edgeshares tag: probability
Hypothesis Testingshares tag: statistics
Income Levelsshares tag: statistics
Independenceshares tag: probability
Information Coefficientshares tag: statistics
Information Theoryshares tag: probability
Kolmogorov Axiomsshares tag: probability
Laplaceshares tag: probability
Law of Large Numbersshares tag: probability
Least Squaresshares tag: statistics
Level vs Directionshares tag: statistics
Linear Regressionshares tag: statistics
Lonely Numbershares tag: statistics
Lotteryshares tag: probability
Majority Trapshares tag: statistics
Markov Chainshares tag: probability
Meanshares tag: statistics
Mean Reversionshares tag: statistics
Measurement Errorshares tag: statistics
Medianshares tag: statistics
Misleading Statisticsshares tag: statistics
Monty Hall Problemshares tag: probability
Necessity and Sufficiencyshares tag: causation
Newcomb Problemshares tag: probability
Normal Distributionshares tag: probability
Null Hypothesisshares tag: statistics
Option Pricingshares tag: probability
Overfittingshares tag: statistics
P-Valueshares tag: statistics
Pascal-Fermat Correspondenceshares tag: probability
Peak Childshares tag: statistics
Per Capita Ratioshares tag: statistics
Percentageshares tag: statistics
Performance Rankshares tag: statistics
Pie Chartshares tag: statistics
Placebo Effectshares tag: statistics
Poisson Distributionshares tag: probability
Population Projectionshares tag: statistics
Precision vs. Accuracyshares tag: statistics
Principal Component Analysisshares tag: statistics
Principle Of Indifferenceshares tag: probability
Probability Axiomsshares tag: probability
Probability Distributionshares tag: probability
Prosecutor's Fallacyshares tag: probability
Questionnaire Designshares tag: statistics
Queueing Theoryshares tag: probability
Random Sampleshares tag: statistics
Random Variableshares tag: probability
Randomnessshares tag: probability
Rank Correlationshares tag: statistics
Reference Class Problemshares tag: probability
Reliabilityshares tag: probability
Returnsshares tag: statistics
Risk Aversionshares tag: probability
Risk Calculationshares tag: statistics
Rolling Metricsshares tag: statistics
S-Curveshares tag: statistics
Sample Sizeshares tag: statistics
Sample Spaceshares tag: probability
Samplingshares tag: statistics
Sampling Biasshares tag: statistics
Significance Levelshares tag: statistics
Size Instinctshares tag: statistics
Slow Changeshares tag: statistics
Small Stepsshares tag: statistics
Standard Deviationshares tag: statistics
Statistical Significanceshares tag: statistics
Straight Line Instinctshares tag: statistics
Time-Series Datashares tag: statistics
Varianceshares tag: probability
Weather Forecastingshares tag: probability
Z-Scoreshares tag: statistics

Simpson's Paradox

Definition

Why it matters

How it works

Where it goes next

Continue exploring

Tags

Simpson's Paradox

Definition

Why it matters

How it works

Where it goes next

Related concepts

Continue exploring

Tags