Concept

Statistical Significance

Definition

A result is statistically significant when the data is sufficiently inconsistent with the null hypothesis that the analyst is willing to reject it. Operationally, the test computes a p-value — the probability of observing a result at least as extreme as the one in hand, under the assumption that the null hypothesis is true — and compares it to a pre-chosen significance level α. If the p-value is smaller than α, the result is declared significant.

The phrase is widely used and widely abused. Statistical significance is a narrow technical claim about the compatibility of data with a specific null. It is not a claim about effect size, practical importance, or the probability that the alternative hypothesis is true. Each of those is a separate question requiring separate evidence.

Why it matters

How it works

The test machinery starts from the null hypothesis — a default position usually stating that there is no effect, no difference, no association. The analyst computes a test statistic from the data and asks: if the null were true, how often would I see a test statistic this extreme or more? That probability is the p-value. If it falls below α (conventionally 0.05), the analyst rejects the null and reports the result as statistically significant. The threshold is not magic — it is just the false-positive rate the analyst has decided to tolerate.

A common misreading treats the p-value as the probability that the null is true given the data. It is the opposite: the probability of the data given the null. The two quantities are related by Bayes' theorem and they need not be close. A significant p-value tells you that the data is unusual under the null, which lets you reject the null — it does not directly tell you how likely any particular alternative is. The other persistent confusion is between statistical and practical significance. A medication that lowers blood pressure by half a point may be statistically significant in a trial of fifty thousand patients and clinically meaningless. The discipline is to report effect sizes and confidence intervals alongside the significance verdict, so the reader can judge magnitude as well as detectability.

Where it goes next

Significance Levelshares tag: hypothesis-testing
Hypothesis Testingshares tag: inference
Null Hypothesisshares tag: hypothesis-testing
P-Valueshares tag: hypothesis-testing
Base Rateshares tag: inference
Causationshares tag: inference
Confidence Intervalshares tag: inference
Correlationshares tag: inference
Correlation Coefficientshares tag: inference
Correlation vs Causationshares tag: inference
Experimental Designshares tag: inference
Random Sampleshares tag: inference
Sample Sizeshares tag: inference
Samplingshares tag: inference
Sampling Distributionshares tag: inference
Statistical Inferenceshares tag: inference
80/20 Ruleshares tag: statistics
Argumentshares tag: inference
Bar Chartshares tag: statistics
Base Rate Fallacyshares tag: inference
Bayes Theoremshares tag: inference
Central Tendencyshares tag: statistics
Clinical Trialshares tag: statistics
Conditional Probabilityshares tag: inference
Conditional Value-at-Riskshares tag: statistics
Cost-Effectivenessshares tag: statistics
Data Literacyshares tag: statistics
Decision Under Uncertaintyshares tag: statistics
Deductionshares tag: inference
Descriptive Statisticsshares tag: statistics
Discrete Datashares tag: statistics
Distribution (Market Phase)shares tag: statistics
Distributionsshares tag: statistics
Dollar Streetshares tag: statistics
Doubling Lineshares tag: statistics
Epidemiologyshares tag: statistics
Failure Rateshares tag: statistics
Frequentist Probabilityshares tag: statistics
Frightening vs Dangerousshares tag: statistics
Histogramshares tag: statistics
Income Levelsshares tag: statistics
Inferenceshares tag: inference
Information Coefficientshares tag: statistics
Least Squaresshares tag: statistics
Level vs Directionshares tag: statistics
Linear Regressionshares tag: statistics
Lonely Numbershares tag: statistics
Majority Trapshares tag: statistics
Meanshares tag: statistics
Mean Reversionshares tag: statistics
Measurement Errorshares tag: statistics
Medianshares tag: statistics
Misleading Statisticsshares tag: statistics
Modus Ponensshares tag: inference
Mutually Exclusiveshares tag: statistics
Overfittingshares tag: statistics
Peak Childshares tag: statistics
Per Capita Ratioshares tag: statistics
Percentageshares tag: statistics
Performance Rankshares tag: statistics
Pie Chartshares tag: statistics
Placebo Effectshares tag: statistics
Pollingshares tag: statistics
Population Projectionshares tag: statistics
Precision vs. Accuracyshares tag: statistics
Premiseshares tag: inference
Principal Component Analysisshares tag: statistics
Probabilityshares tag: statistics
Questionnaire Designshares tag: statistics
Randomisationshares tag: statistics
Rank Correlationshares tag: statistics
Reasoningshares tag: inference
Regression to the Meanshares tag: statistics
Returnsshares tag: statistics
Risk Calculationshares tag: statistics
Rolling Metricsshares tag: statistics
S-Curveshares tag: statistics
Sampling Biasshares tag: statistics
Simpson's Paradoxshares tag: statistics
Size Instinctshares tag: statistics
Slow Changeshares tag: statistics
Small Stepsshares tag: statistics
Spurious Correlationshares tag: statistics
Standard Deviationshares tag: statistics
Straight Line Instinctshares tag: statistics
Time-Series Datashares tag: statistics
Validityshares tag: inference
Z-Scoreshares tag: statistics

Statistical Significance

Definition

Why it matters

How it works

Where it goes next

Continue exploring

Tags

Statistical Significance

Definition

Why it matters

How it works

Where it goes next

Related concepts

Continue exploring

Tags