Concept

P-Value

Definition

A p-value is the probability, computed under the assumption that the null hypothesis is true, of obtaining a test statistic at least as extreme as the one actually observed. It is a measure of compatibility between the data and the null model: small p-values indicate that the observed sample would be surprising if the null were true, large ones indicate that it would not.

Conventionally, when the p-value falls below a pre-set significance level (often 0.05), the result is called statistically significant and the null is rejected. The threshold is a convention, not a law of nature — disciplines have moved toward stricter cutoffs (0.01, 0.001) as awareness of replication failures has grown.

Why it matters

How it works

Computing a p-value follows a fixed recipe. The analyst formulates the null and alternative hypotheses, chooses an appropriate test statistic (a t for means, a chi-square for categorical counts, an F for variance ratios), and computes its value from the sample. Each test statistic has a known distribution under the null — the sampling distribution — derived from probability theory. The p-value is then the tail area of that distribution beyond the observed statistic. Modern software returns the number directly; historically, analysts looked it up in tables.

The mechanism makes clear what the p-value is and is not. It is a conditional probability: how likely is data this extreme, given the null. It is not the probability that the null is true — that would require flipping the conditioning, which needs a prior probability and Bayes' rule. A p-value of 0.03 does NOT mean the null has a 3% chance of being true; it means that if the null were true, you would see data this extreme only 3% of the time. The shift in conditioning is subtle, but mishandling it is how researchers end up with claims that do not survive replication. The other practical danger is p-hacking — running many tests until one falls under 0.05 by chance alone. A p-value only carries the advertised meaning when the test was specified in advance and a single hypothesis was tested.

Where it goes next

Null Hypothesisshares tag: hypothesis-testing
Hypothesis Testingshares tag: inference
Significance Levelshares tag: hypothesis-testing
Statistical Significanceshares tag: hypothesis-testing
Base Rateshares tag: inference
Causationshares tag: inference
Confidence Intervalshares tag: inference
Correlationshares tag: inference
Correlation Coefficientshares tag: inference
Correlation vs Causationshares tag: inference
Experimental Designshares tag: inference
Random Sampleshares tag: inference
Sample Sizeshares tag: inference
Samplingshares tag: inference
Sampling Distributionshares tag: inference
Statistical Inferenceshares tag: inference
80/20 Ruleshares tag: statistics
Argumentshares tag: inference
Bar Chartshares tag: statistics
Base Rate Fallacyshares tag: inference
Bayes Theoremshares tag: inference
Central Tendencyshares tag: statistics
Clinical Trialshares tag: statistics
Conditional Probabilityshares tag: inference
Conditional Value-at-Riskshares tag: statistics
Cost-Effectivenessshares tag: statistics
Data Literacyshares tag: statistics
Decision Under Uncertaintyshares tag: statistics
Deductionshares tag: inference
Descriptive Statisticsshares tag: statistics
Discrete Datashares tag: statistics
Distribution (Market Phase)shares tag: statistics
Distributionsshares tag: statistics
Dollar Streetshares tag: statistics
Doubling Lineshares tag: statistics
Epidemiologyshares tag: statistics
Failure Rateshares tag: statistics
Frequentist Probabilityshares tag: statistics
Frightening vs Dangerousshares tag: statistics
Histogramshares tag: statistics
Income Levelsshares tag: statistics
Inferenceshares tag: inference
Information Coefficientshares tag: statistics
Least Squaresshares tag: statistics
Level vs Directionshares tag: statistics
Linear Regressionshares tag: statistics
Lonely Numbershares tag: statistics
Majority Trapshares tag: statistics
Meanshares tag: statistics
Mean Reversionshares tag: statistics
Measurement Errorshares tag: statistics
Medianshares tag: statistics
Misleading Statisticsshares tag: statistics
Modus Ponensshares tag: inference
Mutually Exclusiveshares tag: statistics
Overfittingshares tag: statistics
Peak Childshares tag: statistics
Per Capita Ratioshares tag: statistics
Percentageshares tag: statistics
Performance Rankshares tag: statistics
Pie Chartshares tag: statistics
Placebo Effectshares tag: statistics
Pollingshares tag: statistics
Population Projectionshares tag: statistics
Precision vs. Accuracyshares tag: statistics
Premiseshares tag: inference
Principal Component Analysisshares tag: statistics
Probabilityshares tag: statistics
Questionnaire Designshares tag: statistics
Randomisationshares tag: statistics
Rank Correlationshares tag: statistics
Reasoningshares tag: inference
Regression to the Meanshares tag: statistics
Returnsshares tag: statistics
Risk Calculationshares tag: statistics
Rolling Metricsshares tag: statistics
S-Curveshares tag: statistics
Sampling Biasshares tag: statistics
Simpson's Paradoxshares tag: statistics
Size Instinctshares tag: statistics
Slow Changeshares tag: statistics
Small Stepsshares tag: statistics
Spurious Correlationshares tag: statistics
Standard Deviationshares tag: statistics
Straight Line Instinctshares tag: statistics
Time-Series Datashares tag: statistics
Validityshares tag: inference
Z-Scoreshares tag: statistics

P-Value

Definition

Why it matters

How it works

Where it goes next

Continue exploring

Tags

P-Value

Definition

Why it matters

How it works

Where it goes next

Related concepts

Continue exploring

Tags