Concept

Hypothesis Testing

Definition

Hypothesis testing is a structured procedure for using sample data to decide between two competing claims about a population. The default claim — the null hypothesis, written H0 — typically asserts no effect, no difference, or no relationship. The rival claim — the alternative hypothesis, H1 — asserts that some effect exists. The test asks whether the observed data would be surprising if the null were true; if the data is sufficiently unlikely under the null, the analyst rejects it in favour of the alternative.

The procedure is the foundation of statistical inference and the standard reporting format across empirical science. Its central output is a p-value: the probability of seeing data at least as extreme as what was observed, assuming the null is correct.

Why it matters

How it works

A standard test proceeds in five steps. First, state the null and alternative hypotheses in measurable terms — for example, the population mean equals a specific value, or two group means are equal. Second, choose a significance level (commonly 0.05) that fixes how much risk of a false-positive error you will tolerate. Third, compute a test statistic from the sample — a t-statistic, a z-score, or a chi-squared value — that measures how far the observed data sits from what the null predicts. Fourth, convert that statistic into a p-value using the relevant theoretical distribution. Fifth, compare the p-value to the significance level: if smaller, reject the null; if larger, fail to reject it.

Two error types frame the trade-off. A Type I error rejects a null that is actually true (false positive); a Type II error fails to reject a null that is actually false (false negative). Tightening the significance level reduces Type I errors but raises Type II errors, and vice versa. Sample size is the lever that improves both — more data lets the test detect smaller true effects without inflating false positives.

Where it goes next

Null Hypothesisshares tag: inference
P-Valueshares tag: inference
Significance Levelshares tag: inference
Statistical Significanceshares tag: inference
Base Rateshares tag: inference
Causationshares tag: inference
Confidence Intervalshares tag: inference
Correlationshares tag: inference
Correlation Coefficientshares tag: inference
Correlation vs Causationshares tag: inference
Experimental Designshares tag: inference
Random Sampleshares tag: inference
Sample Sizeshares tag: inference
Samplingshares tag: inference
Sampling Distributionshares tag: inference
Statistical Inferenceshares tag: inference
80/20 Ruleshares tag: statistics
Argumentshares tag: inference
Bar Chartshares tag: statistics
Base Rate Fallacyshares tag: inference
Bayes Theoremshares tag: inference
Central Tendencyshares tag: statistics
Clinical Trialshares tag: statistics
Conditional Probabilityshares tag: inference
Conditional Value-at-Riskshares tag: statistics
Cost-Effectivenessshares tag: statistics
Data Literacyshares tag: statistics
Decision Under Uncertaintyshares tag: statistics
Deductionshares tag: inference
Descriptive Statisticsshares tag: statistics
Discrete Datashares tag: statistics
Distribution (Market Phase)shares tag: statistics
Distributionsshares tag: statistics
Dollar Streetshares tag: statistics
Doubling Lineshares tag: statistics
Epidemiologyshares tag: statistics
Failure Rateshares tag: statistics
Frequentist Probabilityshares tag: statistics
Frightening vs Dangerousshares tag: statistics
Histogramshares tag: statistics
Income Levelsshares tag: statistics
Inferenceshares tag: inference
Information Coefficientshares tag: statistics
Least Squaresshares tag: statistics
Level vs Directionshares tag: statistics
Linear Regressionshares tag: statistics
Lonely Numbershares tag: statistics
Majority Trapshares tag: statistics
Meanshares tag: statistics
Mean Reversionshares tag: statistics
Measurement Errorshares tag: statistics
Medianshares tag: statistics
Misleading Statisticsshares tag: statistics
Modus Ponensshares tag: inference
Mutually Exclusiveshares tag: statistics
Overfittingshares tag: statistics
Peak Childshares tag: statistics
Per Capita Ratioshares tag: statistics
Percentageshares tag: statistics
Performance Rankshares tag: statistics
Pie Chartshares tag: statistics
Placebo Effectshares tag: statistics
Pollingshares tag: statistics
Population Projectionshares tag: statistics
Precision vs. Accuracyshares tag: statistics
Premiseshares tag: inference
Principal Component Analysisshares tag: statistics
Probabilityshares tag: statistics
Questionnaire Designshares tag: statistics
Randomisationshares tag: statistics
Rank Correlationshares tag: statistics
Reasoningshares tag: inference
Regression to the Meanshares tag: statistics
Returnsshares tag: statistics
Risk Calculationshares tag: statistics
Rolling Metricsshares tag: statistics
S-Curveshares tag: statistics
Sampling Biasshares tag: statistics
Simpson's Paradoxshares tag: statistics
Size Instinctshares tag: statistics
Slow Changeshares tag: statistics
Small Stepsshares tag: statistics
Spurious Correlationshares tag: statistics
Standard Deviationshares tag: statistics
Straight Line Instinctshares tag: statistics
Time-Series Datashares tag: statistics
Validityshares tag: inference
Z-Scoreshares tag: statistics

Hypothesis Testing

Definition

Why it matters

How it works

Where it goes next

Continue exploring

Tags

Hypothesis Testing

Definition

Why it matters

How it works

Where it goes next

Related concepts

Continue exploring

Tags