Concept

Experimental Design

Definition

Experimental design is the discipline of planning a study so that its results can support a causal conclusion. The designer specifies which variable will be manipulated (the treatment or independent variable), which outcome will be measured (the dependent variable), how participants or units will be assigned to conditions, what other variables will be held constant or balanced across groups, and how many observations will be collected. Each of these choices is a defence against a specific threat to the inference.

The classical building blocks come from R. A. Fisher's work on agricultural trials: randomisation, replication, blocking, and controlled comparison. The same logic now underwrites randomised controlled trials in medicine, A/B testing in software, controlled field experiments in development economics, and laboratory studies across every empirical discipline. Where a question can be settled by deliberate intervention, a well-designed experiment is the most efficient way to settle it.

Why it matters

How it works

A standard randomised experiment runs in five stages. The researcher first defines the question and chooses an outcome that can be measured cleanly. They specify a treatment whose effect on that outcome is being tested, and an appropriate control condition. They calculate the sample size needed to detect a plausible effect with adequate statistical power. They then randomly assign units to the treatment and control groups, ideally with blinding in both directions. Finally, they collect data, analyse the difference between groups using a method specified in advance, and report effect sizes alongside any test of significance.

Randomisation is the engine that makes the design work. By assigning units to conditions independently of any of their other characteristics, it ensures that the treatment and control groups differ systematically only in the treatment itself. Any subsequent difference in outcomes can therefore be attributed to the treatment with quantified uncertainty. Replication — running the same experiment multiple times, ideally by independent teams — guards against the chance that a single significant result was a fluke. Blocking, stratification, and factorial designs let a single study answer more than one question without losing the protections that randomisation provides.

Where it goes next

Causationshares tag: causation
Correlation vs Causationshares tag: causation
Base Rateshares tag: inference
Confidence Intervalshares tag: inference
Correlationshares tag: inference
Correlation Coefficientshares tag: inference
Hypothesis Testingshares tag: inference
Null Hypothesisshares tag: inference
P-Valueshares tag: inference
Random Sampleshares tag: inference
Randomisationshares tag: experimental-design
Sample Sizeshares tag: inference
Samplingshares tag: inference
Sampling Distributionshares tag: inference
Significance Levelshares tag: inference
Simpson's Paradoxshares tag: causation
Spurious Correlationshares tag: causation
Statistical Inferenceshares tag: inference
Statistical Significanceshares tag: inference
80/20 Ruleshares tag: statistics
Argumentshares tag: inference
Attributionshares tag: causation
Bar Chartshares tag: statistics
Base Rate Fallacyshares tag: inference
Bayes Theoremshares tag: inference
Central Tendencyshares tag: statistics
Clinical Trialshares tag: statistics
Conditional Probabilityshares tag: inference
Conditional Value-at-Riskshares tag: statistics
Cost-Effectivenessshares tag: statistics
Data Literacyshares tag: statistics
Decision Under Uncertaintyshares tag: statistics
Deductionshares tag: inference
Descriptive Statisticsshares tag: statistics
Discrete Datashares tag: statistics
Distribution (Market Phase)shares tag: statistics
Distributionsshares tag: statistics
Dollar Streetshares tag: statistics
Doubling Lineshares tag: statistics
Epidemiologyshares tag: statistics
Failure Rateshares tag: statistics
Frequentist Probabilityshares tag: statistics
Frightening vs Dangerousshares tag: statistics
Great-Man Theoryshares tag: causation
Histogramshares tag: statistics
Income Levelsshares tag: statistics
Inferenceshares tag: inference
Information Coefficientshares tag: statistics
Least Squaresshares tag: statistics
Level vs Directionshares tag: statistics
Linear Regressionshares tag: statistics
Lonely Numbershares tag: statistics
Majority Trapshares tag: statistics
Meanshares tag: statistics
Mean Reversionshares tag: statistics
Measurement Errorshares tag: statistics
Medianshares tag: statistics
Misleading Statisticsshares tag: statistics
Modus Ponensshares tag: inference
Mutually Exclusiveshares tag: statistics
Necessity and Sufficiencyshares tag: causation
Overfittingshares tag: statistics
Peak Childshares tag: statistics
Per Capita Ratioshares tag: statistics
Percentageshares tag: statistics
Performance Rankshares tag: statistics
Pie Chartshares tag: statistics
Placebo Effectshares tag: statistics
Pollingshares tag: statistics
Population Projectionshares tag: statistics
Precision vs. Accuracyshares tag: statistics
Premiseshares tag: inference
Principal Component Analysisshares tag: statistics
Probabilityshares tag: statistics
Questionnaire Designshares tag: statistics
Rank Correlationshares tag: statistics
Reasoningshares tag: inference
Regression to the Meanshares tag: statistics
Returnsshares tag: statistics
Risk Calculationshares tag: statistics
Rolling Metricsshares tag: statistics
S-Curveshares tag: statistics
Sampling Biasshares tag: statistics
Size Instinctshares tag: statistics
Slow Changeshares tag: statistics
Small Stepsshares tag: statistics
Standard Deviationshares tag: statistics
Straight Line Instinctshares tag: statistics
Time-Series Datashares tag: statistics
Validityshares tag: inference
Z-Scoreshares tag: statistics

Experimental Design

Definition

Why it matters

How it works

Where it goes next

Continue exploring

Tags

Experimental Design

Definition

Why it matters

How it works

Where it goes next

Related concepts

Continue exploring

Tags