Concept

Sampling

Definition

Sampling is the procedure by which a subset of individuals — the sample — is drawn from a larger population in order to learn something about that population without examining every member of it. The sample is the empirical handle we get; the population is the thing we actually want to know about. The validity of any statistical conclusion depends on how the bridge between the two was built.

There are many specific designs — simple random sampling, stratified sampling, cluster sampling, systematic sampling, convenience sampling — and each trades cost against representativeness in a different way. The choice of design is rarely neutral; it shapes which questions the resulting data can credibly answer.

Why it matters

How it works

Probability-based sampling gives every member of the population a known, non-zero chance of being included. Simple random sampling assigns equal probability to everyone; stratified sampling first divides the population into strata (age bands, regions, severity categories) and samples within each; cluster sampling picks intact groups (schools, postcode districts) rather than individuals, which is cheaper to administer but adds dependence between observations. The defining feature of these designs is that the selection mechanism is documented and the resulting estimator's behaviour can be derived mathematically.

Non-probability sampling — recruiting from whoever happens to be available, posting an online survey, asking interview subjects to nominate further subjects — sacrifices that mathematical machinery for speed and access. Useful inferences are still possible, but they require assumptions about how the recruited sample relates to the broader population, and those assumptions are usually unverifiable from the data alone. The honest move with a non-probability sample is to describe what was actually collected and to be explicit about the populations the data does and does not represent.

Where it goes next

Random Sampleshares tag: inference
Sample Sizeshares tag: inference
Sampling Biasshares tag: sampling
Sampling Distributionshares tag: inference
Base Rateshares tag: inference
Causationshares tag: inference
Confidence Intervalshares tag: inference
Correlationshares tag: inference
Correlation Coefficientshares tag: inference
Correlation vs Causationshares tag: inference
Experimental Designshares tag: inference
Hypothesis Testingshares tag: inference
Null Hypothesisshares tag: inference
P-Valueshares tag: inference
Pollingshares tag: sampling
Significance Levelshares tag: inference
Statistical Inferenceshares tag: inference
Statistical Significanceshares tag: inference
80/20 Ruleshares tag: statistics
Argumentshares tag: inference
Bar Chartshares tag: statistics
Base Rate Fallacyshares tag: inference
Bayes Theoremshares tag: inference
Central Tendencyshares tag: statistics
Clinical Trialshares tag: statistics
Conditional Probabilityshares tag: inference
Conditional Value-at-Riskshares tag: statistics
Cost-Effectivenessshares tag: statistics
Data Literacyshares tag: statistics
Decision Under Uncertaintyshares tag: statistics
Deductionshares tag: inference
Descriptive Statisticsshares tag: statistics
Discrete Datashares tag: statistics
Distribution (Market Phase)shares tag: statistics
Distributionsshares tag: statistics
Dollar Streetshares tag: statistics
Doubling Lineshares tag: statistics
Epidemiologyshares tag: statistics
Failure Rateshares tag: statistics
Frequentist Probabilityshares tag: statistics
Frightening vs Dangerousshares tag: statistics
Histogramshares tag: statistics
Income Levelsshares tag: statistics
Inferenceshares tag: inference
Information Coefficientshares tag: statistics
Least Squaresshares tag: statistics
Level vs Directionshares tag: statistics
Linear Regressionshares tag: statistics
Lonely Numbershares tag: statistics
Majority Trapshares tag: statistics
Meanshares tag: statistics
Mean Reversionshares tag: statistics
Measurement Errorshares tag: statistics
Medianshares tag: statistics
Misleading Statisticsshares tag: statistics
Modus Ponensshares tag: inference
Mutually Exclusiveshares tag: statistics
Overfittingshares tag: statistics
Peak Childshares tag: statistics
Per Capita Ratioshares tag: statistics
Percentageshares tag: statistics
Performance Rankshares tag: statistics
Pie Chartshares tag: statistics
Placebo Effectshares tag: statistics
Population Projectionshares tag: statistics
Precision vs. Accuracyshares tag: statistics
Premiseshares tag: inference
Principal Component Analysisshares tag: statistics
Probabilityshares tag: statistics
Questionnaire Designshares tag: statistics
Randomisationshares tag: statistics
Rank Correlationshares tag: statistics
Reasoningshares tag: inference
Regression to the Meanshares tag: statistics
Returnsshares tag: statistics
Risk Calculationshares tag: statistics
Rolling Metricsshares tag: statistics
S-Curveshares tag: statistics
Simpson's Paradoxshares tag: statistics
Size Instinctshares tag: statistics
Slow Changeshares tag: statistics
Small Stepsshares tag: statistics
Spurious Correlationshares tag: statistics
Standard Deviationshares tag: statistics
Straight Line Instinctshares tag: statistics
Time-Series Datashares tag: statistics
Validityshares tag: inference
Z-Scoreshares tag: statistics

Sampling

Definition

Why it matters

How it works

Where it goes next

Continue exploring

Tags

Sampling

Definition

Why it matters

How it works

Where it goes next

Related concepts

Continue exploring

Tags