Concept

Correlation Coefficient

Definition

The correlation coefficient — most often Pearson's r — is a single number that captures how tightly two quantitative variables move together along a straight line. Its value runs from negative one through zero to positive one. A coefficient of positive one means the two variables rise together along a perfect straight line; negative one means one rises exactly as the other falls; zero means there is no linear relationship at all. Intermediate values describe partial alignment, with magnitudes near one indicating tight linear relationships and magnitudes near zero indicating loose ones.

The coefficient is dimensionless: it does not depend on the units in which the variables are measured. Converting height from inches to centimetres or income from dollars to euros leaves r unchanged. This makes it a useful common currency for comparing the strength of relationships across different studies and different scales.

Why it matters

How it works

Pearson's correlation coefficient is computed by standardising each variable to zero mean and unit variance, multiplying the standardised pairs, and averaging the products across the data set. The mathematics is exactly the covariance between the two variables divided by the product of their standard deviations. The standardisation strips out scale, which is why the resulting number is bounded between negative one and positive one regardless of the original units.

Interpretation requires care. The coefficient summarises linear association only; two variables linked by a strong but curved relationship — for example, an inverted-U shape — can have a correlation close to zero. A few extreme values can either inflate or suppress the coefficient relative to the bulk of the data. And the magnitude of correlation that counts as meaningful depends on context: r equal to zero point three is unremarkable in psychology, where many influences are at play, but striking in physics, where confounders are usually controlled. Always pair the coefficient with a scatter plot and an honest discussion of sample size and outliers.

Where it goes next

Correlationshares tag: inference
Correlation vs Causationshares tag: correlation
Base Rateshares tag: inference
Causationshares tag: inference
Confidence Intervalshares tag: inference
Experimental Designshares tag: inference
Hypothesis Testingshares tag: inference
Least Squaresshares tag: regression
Linear Regressionshares tag: regression
Null Hypothesisshares tag: inference
P-Valueshares tag: inference
Random Sampleshares tag: inference
Rank Correlationshares tag: correlation
Sample Sizeshares tag: inference
Samplingshares tag: inference
Sampling Distributionshares tag: inference
Significance Levelshares tag: inference
Spurious Correlationshares tag: correlation
Statistical Inferenceshares tag: inference
Statistical Significanceshares tag: inference
80/20 Ruleshares tag: statistics
Argumentshares tag: inference
Bar Chartshares tag: statistics
Base Rate Fallacyshares tag: inference
Bayes Theoremshares tag: inference
Central Tendencyshares tag: statistics
Clinical Trialshares tag: statistics
Conditional Probabilityshares tag: inference
Conditional Value-at-Riskshares tag: statistics
Cost-Effectivenessshares tag: statistics
Data Literacyshares tag: statistics
Decision Under Uncertaintyshares tag: statistics
Deductionshares tag: inference
Descriptive Statisticsshares tag: statistics
Discrete Datashares tag: statistics
Distribution (Market Phase)shares tag: statistics
Distributionsshares tag: statistics
Dollar Streetshares tag: statistics
Doubling Lineshares tag: statistics
Epidemiologyshares tag: statistics
Failure Rateshares tag: statistics
Frequentist Probabilityshares tag: statistics
Frightening vs Dangerousshares tag: statistics
Histogramshares tag: statistics
Income Levelsshares tag: statistics
Inferenceshares tag: inference
Information Coefficientshares tag: statistics
Level vs Directionshares tag: statistics
Lonely Numbershares tag: statistics
Majority Trapshares tag: statistics
Meanshares tag: statistics
Mean Reversionshares tag: statistics
Measurement Errorshares tag: statistics
Medianshares tag: statistics
Misleading Statisticsshares tag: statistics
Modus Ponensshares tag: inference
Mutually Exclusiveshares tag: statistics
Overfittingshares tag: statistics
Peak Childshares tag: statistics
Per Capita Ratioshares tag: statistics
Percentageshares tag: statistics
Performance Rankshares tag: statistics
Pie Chartshares tag: statistics
Placebo Effectshares tag: statistics
Pollingshares tag: statistics
Population Projectionshares tag: statistics
Precision vs. Accuracyshares tag: statistics
Premiseshares tag: inference
Principal Component Analysisshares tag: statistics
Probabilityshares tag: statistics
Questionnaire Designshares tag: statistics
Randomisationshares tag: statistics
Reasoningshares tag: inference
Regression to the Meanshares tag: statistics
Returnsshares tag: statistics
Risk Calculationshares tag: statistics
Rolling Metricsshares tag: statistics
S-Curveshares tag: statistics
Sampling Biasshares tag: statistics
Simpson's Paradoxshares tag: statistics
Size Instinctshares tag: statistics
Slow Changeshares tag: statistics
Small Stepsshares tag: statistics
Standard Deviationshares tag: statistics
Straight Line Instinctshares tag: statistics
Time-Series Datashares tag: statistics
Validityshares tag: inference
Z-Scoreshares tag: statistics

Correlation Coefficient

Definition

Why it matters

How it works

Where it goes next

Continue exploring

Tags

Correlation Coefficient

Definition

Why it matters

How it works

Where it goes next

Related concepts

Continue exploring

Tags