Concept

Least Squares

Definition

Least squares is a rule for choosing the best-fitting line, curve, or model through a set of data points. For each observation, the model's predicted value differs from the actual value by a residual; least squares picks the parameters that make the sum of the squared residuals as small as possible. Squaring serves two purposes: it prevents positive and negative deviations from cancelling, and it penalizes large errors disproportionately, so a single far-off point pulls the fit harder than several small misses combined.

Developed by Legendre and Gauss in the early nineteenth century to reconcile astronomical observations, least squares is now the default fitting method behind linear regression and many of its extensions.

Why it matters

How it works

For a simple linear model that predicts a response from a single predictor, the goal is to pick values for the intercept and slope so that the sum of squared vertical distances from each data point to the line is minimized. Calculus gives a tidy result: the optimal slope equals the covariance of the predictor and response divided by the variance of the predictor, and the optimal intercept makes the line pass through the point at the mean of both variables. No iterative search is needed; the formulas drop out directly.

The same principle generalizes. With many predictors, the closed-form solution uses matrix algebra (the normal equations) to give the coefficient vector in one step. The geometric interpretation is that the fitted values are the projection of the response vector onto the column space of the predictor matrix — the closest reachable point under the squared-distance metric. The squared-distance criterion is so analytically convenient that statisticians lived with its sensitivity to outliers for centuries before robust alternatives became practical.

Where it goes next

Correlationshares tag: regression
Correlation Coefficientshares tag: regression
Linear Regressionshares tag: regression
80/20 Ruleshares tag: statistics
Bar Chartshares tag: statistics
Base Rateshares tag: statistics
Bottlenecksshares tag: optimization
Causationshares tag: statistics
Central Tendencyshares tag: statistics
Clinical Trialshares tag: statistics
Conditional Value-at-Riskshares tag: statistics
Confidence Intervalshares tag: statistics
Correlation vs Causationshares tag: statistics
Cost-Effectivenessshares tag: statistics
Data Literacyshares tag: statistics
Decision Under Uncertaintyshares tag: statistics
Descriptive Statisticsshares tag: statistics
Discrete Datashares tag: statistics
Distribution (Market Phase)shares tag: statistics
Distributionsshares tag: statistics
Dollar Streetshares tag: statistics
Doubling Lineshares tag: statistics
Epidemiologyshares tag: statistics
Experimental Designshares tag: statistics
Failure Rateshares tag: statistics
Frequentist Probabilityshares tag: statistics
Frightening vs Dangerousshares tag: statistics
Histogramshares tag: statistics
Hypothesis Testingshares tag: statistics
Income Levelsshares tag: statistics
Information Coefficientshares tag: statistics
Level vs Directionshares tag: statistics
Lonely Numbershares tag: statistics
Majority Trapshares tag: statistics
Meanshares tag: statistics
Mean Reversionshares tag: statistics
Measurement Errorshares tag: statistics
Medianshares tag: statistics
Misleading Statisticsshares tag: statistics
Mutually Exclusiveshares tag: statistics
Null Hypothesisshares tag: statistics
Overfittingshares tag: statistics
P-Valueshares tag: statistics
Peak Childshares tag: statistics
Per Capita Ratioshares tag: statistics
Percentageshares tag: statistics
Performance Rankshares tag: statistics
Pie Chartshares tag: statistics
Placebo Effectshares tag: statistics
Pollingshares tag: statistics
Population Projectionshares tag: statistics
Precision vs. Accuracyshares tag: statistics
Principal Component Analysisshares tag: statistics
Probabilityshares tag: statistics
Questionnaire Designshares tag: statistics
Random Sampleshares tag: statistics
Randomisationshares tag: statistics
Rank Correlationshares tag: statistics
Regression to the Meanshares tag: statistics
Returnsshares tag: statistics
Risk Calculationshares tag: statistics
Rolling Metricsshares tag: statistics
S-Curveshares tag: statistics
Sample Sizeshares tag: statistics
Samplingshares tag: statistics
Sampling Biasshares tag: statistics
Sampling Distributionshares tag: statistics
Significance Levelshares tag: statistics
Simpson's Paradoxshares tag: statistics
Size Instinctshares tag: statistics
Slow Changeshares tag: statistics
Small Stepsshares tag: statistics
Spurious Correlationshares tag: statistics
Standard Deviationshares tag: statistics
Statistical Inferenceshares tag: statistics
Statistical Significanceshares tag: statistics
Straight Line Instinctshares tag: statistics
Time-Series Datashares tag: statistics
Z-Scoreshares tag: statistics

Least Squares

Definition

Why it matters

How it works

Where it goes next

Continue exploring

Tags

Least Squares

Definition

Why it matters

How it works

Where it goes next

Related concepts

Continue exploring

Tags