A residual (or fitting deviation), on the other hand, is an observable estimate of the unobservable statistical error.

In linear regression, not only is the sum of the residuals necessarily zero, but the sum of the residuals cannot be independent of them. In regression analysis, the distinction between errors and residuals is subtle and important, and leads to the concept of studentized residuals.

Understanding the error term is useful in OLS because it is the subject of many of our important assumptions.

Thus to compare residuals at different inputs, one needs to adjust the residuals by the expected variability of residuals, which is called studentizing. Error is a misnomer; an error is the amount by which an observation differs from its expected value; the latter being based on the whole population from which the statistical unit

Then we have: The difference between the height of each man in the sample and the unobservable population mean is a statistical error, whereas The difference between the height of each man in the sample and the observable sample mean is a residual. In statistics and optimization, errors and residuals are two closely related and easily confused measures of the deviation of an observed value of an element of a statistical sample from its

Likewise, the sum of absolute errors (SAE) refers to the sum of the absolute values of the residuals, which is minimized in the least absolute deviations approach to regression. It is as if the measurement of the man's height were an attempt to measure the population average, so that any difference between the man's height and the average would be

Principles and Procedures of Statistics, with Special Reference to Biological Sciences.

Some think it's the same thing - and not surprisingly given the way textbooks out there seem to use the words interchangeably. Assumption (1): We assume that the unobserved factors are normally distributed around the population regression function. This assumption is critical in OLS.

The sum of squares of the residuals, on the other hand, is observable. A statistical error (or disturbance) is the amount by which an observation differs from its expected value, the latter being based on the whole population from which the statistical unit was

