Normal equations
We consider the problem
, where A is an (m,n) matrix with
, rank(A) = n, b is an (m,1) vector, and x is the (n,1) vector to be determined.
The sign
stands for the least squares approximation, i.e. a minimization of the norm of the residual r = Ax - b
or the square
i.e. a differentiable function of x. The necessary condition for a minimum is:
These equations are called the normal equations , which become in our case:
The solution
is usually computed with the following algorithm:
First (the lower triangular portion of) the symmetric matrix
is computed, then its Cholesky decomposition
. Thereafter one solves
for y and finally x is computed from
.
Unfortunately
is often ill-conditioned and strongly influenced by roundoff errors (see Golub89). Other methods which do not compute ATA and solve
directly are QR decomposition and singular value decomposition.



