Numerical linear algebra

From HandWiki
Revision as of 20:10, 6 February 2024 by Steve Marsio (talk | contribs) (fix)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Short description: Field of mathematics

Numerical linear algebra, sometimes called applied linear algebra, is the study of how matrix operations can be used to create computer algorithms which efficiently and accurately provide approximate answers to questions in continuous mathematics. It is a subfield of numerical analysis, and a type of linear algebra. Computers use floating-point arithmetic and cannot exactly represent irrational data, so when a computer algorithm is applied to a matrix of data, it can sometimes increase the difference between a number stored in the computer and the true number that it is an approximation of. Numerical linear algebra uses properties of vectors and matrices to develop computer algorithms that minimize the error introduced by the computer, and is also concerned with ensuring that the algorithm is as efficient as possible.

Numerical linear algebra aims to solve problems of continuous mathematics using finite precision computers, so its applications to the natural and social sciences are as vast as the applications of continuous mathematics. It is often a fundamental part of engineering and computational science problems, such as image and signal processing, telecommunication, computational finance, materials science simulations, structural biology, data mining, bioinformatics, and fluid dynamics. Matrix methods are particularly used in finite difference methods, finite element methods, and the modeling of differential equations. Noting the broad applications of numerical linear algebra, Lloyd N. Trefethen and David Bau, III argue that it is "as fundamental to the mathematical sciences as calculus and differential equations",[1]:{{{1}}} even though it is a comparatively small field.[2] Because many properties of matrices and vectors also apply to functions and operators, numerical linear algebra can also be viewed as a type of functional analysis which has a particular emphasis on practical algorithms.[1]:{{{1}}}

Common problems in numerical linear algebra include obtaining matrix decompositions like the singular value decomposition, the QR factorization, the LU factorization, or the eigendecomposition, which can then be used to answer common linear algebraic problems like solving linear systems of equations, locating eigenvalues, or least squares optimisation. Numerical linear algebra's central concern with developing algorithms that do not introduce errors when applied to real data on a finite precision computer is often achieved by iterative methods rather than direct ones.

History

Numerical linear algebra was developed by computer pioneers like John von Neumann, Alan Turing, James H. Wilkinson, Alston Scott Householder, George Forsythe, and Heinz Rutishauser, in order to apply the earliest computers to problems in continuous mathematics, such as ballistics problems and the solutions to systems of partial differential equations.[2] The first serious attempt to minimize computer error in the application of algorithms to real data is John von Neumann and Herman Goldstine's work in 1947.[3] The field has grown as technology has increasingly enabled researchers to solve complex problems on extremely large high-precision matrices, and some numerical algorithms have grown in prominence as technologies like parallel computing have made them practical approaches to scientific problems.[2]

Matrix decompositions

Partitioned matrices

Main page: Block matrix

For many problems in applied linear algebra, it is useful to adopt the perspective of a matrix as being a concatenation of column vectors. For example, when solving the linear system [math]\displaystyle{ x = A^{-1}b }[/math], rather than understanding x as the product of [math]\displaystyle{ A^{-1} }[/math] with b, it is helpful to think of x as the vector of coefficients in the linear expansion of b in the basis formed by the columns of A.[1]:{{{1}}} Thinking of matrices as a concatenation of columns is also a practical approach for the purposes of matrix algorithms. This is because matrix algorithms frequently contain two nested loops: one over the columns of a matrix A, and another over the rows of A. For example, for matrices [math]\displaystyle{ A^{m \times n} }[/math] and vectors [math]\displaystyle{ x^{n \times 1} }[/math] and [math]\displaystyle{ y^{m \times 1} }[/math], we could use the column partitioning perspective to compute y := Ax + y as

for q = 1:n
  for p = 1:m
    y(p) = A(p,q)*x(q) + y(p)
  end
end

Singular value decomposition

Main page: Singular value decomposition

The singular value decomposition of a matrix [math]\displaystyle{ A^{m \times n} }[/math] is [math]\displaystyle{ A = U \Sigma V^\ast }[/math] where U and V are unitary, and [math]\displaystyle{ \Sigma }[/math] is diagonal. The diagonal entries of [math]\displaystyle{ \Sigma }[/math] are called the singular values of A. Because singular values are the square roots of the eigenvalues of [math]\displaystyle{ AA^\ast }[/math], there is a tight connection between the singular value decomposition and eigenvalue decompositions. This means that most methods for computing the singular value decomposition are similar to eigenvalue methods;[1]:{{{1}}} perhaps the most common method involves Householder procedures.[1]:{{{1}}}

QR factorization

Main page: QR decomposition

The QR factorization of a matrix [math]\displaystyle{ A^{m \times n} }[/math] is a matrix [math]\displaystyle{ Q^{m \times m} }[/math] and a matrix [math]\displaystyle{ R^{m \times n} }[/math] so that A = QR, where Q is orthogonal and R is upper triangular.[1]:{{{1}}}[4]:{{{1}}} The two main algorithms for computing QR factorizations are the Gram–Schmidt process and the Householder transformation. The QR factorization is often used to solve linear least-squares problems, and eigenvalue problems (by way of the iterative QR algorithm).

LU factorization

Main page: LU decomposition

An LU factorization of a matrix A consists of a lower triangular matrix L and an upper triangular matrix U so that A = LU. The matrix U is found by an upper triangularization procedure which involves left-multiplying A by a series of matrices [math]\displaystyle{ M_1,\ldots,M_{n-1} }[/math] to form the product [math]\displaystyle{ M_{n-1} \cdots M_1 A = U }[/math], so that equivalently [math]\displaystyle{ L = M_1^{-1} \cdots M_{n-1}^{-1} }[/math].[1]:{{{1}}}[4]:{{{1}}}

Eigenvalue decomposition

Main page: Eigendecomposition of a matrix

The eigenvalue decomposition of a matrix [math]\displaystyle{ A^{m \times m} }[/math] is [math]\displaystyle{ A = X \Lambda X^{-1} }[/math], where the columns of X are the eigenvectors of A, and [math]\displaystyle{ \Lambda }[/math] is a diagonal matrix the diagonal entries of which are the corresponding eigenvalues of A.[1]:{{{1}}} There is no direct method for finding the eigenvalue decomposition of an arbitrary matrix. Because it is not possible to write a program that finds the exact roots of an arbitrary polynomial in finite time, any general eigenvalue solver must necessarily be iterative.[1]:{{{1}}}

Algorithms

Gaussian elimination

Main page: Gaussian elimination

From the numerical linear algebra perspective, Gaussian elimination is a procedure for factoring a matrix A into its LU factorization, which Gaussian elimination accomplishes by left-multiplying A by a succession of matrices [math]\displaystyle{ L_{m-1} \cdots L_2 L_1 A = U }[/math] until U is upper triangular and L is lower triangular, where [math]\displaystyle{ L \equiv L_1^{-1}L_2^{-1} \cdots L_{m-1}^{-1} }[/math].[1]:{{{1}}} Naive programs for Gaussian elimination are notoriously highly unstable, and produce huge errors when applied to matrices with many significant digits.[2] The simplest solution is to introduce pivoting, which produces a modified Gaussian elimination algorithm that is stable.[1]:{{{1}}}

Solutions of linear systems

Numerical linear algebra characteristically approaches matrices as a concatenation of columns vectors. In order to solve the linear system [math]\displaystyle{ x = A^{-1}b }[/math], the traditional algebraic approach is to understand x as the product of [math]\displaystyle{ A^{-1} }[/math] with b. Numerical linear algebra instead interprets x as the vector of coefficients of the linear expansion of b in the basis formed by the columns of A.[1]:{{{1}}}

Many different decompositions can be used to solve the linear problem, depending on the characteristics of the matrix A and the vectors x and b, which may make one factorization much easier to obtain than others. If A = QR is a QR factorization of A, then equivalently [math]\displaystyle{ Rx = Q^\ast b }[/math]. This is as easy to compute as a matrix factorization.[1]:{{{1}}} If [math]\displaystyle{ A = X \Lambda X^{-1} }[/math] is an eigendecomposition A, and we seek to find b so that b = Ax, with [math]\displaystyle{ b' = X^{-1}b }[/math] and [math]\displaystyle{ x' = X^{-1}x }[/math], then we have [math]\displaystyle{ b' = \Lambda x' }[/math].[1]:{{{1}}} This is closely related to the solution to the linear system using the singular value decomposition, because singular values of a matrix are the absolute values of its eigenvalues, which are also equivalent to the square roots of the absolute values of the eigenvalues of the Gram matrix [math]\displaystyle{ X^{*} X }[/math]. And if A = LU is an LU factorization of A, then Ax = b can be solved using the triangular matrices Ly = b and Ux = y.[1]:{{{1}}}[4]:{{{1}}}

Least squares optimisation

Main page: Numerical methods for linear least squares

Matrix decompositions suggest a number of ways to solve the linear system r = bAx where we seek to minimize r, as in the regression problem. The QR algorithm solves this problem by computing the reduced QR factorization of A and rearranging to obtain [math]\displaystyle{ \widehat{R}x = \widehat{Q}^\ast b }[/math]. This upper triangular system can then be solved for x. The SVD also suggests an algorithm for obtaining linear least squares. By computing the reduced SVD decomposition [math]\displaystyle{ A = \widehat{U}\widehat{\Sigma}V^\ast }[/math] and then computing the vector [math]\displaystyle{ \widehat{U}^\ast b }[/math], we reduce the least squares problem to a simple diagonal system.[1]:{{{1}}} The fact that least squares solutions can be produced by the QR and SVD factorizations means that, in addition to the classical normal equations method for solving least squares problems, these problems can also be solved by methods that include the Gram-Schmidt algorithm and Householder methods.

Conditioning and stability

Allow that a problem is a function [math]\displaystyle{ f: X \to Y }[/math], where X is a normed vector space of data and Y is a normed vector space of solutions. For some data point [math]\displaystyle{ x \in X }[/math], the problem is said to be ill-conditioned if a small perturbation in x produces a large change in the value of f(x). We can quantify this by defining a condition number which represents how well-conditioned a problem is, defined as [math]\displaystyle{ \widehat{\kappa} = \lim_{\delta \to 0} \sup_{\| \delta x \| \leq \delta} \frac{\| \delta f \|}{\| \delta x \|}. }[/math]

Instability is the tendency of computer algorithms, which depend on floating-point arithmetic, to produce results that differ dramatically from the exact mathematical solution to a problem. When a matrix contains real data with many significant digits, many algorithms for solving problems like linear systems of equation or least squares optimisation may produce highly inaccurate results. Creating stable algorithms for ill-conditioned problems is a central concern in numerical linear algebra. One example is that the stability of householder triangularization makes it a particularly robust solution method for linear systems, whereas the instability of the normal equations method for solving least squares problems is a reason to favour matrix decomposition methods like using the singular value decomposition. Some matrix decomposition methods may be unstable, but have straightforward modifications that make them stable; one example is the unstable Gram–Schmidt, which can easily be changed to produce the stable modified Gram–Schmidt.[1]:{{{1}}} Another classical problem in numerical linear algebra is the finding that Gaussian elimination is unstable, but becomes stable with the introduction of pivoting.

Iterative methods

There are two reasons that iterative algorithms are an important part of numerical linear algebra. First, many important numerical problems have no direct solution; in order to find the eigenvalues and eigenvectors of an arbitrary matrix, we can only adopt an iterative approach. Second, noniterative algorithms for an arbitrary [math]\displaystyle{ m \times m }[/math] matrix require [math]\displaystyle{ O(m^3) }[/math] time, which is a surprisingly high floor given that matrices contain only [math]\displaystyle{ m^2 }[/math] numbers. Iterative approaches can take advantage of several features of some matrices to reduce this time. For example, when a matrix is sparse, an iterative algorithm can skip many of the steps that a direct approach would necessarily follow, even if they are redundant steps given a highly structured matrix.

The core of many iterative methods in numerical linear algebra is the projection of a matrix onto a lower dimensional Krylov subspace, which allows features of a high-dimensional matrix to be approximated by iteratively computing the equivalent features of similar matrices starting in a low dimension space and moving to successively higher dimensions. When A is symmetric and we wish to solve the linear problem Ax = b, the classical iterative approach is the conjugate gradient method. If A is not symmetric, then examples of iterative solutions to the linear problem are the generalized minimal residual method and CGN. If A is symmetric, then to solve the eigenvalue and eigenvector problem we can use the Lanczos algorithm, and if A is non-symmetric, then we can use Arnoldi iteration.

Software

Main page: Software:List of numerical analysis software

Several programming languages use numerical linear algebra optimisation techniques and are designed to implement numerical linear algebra algorithms. These languages include MATLAB, Analytica, Maple, and Mathematica. Other programming languages which are not explicitly designed for numerical linear algebra have libraries that provide numerical linear algebra routines and optimisation; C and Fortran have packages like Basic Linear Algebra Subprograms and LAPACK, python has the library NumPy, and Perl has the Perl Data Language. Many numerical linear algebra commands in R rely on these more fundamental libraries like LAPACK.[5] More libraries can be found on the List of numerical libraries.

References

  1. 1.00 1.01 1.02 1.03 1.04 1.05 1.06 1.07 1.08 1.09 1.10 1.11 1.12 1.13 1.14 1.15 1.16 Trefethen, Lloyd; Bau III, David (1997). Numerical Linear Algebra (1st ed.). Philadelphia: SIAM. ISBN 978-0-89871-361-9. 
  2. 2.0 2.1 2.2 2.3 Golub, Gene H.. "A History of Modern Numerical Linear Algebra". https://www.stat.uchicago.edu/~lekheng/courses/302/slides0.pdf. 
  3. von Neumann, John; Goldstine, Herman H. (1947). "Numerical inverting of matrices of high order". Bulletin of the American Mathematical Society 53 (11): 1021–1099. doi:10.1090/s0002-9904-1947-08909-6. 
  4. 4.0 4.1 4.2 Golub, Gene H.; Van Loan, Charles F. (1996). Matrix Computations (3rd ed.). Baltimore: The Johns Hopkins University Press. ISBN 0-8018-5413-X. 
  5. Rickert, Joseph (August 29, 2013). "R and Linear Algebra". https://www.r-bloggers.com/r-and-linear-algebra/. 

Further reading

  • Dongarra, Jack; Hammarling, Sven (1990). "Evolution of Numerical Software for Dense Linear Algebra". in Cox, M. G.; Hammarling, S.. Reliable Numerical Computation. Oxford: Clarendon Press. pp. 297–327. ISBN 0-19-853564-3. 

External links