Laplace expansion

From HandWiki
Short description: Expression of a determinant in terms of minors


In linear algebra, the Laplace expansion, named after Pierre-Simon Laplace, also called cofactor expansion, is an expression of the determinant of an n × n-matrix B as a weighted sum of minors, which are the determinants of some (n − 1) × (n − 1)-submatrices of B. Specifically, for every i, the Laplace expansion along the ith row is the equality [math]\displaystyle{ \begin{align} \det(B)&= \sum_{j=1}^{n} (-1)^{i+j} b_{i,j} m_{i, j}, \end{align} }[/math] where [math]\displaystyle{ b_{i,j} }[/math] is the entry of the ith row and jth column of B, and [math]\displaystyle{ m_{i,j} }[/math] is the determinant of the submatrix obtained by removing the ith row and the jth column of B. Similarly, the Laplace expansion along the jth column is the equality [math]\displaystyle{ \begin{align} \det(B)&= \sum_{i=1}^{n} (-1)^{i+j} b_{i,j} m_{i, j}. \end{align} }[/math] (Each identity implies the other, since the determinants of a matrix and its transpose are the same.)

The coefficient [math]\displaystyle{ (-1)^{i+j} m_{i, j} }[/math] of [math]\displaystyle{ b_{i,j} }[/math] in the above sum is called the cofactor of [math]\displaystyle{ b_{i,j} }[/math] in B.

The Laplace expansion is often useful in proofs, as in, for example, allowing recursion on the size of matrices. It is also of didactic interest for its simplicity and as one of several ways to view and compute the determinant. For large matrices, it quickly becomes inefficient to compute when compared to Gaussian elimination.

Examples

Consider the matrix

[math]\displaystyle{ B = \begin{bmatrix} 1 & 2 & 3 \\ 4 & 5 & 6 \\ 7 & 8 & 9 \end{bmatrix}. }[/math]

The determinant of this matrix can be computed by using the Laplace expansion along any one of its rows or columns. For instance, an expansion along the first row yields:

[math]\displaystyle{ \begin{align} |B| & = 1 \cdot \begin{vmatrix} 5 & 6 \\ 8 & 9 \end{vmatrix} - 2 \cdot \begin{vmatrix} 4 & 6 \\ 7 & 9 \end{vmatrix} + 3 \cdot \begin{vmatrix} 4 & 5 \\ 7 & 8 \end{vmatrix} \\[5pt] & = 1 \cdot (-3) - 2 \cdot (-6) + 3 \cdot (-3) = 0. \end{align} }[/math]

Laplace expansion along the second column yields the same result:

[math]\displaystyle{ \begin{align} |B| & = -2 \cdot \begin{vmatrix} 4 & 6 \\ 7 & 9 \end{vmatrix} + 5 \cdot \begin{vmatrix} 1 & 3 \\ 7 & 9 \end{vmatrix} - 8 \cdot \begin{vmatrix} 1 & 3 \\ 4 & 6 \end{vmatrix} \\[5pt] & = -2 \cdot (-6) + 5 \cdot (-12) - 8 \cdot (-6) = 0. \end{align} }[/math]

It is easy to verify that the result is correct: the matrix is singular because the sum of its first and third column is twice the second column, and hence its determinant is zero.

Proof

Suppose [math]\displaystyle{ B }[/math] is an n × n matrix and [math]\displaystyle{ i,j\in\{1,2,\dots,n\}. }[/math] For clarity we also label the entries of [math]\displaystyle{ B }[/math] that compose its [math]\displaystyle{ i,j }[/math] minor matrix [math]\displaystyle{ M_{ij} }[/math] as

[math]\displaystyle{ (a_{st}) }[/math] for [math]\displaystyle{ 1 \le s,t \le n-1. }[/math]

Consider the terms in the expansion of [math]\displaystyle{ |B| }[/math] that have [math]\displaystyle{ b_{ij} }[/math] as a factor. Each has the form

[math]\displaystyle{ \sgn \tau\,b_{1,\tau(1)} \cdots b_{i,j} \cdots b_{n,\tau(n)} = \sgn \tau\,b_{ij} a_{1,\sigma(1)} \cdots a_{n-1,\sigma(n-1)} }[/math]

for some permutation τSn with [math]\displaystyle{ \tau(i)=j }[/math], and a unique and evidently related permutation [math]\displaystyle{ \sigma\in S_{n-1} }[/math] which selects the same minor entries as τ. Similarly each choice of σ determines a corresponding τ i.e. the correspondence [math]\displaystyle{ \sigma\leftrightarrow\tau }[/math] is a bijection between [math]\displaystyle{ S_{n-1} }[/math] and [math]\displaystyle{ \{\tau\in S_n\colon\tau(i)=j\}. }[/math] Using Cauchy's two-line notation, the explicit relation between [math]\displaystyle{ \tau }[/math] and [math]\displaystyle{ \sigma }[/math] can be written as

[math]\displaystyle{ \sigma = \begin{pmatrix} 1 & 2 & \cdots & i & \cdots & n-1 \\ (\leftarrow)_j ( \tau(1) ) & (\leftarrow)_j ( \tau(2) ) & \cdots & (\leftarrow)_j ( \tau(i+1) ) & \cdots & (\leftarrow)_j ( \tau(n) ) \end{pmatrix} }[/math]

where [math]\displaystyle{ (\leftarrow)_j }[/math] is a temporary shorthand notation for a cycle [math]\displaystyle{ (n,n-1,\cdots,j+1,j) }[/math]. This operation decrements all indices larger than j so that every index fits in the set {1,2,...,n-1}

The permutation τ can be derived from σ as follows. Define [math]\displaystyle{ \sigma'\in S_n }[/math] by [math]\displaystyle{ \sigma'(k) = \sigma(k) }[/math] for [math]\displaystyle{ 1 \le k \le n-1 }[/math] and [math]\displaystyle{ \sigma'(n) = n }[/math]. Then [math]\displaystyle{ \sigma' }[/math] is expressed as

[math]\displaystyle{ \sigma' = \begin{pmatrix} 1 & 2 & \cdots & i & \cdots & n-1 & n \\ (\leftarrow)_j ( \tau(1) ) & (\leftarrow)_j ( \tau(2) ) & \cdots & (\leftarrow)_j ( \tau(i+1) ) & \cdots & (\leftarrow)_j ( \tau(n) ) & n\end{pmatrix} }[/math]

Now, the operation which apply [math]\displaystyle{ (\leftarrow)_i }[/math] first and then apply [math]\displaystyle{ \sigma' }[/math] is (Notice applying A before B is equivalent to applying inverse of A to the upper row of B in two-line notation)

[math]\displaystyle{ \sigma' (\leftarrow)_i = \begin{pmatrix} 1 & 2 & \cdots & i+1 & \cdots & n & i \\ (\leftarrow)_j ( \tau(1) ) & (\leftarrow)_j ( \tau(2) ) & \cdots & (\leftarrow)_j ( \tau(i+1) ) & \cdots & (\leftarrow)_j ( \tau(n) ) & n \end{pmatrix} }[/math]

where [math]\displaystyle{ (\leftarrow)_i }[/math] is temporary shorthand notation for [math]\displaystyle{ (n,n-1,\cdots,i+1,i) }[/math].

the operation which applies [math]\displaystyle{ \tau }[/math] first and then applies [math]\displaystyle{ (\leftarrow)_j }[/math] is

[math]\displaystyle{ (\leftarrow)_j \tau = \begin{pmatrix} 1 & 2 & \cdots & i & \cdots & n-1 & n \\ (\leftarrow)_j ( \tau(1) ) & (\leftarrow)_j ( \tau(2) ) & \cdots & n & \cdots & (\leftarrow)_j ( \tau(n-1) ) & (\leftarrow)_j ( \tau(n) ) \end{pmatrix} }[/math]

above two are equal thus,

[math]\displaystyle{ (\leftarrow)_j \tau = \sigma' (\leftarrow)_i }[/math]
[math]\displaystyle{ \tau = (\rightarrow)_j \sigma' (\leftarrow)_i }[/math]

where [math]\displaystyle{ (\rightarrow)_j }[/math] is the inverse of [math]\displaystyle{ (\leftarrow)_j }[/math] which is [math]\displaystyle{ (j,j+1,\cdots,n) }[/math].

Thus

[math]\displaystyle{ \tau\,=(j,j+1,\ldots,n)\sigma'(n,n-1,\ldots,i) }[/math]

Since the two cycles can be written respectively as [math]\displaystyle{ n-i }[/math] and [math]\displaystyle{ n-j }[/math] transpositions,

[math]\displaystyle{ \sgn\tau\,= (-1)^{2n-(i+j)} \sgn\sigma'\,= (-1)^{i+j} \sgn\sigma. }[/math]

And since the map [math]\displaystyle{ \sigma\leftrightarrow\tau }[/math] is bijective,

[math]\displaystyle{ \begin{align} \sum_{i=1}^n\sum_{\tau \in S_n:\tau(i)=j} \sgn \tau\,b_{1,\tau(1)} \cdots b_{n,\tau(n)} &= \sum_{i=1}^{n}\sum_{\sigma \in S_{n-1}} (-1)^{i+j}\sgn\sigma\, b_{ij} a_{1,\sigma(1)} \cdots a_{n-1,\sigma(n-1)}\\ &= \sum_{i=1}^{n}b_{ij}(-1)^{i+j} \sum_{\sigma \in S_{n-1}} \sgn\sigma\, a_{1,\sigma(1)} \cdots a_{n-1,\sigma(n-1)}\\ &= \sum_{i=1}^{n} b_{ij} (-1)^{i+j} M_{ij} \end{align} }[/math]

from which the result follows. Similarly, the result holds if the index of the outer summation was replaced with [math]\displaystyle{ j }[/math].

Laplace expansion of a determinant by complementary minors

Laplace's cofactor expansion can be generalised as follows.

Example

Consider the matrix

[math]\displaystyle{ A = \begin{bmatrix} 1 & 2 & 3 & 4 \\ 5 & 6 & 7 & 8 \\ 9 & 10 & 11 & 12 \\ 13 & 14 & 15 & 16 \end{bmatrix}. }[/math]

The determinant of this matrix can be computed by using the Laplace's cofactor expansion along the first two rows as follows. Firstly note that there are 6 sets of two distinct numbers in {1, 2, 3, 4}, namely let [math]\displaystyle{ S=\left\{\{1,2\},\{1,3\},\{1,4\},\{2,3\},\{2,4\},\{3,4\}\right\} }[/math] be the aforementioned set.

By defining the complementary cofactors to be

[math]\displaystyle{ b_{\{j,k\}}=\begin{vmatrix} a_{1j} & a_{1k} \\ a_{2j} & a_{2k} \end{vmatrix}, }[/math]
[math]\displaystyle{ c_{\{p,q\}}=\begin{vmatrix} a_{3p} & a_{3q} \\ a_{4p} & a_{4q} \end{vmatrix}, }[/math]

and the sign of their permutation to be

[math]\displaystyle{ \varepsilon^{\{j,k\},\{p,q\}} = \sgn\begin{bmatrix} 1 & 2 & 3 & 4 \\ j & k & p & q \end{bmatrix}, \text{ where } p\neq j,q\neq k. }[/math]

The determinant of A can be written out as

[math]\displaystyle{ |A| = \sum_{H \in S} \varepsilon^{H,H^\prime}b_{H}c_{H^\prime}, }[/math]

where [math]\displaystyle{ H^{\prime} }[/math] is the complementary set to [math]\displaystyle{ H }[/math].

In our explicit example this gives us

[math]\displaystyle{ \begin{align} |A| &= b_{\{1,2\}}c_{\{3,4\}} -b_{\{1,3\}}c_{\{2,4\}} +b_{\{1,4\}}c_{\{2,3\}}+b_{\{2,3\}}c_{\{1,4\}} -b_{\{2,4\}}c_{\{1,3\}} +b_{\{3,4\}}c_{\{1,2\}} \\[5pt] &= \begin{vmatrix} 1 & 2 \\ 5 & 6 \end{vmatrix} \cdot \begin{vmatrix} 11 & 12 \\ 15 & 16 \end{vmatrix} - \begin{vmatrix} 1 & 3 \\ 5 & 7 \end{vmatrix} \cdot \begin{vmatrix} 10 & 12 \\ 14 & 16 \end{vmatrix} + \begin{vmatrix} 1 & 4 \\ 5 & 8 \end{vmatrix} \cdot \begin{vmatrix} 10 & 11 \\ 14 & 15 \end{vmatrix} + \begin{vmatrix} 2 & 3 \\ 6 & 7 \end{vmatrix} \cdot \begin{vmatrix} 9 & 12 \\ 13 & 16 \end{vmatrix} - \begin{vmatrix} 2 & 4 \\ 6 & 8 \end{vmatrix} \cdot \begin{vmatrix} 9 & 11 \\ 13 & 15 \end{vmatrix} + \begin{vmatrix} 3 & 4 \\ 7 & 8 \end{vmatrix} \cdot \begin{vmatrix} 9 & 10 \\ 13 & 14 \end{vmatrix}\\[5pt] &= -4 \cdot (-4) -(-8) \cdot (-8) +(-12) \cdot (-4) +(-4) \cdot (-12) -(-8) \cdot (-8) +(-4) \cdot (-4)\\[5pt] &= 16 - 64 + 48 + 48 - 64 + 16 = 0. \end{align} }[/math]

As above, it is easy to verify that the result is correct: the matrix is singular because the sum of its first and third column is twice the second column, and hence its determinant is zero.

General statement

Let [math]\displaystyle{ B=[b_{ij}] }[/math] be an n × n matrix and [math]\displaystyle{ S }[/math] the set of k-element subsets of {1, 2, ... , n}, [math]\displaystyle{ H }[/math] an element in it. Then the determinant of [math]\displaystyle{ B }[/math] can be expanded along the k rows identified by [math]\displaystyle{ H }[/math] as follows:

[math]\displaystyle{ |B| = \sum_{L\in S} \varepsilon^{H,L} b_{H,L} c_{H,L} }[/math]

where [math]\displaystyle{ \varepsilon^{H,L} }[/math] is the sign of the permutation determined by [math]\displaystyle{ H }[/math] and [math]\displaystyle{ L }[/math], equal to [math]\displaystyle{ (-1)^{\left(\sum_{h\in H}h\right) + \left(\sum_{\ell\in L}\ell\right)} }[/math], [math]\displaystyle{ b_{H,L} }[/math] the square minor of [math]\displaystyle{ B }[/math] obtained by deleting from [math]\displaystyle{ B }[/math] rows and columns with indices in [math]\displaystyle{ H }[/math] and [math]\displaystyle{ L }[/math] respectively, and [math]\displaystyle{ c_{H,L} }[/math] (called the complement of [math]\displaystyle{ b_{H,L} }[/math]) defined to be [math]\displaystyle{ b_{H',L'} }[/math] , [math]\displaystyle{ H' }[/math] and [math]\displaystyle{ L' }[/math] being the complement of [math]\displaystyle{ H }[/math] and [math]\displaystyle{ L }[/math] respectively.

This coincides with the theorem above when [math]\displaystyle{ k=1 }[/math]. The same thing holds for any fixed k columns.

Computational expense

The Laplace expansion is computationally inefficient for high-dimension matrices, with a time complexity in big O notation of O(n!). Alternatively, using a decomposition into triangular matrices as in the LU decomposition can yield determinants with a time complexity of O(n3).[1] The following Python code implements Laplace expansion recursively[citation needed]:

def determinant(M):
    # Base case of recursive function: 1x1 matrix
    if len(M) == 1: 
        return M[0][0]

    total = 0
    for column, element in enumerate(M[0]):
        # Exclude first row and current column.
        K = [x[:column] + x[column + 1 :] for x in M[1:]]
        s = 1 if column % 2 == 0 else -1 
        total += s * element * determinant(K)
    return total

See also

References

  1. Stoer Bulirsch: Introduction to Numerical Mathematics

de:Determinante#Laplacescher Entwicklungssatz