Augmented matrix

From HandWiki
Short description: Matrix formed by appending columns of two other matrices

In linear algebra, an augmented matrix [math]\displaystyle{ (A \vert B) }[/math] is a [math]\displaystyle{ k \times (n+1) }[/math] matrix obtained by appending a [math]\displaystyle{ k }[/math]-dimensional row vector [math]\displaystyle{ B }[/math], on the right, as a further column to a [math]\displaystyle{ k \times n }[/math]-dimensional matrix [math]\displaystyle{ A }[/math]. This is usually done for the purpose of performing the same elementary row operations on the augmented matrix [math]\displaystyle{ (A \vert B) }[/math] as is done on the original one [math]\displaystyle{ A }[/math] when solving a system of linear equations by Gaussian elimination.

For example, given the matrices [math]\displaystyle{ A }[/math] and column vector [math]\displaystyle{ B }[/math], where [math]\displaystyle{ A = \begin{bmatrix} 1 & 3 & 2 \\ 2 & 0 & 1 \\ 5 & 2 & 2 \end{bmatrix} , \quad B = \begin{bmatrix} 4 \\ 3 \\ 1 \end{bmatrix}, }[/math] the augmented matrix [math]\displaystyle{ (A \vert B) }[/math] is [math]\displaystyle{ (A|B) = \left[\begin{array}{ccc|c} 1 & 3 & 2 & 4 \\ 2 & 0 & 1 & 3 \\ 5 & 2 & 2 & 1 \end{array}\right]. }[/math]

For a given number [math]\displaystyle{ n }[/math] of unknowns, the number of solutions to a system of [math]\displaystyle{ k }[/math] linear equations depends only on the rank of the matrix of coefficients [math]\displaystyle{ A }[/math] representing the system and the rank of the corresponding augmented matrix [math]\displaystyle{ (A \vert B) }[/math] where the components of [math]\displaystyle{ B }[/math] consist of the right hand sides of the [math]\displaystyle{ k }[/math] successive linear equations. According to the Rouché–Capelli theorem, any system of linear equations

[math]\displaystyle{ A X = B }[/math]

where [math]\displaystyle{ X=(x_1, \dots, x_n)^T }[/math] is the [math]\displaystyle{ n }[/math]-component column vector whose entries are the unknowns of the system is inconsistent (has no solutions) if the rank of the augmented matrix [math]\displaystyle{ (A \vert B) }[/math] is greater than the rank of the coefficient matrix [math]\displaystyle{ A }[/math] . If, on the other hand, the ranks of these two matrices are equal, the system must have at least one solution. The solution is unique if and only if the rank equals the number of variables [math]\displaystyle{ n }[/math]. Otherwise the general solution has [math]\displaystyle{ j }[/math] free parameters where [math]\displaystyle{ j }[/math] is the difference between the number of variables [math]\displaystyle{ n }[/math] and the rank. In such a case there as an affine space of solutions of dimension equal to this difference.

The inverse of a nonsingular square matrix [math]\displaystyle{ A }[/math] of dimension [math]\displaystyle{ n\times n }[/math] may be found by appending the [math]\displaystyle{ n\times n }[/math] identity matrix [math]\displaystyle{ \mathbf{I} }[/math] to the right of [math]\displaystyle{ A }[/math] to form the [math]\displaystyle{ n \times 2n }[/math] dimensional augmented matrix [math]\displaystyle{ (A \vert \mathbf{I}) }[/math]. Applying elementary row operations to transform the left-hand [math]\displaystyle{ n\times n }[/math] block to the identity matrix [math]\displaystyle{ \mathbf{I} }[/math], the right-hand [math]\displaystyle{ n\times n }[/math] block is then the inverse matrix [math]\displaystyle{ A^{-1} }[/math]

Example of finding the inverse of a matrix

Let [math]\displaystyle{ A }[/math] be the square 2×2 matrix [math]\displaystyle{ A = \begin{bmatrix} 1 & 3 \\ -5 & 0 \end{bmatrix}. }[/math]

To find the inverse of [math]\displaystyle{ A }[/math] we form the augmented matrix [math]\displaystyle{ (A \vert \mathbf{I}_2) }[/math] where [math]\displaystyle{ \mathbf{I}_2 }[/math] is the [math]\displaystyle{ 2\times 2 }[/math] identity matrix. We then reduce the part of [math]\displaystyle{ (A \vert \mathbf{I}_2 }[/math] corresponding to [math]\displaystyle{ A }[/math] to the identity matrix using elementary row operations on [math]\displaystyle{ (A \vert \mathbf{I}_2) }[/math] . [math]\displaystyle{ (A \vert \mathbf{I}_2) = \left[\begin{array}{cc|cc} 1 & 3 & 1 & 0\\ -5 & 0 & 0 & 1 \end{array}\right] }[/math] [math]\displaystyle{ (I|A^{-1}) = \left[\begin{array}{cc|cc} 1 & 0 & 0 & -\frac{1}{5} \\ 0 & 1 & \frac{1}{3} & \frac{1}{15} \end{array}\right], }[/math] the right part of which is the inverse [math]\displaystyle{ A^{-1} }[/math].

Existence and number of solutions

Consider the system of equations [math]\displaystyle{ \begin{align} x + y + 2z &= 2 \\ x + y + z &= 3 \\ 2x + 2y + 2z &= 6. \end{align} }[/math]

The coefficient matrix is [math]\displaystyle{ A = \begin{bmatrix} 1 & 1 & 2 \\ 1 & 1 & 1 \\ 2 & 2 & 2 \\ \end{bmatrix}, }[/math] and the augmented matrix is [math]\displaystyle{ (A|B) = \left[\begin{array}{ccc|c} 1 & 1 & 2 & 2\\ 1 & 1 & 1 & 3 \\ 2 & 2 & 2 & 6 \end{array}\right]. }[/math]

Since both of these have the same rank, namely 2, there exists at least one solution; and since their rank is less than the number of unknowns, the latter being 3, there are an infinite number of solutions.

In contrast, consider the system [math]\displaystyle{ \begin{align} x + y + 2z &= 3 \\ x + y + z &= 1 \\ 2x + 2y + 2z &= 5. \end{align} }[/math]

The coefficient matrix is [math]\displaystyle{ A = \begin{bmatrix} 1 & 1 & 2 \\ 1 & 1 & 1 \\ 2 & 2 & 2 \\ \end{bmatrix}, }[/math] and the augmented matrix is [math]\displaystyle{ (A|B) = \left[\begin{array}{ccc|c} 1 & 1 & 2 & 3 \\ 1 & 1 & 1 & 1 \\ 2 & 2 & 2 & 5 \end{array}\right]. }[/math]

In this example the coefficient matrix has rank 2 while the augmented matrix has rank 3; so this system of equations has no solution. Indeed, an increase in the number of linearly independent rows has made the system of equations inconsistent.

Solution of a linear system

As used in linear algebra, an augmented matrix is used to represent the coefficients and the solution vector of each equation set. For the set of equations [math]\displaystyle{ \begin{align} x + 2y + 3z &= 0 \\ 3x + 4y + 7z &= 2 \\ 6x + 5y + 9z &= 11 \end{align} }[/math] the coefficients and constant terms give the matrices [math]\displaystyle{ A = \begin{bmatrix} 1 & 2 & 3 \\ 3 & 4 & 7 \\ 6 & 5 & 9 \end{bmatrix} , \quad B = \begin{bmatrix} 0 \\ 2 \\ 11 \end{bmatrix}, }[/math] and hence give the augmented matrix [math]\displaystyle{ (A|B) = \left[\begin{array}{ccc|c} 1 & 2 & 3 & 0 \\ 3 & 4 & 7 & 2 \\ 6 & 5 & 9 & 11 \end{array}\right]. }[/math]

Note that the rank of the coefficient matrix, which is 3, equals the rank of the augmented matrix, so at least one solution exists; and since this rank equals the number of unknowns, there is exactly one solution.

To obtain the solution, row operations can be performed on the augmented matrix to obtain the identity matrix on the left side, yielding [math]\displaystyle{ \left[\begin{array}{ccc|r} 1 & 0 & 0 & 4 \\ 0 & 1 & 0 & 1 \\ 0 & 0 & 1 & -2 \\ \end{array}\right], }[/math] so the solution of the system is (x, y, z) = (4, 1, −2).

References

  • Marvin Marcus and Henryk Minc, A survey of matrix theory and matrix inequalities, Dover Publications, 1992, ISBN:0-486-67102-X. Page 31.