Partial inverse of a matrix

From HandWiki

Definition

In linear algebra and statistics, the partial inverse of a matrix is an operation related to Gaussian elimination which has applications in numerical analysis, statistics and physics. It is also known by various authors as the principal pivot transform, or as the sweep, gyration, or exchange operator, represented by invk if restricted to blocks along the main diagonal, or by Y^k if considering the general case of any arbitrary block from the matrix.[1]

Given an n×n matrix A over a vector space V partitioned into blocks:

A=(A11A12A21A22)

If A11 is invertible, then the partial inverse of A around the pivot block A11 is created by inverting A11, putting the Schur complement A/A11 in place of A22, and adjusting the off-diagonal elements accordingly:[2]

inv1A=((A11)1(A11)1A12A21(A11)1A22A21(A11)1A12)

Conceptually, partial inversion corresponds to a rotation[3] of the graph of the matrix (X,AX)V×V, such that, for conformally-partitioned column matrices (x1,x2)T and (y1,y2)T:[2]

A(x1x2)=(y1y2)inv1(A)(y1x2)=(x1y2)

As defined this way, this operator is its own inverse: invk(invk(A))=A, and if the pivot block A11 is chosen to be the entire matrix, then the transform simply gives the matrix inverse A1. Note that some authors define a related operation (under one of the other names) which is not an inverse per se; particularly, one common definition instead has (invk)2(A)=A.

The transform is often presented as a pivot around a single non-zero element akk, in which case one has

[invk(A)]ij={1akki=j=kakjakki=k,jkaikakkik,j=kaijaikakjakkik,jk

Properties

Restricted Partial Inversion

Restricted partial inverses obey a number of nice properties:[4]

  • inversions around different blocks commute, so larger pivots may be built up from sequences of smaller ones.
  • partial inversion preserves the space of symmetric matrices.
  • partial inversion over a block of size 1 along the main diagonal of a matrix can be expressed in terms of a composition of an identity operator, a commutator operator and a special commutator that resembles the Yang-Baxter equation.[1]

General Partial Inversion

The general partial inversion group Z(n) of generators Y^k that act on objects of size n is a non-Abelian group, since it is a composition of the symmetric group S(n), which is non-Abelian, with the restricted partial inversion group V(n) (which is Abelian) .[1] Z(n) admits representations of the dihedral group and the unitary group.

Z(n)=V(n)×S(n)

Commutators

It was suggested in [1] that a composition Y^ikY^j only commutes if their row indices are different and their column indices are also different:

[Y^ik,Y^j]=0iffijandk.

The Yang-Baxter equation ABA=BAB occurs as a constraint[1] of generators of Z(n), by introducing the substitutions

A=Y^i1,i,B=Y^i+1,i.

Dihedral Group Representation

Compositions of matrix inverse along the main diagonal, y^1 and a generalized inverse along the secondary diagonal y^2 form a representation of the dihedral group, D(n):

y^1=k=1ninvk,y^2=k=1nY^k,nk+1,y^3:=y^1y^2=J^,[y^k,y^]=0,withk,{1,2,3},

where J^ is a similarity transformation made of the reverse permutation matrix.

In three dimensions, the dihedral group D(3) over a matrix M is represented in terms of the identity I^ operator and the generators y^1,y^2,y^3 as

I^M=M,y^1M=inv3inv2inv1M=Y^33Y^22Y^11M,y^2M=Y^31Y^22Y^13M,y^3M=(001010100)M(001010100).

Additional Properties

  • A sequence of partial inversions and matrix powers defines the graph operation of Renormalized Growth,[1] where the dimension of the matrix is kept constant, while the graph that represents it is interpreted to grow.

Applications

Physics

In S-matrix theory and pseudo-unitary quantum mechanics, partial inversion is used to map S-matrices to T-matrices, and, consequently, unitary matrices to Z-pseudo-unitary matrices.[1]

Numerical Analysis

Use of the partial inverse in numerical analysis is due to the fact that there is some flexibility in the choices of pivots, allowing for non-invertible elements to be avoided, and because the operation of rotation (of the graph of the pivoted matrix) has better numerical stability than the shearing operation which is implicitly performed by Gaussian elimination.[3] Use in statistics is due to the fact that the resulting matrix nicely decomposes into blocks which have useful meanings in the context of linear regression.[4]

References

  1. 1.0 1.1 1.2 1.3 1.4 1.5 1.6 Lima, Dennis; Al-Kuwari, Saif (2024). "Unitarization of pseudo-unitary quantum circuits in the S-matrix framework". Physica Scripta 99 (4): 045202. doi:10.1088/1402-4896/ad28f7. 
  2. 2.0 2.1 Tsatsomeros, M. J. (2000). Principal pivot transforms: properties and applications. Linear Algebra and its Applications, 307 (1-3), 151–165.
  3. 3.0 3.1 Sweeping a matrix rotates its graph,
  4. 4.0 4.1 Exceedingly Simple Principal Pivot Transforms