Semi-orthogonal matrix

From HandWiki
Short description: Linear algebra concept

In linear algebra, a semi-orthogonal matrix is a non-square matrix with real entries where: if the number of columns exceeds the number of rows, then the rows are orthonormal vectors; but if the number of rows exceeds the number of columns, then the columns are orthonormal vectors.


Properties

Let A be an m×n semi-orthogonal matrix.

  • Either ATA=I or AAT=I.[1][2][3]
  • A semi-orthogonal matrix is an isometry. This means that it preserves the norm either in row space, or column space.
  • A semi-orthogonal matrix always has full rank.
  • A square matrix is semi-orthogonal if and only if it is an orthogonal matrix.
  • A real matrix is semi-orthogonal if and only if its non-zero singular values are all equal to 1.
  • A semi-orthogonal matrix A is semi-unitary (either AA = I or AA = I) and either left-invertible or right-invertible (left-invertible if it has more rows than columns, otherwise right invertible).


Examples

Tall matrix (sub-isometry)

Consider the 3×2 matrix whose columns are orthonormal: A=(100100) Here, its columns are orthonormal. Therefore, it is semi-orthogonal, which is confirmed by: ATA=(100010)(100100)=(1001)=I2

Short matrix

Consider the 2×3 matrix whose rows are orthonormal: B=(100010) Here, its rows are orthonormal. Therefore, it is semi-orthogonal, which is confirmed by: BBT=(100010)(100100)=(1001)=I2

Non-example

The following 3×2 matrix has orthogonal, but not orthonormal, columns and is therefore not semi-orthogonal: C=(200100) The calculation confirms this: CTC=(200010)(200100)=(4001)I2

Proofs

Preservation of Norm

If a matrix A is tall or square (mn), its semi-orthogonality implies ATA=In. For any vector xn, A preserves its norm: Ax22=(Ax)T(Ax)=xTATAx=xTInx=x22 If a matrix A is short (m<n), it preserves the norm of vectors in its row space.

Justification for Full Rank

If ATA=In, then the columns of A are linearly independent, so the rank of A must be n. If AAT=Im, then the rows of A are linearly independent, so the rank of A must be m. In both cases, the matrix has full rank.

Singular Value Property

The statement is that a real matrix A is semi-orthogonal if and only if all of its non-zero singular values are 1.

This follows directly from the SVD, A=UΣVT.
() Assume A is semi-orthogonal. Then either ATA=I or AAT=I. The non-zero singular values of A are the square roots of the non-zero eigenvalues of both ATA and AAT. Since one of these "Gramian" matrices is an identity matrix, its eigenvalues are all 1. Thus, the non-zero singular values of A must be 1.
() Assume all non-zero singular values of A are 1. This forces the block of Σ containing the non-zero values to be an identity matrix. This structure ensures that either ΣTΣ=In (if A has full column rank) or ΣΣT=Im (if A has full row rank). Substituting this into the expressions for ATA=V(ΣTΣ)VT or AAT=U(ΣΣT)UT respectively shows that one of them must simplify to an identity matrix, satisfying the definition of a semi-orthogonal matrix.

References

  1. Abadir, K.M., Magnus, J.R. (2005). Matrix Algebra. Cambridge University Press.
  2. Zhang, Xian-Da. (2017). Matrix analysis and applications. Cambridge University Press.
  3. Povey, Daniel, et al. (2018). "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks." Interspeech.