Gram matrix

Short description: Matrix of inner products of vectors

In linear algebra, the Gram matrix (or Gramian matrix, Gramian) of vectors $v_{1}, \dots, v_{n}$ in an inner product space is the Hermitian matrix of inner products, whose entries are given by the inner product $G_{i j} = ⟨ v_{i}, v_{j} ⟩$ .^[1] If the vectors $v_{1}, \dots, v_{n}$ are the columns of matrix $X$ then the Gram matrix is $X^{†} X$ in the general case that the vector coordinates are complex numbers, which simplifies to $X^{⊤} X$ for the case that the vector coordinates are real numbers.

An important application is to compute linear independence: a set of vectors are linearly independent if and only if the Gram determinant (the determinant of the Gram matrix) is non-zero.

It is named after Jørgen Pedersen Gram.

Examples

For finite-dimensional real vectors in $ℝ^{n}$ with the usual Euclidean dot product, the Gram matrix is $G = V^{⊤} V$ , where $V$ is a matrix whose columns are the vectors $v_{k}$ and $V^{⊤}$ is its transpose whose rows are the vectors $v_{k}^{⊤}$ . For complex vectors in $ℂ^{n}$ , $G = V^{†} V$ , where $V^{†}$ is the conjugate transpose of $V$ .

Given square-integrable functions ${ℓ_{i} (\cdot), i = 1, \dots, n}$ on the interval $[t_{0}, t_{f}]$ , the Gram matrix $G = [G_{i j}]$ is:

G_{i j} = \int_{t_{0}}^{t_{f}} ℓ_{i}^{*} (τ) ℓ_{j} (τ) d τ .

where $ℓ_{i}^{*} (τ)$ is the complex conjugate of $ℓ_{i} (τ)$ .

For any bilinear form $B$ on a finite-dimensional vector space over any field we can define a Gram matrix $G$ attached to a set of vectors $v_{1}, \dots, v_{n}$ by $G_{i j} = B (v_{i}, v_{j})$ . The matrix will be symmetric if the bilinear form $B$ is symmetric.

Applications

In Riemannian geometry, given an embedded $k$ -dimensional Riemannian manifold $M \subset ℝ^{n}$ and a parametrization $ϕ : U \to M$ for $(x_{1}, \dots, x_{k}) \in U \subset ℝ^{k}$ , the volume form $ω$ on $M$ induced by the embedding may be computed using the Gramian of the coordinate tangent vectors: $ω = \sqrt{\det G} d x_{1} \dots d x_{k}, G = [⟨ \frac{\partial ϕ}{\partial x_{i}}, \frac{\partial ϕ}{\partial x_{j}} ⟩] .$ This generalizes the classical surface integral of a parametrized surface $ϕ : U \to S \subset ℝ^{3}$ for $(x, y) \in U \subset ℝ^{2}$ : $\int_{S} f d A = \iint_{U} f (ϕ (x, y)) | \frac{\partial ϕ}{\partial x} \times \frac{\partial ϕ}{\partial y} | d x d y .$
If the vectors are centered random variables, the Gramian is approximately proportional to the covariance matrix, with the scaling determined by the number of elements in the vector.
In quantum chemistry, the Gram matrix of a set of basis vectors is the overlap matrix.
In control theory (or more generally systems theory), the controllability Gramian and observability Gramian determine properties of a linear system.
Gramian matrices arise in covariance structure model fitting (see e.g., Jamshidian and Bentler, 1993, Applied Psychological Measurement, Volume 18, pp. 79–94).
In the finite element method, the Gram matrix arises from approximating a function from a finite dimensional space; the Gram matrix entries are then the inner products of the basis functions of the finite dimensional subspace.
In machine learning, kernel functions are often represented as Gram matrices.^[2] (Also see kernel PCA)
Since the Gram matrix over the reals is a symmetric matrix, it is diagonalizable and its eigenvalues are non-negative. The diagonalization of the Gram matrix is the singular value decomposition.

Properties

Positive-semidefiniteness

The Gram matrix is symmetric in the case the inner product is real-valued; it is Hermitian in the general, complex case by definition of an inner product.

The Gram matrix is positive semidefinite, and every positive semidefinite matrix is the Gramian matrix for some set of vectors. The fact that the Gramian matrix is positive-semidefinite can be seen from the following simple derivation:

x^{†} 𝐆 x = \sum_{i, j} x_{i}^{*} x_{j} ⟨ v_{i}, v_{j} ⟩ = \sum_{i, j} ⟨ x_{i} v_{i}, x_{j} v_{j} ⟩ = ⟨ \sum_{i} x_{i} v_{i}, \sum_{j} x_{j} v_{j} ⟩ = ‖ \sum_{i} x_{i} v_{i} ‖^{2} \geq 0 .

The first equality follows from the definition of matrix multiplication, the second and third from the bi-linearity of the inner-product, and the last from the positive definiteness of the inner product. Note that this also shows that the Gramian matrix is positive definite if and only if the vectors $v_{i}$ are linearly independent (that is, $\sum_{i} x_{i} v_{i} \neq 0$ for all $x$ ).^[1]

Finding a vector realization

Given any positive semidefinite matrix $M$ , one can decompose it as:

M = B^{†} B

,

where $B^{†}$ is the conjugate transpose of $B$ (or $M = B^{T} B$ in the real case).

Here $B$ is a $k \times n$ matrix, where $k$ is the rank of $M$ . Various ways to obtain such a decomposition include computing the Cholesky decomposition or taking the non-negative square root of $M$ .

The columns $b^{(1)}, \dots, b^{(n)}$ of $B$ can be seen as n vectors in $ℂ^{k}$ (or k-dimensional Euclidean space $ℝ^{k}$ , in the real case). Then

M_{i j} = b^{(i)} \cdot b^{(j)}

where the dot product $a \cdot b = \sum_{ℓ = 1}^{k} a_{ℓ}^{*} b_{ℓ}$ is the usual inner product on $ℂ^{k}$ .

Thus a Hermitian matrix $M$ is positive semidefinite if and only if it is the Gram matrix of some vectors $b^{(1)}, \dots, b^{(n)}$ . Such vectors are called a vector realization of $M$ . The infinite-dimensional analog of this statement is Mercer's theorem.

Uniqueness of vector realizations

If $M$ is the Gram matrix of vectors $v_{1}, \dots, v_{n}$ in $ℝ^{k}$ then applying any rotation or reflection of $ℝ^{k}$ (any orthogonal transformation, that is, any Euclidean isometry preserving 0) to the sequence of vectors results in the same Gram matrix. That is, for any $k \times k$ orthogonal matrix $Q$ , the Gram matrix of $Q v_{1}, \dots, Q v_{n}$ is also $M$ .

This is the only way in which two real vector realizations of $M$ can differ: the vectors $v_{1}, \dots, v_{n}$ are unique up to orthogonal transformations. In other words, the dot products $v_{i} \cdot v_{j}$ and $w_{i} \cdot w_{j}$ are equal if and only if some rigid transformation of $ℝ^{k}$ transforms the vectors $v_{1}, \dots, v_{n}$ to $w_{1}, \dots, w_{n}$ and 0 to 0.

The same holds in the complex case, with unitary transformations in place of orthogonal ones. That is, if the Gram matrix of vectors $v_{1}, \dots, v_{n}$ is equal to the Gram matrix of vectors $w_{1}, \dots, w_{n}$ in $ℂ^{k}$ then there is a unitary $k \times k$ matrix $U$ (meaning $U^{†} U = I$ ) such that $v_{i} = U w_{i}$ for $i = 1, \dots, n$ .^[3]

Other properties

Because $G = G^{†}$ , it is necessarily the case that $G$ and $G^{†}$ commute. That is, a real or complex Gram matrix $G$ is also a normal matrix.
The Gram matrix of any orthonormal basis is the identity matrix. Equivalently, the Gram matrix of the rows or the columns of a real rotation matrix is the identity matrix. Likewise, the Gram matrix of the rows or columns of a unitary matrix is the identity matrix.
The rank of the Gram matrix of vectors in $ℝ^{k}$ or $ℂ^{k}$ equals the dimension of the space spanned by these vectors.^[1]

Gram determinant

The Gram determinant or Gramian is the determinant of the Gram matrix: $| G (v_{1}, \dots, v_{n}) | = | \begin{matrix} ⟨ v_{1}, v_{1} ⟩ & ⟨ v_{1}, v_{2} ⟩ & \dots & ⟨ v_{1}, v_{n} ⟩ \\ ⟨ v_{2}, v_{1} ⟩ & ⟨ v_{2}, v_{2} ⟩ & \dots & ⟨ v_{2}, v_{n} ⟩ \\ ⋮ & ⋮ & ⋱ & ⋮ \\ ⟨ v_{n}, v_{1} ⟩ & ⟨ v_{n}, v_{2} ⟩ & \dots & ⟨ v_{n}, v_{n} ⟩ \end{matrix} | .$

If $v_{1}, \dots, v_{n}$ are vectors in $ℝ^{m}$ then it is the square of the n-dimensional volume of the parallelotope formed by the vectors. In particular, the vectors are linearly independent if and only if the parallelotope has nonzero n-dimensional volume, if and only if Gram determinant is nonzero, if and only if the Gram matrix is nonsingular. When n > m the determinant and volume are zero. When n = m, this reduces to the standard theorem that the absolute value of the determinant of n n-dimensional vectors is the n-dimensional volume. The volume of the simplex formed by the vectors is $Volume(parallelotope) / n!$ .

When $v_{1}, \dots, v_{n}$ are linearly independent, the distance between a point $x$ and the linear span of $v_{1}, \dots, v_{n}$ is $\sqrt{\frac{| G (x, v_{1}, \dots, v_{n}) |}{| G (v_{1}, \dots, v_{n}) |}}$ .

Consider the moment problem: given $c_{1}, \dots, c_{n} \in ℂ$ , find a vector $v$ such that $⟨ v, v_{i} ⟩ = c_{i}$ , for all $1 ⩽ i ⩽ n$ . There exists a unique solution with minimal norm:^[4]Template:Pg $v = - \frac{1}{G (v_{1}, v_{2}, \dots, v_{n})} \det [\begin{matrix} 0 & c_{1} & c_{2} & \dots & c_{n} \\ v_{1} & ⟨ v_{1}, v_{1} ⟩ & ⟨ v_{1}, v_{2} ⟩ & \dots & ⟨ v_{1}, v_{n} ⟩ \\ v_{2} & ⟨ v_{2}, v_{1} ⟩ & ⟨ v_{2}, v_{2} ⟩ & \dots & ⟨ v_{2}, v_{n} ⟩ \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ v_{n} & ⟨ v_{n}, v_{1} ⟩ & ⟨ v_{n}, v_{2} ⟩ & \dots & ⟨ v_{n}, v_{n} ⟩ \end{matrix}]$ The Gram determinant can also be expressed in terms of the exterior product of vectors by

| G (v_{1}, \dots, v_{n}) | = ‖ v_{1} \land \dots \land v_{n} ‖^{2} .

The Gram determinant therefore supplies an inner product for the space $⋀^{n} (V)$ . If an orthonormal basis e_i, i = 1, 2, ..., n on $V$ is given, the vectors

e_{i_{1}} \land \dots \land e_{i_{n}}, i_{1} < \dots < i_{n},

will constitute an orthonormal basis of n-dimensional volumes on the space $⋀^{n} (V)$ . Then the Gram determinant $| G (v_{1}, \dots, v_{n}) |$ amounts to an n-dimensional Pythagorean Theorem for the volume of the parallelotope formed by the vectors $v_{1} \land \dots \land v_{n}$ in terms of its projections onto the basis volumes $e_{i_{1}} \land \dots \land e_{i_{n}}$ .

When the vectors $v_{1}, \dots, v_{n} \in ℝ^{m}$ are defined from the positions of points $p_{1}, \dots, p_{n}$ relative to some reference point $p_{n + 1}$ ,

(v_{1}, v_{2}, \dots, v_{n}) = (p_{1} - p_{n + 1}, p_{2} - p_{n + 1}, \dots, p_{n} - p_{n + 1}),

then the Gram determinant can be written as the difference of two Gram determinants,

| G (v_{1}, \dots, v_{n}) | = | G ((p_{1}, 1), \dots, (p_{n + 1}, 1)) | - | G (p_{1}, \dots, p_{n + 1}) |,

where each $(p_{j}, 1)$ is the corresponding point $p_{j}$ supplemented with the coordinate value of 1 for an $(m + 1)$ -st dimension. Note that in the common case that $n = m$ , the second term on the right-hand side will be zero.

Constructing an orthonormal basis

Given a set of linearly independent vectors ${v_{i}}$ with Gram matrix $G$ defined by $G_{i j} : = ⟨ v_{i}, v_{j} ⟩$ , one can construct an orthonormal basis

u_{i} : = \sum_{j} (G^{- 1 / 2})_{j i} v_{j} .

In matrix notation, $U = V G^{- 1 / 2}$ , where $U$ has orthonormal basis vectors ${u_{i}}$ and the matrix $V$ is composed of the given column vectors ${v_{i}}$ .

The matrix $G^{- 1 / 2}$ is guaranteed to exist. Indeed, $G$ is Hermitian, and so can be decomposed as $G = U D U^{†}$ with $U$ a unitary matrix and $D$ a real diagonal matrix. Additionally, the $v_{i}$ are linearly independent if and only if $G$ is positive definite, which implies that the diagonal entries of $D$ are positive. $G^{- 1 / 2}$ is therefore uniquely defined by $G^{- 1 / 2} : = U D^{- 1 / 2} U^{†}$ . One can check that these new vectors are orthonormal:

\begin{aligned} ⟨ u_{i}, u_{j} ⟩ & = \sum_{i^{'}} \sum_{j^{'}} ⟨ (G^{- 1 / 2})_{i^{'} i} v_{i^{'}}, (G^{- 1 / 2})_{j^{'} j} v_{j^{'}} ⟩ \\ = \sum_{i^{'}} \sum_{j^{'}} (G^{- 1 / 2})_{i i^{'}} G_{i^{'} j^{'}} (G^{- 1 / 2})_{j^{'} j} \\ = (G^{- 1 / 2} G G^{- 1 / 2})_{i j} = δ_{i j} \end{aligned}

where we used $(G^{- 1 / 2})^{†} = G^{- 1 / 2}$ .

References

↑ ^1.0 ^1.1 ^1.2 Horn & Johnson 2013, p. 441, p.441, Theorem 7.2.10
↑ Lanckriet, G. R. G.; Cristianini, N.; Bartlett, P.; Ghaoui, L. E.; Jordan, M. I. (2004). "Learning the kernel matrix with semidefinite programming". Journal of Machine Learning Research 5: 27–72 [p. 29]. https://dl.acm.org/citation.cfm?id=894170.
↑ (Horn Johnson), p. 452, Theorem 7.3.11
↑ Ramon, Garcia, Stephan; Javad, Mashreghi; T., Ross, William (2023-01-30). "Operator Theory by Example" (in en). OUP Academic. doi:10.1093/oso/9780192863867.001.0001. https://academic.oup.com/book/45766.

Horn, Roger A.; Johnson, Charles R. (2013). Matrix Analysis (2nd ed.). Cambridge University Press. ISBN 978-0-521-54823-6.

External links

Hazewinkel, Michiel, ed. (2001), "Gram matrix", Encyclopedia of Mathematics, Springer Science+Business Media B.V. / Kluwer Academic Publishers, ISBN 978-1-55608-010-4, https://www.encyclopediaofmath.org/index.php?title=p/g044750
Volumes of parallelograms by Frank Jones

fr:Matrice de Gram

0.00

(0 votes)

Original source: https://en.wikipedia.org/wiki/Gram matrix. Read more

[HJ-7.2.10-1] 1.0 ^1.1 ^1.2 Horn & Johnson 2013, p. 441, p.441, Theorem 7.2.10

[2] Lanckriet, G. R. G.; Cristianini, N.; Bartlett, P.; Ghaoui, L. E.; Jordan, M. I. (2004). "Learning the kernel matrix with semidefinite programming". Journal of Machine Learning Research 5: 27–72 [p. 29]. https://dl.acm.org/citation.cfm?id=894170.

[3] (Horn Johnson), p. 452, Theorem 7.3.11

[4] Ramon, Garcia, Stephan; Javad, Mashreghi; T., Ross, William (2023-01-30). "Operator Theory by Example" (in en). OUP Academic. doi:10.1093/oso/9780192863867.001.0001. https://academic.oup.com/book/45766.

[1]

[2]

[3]

[4]

v t e Matrix classes
Explicitly constrained entries	(0,1) Alternant Anti-diagonal Anti-Hermitian Anti-symmetric Arrowhead Band Bidiagonal Binary Bisymmetric Block-diagonal Block Block tridiagonal Boolean Cauchy Centrosymmetric Conference Complex Hadamard Copositive Diagonally dominant Diagonal Discrete Fourier Transform Elementary Equivalent Frobenius Generalized permutation Hadamard Hankel Hermitian Hessenberg Hollow Integer Logical Markov Metzler Monomial Moore Nonnegative Partitioned Parisi Pentadiagonal Permutation Persymmetric Polynomial Positive Quaternionic Sign Signature Skew-Hermitian Skew-symmetric Skyline Sparse Sylvester Symmetric Toeplitz Triangular Tridiagonal Unitary Vandermonde Walsh Z
Constant	Exchange Hilbert Identity Lehmer Of ones Pascal Pauli Redheffer Shift Zero
Conditions on eigenvalues or eigenvectors	Companion Convergent Defective Diagonalizable Hurwitz Positive-definite Stability Stieltjes
Satisfying conditions on products or inverses	Congruent Idempotent or Projection Invertible Involutory Nilpotent Normal Orthogonal Orthonormal Singular Unimodular Unipotent Totally unimodular Weighing
With specific applications	Adjugate Alternating sign Augmented Bézout Carleman Cartan Circulant Cofactor Commutation Confusion Coxeter Derogatory Distance Duplication Elimination Euclidean distance Fundamental (linear differential equation) Generator Gramian Hessian Householder Jacobian Moment Payoff Pick Random Rotation Seifert Shear Similarity Symplectic Totally positive Transformation Wedderburn X–Y–Z
Used in statistics	Bernoulli Centering Correlation Covariance Design Dispersion Doubly stochastic Fisher information Hat Precision Stochastic Transition
Used in graph theory	Adjacency Biadjacency Degree Edmonds Incidence Laplacian Seidel adjacency Skew-adjacency Tutte
Used in science and engineering	Cabibbo–Kobayashi–Maskawa Density Fundamental (computer vision) Fuzzy associative Gamma Gell-Mann Hamiltonian Irregular Overlap S State transition Substitution Z (chemistry)
Related terms	Jordan canonical form Linear independence Matrix exponential Matrix representation of conic sections Perfect matrix Pseudoinverse Quaternionic matrix Row echelon form Wronskian
List of matrices Category:Matrices

Anonymous

Search

Gram matrix

Namespaces

More

Page actions

Contents

Examples

Applications

Properties

Positive-semidefiniteness

Finding a vector realization

Uniqueness of vector realizations

Other properties

Gram determinant

Constructing an orthonormal basis

See also

References

External links

Navigation

Navigation

Resources

Help

googletranslator

Navigation

Wiki tools

Wiki tools

Anonymous

Search

Gram matrix

Examples

Applications

Properties

Positive-semidefiniteness

Finding a vector realization

Uniqueness of vector realizations

Other properties

Gram determinant

Constructing an orthonormal basis

See also

References

External links

Navigation

Wiki tools

Page tools

Other projects

Categories