Min-max theorem

Short description: Variational characterization of eigenvalues of compact Hermitian operators on Hilbert spaces

In linear algebra and functional analysis, the min-max theorem, or variational theorem, or Courant–Fischer–Weyl min-max principle, is a result that gives a variational characterization of eigenvalues of compact Hermitian operators on Hilbert spaces. It can be viewed as the starting point of many results of similar nature.

This article first discusses the finite-dimensional case and its applications before considering compact operators on infinite-dimensional Hilbert spaces. We will see that for compact operators, the proof of the main theorem uses essentially the same idea from the finite-dimensional argument.

In the case that the operator is non-Hermitian, the theorem provides an equivalent characterization of the associated singular values. The min-max theorem can be extended to self-adjoint operators that are bounded below.

Matrices

Let $A$ be a $n \times n$ Hermitian matrix. As with many other variational results on eigenvalues, one considers the Rayleigh–Ritz quotient $RA : Cn \ {0} → R$ defined by

[math]\displaystyle{ R_A(x) = \frac{(Ax, x)}{(x,x)} }[/math]

where $(\cdot, \cdot)$ denotes the Euclidean inner product on $C n$ . Clearly, the Rayleigh quotient of an eigenvector is its associated eigenvalue. Equivalently, the Rayleigh–Ritz quotient can be replaced by

[math]\displaystyle{ f(x) = (Ax, x), \; \|x\| = 1. }[/math]

For Hermitian matrices A, the range of the continuous function R_A(x), or f(x), is a compact interval [a, b] of the real line. The maximum b and the minimum a are the largest and smallest eigenvalue of A, respectively. The min-max theorem is a refinement of this fact.

Min-max theorem

Let [math]\displaystyle{ A }[/math] be Hermitian on an inner product space [math]\displaystyle{ V }[/math] with dimension [math]\displaystyle{ n }[/math], with spectrum ordered in descending order [math]\displaystyle{ \lambda_1 \geq ... \geq \lambda_n }[/math].

Let [math]\displaystyle{ v_1, ..., v_n }[/math] be the corresponding unit-length orthogonal eigenvectors.

Reverse the spectrum ordering, so that [math]\displaystyle{ \xi_1 = \lambda_n, ..., \xi_n = \lambda_1 }[/math].

(Poincare’s inequality) — Let [math]\displaystyle{ M }[/math] be a subspace of [math]\displaystyle{ V }[/math] with dimension [math]\displaystyle{ k }[/math], then there exists unit vectors [math]\displaystyle{ x, y\in M }[/math], such that

[math]\displaystyle{ \langle x, Ax\rangle\leq \lambda_k }[/math], and [math]\displaystyle{ \langle y, Ay\rangle \geq \xi_k }[/math].

Proof

Part 2 is a corollary, using [math]\displaystyle{ -A }[/math].

[math]\displaystyle{ M }[/math] is a [math]\displaystyle{ k }[/math] dimensional subspace, so if we pick any list of [math]\displaystyle{ n-k+1 }[/math] vectors, their span [math]\displaystyle{ N := span(v_k, ... v_n) }[/math] must intersect [math]\displaystyle{ M }[/math] on at least a single line.

Take unit [math]\displaystyle{ x \in M\cap N }[/math]. That’s what we need.

[math]\displaystyle{ x = \sum_{i=k}^n a_i v_i }[/math], since [math]\displaystyle{ x\in N }[/math].

Since [math]\displaystyle{ \sum_{i=k}^n |a_i|^2 = 1 }[/math], we find [math]\displaystyle{ \langle x,Ax \rangle = \sum_{i=k}^n |a_i|^2\lambda_i \leq \lambda_k }[/math].

min-max theorem — [math]\displaystyle{ \begin{aligned} \lambda_k &=\max _{\begin{array}{c} \mathcal{M} \subset V \\ \operatorname{dim}(\mathcal{M})=k \end{array}} \min _{\begin{array}{c} x \in \mathcal{M} \\ \|x\|=1 \end{array}}\langle x, A x\rangle\\ &=\min _{\begin{array}{c} \mathcal{M} \subset V \\ \operatorname{dim}(\mathcal{M})=n-k+1 \end{array}} \max _{\begin{array}{c} x \in \mathcal{M} \\ \|x\|=1 \end{array}}\langle x, A x\rangle \text{. } \end{aligned} }[/math]

Proof

Part 2 is a corollary of part 1, by using [math]\displaystyle{ -A }[/math].

By Poincare’s inequality, [math]\displaystyle{ \lambda_k }[/math] is an upper bound to the right side.

By setting [math]\displaystyle{ \mathcal M = span(v_1, ... v_k) }[/math], the upper bound is achieved.

Counterexample in the non-Hermitian case

Let N be the nilpotent matrix

[math]\displaystyle{ \begin{bmatrix} 0 & 1 \\ 0 & 0 \end{bmatrix}. }[/math]

Define the Rayleigh quotient [math]\displaystyle{ R_N(x) }[/math] exactly as above in the Hermitian case. Then it is easy to see that the only eigenvalue of N is zero, while the maximum value of the Rayleigh quotient is $1 / 2$ . That is, the maximum value of the Rayleigh quotient is larger than the maximum eigenvalue.

Applications

Min-max principle for singular values

The singular values {σ_k} of a square matrix M are the square roots of the eigenvalues of M*M (equivalently MM*). An immediate consequence^{[citation needed]} of the first equality in the min-max theorem is:

[math]\displaystyle{ \sigma_k^{\uparrow} = \min_{S:\dim(S)=k} \max_{x \in S, \|x\| = 1} (M^* Mx, x)^{\frac{1}{2}}=\min_{S:\dim(S)=k} \max_{x \in S, \|x\| = 1} \| Mx \|. }[/math]

Similarly,

[math]\displaystyle{ \sigma_k^{\uparrow} = \max_{S:\dim(S)=n-k+1} \min_{x \in S, \|x\| = 1} \| Mx \|. }[/math]

Here [math]\displaystyle{ \sigma_k=\sigma_k^\uparrow }[/math] denotes the k^th entry in the increasing sequence of σ's, so that [math]\displaystyle{ \sigma_1\leq\sigma_2\leq\cdots }[/math].

Cauchy interlacing theorem

Main page: Poincaré separation theorem

Let $A$ be a symmetric n × n matrix. The m × m matrix B, where m ≤ n, is called a compression of $A$ if there exists an orthogonal projection P onto a subspace of dimension m such that PAP* = B. The Cauchy interlacing theorem states:

Theorem. If the eigenvalues of

A

are

α 1 \leq ... \leq α n

, and those of B are

β 1 \leq ... \leq β j \leq ... \leq β m

, then for all

j \leq m

,

[math]\displaystyle{ \alpha_j \leq \beta_j \leq \alpha_{n-m+j}. }[/math]

This can be proven using the min-max principle. Let β_i have corresponding eigenvector b_i and S_j be the j dimensional subspace $S j = span{b 1, ..., b j},$ then

[math]\displaystyle{ \beta_j = \max_{x \in S_j, \|x\| = 1} (Bx, x) = \max_{x \in S_j, \|x\| = 1} (PAP^*x, x) \geq \min_{S_j} \max_{x \in S_j, \|x\| = 1} (A(P^*x), P^*x) = \alpha_j. }[/math]

According to first part of min-max, $α j \leq β j .$ On the other hand, if we define $S m - j +1 = span{b j, ..., b m},$ then

[math]\displaystyle{ \beta_j = \min_{x \in S_{m-j+1}, \|x\| = 1} (Bx, x) = \min_{x \in S_{m-j+1}, \|x\| = 1} (PAP^*x, x)= \min_{x \in S_{m-j+1}, \|x\| = 1} (A(P^*x), P^*x) \leq \alpha_{n-m+j}, }[/math]

where the last inequality is given by the second part of min-max.

When $n - m = 1$ , we have $α j \leq β j \leq α j +1$ , hence the name interlacing theorem.

Compact operators

Let $A$ be a compact, Hermitian operator on a Hilbert space H. Recall that the spectrum of such an operator (the set of eigenvalues) is a set of real numbers whose only possible cluster point is zero. It is thus convenient to list the positive eigenvalues of $A$ as

[math]\displaystyle{ \cdots \le \lambda_k \le \cdots \le \lambda_1, }[/math]

where entries are repeated with multiplicity, as in the matrix case. (To emphasize that the sequence is decreasing, we may write [math]\displaystyle{ \lambda_k = \lambda_k^\downarrow }[/math].) When H is infinite-dimensional, the above sequence of eigenvalues is necessarily infinite. We now apply the same reasoning as in the matrix case. Letting S_k ⊂ H be a k dimensional subspace, we can obtain the following theorem.

Theorem (Min-Max). Let

A

be a compact, self-adjoint operator on a Hilbert space

H

, whose positive eigenvalues are listed in decreasing order

... \leq λ k \leq ... \leq λ 1

. Then:

[math]\displaystyle{ \begin{align} \max_{S_k} \min_{x \in S_k, \|x\| = 1} (Ax,x) &= \lambda_k ^{\downarrow}, \\ \min_{S_{k-1}} \max_{x \in S_{k-1}^{\perp}, \|x\|=1} (Ax, x) &= \lambda_k^{\downarrow}. \end{align} }[/math]

A similar pair of equalities hold for negative eigenvalues.

Proof

Let S' be the closure of the linear span [math]\displaystyle{ S' =\operatorname{span}\{u_k,u_{k+1},\ldots\} }[/math]. The subspace S' has codimension k − 1. By the same dimension count argument as in the matrix case, S' ∩ S_k has positive dimension. So there exists x ∈ S' ∩ S_k with [math]\displaystyle{ \|x\|=1 }[/math]. Since it is an element of S' , such an x necessarily satisfy

[math]\displaystyle{ (Ax, x) \le \lambda_k. }[/math]

Therefore, for all S_k

[math]\displaystyle{ \inf_{x \in S_k, \|x\| = 1}(Ax,x) \le \lambda_k }[/math]

But $A$ is compact, therefore the function f(x) = (Ax, x) is weakly continuous. Furthermore, any bounded set in H is weakly compact. This lets us replace the infimum by minimum:

[math]\displaystyle{ \min_{x \in S_k, \|x\| = 1}(Ax,x) \le \lambda_k. }[/math]

So

[math]\displaystyle{ \sup_{S_k} \min_{x \in S_k, \|x\| = 1}(Ax,x) \le \lambda_k. }[/math]

Because equality is achieved when [math]\displaystyle{ S_k=\operatorname{span}\{u_1,\ldots,u_k\} }[/math],

[math]\displaystyle{ \max_{S_k} \min_{x \in S_k, \|x\| = 1}(Ax,x) = \lambda_k. }[/math]

This is the first part of min-max theorem for compact self-adjoint operators.

Analogously, consider now a $(k - 1)$ -dimensional subspace S_k−1, whose the orthogonal complement is denoted by S_k−1^⊥. If S' = span{u₁...u_k},

[math]\displaystyle{ S' \cap S_{k-1}^{\perp} \ne {0}. }[/math]

So

[math]\displaystyle{ \exists x \in S_{k-1}^{\perp} \, \|x\| = 1, (Ax, x) \ge \lambda_k. }[/math]

This implies

[math]\displaystyle{ \max_{x \in S_{k-1}^{\perp}, \|x\| = 1} (Ax, x) \ge \lambda_k }[/math]

where the compactness of A was applied. Index the above by the collection of k-1-dimensional subspaces gives

[math]\displaystyle{ \inf_{S_{k-1}} \max_{x \in S_{k-1}^{\perp}, \|x\|=1} (Ax, x) \ge \lambda_k. }[/math]

Pick S_k−1 = span{u₁, ..., u_k−1} and we deduce

[math]\displaystyle{ \min_{S_{k-1}} \max_{x \in S_{k-1}^{\perp}, \|x\|=1} (Ax, x) = \lambda_k. }[/math]

Self-adjoint operators

The min-max theorem also applies to (possibly unbounded) self-adjoint operators.^[1]^[2] Recall the essential spectrum is the spectrum without isolated eigenvalues of finite multiplicity. Sometimes we have some eigenvalues below the essential spectrum, and we would like to approximate the eigenvalues and eigenfunctions.

Theorem (Min-Max). Let A be self-adjoint, and let [math]\displaystyle{ E_1\le E_2\le E_3\le\cdots }[/math] be the eigenvalues of A below the essential spectrum. Then

[math]\displaystyle{ E_n=\min_{\psi_1,\ldots,\psi_{n}}\max\{\langle\psi,A\psi\rangle:\psi\in\operatorname{span}(\psi_1,\ldots,\psi_{n}), \, \| \psi \| = 1\} }[/math].

If we only have N eigenvalues and hence run out of eigenvalues, then we let [math]\displaystyle{ E_n:=\inf\sigma_{ess}(A) }[/math] (the bottom of the essential spectrum) for n>N, and the above statement holds after replacing min-max with inf-sup.

Theorem (Max-Min). Let A be self-adjoint, and let [math]\displaystyle{ E_1\le E_2\le E_3\le\cdots }[/math] be the eigenvalues of A below the essential spectrum. Then

[math]\displaystyle{ E_n=\max_{\psi_1,\ldots,\psi_{n-1}}\min\{\langle\psi,A\psi\rangle:\psi\perp\psi_1,\ldots,\psi_{n-1}, \, \| \psi \| = 1\} }[/math].

If we only have N eigenvalues and hence run out of eigenvalues, then we let [math]\displaystyle{ E_n:=\inf\sigma_{ess}(A) }[/math] (the bottom of the essential spectrum) for n > N, and the above statement holds after replacing max-min with sup-inf.

The proofs^[1]^[2] use the following results about self-adjoint operators:

Theorem. Let A be self-adjoint. Then [math]\displaystyle{ (A-E)\ge0 }[/math] for [math]\displaystyle{ E\in\mathbb{R} }[/math] if and only if [math]\displaystyle{ \sigma(A)\subseteq[E,\infty) }[/math].^[1]^:77

Theorem. If A is self-adjoint, then

[math]\displaystyle{ \inf\sigma(A)=\inf_{\psi\in\mathfrak{D}(A),\|\psi\|=1}\langle\psi,A\psi\rangle }[/math]

and

[math]\displaystyle{ \sup\sigma(A)=\sup_{\psi\in\mathfrak{D}(A),\|\psi\|=1}\langle\psi,A\psi\rangle }[/math].^[1]^:77

References

↑ ^1.0 ^1.1 ^1.2 ^1.3 G. Teschl, Mathematical Methods in Quantum Mechanics (GSM 99) https://www.mat.univie.ac.at/~gerald/ftp/book-schroe/schroe.pdf
↑ ^2.0 ^2.1 Lieb; Loss (2001). Analysis. GSM. 14 (2nd ed.). Providence: American Mathematical Society. ISBN 0-8218-2783-9.

External links and citations to related work

Fisk, Steve (2005). A very short proof of Cauchy's interlace theorem for eigenvalues of Hermitian matrices.
Hwang, Suk-Geun (2004). "Cauchy's Interlace Theorem for Eigenvalues of Hermitian Matrices". The American Mathematical Monthly 111 (2): 157–159. doi:10.2307/4145217. https://www.jstor.org/stable/4145217.
Kline, Jeffery (2020). "Bordered Hermitian matrices and sums of the Möbius function". Linear Algebra and Its Applications 588: 224–237. doi:10.1016/j.laa.2019.12.004.
Reed, Michael; Simon, Barry (1978). Methods of Modern Mathematical Physics IV: Analysis of Operators. Academic Press. ISBN 978-0-08-057045-7. https://www.elsevier.com/books/iv-analysis-of-operators/reed/978-0-08-057045-7.

0.00

(0 votes)

Original source: https://en.wikipedia.org/wiki/Min-max theorem. Read more

[teschl-1] 1.0 ^1.1 ^1.2 ^1.3 G. Teschl, Mathematical Methods in Quantum Mechanics (GSM 99) https://www.mat.univie.ac.at/~gerald/ftp/book-schroe/schroe.pdf

[lieb-loss-2] 2.0 ^2.1 Lieb; Loss (2001). Analysis. GSM. 14 (2nd ed.). Providence: American Mathematical Society. ISBN 0-8218-2783-9.

[1]

[2]

v t e Functional analysis (topics)
Topological vector spaces	Asplund Banach (list) Banach lattice Barrelled Bornological Brauner F-space Fréchet (tame) Hilbert (Inner product space Polarization identity) LF-space Locally convex (Seminorms/Minkowski functionals) Mackey Montel Nuclear Normed (norm) Quasinormed Reflexive Riesz Smith Stereotype Strictly convex Webbed Topological tensor product (of Hilbert spaces)
Topologies of function spaces	Dual Dual space (Dual norm) Operator Ultraweak Weak (polar operator) Mackey Strong (polar operator) Ultrastrong Uniform convergence
Linear operators	Adjoint Bilinear (form operator sesquilinear) (Un)Bounded Closed Compact (on Hilbert spaces) (Dis)Continuous Densely defined Fredholm Hilbert–Schmidt Functionals (positive) Normal Nuclear Self-adjoint Strictly singular Trace class Transpose Unitary
Operator theory	Banach algebras C-algebras Spectrum (C-algebra radius) Spectral theory (of ODEs Spectral theorem) Polar decomposition Singular value decomposition
Theorems	Banach–Alaoglu Banach–Mazur Banach–Saks Banach–Schauder (open mapping) Banach–Steinhaus (Uniform boundedness) Bessel's inequality Cauchy–Schwarz inequality Closed graph Closed range Eberlein–Šmulian Freudenthal spectral Gelfand–Mazur Gelfand–Naimark Goldstine Hahn–Banach (hyperplane separation) Kakutani fixed-point Krein–Milman Lomonosov's invariant subspace Mackey–Arens Mazur's lemma M. Riesz extension Riesz representation Parseval's identity Schauder fixed-point
Analysis	Abstract Wiener space Bochner space Differentiation in Fréchet spaces Derivatives (Fréchet Gateaux functional holomorphic) Integrals (Bochner Dunford Gelfand–Pettis regulated Paley–Wiener weak) Functional calculus (Borel continuous holomorphic) Inverse function theorem (Nash–Moser theorem) Measures (Lebesgue Projection-valued Vector) Weakly measurable function
Types of sets	Absolutely convex Absorbing Balanced Bounded Convex Convex cone (subset) Linear cone (subset) Radial Star-shaped Symmetric Zonotope
Subsets / set operations	Algebraic interior (core) Bounding points Convex hull Extreme point Interior Minkowski addition Polar

Anonymous

Search

Min-max theorem

Namespaces

More

Page actions

Contents

Matrices

Min-max theorem

Counterexample in the non-Hermitian case

Applications

Min-max principle for singular values

Cauchy interlacing theorem

Compact operators

Self-adjoint operators

See also

References

External links and citations to related work

Navigation

Navigation

Help

Translate

Wiki tools

Wiki tools

Anonymous

Search

Min-max theorem

Matrices

Min-max theorem

Counterexample in the non-Hermitian case

Applications

Min-max principle for singular values

Cauchy interlacing theorem

Compact operators

Self-adjoint operators

See also

References

External links and citations to related work

Navigation

Wiki tools

Page tools

Other projects

Categories