Hilbert projection theorem

From HandWiki
Short description: On closed convex subsets in Hilbert space

In mathematics, the Hilbert projection theorem is a famous result of convex analysis that says that for every vector x in a Hilbert space H and every nonempty closed convex CH, there exists a unique vector mC for which cx is minimized over the vectors cC; that is, such that mxcx for every cC.

Finite dimensional case

Some intuition for the theorem can be obtained by considering the first order condition of the optimization problem.

Consider a finite dimensional real Hilbert space H with a subspace C and a point x. If mC is a minimizer or minimum point of the function N:C defined by N(c):=cx (which is the same as the minimum point of ccx2), then derivative must be zero at m.

In matrix derivative notation:[1] xc2=cx,cx=2cx,c Since c is a vector in C that represents an arbitrary tangent direction, it follows that mx must be orthogonal to every vector in C.

Statement

Hilbert projection theorem — For every vector x in a Hilbert space H and every nonempty closed convex CH, there exists a unique vector mC for which xm is equal to δ:=infcCxc.

If the closed subset C is also a vector subspace of H then this minimizer m is the unique element in C such that xm is orthogonal to C.

Detailed elementary proof

Proof by reduction to a special case

It suffices to prove the theorem in the case of x=0 because the general case follows from the statement below by replacing C with Cx.

Hilbert projection theorem (case x=0)[2] — For every nonempty closed convex subset CH of a Hilbert space H, there exists a unique vector mC such that infcCc=m.

Furthermore, letting d:=infcCc, if (cn)n=1 is any sequence in C such that limncn=d in [note 1] then limncn=m in H.

Consequences

Proposition — If C is a closed vector subspace of a Hilbert space H then[note 3] H=CC.

Properties

Expression as a global minimum

The statement and conclusion of the Hilbert projection theorem can be expressed in terms of global minimums of the following functions. Their notation will also be used to simplify certain statements.

Given a non-empty subset CH and some xH, define a function dC,x:C[0,) by cxc. A global minimum point of dC,x, if one exists, is any point m in domaindC,x=C such that dC,x(m)dC,x(c) for all cC, in which case dC,x(m)=mx is equal to the global minimum value of the function dC,x, which is: infcCdC,x(c)=infcCxc.

Effects of translations and scalings

When this global minimum point m exists and is unique then denote it by min(C,x); explicitly, the defining properties of min(C,x) (if it exists) are: min(C,x)C and xmin(C,x)xc for all cC. The Hilbert projection theorem guarantees that this unique minimum point exists whenever C is a non-empty closed and convex subset of a Hilbert space. However, such a minimum point can also exist in non-convex or non-closed subsets as well; for instance, just as long is C is non-empty, if xC then min(C,x)=x.

If CH is a non-empty subset, s is any scalar, and x,x0H are any vectors then min(sC+x0,sx+x0)=smin(C,x)+x0 which implies: min(sC,sx)=smin(C,x)min(C,x)=min(C,x) min(C+x0,x+x0)=min(C,x)+x0min(Cx0,xx0)=min(C,x)x0 min(C,x)=min(C+x,0)xmin(C,0)+x=min(C+x,x)min(Cx,0)=min(C,x)x

Examples

The following counter-example demonstrates a continuous linear isomorphism A:HH for which min(A(C),A(x))A(min(C,x)). Endow H:=2 with the dot product, let x0:=(0,1), and for every real s, let Ls:={(x,sx):x} be the line of slope s through the origin, where it is readily verified that min(Ls,x0)=s1+s2(1,s). Pick a real number r0 and define A:22 by A(x,y):=(rx,y) (so this map scales the xcoordinate by r while leaving the ycoordinate unchanged). Then A:22 is an invertible continuous linear operator that satisfies A(Ls)=Ls/r and A(x0)=x0, so that min(A(Ls),A(x0))=sr2+s2(1,s) and A(min(Ls,x0))=s1+s2(r,s). Consequently, if C:=Ls with s0 and if (r,s)(±1,1) then min(A(C),A(x0))A(min(C,x0)).

Iterated projections

For any closed convex nonempty subset CH, let PC:HC be the projection function.

If there are multiple closed convex subsets C1,C2,,Cn, then one can approximate the projection operator PC1Cn by applying PC1,PC2,,PCn in sequence, then do it again and again. That is, one can approximate (PCnPC2PC1)kPC1Cn as k. The Kaczmarz method is a commonly used special case. Such methods can be computationally effective. For example, if C is a complicated shape, then projecting directly to C may be difficult. However, C can be approximated as an intersection of simple objects like half-spaces, hyperplanes, finite-dimensional subspaces, or cones.[4]

If C is a closed subspace, then it is convex. In this case, the projection function P:HC is an orthogonal projection (a continuous linear operator that is self-adjoint). A classic theorem states that, if C1,,Cn are closed subspaces, then[5] limk(PC1PCn)kxPC1Cnx=0,xH

See also

Notes

  1. Because the norm :H is continuous, if limnxn converges in H then necessarily limnxn converges in . But in general, the converse is not guaranteed. However, under this theorem's hypotheses, knowing that limncn=d in is sufficient to conclude that limncn converges in H.
  2. Explicitly, this means that given any ϵ>0 there exists some integer N>0 such that "the quantity" is ϵ whenever m,nN. Here, "the quantity" refers to the inequality's right hand side 2cm2+2cn24d2 and later in the proof, "the quantity" will also refer to cmcn2 and then cmcn. By definition of "Cauchy sequence," (cn)n=1 is Cauchy in H if and only if "the quantity" cmcn satisfies this aforementioned condition.
  3. Technically, H=KK means that the addition map K×KH defined by (k,p)k+p is a surjective linear isomorphism and homeomorphism. See the article on complemented subspaces for more details.

References

  1. Petersen, Kaare. "The Matrix Cookbook". https://www.math.uwaterloo.ca/~hwolkowi/matrixcookbook.pdf. 
  2. Rudin 1991, pp. 306–309.
  3. Rudin 1991, pp. 307−309.
  4. Deutsch, Frank (2001), "The Method of Alternating Projections", Best Approximation in Inner Product Spaces (New York, NY: Springer New York): pp. 193–235, doi:10.1007/978-1-4684-9298-9_9, ISBN 978-1-4419-2890-0, http://link.springer.com/10.1007/978-1-4684-9298-9_9 
  5. Netyanun, Anupan; Solmon, Donald C. (August 2006). "Iterated Products of Projections in Hilbert Space" (in en). The American Mathematical Monthly 113 (7): 644–648. doi:10.1080/00029890.2006.11920347. ISSN 0002-9890. https://www.tandfonline.com/doi/full/10.1080/00029890.2006.11920347. 

Bibliography

Template:Hilbert space