Hermitian adjoint

From HandWiki
Short description: Conjugate transpose of an operator in infinite dimensions

In mathematics, specifically in operator theory, each linear operator [math]\displaystyle{ A }[/math] on an inner product space defines a Hermitian adjoint (or adjoint) operator [math]\displaystyle{ A^* }[/math] on that space according to the rule

[math]\displaystyle{ \langle Ax,y \rangle = \langle x,A^*y \rangle, }[/math]

where [math]\displaystyle{ \langle \cdot,\cdot \rangle }[/math] is the inner product on the vector space.

The adjoint may also be called the Hermitian conjugate or simply the Hermitian[1] after Charles Hermite. It is often denoted by A in fields like physics, especially when used in conjunction with bra–ket notation in quantum mechanics. In finite dimensions where operators can be represented by matrices, the Hermitian adjoint is given by the conjugate transpose (also known as the Hermitian transpose).

The above definition of an adjoint operator extends verbatim to bounded linear operators on Hilbert spaces [math]\displaystyle{ H }[/math]. The definition has been further extended to include unbounded densely defined operators, whose domain is topologically dense in, but not necessarily equal to, [math]\displaystyle{ H. }[/math]

Informal definition

Consider a linear map [math]\displaystyle{ A: H_1\to H_2 }[/math] between Hilbert spaces. Without taking care of any details, the adjoint operator is the (in most cases uniquely defined) linear operator [math]\displaystyle{ A^* : H_2 \to H_1 }[/math] fulfilling

[math]\displaystyle{ \left\langle A h_1, h_2 \right\rangle_{H_2} = \left\langle h_1, A^* h_2 \right\rangle_{H_1}, }[/math]

where [math]\displaystyle{ \langle\cdot, \cdot \rangle_{H_i} }[/math] is the inner product in the Hilbert space [math]\displaystyle{ H_i }[/math], which is linear in the first coordinate and conjugate linear in the second coordinate. Note the special case where both Hilbert spaces are identical and [math]\displaystyle{ A }[/math] is an operator on that Hilbert space.

When one trades the inner product for the dual pairing, one can define the adjoint, also called the transpose, of an operator [math]\displaystyle{ A: E \to F }[/math], where [math]\displaystyle{ E, F }[/math] are Banach spaces with corresponding norms [math]\displaystyle{ \|\cdot\|_E, \|\cdot\|_F }[/math]. Here (again not considering any technicalities), its adjoint operator is defined as [math]\displaystyle{ A^*: F^* \to E^* }[/math] with

[math]\displaystyle{ A^*f = f \circ A : u \mapsto f(Au), }[/math]

I.e., [math]\displaystyle{ \left(A^*f\right)(u) = f(Au) }[/math] for [math]\displaystyle{ f \in F^*, u \in E }[/math].

The above definition in the Hilbert space setting is really just an application of the Banach space case when one identifies a Hilbert space with its dual. Then it is only natural that we can also obtain the adjoint of an operator [math]\displaystyle{ A: H \to E }[/math], where [math]\displaystyle{ H }[/math] is a Hilbert space and [math]\displaystyle{ E }[/math] is a Banach space. The dual is then defined as [math]\displaystyle{ A^*: E^* \to H }[/math] with [math]\displaystyle{ A^*f = h_f }[/math] such that

[math]\displaystyle{ \langle h_f, h\rangle_H = f(Ah). }[/math]

Definition for unbounded operators between Banach spaces

Let [math]\displaystyle{ \left(E, \|\cdot\|_E\right), \left(F, \|\cdot\|_F\right) }[/math] be Banach spaces. Suppose [math]\displaystyle{ A: D(A) \to F }[/math] and [math]\displaystyle{ D(A) \subset E }[/math], and suppose that [math]\displaystyle{ A }[/math] is a (possibly unbounded) linear operator which is densely defined (i.e., [math]\displaystyle{ D(A) }[/math] is dense in [math]\displaystyle{ E }[/math]). Then its adjoint operator [math]\displaystyle{ A^* }[/math] is defined as follows. The domain is

[math]\displaystyle{ D\left(A^*\right) := \left\{g \in F^*:~ \exists c \geq 0:~ \mbox{ for all } u \in D(A):~ |g(Au)| \leq c \cdot \|u\|_E\right\}. }[/math]

Now for arbitrary but fixed [math]\displaystyle{ g \in D(A^*) }[/math] we set [math]\displaystyle{ f: D(A) \to \R }[/math] with [math]\displaystyle{ f(u) = g(Au) }[/math]. By choice of [math]\displaystyle{ g }[/math] and definition of [math]\displaystyle{ D(A^*) }[/math], f is (uniformly) continuous on [math]\displaystyle{ D(A) }[/math] as [math]\displaystyle{ |f(u)| = |g(Au)| \leq c\cdot \|u\|_E }[/math]. Then by the Hahn–Banach theorem, or alternatively through extension by continuity, this yields an extension of [math]\displaystyle{ f }[/math], called [math]\displaystyle{ \hat{f} }[/math], defined on all of [math]\displaystyle{ E }[/math]. This technicality is necessary to later obtain [math]\displaystyle{ A^* }[/math] as an operator [math]\displaystyle{ D\left(A^*\right) \to E^* }[/math] instead of [math]\displaystyle{ D\left(A^*\right) \to (D(A))^*. }[/math] Remark also that this does not mean that [math]\displaystyle{ A }[/math] can be extended on all of [math]\displaystyle{ E }[/math] but the extension only worked for specific elements [math]\displaystyle{ g \in D\left(A^*\right) }[/math].

Now, we can define the adjoint of [math]\displaystyle{ A }[/math] as

[math]\displaystyle{ \begin{align} A^*: F^* \supset D(A^*) &\to E^* \\ g &\mapsto A^*g = \hat f. \end{align} }[/math]

The fundamental defining identity is thus

[math]\displaystyle{ g(Au) = \left(A^* g\right)(u) }[/math] for [math]\displaystyle{ u \in D(A). }[/math]

Definition for bounded operators between Hilbert spaces

Suppose H is a complex Hilbert space, with inner product [math]\displaystyle{ \langle\cdot,\cdot\rangle }[/math]. Consider a continuous linear operator A : HH (for linear operators, continuity is equivalent to being a bounded operator). Then the adjoint of A is the continuous linear operator A : HH satisfying

[math]\displaystyle{ \langle Ax , y \rangle = \left\langle x , A^* y\right\rangle \quad \mbox{for all } x, y \in H. }[/math]

Existence and uniqueness of this operator follows from the Riesz representation theorem.[2]

This can be seen as a generalization of the adjoint matrix of a square matrix which has a similar property involving the standard complex inner product.

Properties

The following properties of the Hermitian adjoint of bounded operators are immediate:[2]

  1. Involutivity: A∗∗ = A
  2. If A is invertible, then so is A, with [math]\displaystyle{ \left(A^*\right)^{-1} = \left(A^{-1}\right)^* }[/math]
  3. Conjugate linearity:
  4. "Anti-distributivity": (AB) = BA

If we define the operator norm of A by

[math]\displaystyle{ \| A \|_\text{op} := \sup \left\{\|Ax\| : \|x\| \le 1\right\} }[/math]

then

[math]\displaystyle{ \left\|A^* \right\|_\text{op} = \|A\|_\text{op}. }[/math][2]

Moreover,

[math]\displaystyle{ \left\|A^* A \right\|_\text{op} = \|A\|_\text{op}^2. }[/math][2]

One says that a norm that satisfies this condition behaves like a "largest value", extrapolating from the case of self-adjoint operators.

The set of bounded linear operators on a complex Hilbert space H together with the adjoint operation and the operator norm form the prototype of a C*-algebra.

Adjoint of densely defined unbounded operators between Hilbert spaces

Definition

Let the inner product [math]\displaystyle{ \langle \cdot, \cdot \rangle }[/math] be linear in the first argument. A densely defined operator A from a complex Hilbert space H to itself is a linear operator whose domain D(A) is a dense linear subspace of H and whose values lie in H.[3] By definition, the domain D(A) of its adjoint A is the set of all yH for which there is a zH satisfying

[math]\displaystyle{ \langle Ax , y \rangle = \langle x , z \rangle \quad \mbox{for all } x \in D(A). }[/math]

Owing to the density of [math]\displaystyle{ D(A) }[/math] and Riesz representation theorem, [math]\displaystyle{ z }[/math] is uniquely defined, and, by definition, [math]\displaystyle{ A^*y=z. }[/math][4]

Properties 1.–5. hold with appropriate clauses about domains and codomains.[clarification needed] For instance, the last property now states that (AB) is an extension of BA if A, B and AB are densely defined operators.[5]

ker A*=(im A)

For every [math]\displaystyle{ y \in \ker A^*, }[/math] the linear functional [math]\displaystyle{ x \mapsto \langle Ax,y \rangle = \langle x,A^*y\rangle }[/math] is identically zero, and hence [math]\displaystyle{ y \in (\operatorname{im} A)^\perp. }[/math]

Conversely, the assumption that [math]\displaystyle{ y \in (\operatorname{im} A)^\perp }[/math] causes the functional [math]\displaystyle{ x \mapsto \langle Ax,y \rangle }[/math] to be identically zero. Since the functional is obviously bounded, the definition of [math]\displaystyle{ A^* }[/math] assures that [math]\displaystyle{ y \in D(A^*). }[/math] The fact that, for every [math]\displaystyle{ x \in D(A), }[/math] [math]\displaystyle{ \langle Ax,y \rangle = \langle x,A^*y\rangle = 0 }[/math] shows that [math]\displaystyle{ A^* y \in D(A)^\perp =\overline{D(A)}^\perp = \{0\}, }[/math] given that [math]\displaystyle{ D(A) }[/math] is dense.

This property shows that [math]\displaystyle{ \operatorname{ker}A^* }[/math] is a topologically closed subspace even when [math]\displaystyle{ D(A^*) }[/math] is not.

Geometric interpretation

If [math]\displaystyle{ H_1 }[/math] and [math]\displaystyle{ H_2 }[/math] are Hilbert spaces, then [math]\displaystyle{ H_1 \oplus H_2 }[/math] is a Hilbert space with the inner product

[math]\displaystyle{ \bigl \langle (a,b),(c,d) \bigr \rangle_{H_1 \oplus H_2} \stackrel{\text{def}}{=} \langle a,c \rangle_{H_1} + \langle b,d \rangle_{H_2}, }[/math]

where [math]\displaystyle{ a,c \in H_1 }[/math] and [math]\displaystyle{ b,d \in H_2. }[/math]

Let [math]\displaystyle{ J\colon H\oplus H \to H \oplus H }[/math] be the symplectic mapping, i.e. [math]\displaystyle{ J(\xi, \eta) = (-\eta, \xi). }[/math] Then the graph

[math]\displaystyle{ G(A^*) =\{(x,y) \mid x\in D(A^*),\ y=A^*x\} \subseteq H \oplus H }[/math]

of [math]\displaystyle{ A^* }[/math] is the orthogonal complement of [math]\displaystyle{ JG(A): }[/math]

[math]\displaystyle{ G(A^*) = (JG(A))^\perp = \{ (x, y) \in H \oplus H : \bigl \langle (x, y) , (-A\xi, \xi) \bigr \rangle_{H \oplus H} = 0\;\;\forall \xi \in D(A)\}. }[/math]

The assertion follows from the equivalences

[math]\displaystyle{ \bigl \langle (x, y) , (-A\xi, \xi) \bigr \rangle = 0 \quad \Leftrightarrow \quad \langle A\xi, x \rangle = \langle \xi, y \rangle, }[/math]

and

[math]\displaystyle{ \Bigl[ \forall \xi \in D(A)\ \ \langle A\xi, x \rangle = \langle \xi, y \rangle \Bigr] \quad \Leftrightarrow \quad x \in D(A^*)\ \&\ y = A^*x. }[/math]

Corollaries

A* is closed

An operator [math]\displaystyle{ A }[/math] is closed if the graph [math]\displaystyle{ G(A) }[/math] is topologically closed in [math]\displaystyle{ H \oplus H. }[/math] The graph [math]\displaystyle{ G(A^*) }[/math] of the adjoint operator [math]\displaystyle{ A^* }[/math] is the orthogonal complement of a subspace, and therefore is closed.

A* is densely defined ⇔ A is closable

An operator [math]\displaystyle{ A }[/math] is closable if the topological closure [math]\displaystyle{ G^\text{cl}(A) \subseteq H \oplus H }[/math] of the graph [math]\displaystyle{ G(A) }[/math] is the graph of a function. Since [math]\displaystyle{ G^\text{cl}(A) }[/math] is a (closed) linear subspace, the word "function" may be replaced with "linear operator". For the same reason, [math]\displaystyle{ A }[/math] is closable if and only if [math]\displaystyle{ (0,v) \notin G^\text{cl}(A) }[/math] unless [math]\displaystyle{ v=0. }[/math]

The adjoint [math]\displaystyle{ A^* }[/math] is densely defined if and only if [math]\displaystyle{ A }[/math] is closable. This follows from the fact that, for every [math]\displaystyle{ v \in H, }[/math]

[math]\displaystyle{ v \in D(A^*)^\perp\ \Leftrightarrow\ (0,v) \in G^\text{cl}(A), }[/math]

which, in turn, is proven through the following chain of equivalencies:

[math]\displaystyle{ \begin{align} v \in D(A^*)^\perp &\Longleftrightarrow (v,0) \in G(A^*)^\perp \Longleftrightarrow (v,0) \in (JG(A))^\text{cl} = JG^\text{cl}(A) \\ &\Longleftrightarrow (0,-v) = J^{-1}(v,0) \in G^\text{cl}(A) \\ &\Longleftrightarrow (0,v) \in G^\text{cl}(A). \end{align} }[/math]
A** = Acl

The closure [math]\displaystyle{ A^\text{cl} }[/math] of an operator [math]\displaystyle{ A }[/math] is the operator whose graph is [math]\displaystyle{ G^\text{cl}(A) }[/math] if this graph represents a function. As above, the word "function" may be replaced with "operator". Furthermore, [math]\displaystyle{ A^{**} = A^{\text{cl}}, }[/math] meaning that [math]\displaystyle{ G(A^{**}) = G^{\text{cl}}(A). }[/math]

To prove this, observe that [math]\displaystyle{ J^* = -J, }[/math] i.e. [math]\displaystyle{ \langle Jx,y\rangle_{H \oplus H} = -\langle x,Jy\rangle_{H \oplus H}, }[/math] for every [math]\displaystyle{ x,y \in H \oplus H. }[/math] Indeed,

[math]\displaystyle{ \begin{align} \langle J(x_1,x_2),(y_1,y_2)\rangle_{H \oplus H} &= \langle (-x_2,x_1),(y_1,y_2)\rangle_{H \oplus H} = \langle -x_2,y_1\rangle_H + \langle x_1,y_2 \rangle_H \\ &= \langle x_1,y_2 \rangle_H + \langle x_2,-y_1 \rangle_H = \langle (x_1,x_2),-J(y_1,y_2)\rangle_{H \oplus H}. \end{align} }[/math]

In particular, for every [math]\displaystyle{ y \in H \oplus H }[/math] and every subspace [math]\displaystyle{ V \subseteq H \oplus H, }[/math] [math]\displaystyle{ y \in (JV)^\perp }[/math] if and only if [math]\displaystyle{ Jy \in V^\perp. }[/math] Thus, [math]\displaystyle{ J[(JV)^\perp] = V^\perp }[/math] and [math]\displaystyle{ [J[(JV)^\perp]]^\perp = V^\text{cl}. }[/math] Substituting [math]\displaystyle{ V = G(A), }[/math] obtain [math]\displaystyle{ G^\text{cl}(A) = G(A^{**}). }[/math]

A* = (Acl)*

For a closable operator [math]\displaystyle{ A, }[/math] [math]\displaystyle{ A^* = \left(A^\text{cl}\right)^*, }[/math] meaning that [math]\displaystyle{ G(A^*) = G\left(\left(A^\text{cl}\right)^*\right). }[/math] Indeed,

[math]\displaystyle{ G\left(\left(A^\text{cl}\right)^*\right) = \left(JG^\text{cl}(A)\right)^\perp = \left(\left(JG(A)\right)^\text{cl}\right)^\perp = (JG(A))^\perp = G(A^*). }[/math]

Counterexample where the adjoint is not densely defined

Let [math]\displaystyle{ H=L^2(\mathbb{R},l), }[/math] where [math]\displaystyle{ l }[/math] is the linear measure. Select a measurable, bounded, non-identically zero function [math]\displaystyle{ f \notin L^2, }[/math] and pick [math]\displaystyle{ \varphi_0 \in L^2 \setminus \{0\}. }[/math] Define

[math]\displaystyle{ A \varphi = \langle f,\varphi\rangle \varphi_0. }[/math]

It follows that [math]\displaystyle{ D(A) = \{\varphi \in L^2 \mid \langle f,\varphi\rangle \neq \infty\}. }[/math] The subspace [math]\displaystyle{ D(A) }[/math] contains all the [math]\displaystyle{ L^2 }[/math] functions with compact support. Since [math]\displaystyle{ \mathbf{1}_{[-n,n]} \cdot \varphi\ \stackrel{L^2}{\to}\ \varphi, }[/math] [math]\displaystyle{ A }[/math] is densely defined. For every [math]\displaystyle{ \varphi \in D(A) }[/math] and [math]\displaystyle{ \psi \in D(A^*), }[/math]

[math]\displaystyle{ \langle \varphi, A^*\psi \rangle = \langle A\varphi, \psi \rangle = \langle \langle f,\varphi \rangle\varphi_0, \psi \rangle = \langle f,\varphi \rangle\cdot \langle \varphi_0, \psi \rangle = \langle \varphi, \langle \varphi_0, \psi \rangle f\rangle. }[/math]

Thus, [math]\displaystyle{ A^* \psi = \langle \varphi_0, \psi \rangle f. }[/math] The definition of adjoint operator requires that [math]\displaystyle{ \mathop{\text{Im}}A^* \subseteq H=L^2. }[/math] Since [math]\displaystyle{ f \notin L^2, }[/math] this is only possible if [math]\displaystyle{ \langle \varphi_0, \psi \rangle= 0. }[/math] For this reason, [math]\displaystyle{ D(A^*) = \{\varphi_0\}^\perp. }[/math] Hence, [math]\displaystyle{ A^* }[/math] is not densely defined and is identically zero on [math]\displaystyle{ D(A^*). }[/math] As a result, [math]\displaystyle{ A }[/math] is not closable and has no second adjoint [math]\displaystyle{ A^{**}. }[/math]

Hermitian operators

A bounded operator A : HH is called Hermitian or self-adjoint if

[math]\displaystyle{ A = A^* }[/math]

which is equivalent to

[math]\displaystyle{ \langle Ax , y \rangle = \langle x , A y \rangle \mbox{ for all } x, y \in H. }[/math][6]

In some sense, these operators play the role of the real numbers (being equal to their own "complex conjugate") and form a real vector space. They serve as the model of real-valued observables in quantum mechanics. See the article on self-adjoint operators for a full treatment.

Adjoints of conjugate-linear operators

For a conjugate-linear operator the definition of adjoint needs to be adjusted in order to compensate for the complex conjugation. An adjoint operator of the conjugate-linear operator A on a complex Hilbert space H is an conjugate-linear operator A : HH with the property:

[math]\displaystyle{ \langle Ax , y \rangle = \overline{\left\langle x , A^* y \right\rangle} \quad \text{for all } x, y \in H. }[/math]

Other adjoints

The equation

[math]\displaystyle{ \langle Ax , y \rangle = \left\langle x, A^* y \right\rangle }[/math]

is formally similar to the defining properties of pairs of adjoint functors in category theory, and this is where adjoint functors got their name from.

See also

References

  1. Miller, David A. B. (2008). Quantum Mechanics for Scientists and Engineers. Cambridge University Press. pp. 262, 280. 
  2. 2.0 2.1 2.2 2.3 Reed & Simon 2003, pp. 186–187; Rudin 1991, §12.9
  3. See unbounded operator for details.
  4. Reed & Simon 2003, p. 252; Rudin 1991, §13.1
  5. Rudin 1991, Thm 13.2
  6. Reed & Simon 2003, pp. 187; Rudin 1991, §12.11