Quantum depolarizing channel

From HandWiki
Revision as of 07:08, 24 October 2022 by Rjetedi (talk | contribs) (correction)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Short description: Model for quantum noise in quantum systems

A quantum depolarizing channel is a model for quantum noise in quantum systems. The [math]\displaystyle{ d }[/math]-dimensional depolarizing channel can be viewed as a completely positive trace-preserving map [math]\displaystyle{ \Delta_\lambda }[/math], depending on one parameter [math]\displaystyle{ \lambda }[/math], which maps a state [math]\displaystyle{ \rho }[/math] onto a linear combination of itself and the maximally mixed state,

[math]\displaystyle{ \Delta_\lambda(\rho)=(1-\lambda)\rho+\frac{\lambda}{d}I }[/math].

The condition of complete positivity requires [math]\displaystyle{ \lambda }[/math] to satisfy the bounds

[math]\displaystyle{ 0\le\lambda\le 1+\frac{1}{d^2-1} }[/math].

Qubit Channel

The single qubit depolarizing channel has operator-sum representation[1] on a density matrix [math]\displaystyle{ \rho }[/math] given by

[math]\displaystyle{ \Delta_\lambda(\rho) = \sum_{i=0}^{3} K_i \rho K_i^\dagger, }[/math]

where [math]\displaystyle{ K_i }[/math] are the Kraus operators given by

[math]\displaystyle{ K_0 = \sqrt{1-\frac{3\lambda}{4}} I, K_1 = \sqrt{\frac{\lambda}{4}} X, K_2 = \sqrt{\frac{\lambda}{4}} Y, K_3 = \sqrt{\frac{\lambda}{4}} Z }[/math]

and [math]\displaystyle{ \{I,X,Y,Z\} }[/math] are the Pauli matrices. The trace preserving condition is satisfied by the fact that [math]\displaystyle{ \sum_{i}K_i ^\dagger K_i = I. }[/math]

Geometrically the depolarizing channel [math]\displaystyle{ \Delta_\lambda }[/math] can be interpreted as a uniform contraction of the Bloch sphere, parameterized by [math]\displaystyle{ \lambda }[/math]. In the case where [math]\displaystyle{ \lambda=1 }[/math] the channel returns the maximally-mixed state for any input state [math]\displaystyle{ \rho }[/math], which corresponds of the complete contraction of the Bloch-sphere down to the single-point [math]\displaystyle{ \frac{I}{2} }[/math] given by the origin.

Classical capacity

The HSW theorem states that the classical capacity of a quantum channel [math]\displaystyle{ \Psi }[/math] can be characterized as its regularized Holevo information:

[math]\displaystyle{ \lim_{n\to\infty}\frac{1}{n}\chi\left(\Psi^{\otimes n}\right) }[/math]

This quantity is difficult to compute and this reflects our ignorance on quantum channels. However, if the Holevo information is additive for a channel [math]\displaystyle{ \Psi }[/math], i.e.,

[math]\displaystyle{ \chi\left(\Psi\otimes\Psi\right)=\chi\left(\Psi\right)+\chi\left(\Psi\right) }[/math]

Then we can get its classical capacity by computing the Holevo information of the channel.

The additivity of Holevo information for all channels was a famous open conjecture in quantum information theory, but it is now known that this conjecture doesn't hold in general. This was proved by showing that the additivity of minimum output entropy for all channels doesn't hold,[2] which is an equivalent conjecture.

Nonetheless, the additivity of the Holevo information is shown to hold for the quantum depolarizing channel,[3] and an outline of the proof is given below. As a consequence, entanglement across multiple uses of the channel cannot increase the classical capacity. In this sense, the channel behaves like a classical channel. To achieve the optimal rate of communication, it suffices to choose an orthonormal basis to encode the message, and perform measurements that project onto to the same basis at the receiving end.

Outline of the proof of the additivity of Holevo information

The additivity of Holevo information for the depolarizing channel was proved by Christopher King.[3] He showed that the maximum output p-norm of the depolarizing channel is multiplicative, which implied the additivity of the minimum output entropy, which is equivalent to the additivity of the Holevo information.

A stronger version of the additivity of the Holevo information is shown for the depolarizing channel [math]\displaystyle{ \Delta_\lambda }[/math]. For any channel [math]\displaystyle{ \Psi }[/math]:

[math]\displaystyle{ \chi\left(\Delta_\lambda\otimes\Psi\right)=\chi\left(\Delta_\lambda\right)+\chi\left(\Psi\right) }[/math]

This is implied by the following multiplicativity of maximum output p-norm (denoted as [math]\displaystyle{ v_p }[/math]):

[math]\displaystyle{ v_p\left(\Delta_\lambda\otimes\Psi\right)=v_p\left(\Delta_\lambda\right)v_p\left(\Psi\right) }[/math]

The greater than or equal to direction of the above is trivial, it suffices to take the tensor product the states that achieve the maximum p-norm for [math]\displaystyle{ \Delta_\lambda }[/math] and [math]\displaystyle{ \Psi }[/math] respectively, and input the product state into the product channel to get the output p-norm [math]\displaystyle{ v_p(\Delta_\lambda)v_p(\Psi) }[/math]. The proof for the other direction is more involved

The main idea of the proof is to rewrite the depolarizing channel as a convex combination of simpler channels, and use properties of those simpler channels to get the multiplicativity of the maximum output p-norm for the depolarizing channel.

It turns out that we can write the depolarizing channel as follows:

[math]\displaystyle{ \Delta_\lambda(\rho)=\sum_{n=1}^{2d^2(d+1)}c_nU_n^*\Phi_\lambda^{(n)}(\rho)Un }[/math]

where [math]\displaystyle{ c_n }[/math]'s are positive numbers, [math]\displaystyle{ U_n }[/math]'s are unitary matrices, [math]\displaystyle{ \Phi^{(n)}_\lambda }[/math]'s are some dephasing channels and [math]\displaystyle{ \rho }[/math] is an arbitrary input state.

Therefore, the product channel can be written as:

[math]\displaystyle{ \left(\Delta_\lambda\otimes\Psi\right)(\rho)=\sum_{n=1}^{2d^2(d+1)}c_n\left(U_n^*\otimes I\right)\left(\Phi_\lambda^{(n)}\otimes\Psi\right)(\rho)\left(U_n\otimes I\right) }[/math]

By the convexity and the unitary invariance of the p-norm, it suffices to show the simpler bound:

[math]\displaystyle{ \|\left(\Phi^{(n)}_\lambda\otimes\Psi\right)(\rho)\|_p\le v_p(\Delta_\lambda)v_p(\Psi) }[/math]

One important mathematical tool used in the proof of this bound is the Lieb–Thirring inequality, which provides a bound for p-norm of a product of positive matrices. The details and the calculations of the proof are skipped, interested readers are referred to the paper of C. King mentioned above.

Discussion

The main technique used in this proof, namely rewriting the channel of interest as a convex combination of other simpler channels, is a generalization of the method used earlier to prove similar results for unital qubit channels.[4]

The fact that the classical capacity of the depolarizing channel is equal to the Holevo information of the channel means that we can't really use quantum effects such as entanglement to improve the transmission rate of classical information. In this sense, the depolarizing channel can be treated as a classical channel.

However the fact that the additivity of Holevo information doesn't hold in general proposes some areas of future work, namely finding channels that violates the additivity, in other words, channels that can exploit quantum effects to improve the classical capacity beyond its Holevo information.

Notes

  1. Michael A. Nielsen and Isaac L. Chuang (2000). Quantum Computation and Quantum Information. Cambridge University Press. 
  2. Hastings 2009.
  3. 3.0 3.1 King 2003.
  4. C. King, Additivity for unital qubit channels

References