Physics:Von Neumann entropy

From HandWiki
Short description: Type of entropy in quantum theory
John von Neumann, whom the concept is named after

In physics, the von Neumann entropy, named after John von Neumann, is a measure of the statistical uncertainty within a description of a quantum system. It extends the concept of Gibbs entropy from classical statistical mechanics to quantum statistical mechanics, and it is the quantum counterpart of the Shannon entropy from classical information theory. For a quantum-mechanical system described by a density matrix ρ, the von Neumann entropy is({{{1}}}, {{{2}}}) S=tr(ρlnρ), where tr denotes the trace and ln denotes the matrix version of the natural logarithm. If the density matrix ρ is written in a basis of its eigenvectors |1,|2,|3, as ρ=jηj|jj|, then the von Neumann entropy is merely S=jηjlnηj. In this form, S can be seen as the Shannon entropy of the eigenvalues, reinterpreted as probabilities.({{{1}}}, {{{2}}})

The von Neumann entropy and quantities based upon it are widely used in the study of quantum entanglement.[1]

Fundamentals

In quantum mechanics, probabilities for the outcomes of experiments made upon a system are calculated from the quantum state describing that system. Each physical system is associated with a vector space, or more specifically a Hilbert space. The dimension of the Hilbert space may be infinite, as it is for the space of square-integrable functions on a line, which is used to define the quantum physics of a continuous degree of freedom. Alternatively, the Hilbert space may be finite-dimensional, as occurs for spin degrees of freedom. A density operator, the mathematical representation of a quantum state, is a positive semi-definite, self-adjoint operator of trace one acting on the Hilbert space of the system.[2][3][4] A density operator that is a rank-1 projection is known as a pure quantum state, and all quantum states that are not pure are designated mixed. Pure states are also known as wavefunctions. Assigning a pure state to a quantum system implies certainty about the outcome of some measurement on that system (i.e., P(x)=1 for some outcome x). The state space of a quantum system is the set of all states, pure and mixed, that can be assigned to it. For any system, the state space is a convex set: Any mixed state can be written as a convex combination of pure states, though not in a unique way.[5] The von Neumann entropy quantifies the extent to which a state is mixed.[6]

The prototypical example of a finite-dimensional Hilbert space is a qubit, a quantum system whose Hilbert space is 2-dimensional. An arbitrary state for a qubit can be written as a linear combination of the Pauli matrices, which provide a basis for 2×2 self-adjoint matrices:({{{1}}}, {{{2}}}) ρ=12(I+rxσx+ryσy+rzσz), where the real numbers (rx,ry,rz) are the coordinates of a point within the unit ball and σx=(0110),σy=(0ii0),σz=(1001). The von Neumann entropy vanishes when ρ is a pure state, i.e., when the point (rx,ry,rz) lies on the surface of the unit ball, and it attains its maximum value when ρ is the maximally mixed state, which is given by rx=ry=rz=0.({{{1}}}, {{{2}}})

Properties

Some properties of the von Neumann entropy:

  • S(ρ) is zero if and only if ρ represents a pure state.({{{1}}}, {{{2}}})
  • S(ρ) is maximal and equal to lnN for a maximally mixed state, N being the dimension of the Hilbert space.({{{1}}}, {{{2}}})
  • S(ρ) is invariant under changes in the basis of ρ, that is, S(ρ) = S(UρU), with U a unitary transformation.({{{1}}}, {{{2}}})
  • S(ρ) is concave, that is, given a collection of positive numbers λi which sum to unity (Σiλi=1) and density operators ρi, we have({{{1}}}, {{{2}}})

S(i=1kλiρi)i=1kλiS(ρi).

  • S(ρ) is additive for independent systems. Given two density matrices ρA , ρB describing independent systems A and B, we have({{{1}}}, {{{2}}})

S(ρAρB)=S(ρA)+S(ρB).

  • S(ρ) is strongly subadditive. That is, for any three systems A, B, and C:[7]

S(ρABC)+S(ρB)S(ρAB)+S(ρBC).

This automatically means that S(ρ) is subadditive:

S(ρAC)S(ρA)+S(ρC).

Below, the concept of subadditivity is discussed, followed by its generalization to strong subadditivity.

Subadditivity

If ρA, ρB are the reduced density matrices of the general state ρAB, then |S(ρA)S(ρB)|S(ρAB)S(ρA)+S(ρB).

The right hand inequality is known as subadditivity, and the left is sometimes known as the triangle inequality.[8] While in Shannon's theory the entropy of a composite system can never be lower than the entropy of any of its parts, in quantum theory this is not the case; i.e., it is possible that S(ρAB) = 0, while S(ρA) = S(ρB) > 0. This is expressed by saying that the Shannon entropy is monotonic but the von Neumann entropy is not.[9] For example, take the Bell state of two spin-1/2 particles: |ψ=|+|. This is a pure state with zero entropy, but each spin has maximum entropy when considered individually, because its reduced density matrix is the maximally mixed state. This indicates that it is an entangled state;({{{1}}}, {{{2}}}) the use of entropy as an entanglement measure is discussed further below.

Strong subadditivity

The von Neumann entropy is also strongly subadditive.[10] Given three Hilbert spaces, A, B, C, S(ρABC)+S(ρB)S(ρAB)+S(ρBC). By using the proof technique that establishes the left side of the triangle inequality above, one can show that the strong subadditivity inequality is equivalent to the following inequality: S(ρA)+S(ρC)S(ρAB)+S(ρBC) where ρAB, etc. are the reduced density matrices of a density matrix ρABC.[11] If we apply ordinary subadditivity to the left side of this inequality, we then find S(ρAC)S(ρAB)+S(ρBC). By symmetry, for any tripartite state ρABC, each of the three numbers S(ρAB), S(ρBC), S(ρAC) is less than or equal to the sum of the other two.[12]

Minimum Shannon entropy

Given a quantum state and a specification of a quantum measurement, we can calculate the probabilities for the different possible results of that measurement, and thus we can find the Shannon entropy of that probability distribution. A quantum measurement can be specified mathematically as a positive operator valued measure, or POVM.[13] In the simplest case, a system with a finite-dimensional Hilbert space and measurement with a finite number of outcomes, a POVM is a set of positive semi-definite matrices {Fi} on the Hilbert space that sum to the identity matrix,[14] i=1nFi=I. The POVM element Fi is associated with the measurement outcome i, such that the probability of obtaining it when making a measurement on the quantum state ρ is given by Prob(i)=tr(ρFi). A POVM is rank-1 if all of the elements are proportional to rank-1 projection operators. The von Neumann entropy is the minimum achievable Shannon entropy, where the minimization is taken over all rank-1 POVMs.[15]

Holevo χ quantity

If ρi are density operators and λi is a collection of positive numbers which sum to unity (Σiλi=1), then ρ=i=1kλiρi is a valid density operator, and the difference between its von Neumann entropy and the weighted average of the entropies of the ρi is bounded by the Shannon entropy of the λi: S(i=1kλiρi)i=1kλiS(ρi)i=1kλilogλi. Equality is attained when the supports of the ρi – the spaces spanned by their eigenvectors corresponding to nonzero eigenvalues – are orthogonal. The difference on the left-hand side of this inequality is known as the Holevo χ quantity and also appears in Holevo's theorem, an important result in quantum information theory.[16]

Change under time evolution

Unitary

The time evolution of an isolated system is described by a unitary operator: ρUρU. Unitary evolution takes pure states into pure states,({{{1}}}, {{{2}}}) and it leaves the von Neumann entropy unchanged. This follows from the fact that the entropy of ρ is a function of the eigenvalues of ρ.({{{1}}}, {{{2}}})

Measurement

A measurement upon a quantum system will generally bring about a change of the quantum state of that system. Writing a POVM does not provide the complete information necessary to describe this state-change process.[17] To remedy this, further information is specified by decomposing each POVM element into a product: Ei=AiAi. The Kraus operators Ai, named for Karl Kraus, provide a specification of the state-change process. They are not necessarily self-adjoint, but the products AiAi are. If upon performing the measurement the outcome Ei is obtained, then the initial state ρ is updated to ρρ=AiρAiProb(i)=AiρAitr(ρEi). An important special case is the Lüders rule, named for Gerhart Lüders.[18][19] If the POVM elements are projection operators, then the Kraus operators can be taken to be the projectors themselves: ρρ=ΠiρΠitr(ρΠi). If the initial state ρ is pure, and the projectors Πi have rank 1, they can be written as projectors onto the vectors |ψ and |i, respectively. The formula simplifies thus to ρ=|ψψ|ρ=|ii|ψψ|ii||i|ψ|2=|ii|. We can define a linear, trace-preserving, completely positive map, by summing over all the possible post-measurement states of a POVM without the normalisation: ρiAiρAi. It is an example of a quantum channel,[20] and can be interpreted as expressing how a quantum state changes if a measurement is performed but the result of that measurement is lost.[21] Channels defined by projective measurements can never decrease the von Neumann entropy; they leave the entropy unchanged only if they do not change the density matrix.[22] A quantum channel will increase or leave constant the von Neumann entropy of every input state if and only if the channel is unital, i.e., if it leaves fixed the maximally mixed state. An example of a channel that decreases the von Neumann entropy is the amplitude damping channel for a qubit, which sends all mixed states towards a pure state.[23]

Thermodynamic meaning

The quantum version of the canonical distribution, the Gibbs states, are found by maximizing the von Neumann entropy under the constraint that the expected value of the Hamiltonian is fixed. A Gibbs state is a density operator with the same eigenvectors as the Hamiltonian, and its eigenvalues are λi=1Zexp(EikBT), where T is the temperature, kB is the Boltzmann constant, and Z is the partition function.({{{1}}}, {{{2}}})[24] The von Neumann entropy of a Gibbs state is, up to a factor kB, the thermodynamic entropy.({{{1}}}, {{{2}}})

Generalizations and derived quantities

Conditional entropy

Let ρAB be a joint state for the bipartite quantum system AB. Then the conditional von Neumann entropy S(A|B) is the difference between the entropy of ρAB and the entropy of the marginal state for subsystem B alone: S(A|B)=S(ρAB)S(ρB). This is bounded above by S(ρA). In other words, conditioning the description of subsystem A upon subsystem B cannot increase the entropy associated with A.[25]

Quantum mutual information can be defined as the difference between the entropy of the joint state and the total entropy of the marginals: S(A:B)=S(ρA)+S(ρB)S(ρAB), which can also be expressed in terms of conditional entropy:[26] S(A:B)=S(A)S(A|B)=S(B)S(B|A).

Relative entropy

Let ρ and σ be two density operators in the same state space. The relative entropy is defined to be S(σ|ρ)=tr[ρ(logρlogσ)]. The relative entropy is always greater than or equal to zero; it equals zero if and only if ρ=σ.({{{1}}}, {{{2}}}) Unlike the von Neumann entropy itself, the relative entropy is monotonic, in that it decreases (or remains constant) when part of a system is traced over:[27] S(σA|ρA)S(σAB|ρAB).

Entanglement measures

Just as energy is a resource that facilitates mechanical operations, entanglement is a resource that facilitates performing tasks that involve communication and computation.({{{1}}}, {{{2}}}) The mathematical definition of entanglement can be paraphrased as saying that maximal knowledge about the whole of a system does not imply maximal knowledge about the individual parts of that system.[28] If the quantum state that describes a pair of particles is entangled, then the results of measurements upon one half of the pair can be strongly correlated with the results of measurements upon the other. However, entanglement is not the same as "correlation" as understood in classical probability theory and in daily life. Instead, entanglement can be thought of as potential correlation that can be used to generate actual correlation in an appropriate experiment.[29] The state of a composite system is always expressible as a sum, or superposition, of products of states of local constituents; it is entangled if this sum cannot be written as a single product term.[30] Entropy provides one tool that can be used to quantify entanglement.[31][32] If the overall system is described by a pure state, the entropy of one subsystem can be used to measure its degree of entanglement with the other subsystems. For bipartite pure states, the von Neumann entropy of reduced states is the unique measure of entanglement in the sense that it is the only function on the family of states that satisfies certain axioms required of an entanglement measure.[33][34] It is thus known as the entanglement entropy.[35]

It is a classical result that the Shannon entropy achieves its maximum at, and only at, the uniform probability distribution {1/n, ..., 1/n}.[36] Therefore, a bipartite pure state ρHAHB is said to be a maximally entangled state if the reduced state of each subsystem of ρ is the diagonal matrix[37] (1n1n).

For mixed states, the reduced von Neumann entropy is not the only reasonable entanglement measure.({{{1}}}, {{{2}}}) Some of the other measures are also entropic in character. For example, the relative entropy of entanglement is given by minimizing the relative entropy between a given state ρ and the set of nonentangled, or separable, states.[38] The entanglement of formation is defined by minimizing, over all possible ways of writing of ρ as a convex combination of pure states, the average entanglement entropy of those pure states.[39] The squashed entanglement is based on the idea of extending a bipartite state ρAB to a state describing a larger system, ρABE, such that the partial trace of ρABE over E yields ρAB. One then finds the infimum of the quantity 12[S(ρAE)+S(ρBE)S(ρE)S(ρABE)], over all possible choices of ρABE.[40]

Quantum Rényi entropies

Just as the Shannon entropy function is one member of the broader family of classical Rényi entropies, so too can the von Neumann entropy be generalized to the quantum Rényi entropies: Sα(ρ)=11αln[trρα]=11αlni=1Nλiα. In the limit that α1, this recovers the von Neumann entropy. The quantum Rényi entropies are all additive for product states, and for any α, the Rényi entropy Sα vanishes for pure states and is maximized by the maximally mixed state. For any given state ρ, Sα(ρ) is a continuous, nonincreasing function of the parameter α. A weak version of subadditivity can be proven: Sα(ρA)S0(ρB)Sα(ρAB)Sα(ρA)+S0(ρB). Here, S0 is the quantum version of the Hartley entropy, i.e., the logarithm of the rank of the density matrix.[41]

History

The density matrix was introduced, with different motivations, by von Neumann and by Lev Landau. The motivation that inspired Landau was the impossibility of describing a subsystem of a composite quantum system by a state vector.[42] On the other hand, von Neumann introduced the density matrix in order to develop both quantum statistical mechanics and a theory of quantum measurements.[43] He introduced the expression now known as von Neumann entropy by arguing that a probabilistic combination of pure states is analogous to a mixture of ideal gases.[44][45] Von Neumann first published on the topic in 1927.[46] His argument was built upon earlier work by Albert Einstein and Leo Szilard.[47][48][49]

Max Delbrück and Gert Molière proved the concavity and subadditivity properties of the von Neumann entropy in 1936. Quantum relative entropy was introduced by Hisaharu Umegaki in 1962.[50][51] The subadditivity and triangle inequalities were proved in 1970 by Huzihiro Araki and Elliott H. Lieb.[52][53] using a matrix inequality proved earlier by Lieb.[54][55]

References

  1. Nielsen & Chuang 2010, p. 700.
  2. Fano, U. (1957). "Description of States in Quantum Mechanics by Density Matrix and Operator Techniques". Reviews of Modern Physics 29 (1): 74–93. doi:10.1103/RevModPhys.29.74. Bibcode1957RvMP...29...74F. 
  3. Holevo 2001, pp. 1, 15.
  4. Hall, Brian C. (2013). "Systems and Subsystems, Multiple Particles". Quantum Theory for Mathematicians. Graduate Texts in Mathematics. 267. Springer. pp. 419–440. doi:10.1007/978-1-4614-7116-5_19. ISBN 978-1-4614-7115-8. 
  5. Kirkpatrick, K. A. (February 2006). "The Schrödinger-HJW Theorem". Foundations of Physics Letters 19 (1): 95–102. doi:10.1007/s10702-006-1852-1. ISSN 0894-9875. Bibcode2006FoPhL..19...95K. 
  6. Holevo 2001, p. 15.
  7. Nielsen & Chuang 2010, p. 519.
  8. Nielsen & Chuang 2010, pp. 515–516.
  9. Bengtsson & Życzkowski 2017, p. 361.
  10. Bengtsson & Życzkowski 2017, p. 364.
  11. Ruskai, Mary Beth (2002). "Inequalities for Quantum Entropy: A Review with Conditions for Equality". Journal of Mathematical Physics 43: 4358–4375. doi:10.1063/1.1497701. 
  12. Lieb, Elliott H. (January 1975). "Some convexity and subadditivity properties of entropy". Bulletin of the American Mathematical Society 81 (1). doi:10.1090/S0002-9904-1975-13621-4. 
  13. Peres, Asher; Terno, Daniel R. (2004). "Quantum information and relativity theory". Reviews of Modern Physics 76 (1): 93–123. doi:10.1103/RevModPhys.76.93. Bibcode2004RvMP...76...93P. 
  14. Nielsen & Chuang 2010, p. 90.
  15. Wilde 2017, §11.1.2.
  16. Nielsen & Chuang 2010, pp. 531–534.
  17. Wilde 2017, §4.2.1.
  18. Lüders, Gerhart (1950). "Über die Zustandsänderung durch den Messprozeß". Annalen der Physik 443 (5–8): 322. doi:10.1002/andp.19504430510. Bibcode1950AnP...443..322L.  Translated by K. A. Kirkpatrick as Lüders, Gerhart (3 April 2006). "Concerning the state-change due to the measurement process". Annalen der Physik 15 (9): 663–670. doi:10.1002/andp.200610207. Bibcode2006AnP...518..663L. 
  19. Busch, Paul; Lahti, Pekka (2009), Greenberger, Daniel; Hentschel, Klaus; Weinert, Friedel, eds., "Lüders Rule" (in en), Compendium of Quantum Physics (Springer Berlin Heidelberg): pp. 356–358, doi:10.1007/978-3-540-70626-7_110, ISBN 978-3-540-70622-9 
  20. Wilde 2017, §4.4.1.
  21. Wilde 2017, §4.5.1.
  22. Nielsen & Chuang 2010, p. 515.
  23. Bengtsson & Życzkowski 2017, pp. 380–381.
  24. Yunger Halpern, Nicole (2016-07-07). "Microcanonical and resource-theoretic derivations of the thermal state of a quantum system with noncommuting charges". Nature Communications 7. doi:10.1038/ncomms12051. https://authors.library.caltech.edu/69093/1/ncomms12051.pdf. 
  25. Wilde 2017, §11.4.
  26. Nielsen & Chuang 2010, p. 514.
  27. Nielsen & Chuang 2010, p. 524.
  28. Rau 2021, p. 131.
  29. Fuchs, Christopher A. (6 January 2011). Coming of Age with Quantum Information. Cambridge University Press. p. 130. ISBN 978-0-521-19926-1. 
  30. Rieffel & Polak 2011, p. 39.
  31. Plenio, Martin B.; Virmani, Shashank (2007). "An introduction to entanglement measures". Quantum Information and Computation 1: 1–51. Bibcode2005quant.ph..4163P. 
  32. Vedral, Vlatko (2002). "The role of relative entropy in quantum information theory". Reviews of Modern Physics 74 (1): 197–234. doi:10.1103/RevModPhys.74.197. Bibcode2002RvMP...74..197V. 
  33. Holevo 2001, p. 31.
  34. Hill, S.; Wootters, W. K. (1997). "Entanglement of a Pair of Quantum Bits". Physical Review Letters 78 (26): 5022–5025. doi:10.1103/PhysRevLett.78.5022. Bibcode1997PhRvL..78.5022H. 
  35. Bengtsson & Życzkowski 2017, p. 447.
  36. Nielsen & Chuang 2010, p. 505.
  37. Enríquez, M.; Wintrowicz, I.; Życzkowski, K. (March 2016). "Maximally Entangled Multipartite States: A Brief Survey". Journal of Physics: Conference Series 698 (1). doi:10.1088/1742-6596/698/1/012003. Bibcode2016JPhCS.698a2003E. https://iopscience.iop.org/article/10.1088/1742-6596/698/1/012003. 
  38. Bengtsson & Życzkowski 2017, p. 474.
  39. Bengtsson & Życzkowski 2017, p. 475.
  40. Bengtsson & Życzkowski 2017, p. 477.
  41. Bengtsson & Życzkowski 2017, pp. 369–370.
  42. Landau, L. (1927). "Das Daempfungsproblem in der Wellenmechanik". Zeitschrift für Physik 45 (5–6): 430–464. doi:10.1007/BF01343064. Bibcode1927ZPhy...45..430L. 
  43. Von Neumann, John (1932). Mathematische Grundlagen der Quantenmechanik. Berlin: Springer. ISBN 3-540-59207-5. ; Von Neumann, John (1955). Mathematical Foundations of Quantum Mechanics. Princeton University Press. ISBN 978-0-691-02893-4. 
  44. Peres 1993, p. 271.
  45. Henderson, Leah (2003). "The Von Neumann Entropy: A reply to Shenker". British Journal for the Philosophy of Science 54: 291–296. doi:10.1093/bjps/54.2.291. 
  46. Petz, D. (2001). "Entropy, von Neumann and the von Neumann entropy". John von Neumann and the Foundations of Quantum Physics. Kluwer. pp. 83–96. doi:10.1007/978-94-017-2012-0_7. ISBN 978-0-7923-6812-0. 
  47. Einstein, Albert (1914). "Beiträge zur Quantentheorie". Deutsche Physikalische Gesellschaft. Verhandlungen 16: 820.  Translated in The Collected Papers of Albert Einstein, Volume 6. Princeton University Press. 1997. pp. 20–26. ISBN 0-691-01734-4. 
  48. Szilard, Leo (December 1925). "Über die Ausdehnung der phänomenologischen Thermodynamik auf die Schwankungserscheinungen". Zeitschrift für Physik 32: 753–788. doi:10.1007/BF01331713. 
  49. Wehrl, Alfred (April 1978). "General properties of entropy". Reviews of Modern Physics 50 (2): 221–260. doi:10.1103/RevModPhys.50.221. 
  50. Bengtsson & Życzkowski 2017, pp. 361, 365.
  51. Delbrück, Max; Harding, Carolyn. "Max Delbrück Oral History Interview". https://resolver.caltech.edu/CaltechOH:OH_Delbruck_M. 
  52. "Proof of the Strong Subadditivity of Quantum-Mechanical Entropy". Journal of Mathematical Physics 14 (12): 1938–1941. 1973. doi:10.1063/1.1666274. Bibcode1973JMP....14.1938L. http://www.numdam.org/item/RCP25_1973__19__A5_0/. 
  53. Ruskai, Mary Beth (10 January 2014). "Evolution of a Fundamental Theorem on Quantum Entropy". World Scientific. https://www.youtube.com/watch?v=P3-xI1u1Y2s.  Invited talk at the Conference in Honour of the 90th Birthday of Freeman Dyson, Institute of Advanced Studies, Nanyang Technological University, Singapore, 26–29 August 2013.
  54. "Convex Trace Functions and the Wigner–Yanase–Dyson Conjecture". Advances in Mathematics 11 (3): 267–288. 1973. doi:10.1016/0001-8708(73)90011-X. 
  55. Nielsen & Chuang 2010, p. 527.
  • Bengtsson, Ingemar; Życzkowski, Karol (2017). Geometry of Quantum States: An Introduction to Quantum Entanglement (2nd ed.). Cambridge University Press. ISBN 978-1-107-02625-4. 
  • Holevo, Alexander S. (2001). Statistical Structure of Quantum Theory. Lecture Notes in Physics. Monographs. Springer. ISBN 3-540-42082-7. 
  • Nielsen, Michael A.; Chuang, Isaac L. (2010). Quantum Computation and Quantum Information (10th anniversary ed.). Cambridge: Cambridge Univ. Press. ISBN 978-0-521-63503-5. 
  • Peres, Asher (1993). Quantum Theory: Concepts and Methods. Kluwer. ISBN 0-7923-2549-4. 
  • Rau, Jochen (2017). Statistical Physics and Thermodynamics. Oxford University Press. ISBN 978-0-19-959506-8. 
  • Rau, Jochen (2021). Quantum Theory: An Information Processing Approach. Oxford University Press. ISBN 978-0-19-289630-8. 
  • Rieffel, Eleanor; Polak, Wolfgang (2011). Quantum Computing: A Gentle Introduction. Scientific and engineering computation. Cambridge, Mass: MIT Press. ISBN 978-0-262-01506-6. 
  • Wilde, Mark M. (2017). Quantum Information Theory (2nd ed.). Cambridge University Press. doi:10.1017/9781316809976. ISBN 978-1-316-80997-6. 
  • Zwiebach, Barton (2022). Mastering Quantum Mechanics: Essentials, Theory, and Applications. MIT Press. ISBN 978-0-262-04613-8.