Physics:Thomas precession

From HandWiki
Short description: Relativistic correction
Llewellyn Thomas (1903 – 1992)

In physics, the Thomas precession, named after Llewellyn Thomas, is a relativistic correction that applies to the spin of an elementary particle or the rotation of a macroscopic gyroscope and relates the angular velocity of the spin of a particle following a curvilinear orbit to the angular velocity of the orbital motion.

For a given inertial frame, if a second frame is Lorentz-boosted relative to it, and a third boosted relative to the second, but non-colinear with the first boost, then the Lorentz transformation between the first and third frames involves a combined boost and rotation, known as the "Wigner rotation" or "Thomas rotation". For accelerated motion, the accelerated frame has an inertial frame at every instant. Two boosts a small time interval (as measured in the lab frame) apart leads to a Wigner rotation after the second boost. In the limit the time interval tends to zero, the accelerated frame will rotate at every instant, so the accelerated frame rotates with an angular velocity.

The precession can be understood geometrically as a consequence of the fact that the space of velocities in relativity is hyperbolic, and so parallel transport of a vector (the gyroscope's angular velocity) around a circle (its linear velocity) leaves it pointing in a different direction, or understood algebraically as being a result of the non-commutativity of Lorentz transformations. Thomas precession gives a correction to the spin–orbit interaction in quantum mechanics, which takes into account the relativistic time dilation between the electron and the nucleus of an atom.

Thomas precession is a kinematic effect in the flat spacetime of special relativity. In the curved spacetime of general relativity, Thomas precession combines with a geometric effect to produce de Sitter precession. Although Thomas precession (net rotation after a trajectory that returns to its initial velocity) is a purely kinematic effect, it only occurs in curvilinear motion and therefore cannot be observed independently of some external force causing the curvilinear motion such as that caused by an electromagnetic field, a gravitational field or a mechanical force, so Thomas precession is usually accompanied by dynamical effects.[1]

If the system experiences no external torque, e.g., in external scalar fields, its spin dynamics is determined only by the Thomas precession. A single discrete Thomas rotation (as opposed to the series of infinitesimal rotations that add up to the Thomas precession) is present in situations anytime there are three or more inertial frames in non-collinear motion, as can be seen using Lorentz transformations.


Thomas precession in relativity was already known to Ludwik Silberstein,[2] in 1914. But the only knowledge Thomas had of relativistic precession came from de Sitter's paper on the relativistic precession of the moon, first published in a book by Eddington.[3]

In 1925 Thomas relativistically recomputed the precessional frequency of the doublet separation in the fine structure of the atom. He thus found the missing factor 1/2, which came to be known as the Thomas half.

This discovery of the relativistic precession of the electron spin led to the understanding of the significance of the relativistic effect. The effect was consequently named "Thomas precession".



Consider a physical system moving through Minkowski spacetime. Assume that there is at any moment an inertial system such that in it, the system is at rest. This assumption is sometimes called the third postulate of relativity.[4] This means that at any instant, the coordinates and state of the system can be Lorentz transformed to the lab system through some Lorentz transformation.

Let the system be subject to external forces that produce no torque with respect to its center of mass in its (instantaneous) rest frame. The condition of "no torque" is necessary to isolate the phenomenon of Thomas precession. As a simplifying assumption one assumes that the external forces bring the system back to its initial velocity after some finite time. Fix a Lorentz frame O such that the initial and final velocities are zero.

The Pauli–Lubanski spin vector Sμ is defined to be (0, Si) in the system's rest frame, with Si the angular-momentum three-vector about the center of mass. In the motion from initial to final position, Sμ undergoes a rotation, as recorded in O, from its initial to its final value. This continuous change is the Thomas precession.[5]


Value of γ2/(γ + 1) as β = v/c increases, with v the instantaneous magnitude of the particle's velocity. The Thomas rotation is negligible for β < 0.5, increases steadily for 0.5 < β < 0.8, then rapidly shoots to infinity as β tends to 1. The "Thomas half" is evident in the low-speed limit, and the rotation is only very clear for speeds approaching that of light.

Consider the motion of a particle. Introduce a lab frame Σ in which an observer can measure the relative motion of the particle. At each instant of time the particle has an inertial frame in which it is at rest. Relative to this lab frame, the instantaneous velocity of the particle is v(t) with magnitude |v| = v bounded by the speed of light c, so that 0 ≤ v < c. Here the time t is the coordinate time as measured in the lab frame, not the proper time of the particle.

Apart from the upper limit on magnitude, the velocity of the particle is arbitrary and not necessarily constant, its corresponding vector of acceleration is a = dv(t)/dt. As a result of the Wigner rotation at every instant, the particle's frame precesses with an angular velocity given by the[6][7][8][9]

Thomas precession

[math]\displaystyle{ \boldsymbol{\omega}_\text{T} = \frac{1}{c^2}\left(\frac{\gamma^2}{\gamma+1}\right)\mathbf{a}\times\mathbf{v} }[/math]

where × is the cross product and

[math]\displaystyle{ \gamma = \dfrac{1}{\sqrt{1-\dfrac{|\mathbf{v}(t)|^2}{c^2}}} }[/math]

is the instantaneous Lorentz factor, a function of the particle's instantaneous velocity. Like any angular velocity, ωT is a pseudovector; its magnitude is the angular speed the particle's frame precesses (in radians per second), and the direction points along the rotation axis. As is usual, the right-hand convention of the cross product is used (see right-hand rule).

The precession depends on accelerated motion, and the non-collinearity of the particle's instantaneous velocity and acceleration. No precession occurs if the particle moves with uniform velocity (constant v so a = 0), or accelerates in a straight line (in which case v and a are parallel or antiparallel so their cross product is zero). The particle has to move in a curve, say an arc, spiral, helix, or a circular orbit or elliptical orbit, for its frame to precess. The angular velocity of the precession is a maximum if the velocity and acceleration vectors are perpendicular throughout the motion (a circular orbit), and is large if their magnitudes are large (the magnitude of v is almost c).

In the non-relativistic limit, v0 so γ → 1, and the angular velocity is approximately

[math]\displaystyle{ \boldsymbol{\omega}_\text{T} \approx \frac{1}{2c^2}\mathbf{a}\times\mathbf{v} }[/math]

The factor of 1/2 turns out to be the critical factor to agree with experimental results. It is informally known as the "Thomas half".

Mathematical explanation

Lorentz transformations

The description of relative motion involves Lorentz transformations, and it is convenient to use them in matrix form; symbolic matrix expressions summarize the transformations and are easy to manipulate, and when required the full matrices can be written explicitly. Also, to prevent extra factors of c cluttering the equations, it is convenient to use the definition β(t) = v(t)/c with magnitude |β| = β such that 0 ≤ β < 1.

The spacetime coordinates of the lab frame are collected into a 4×1 column vector, and the boost is represented as a 4×4 symmetric matrix, respectively

[math]\displaystyle{ X = \begin{bmatrix} c t \\ x \\ y \\ z \end{bmatrix}\,,\quad B(\boldsymbol{\beta}) = \begin{bmatrix} \gamma&-\gamma\beta_x &-\gamma\beta_y&-\gamma\beta_z\\ -\gamma\beta_x &1+(\gamma-1)\dfrac{\beta_x^2}{\beta^2} &(\gamma-1)\dfrac{\beta_x \beta_y}{\beta^2} &(\gamma-1)\dfrac{\beta_x \beta_z}{\beta^2} \\ -\gamma\beta_y &(\gamma-1)\dfrac{\beta_y \beta_x}{\beta^2} &1+(\gamma-1)\dfrac{\beta_y^2}{\beta^2} &(\gamma-1)\dfrac{\beta_y \beta_z}{\beta^2} \\ -\gamma\beta_z&(\gamma-1)\dfrac{\beta_z \beta_x}{\beta^2} &(\gamma-1)\dfrac{\beta_z \beta_y}{\beta^2} & 1+(\gamma-1)\dfrac{\beta_z^2}{\beta^2} \\ \end{bmatrix} }[/math]

and turn

[math]\displaystyle{ \gamma = \frac{1}{\sqrt{1-|\boldsymbol{\beta}|^2}} }[/math]

is the Lorentz factor of β. In other frames, the corresponding coordinates are also arranged into column vectors. The inverse matrix of the boost corresponds to a boost in the opposite direction, and is given by B(β)−1 = B(−β).

At an instant of lab-recorded time t measured in the lab frame, the transformation of spacetime coordinates from the lab frame Σ to the particle's frame Σ is

[math]\displaystyle{ X' = B(\boldsymbol{\beta})X }[/math]






and at later lab-recorded time t + Δt we can define a new frame Σ′′ for the particle, which moves with velocity β + Δβ relative to Σ, and the corresponding boost is

[math]\displaystyle{ X'' = B(\boldsymbol{\beta} + \Delta\boldsymbol{\beta})X }[/math]






The vectors β and Δβ are two separate vectors. The latter is a small increment, and can be conveniently split into components parallel (‖) and perpendicular (⊥) to β[nb 1]

[math]\displaystyle{ \Delta\boldsymbol{\beta} = \Delta\boldsymbol{\beta}_\parallel + \Delta\boldsymbol{\beta}_\perp }[/math]

Combining (1) and (2) obtains the Lorentz transformation between Σ′ and Σ′′,

[math]\displaystyle{ X'' = B(\boldsymbol{\beta} + \Delta\boldsymbol{\beta})B(-\boldsymbol{\beta})X' \,, }[/math]






and this composition contains all the required information about the motion between these two lab times. Notice B(β + Δβ)B(−β) and B(β + Δβ) are infinitesimal transformations because they involve a small increment in the relative velocity, while B(−β) is not.

The composition of two boosts equates to a single boost combined with a Wigner rotation about an axis perpendicular to the relative velocities;

[math]\displaystyle{ \Lambda = B(\boldsymbol{\beta} + \Delta\boldsymbol{\beta})B(-\boldsymbol{\beta}) = R(\Delta\boldsymbol{\theta})B(\Delta\mathbf{b}) }[/math]






The rotation is given by is a 4×4 rotation matrix R in the axis–angle representation, and coordinate systems are taken to be right-handed. This matrix rotates 3d vectors anticlockwise about an axis (active transformation), or equivalently rotates coordinate frames clockwise about the same axis (passive transformation). The axis-angle vector Δθ parametrizes the rotation, its magnitude Δθ is the angle Σ′′ has rotated, and direction is parallel to the rotation axis, in this case the axis is parallel to the cross product (−β)×(β + Δβ) = −β×Δβ. If the angles are negative, then the sense of rotation is reversed. The inverse matrix is given by Rθ)−1 = R(−Δθ).

Corresponding to the boost is the (small change in the) boost vector Δb, with magnitude and direction of the relative velocity of the boost (divided by c). The boost Bb) and rotation Rθ) here are infinitesimal transformations because Δb and rotation Δθ are small.

The rotation gives rise to the Thomas precession, but there is a subtlety. To interpret the particle's frame as a co-moving inertial frame relative to the lab frame, and agree with the non-relativistic limit, we expect the transformation between the particle's instantaneous frames at times t and t + Δt to be related by a boost without rotation. Combining (3) and (4) and rearranging gives

[math]\displaystyle{ B(\Delta\mathbf{b})X' = R(-\Delta\boldsymbol{\theta}) X'' = X''' \,, }[/math]






where another instantaneous frame Σ′′′ is introduced with coordinates X′′′, to prevent conflation with Σ′′. To summarize the frames of reference: in the lab frame Σ an observer measures the motion of the particle, and three instantaneous inertial frames in which the particle is at rest are Σ′ (at time t), Σ′′ (at time t + Δt), and Σ′′′ (at time t + Δt). The frames Σ′′ and Σ′′′ are at the same location and time, they differ only by a rotation. By contrast Σ′ and Σ′′′ differ by a boost and lab time interval Δt.

Relating the coordinates X′′′ to the lab coordinates X via (5) and (2);

[math]\displaystyle{ X''' = R(-\Delta\boldsymbol{\theta}) X'' = R(-\Delta\boldsymbol{\theta}) B(\boldsymbol{\beta} + \Delta\boldsymbol{\beta}) X \,, }[/math]






the frame Σ′′′ is rotated in the negative sense.

The rotation is between two instants of lab time. As Δt → 0, the particle's frame rotates at every instant, and the continuous motion of the particle amounts to a continuous rotation with an angular velocity at every instant. Dividing −Δθ by Δt, and taking the limit Δt → 0, the angular velocity is by definition

[math]\displaystyle{ \boldsymbol{\omega}_T = -\lim_{\Delta t \rightarrow 0 }\frac{\Delta\boldsymbol{\theta}}{\Delta t} }[/math]






It remains to find what Δθ precisely is.

Extracting the formula

The composition can be obtained by explicitly calculating the matrix product. The boost matrix of β + Δβ will require the magnitude and Lorentz factor of this vector. Since Δβ is small, terms of "second order" |Δβ|2, βx)2, βy)2, ΔβxΔβy and higher are negligible. Taking advantage of this fact, the magnitude squared of the vector is

[math]\displaystyle{ |\boldsymbol{\beta}+\Delta\boldsymbol{\beta}|^2=|\boldsymbol{\beta}|^2+2\boldsymbol{\beta}\cdot\Delta\boldsymbol{\beta} }[/math]

and expanding the Lorentz factor of β + Δβ as a power series gives to first order in Δβ,

[math]\displaystyle{ \begin{align}\frac{1}{\sqrt{1-|\boldsymbol{\beta}+\Delta\boldsymbol{\beta}|^2}} & =1+\frac{1}{2}|\boldsymbol{\beta}+\Delta\boldsymbol{\beta}|^2+\frac{3}{8}|\boldsymbol{\beta}+\Delta\boldsymbol{\beta}|^4+\cdots \\ & = \left(1+\frac{|\boldsymbol{\beta}|^2}2+\frac{3}{8}|\boldsymbol{\beta}|^4+\cdots\right)+\left(1+\frac{3}2|\boldsymbol{\beta}|^2+\cdots\right)\boldsymbol{\beta}\cdot\Delta\boldsymbol{\beta} \\ & \approx \gamma + \gamma^3\boldsymbol{\beta}\cdot\Delta\boldsymbol{\beta} \end{align} }[/math]

using the Lorentz factor γ of β as above.

Composition of boosts in the xy plane

To simplify the calculation without loss of generality, take the direction of β to be entirely in the x direction, and Δβ in the xy plane, so the parallel component is along the x direction while the perpendicular component is along the y direction. The axis of the Wigner rotation is along the z direction. In the Cartesian basis ex, ey, ez, a set of mutually perpendicular unit vectors in their indicated directions, we have

[math]\displaystyle{ \boldsymbol{\beta} = \beta\mathbf{e}_x \,, \quad \Delta\boldsymbol{\beta}_\parallel = \Delta\beta_x\mathbf{e}_x \,, \quad \Delta\boldsymbol{\beta}_\perp = \Delta\beta_y\mathbf{e}_y \,,\quad \boldsymbol{\beta}\times\Delta\boldsymbol{\beta}= \beta\Delta\beta_y\mathbf{e}_z }[/math]

This simplified setup allows the boost matrices to be given explicitly with the minimum number of matrix entries. In general, of course, β and Δβ can be in any plane, the final result given later will not be different.

Explicitly, at time t the boost is in the negative x direction

[math]\displaystyle{ B(-\boldsymbol{\beta}) = \begin{bmatrix} \gamma& \gamma\beta & 0 & 0 \\ \gamma\beta & \gamma & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} }[/math]

and the boost at the time t + Δt is

[math]\displaystyle{ B(\boldsymbol{\beta} + \Delta\boldsymbol{\beta})= \begin{bmatrix} \gamma + \gamma^3 \beta \Delta \beta_x & -(\gamma\beta + \gamma^3 \Delta \beta_x) & -\gamma\Delta\beta_y & 0 \\ -(\gamma\beta + \gamma^3 \Delta \beta_x) & \gamma + \gamma^3 \beta \Delta \beta_x & \left(\dfrac{\gamma-1}{\beta}\right)\Delta\beta_y & 0 \\ -\gamma\Delta\beta_y & \left(\dfrac{\gamma-1}{\beta}\right)\Delta\beta_y & 1 & 0 \\ 0 & 0 & 0 & 1 \end{bmatrix} }[/math]

where γ is the Lorentz factor of β, not β + Δβ. The composite transformation is then the matrix product

[math]\displaystyle{ \Lambda = B(\boldsymbol{\beta} + \Delta\boldsymbol{\beta})B(-\boldsymbol{\beta})= \begin{bmatrix} 1 & -\gamma^2 \Delta \beta_x & -\gamma\Delta\beta_y & 0 \\ -\gamma^2 \Delta \beta_x & 1 & \left(\dfrac{\gamma-1}{\beta}\right)\Delta\beta_y & 0 \\ -\gamma\Delta\beta_y & - \left(\dfrac{\gamma-1}{\beta}\right)\Delta\beta_y & 1 & 0 \\ 0 & 0 & 0 & 1 \end{bmatrix} }[/math]

Introducing the boost generators

[math]\displaystyle{ K_x = \begin{bmatrix} 0 &1 & 0 & 0 \\ 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ \end{bmatrix}\,,\quad K_y = \begin{bmatrix}0 & 0 & 1 & 0\\ 0 & 0 & 0 & 0\\ 1 & 0 & 0 & 0\\ 0 & 0 & 0 & 0 \end{bmatrix}\,,\quad K_z = \begin{bmatrix}0 & 0 & 0 & 1\\ 0 & 0 & 0 & 0\\ 0 & 0 & 0 & 0\\ 1 & 0 & 0 & 0 \end{bmatrix} }[/math]

and rotation generators

[math]\displaystyle{ J_x = \begin{bmatrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & -1 \\ 0 & 0 & 1 & 0 \\ \end{bmatrix}\,,\quad J_y = \begin{bmatrix} 0 & 0 & 0 & 0\\ 0 & 0 & 0 & 1\\ 0 & 0 & 0 & 0\\ 0 & -1 & 0 & 0 \end{bmatrix}\,,\quad J_z = \begin{bmatrix} 0 & 0 & 0 & 0\\ 0 & 0 & -1 & 0\\ 0 & 1 & 0 & 0\\ 0 & 0 & 0 & 0 \end{bmatrix} }[/math]

along with the dot product · facilitates the coordinate independent expression

[math]\displaystyle{ \Lambda = I - \left(\frac{\gamma-1}{\beta^2}\right)(\boldsymbol{\beta}\times\Delta\boldsymbol{\beta})\cdot\mathbf{J} - \gamma(\gamma\Delta\boldsymbol{\beta}_\parallel+\Delta\boldsymbol{\beta}_\perp)\cdot\mathbf{K} }[/math]

which holds if β and Δβ lie in any plane. This is an infinitesimal Lorentz transformation in the form of a combined boost and rotation[nb 2]

[math]\displaystyle{ \Lambda = I - \Delta\boldsymbol{\theta}\cdot\mathbf{J} - \Delta\mathbf{b}\cdot\mathbf{K} }[/math]


[math]\displaystyle{ \Delta\boldsymbol{\theta} = \left(\frac{\gamma-1}{\beta^2}\right)\boldsymbol{\beta}\times\Delta\boldsymbol{\beta} = \frac{1}{c^2}\left(\frac{\gamma^2}{\gamma+1}\right)\mathbf{v}\times\Delta\mathbf{v} }[/math]
[math]\displaystyle{ \Delta\mathbf{b} = \gamma(\gamma\Delta\boldsymbol{\beta}_\parallel+\Delta\boldsymbol{\beta}_\perp) }[/math]

After dividing Δθ by Δt and taking the limit as in (7), one obtains the instantaneous angular velocity

[math]\displaystyle{ \boldsymbol{\omega}_T = \frac{1}{c^2}\left(\frac{\gamma^2}{\gamma+1}\right)\mathbf{a}\times\mathbf{v} }[/math]

where a is the acceleration of the particle as observed in the lab frame. No forces were specified or used in the derivation so the precession is a kinematical effect - it arises from the geometric aspects of motion. However, forces cause accelerations, so the Thomas precession is observed if the particle is subject to forces.

Thomas precession can also be derived using the Fermi-Walker transport equation.[10] One assumes uniform circular motion in flat Minkowski spacetime. The spin 4-vector is orthogonal to the velocity 4-vector. Fermi-Walker transport preserves this relation. One finds that the dot product of the acceleration 4-vector with the spin 4-vector varies sinusoidally with time with an angular frequency Ύ ω, where ω is the angular frequency of the circular motion and Ύ=1/√⟨1-v^2/c^2). This is easily shown by taking the second time derivative of that dot product. Because this angular frequency exceeds ω, the spin precesses in the retrograde direction. The difference (γ-1)ω is the Thomas precession angular frequency already given, as is simply shown by realizing that the magnitude of the 3-acceleration is ω v.


In electron orbitals

In quantum mechanics Thomas precession is a correction to the spin-orbit interaction, which takes into account the relativistic time dilation between the electron and the nucleus in hydrogenic atoms.

Basically, it states that spinning objects precess when they accelerate in special relativity because Lorentz boosts do not commute with each other.

To calculate the spin of a particle in a magnetic field, one must also take into account Larmor precession.

In a Foucault pendulum

The rotation of the swing plane of Foucault pendulum can be treated as a result of parallel transport of the pendulum in a 2-dimensional sphere of Euclidean space. The hyperbolic space of velocities in Minkowski spacetime represents a 3-dimensional (pseudo-) sphere with imaginary radius and imaginary timelike coordinate. Parallel transport of a spinning particle in relativistic velocity space leads to Thomas precession, which is similar to the rotation of the swing plane of a Foucault pendulum.[11] The angle of rotation in both cases is determined by the area integral of curvature in agreement with the Gauss–Bonnet theorem.

Thomas precession gives a correction to the precession of a Foucault pendulum. For a Foucault pendulum located in the city of Nijmegen in the Netherlands the correction is:

[math]\displaystyle{ \omega \approx 9.5 \cdot 10^{-7}\, \mathrm{arcseconds} / \mathrm{day}. }[/math]

Note that it is more than two orders of magnitude smaller than the precession due to the general-relativistic correction arising from frame-dragging, the Lense–Thirring precession.

See also


  1. Explicitly, using vector projection and rejection relative to the direction of β gives
    [math]\displaystyle{ \Delta\boldsymbol{\beta}_\parallel = \frac{\Delta\boldsymbol{\beta}\cdot\boldsymbol{\beta}}{\beta^2}\boldsymbol{\beta}\,,\quad \Delta\boldsymbol{\beta}_\perp = \Delta\boldsymbol{\beta} - \frac{\Delta\boldsymbol{\beta}\cdot\boldsymbol{\beta}}{\beta^2}\boldsymbol{\beta} }[/math]
    but it is easier to simply use the parallel-perpendicular components.
  2. The rotation and boost matrices (each infinitesimal) are given by
    [math]\displaystyle{ R(\Delta\boldsymbol{\theta}) = I - \Delta\boldsymbol{\theta}\cdot\mathbf{J} \,,\quad B(\Delta\mathbf{b}) = I - \Delta\mathbf{b}\cdot\mathbf{K} \,, }[/math]
    At the infinitesimal level, they commute with each other
    [math]\displaystyle{ \Lambda = B(\Delta\mathbf{b})R(\Delta\boldsymbol{\theta}) = R(\Delta\boldsymbol{\theta})B(\Delta\mathbf{b}) }[/math]
    because the products θ·J)(Δb·K) and b·K)(Δθ·J) are negligible. The full boost and rotations do not commute in general.


  1. Malykin 2006
  2. Silberstein 1914, p. 169
  3. Eddington 1924
  4. Goldstein 1980
  5. Ben-Menahem 1986
  6. Jackson 1975, p. 543–546
  7. Goldstein 1980, p. 288
  8. Sard 1970, p. 280
  9. Sexl & Urbantke 1992, p. 42
  10. Misner, Thorne, and Wheeler, Gravitation, p 165, pp 175-176
  11. Krivoruchenko 2009



External links