Physics:Linearized gravity

From HandWiki
Short description: Linear perturbations to solutions of nonlinear Einstein field equations

In the theory of general relativity, linearized gravity is the application of perturbation theory to the metric tensor that describes the geometry of spacetime. As a consequence, linearized gravity is an effective method for modeling the effects of gravity when the gravitational field is weak. The usage of linearized gravity is integral to the study of gravitational waves and weak-field gravitational lensing.

Weak-field approximation

The Einstein field equation (EFE) describing the geometry of spacetime is given as (using natural units)

[math]\displaystyle{ R_{\mu\nu} - \frac{1}{2}Rg_{\mu\nu} = 8\pi GT_{\mu\nu} }[/math]

where [math]\displaystyle{ R_{\mu\nu} }[/math] is the Ricci tensor, [math]\displaystyle{ R }[/math] is the Ricci scalar, [math]\displaystyle{ T_{\mu\nu} }[/math] is the energy–momentum tensor, and [math]\displaystyle{ g_{\mu\nu} }[/math] is the spacetime metric tensor that represent the solutions of the equation.

Although succinct when written out using Einstein notation, hidden within the Ricci tensor and Ricci scalar are exceptionally nonlinear dependencies on the metric which render the prospect of finding exact solutions impractical in most systems. However, when describing particular systems for which the curvature of spacetime is small (meaning that terms in the EFE that are quadratic in [math]\displaystyle{ g_{\mu\nu} }[/math] do not significantly contribute to the equations of motion), one can model the solution of the field equations as being the Minkowski metric[note 1] [math]\displaystyle{ \eta_{\mu\nu} }[/math] plus a small perturbation term [math]\displaystyle{ h_{\mu\nu} }[/math]. In other words:

[math]\displaystyle{ g_{\mu\nu} = \eta_{\mu\nu} + h_{\mu\nu},\qquad |h_{\mu\nu}| \ll 1. }[/math]

In this regime, substituting the general metric [math]\displaystyle{ g_{\mu\nu} }[/math] for this perturbative approximation results in a simplified expression for the Ricci tensor:

[math]\displaystyle{ R_{\mu\nu} = \frac{1}{2}(\partial_\sigma\partial_\mu h^\sigma_\nu + \partial_\sigma\partial_\nu h^\sigma_\mu - \partial_\mu\partial_\nu h - \square h_{\mu\nu}), }[/math]

where [math]\displaystyle{ h = \eta^{\mu\nu}h_{\mu\nu} }[/math] is the trace of the perturbation, [math]\displaystyle{ \partial_\mu }[/math] denotes the partial derivative with respect to the [math]\displaystyle{ x^\mu }[/math] coordinate of spacetime, and [math]\displaystyle{ \square = \eta^{\mu\nu}\partial_\mu\partial_\nu }[/math] is the d'Alembert operator.

Together with the Ricci scalar,

[math]\displaystyle{ R = \eta_{\mu\nu}R^{\mu\nu} = \partial_\mu\partial_\nu h^{\mu\nu} - \square h, }[/math]

the left side of the field equation reduces to

[math]\displaystyle{ R_{\mu\nu} - \frac{1}{2}Rg_{\mu\nu} = \frac{1}{2}(\partial_\sigma\partial_\mu h^\sigma_\nu + \partial_\sigma\partial_\nu h^\sigma_\mu - \partial_\mu\partial_\nu h - \square h_{\mu\nu} - \eta_{\mu\nu}\partial_\rho\partial_\lambda h^{\rho\lambda} + \eta_{\mu\nu}\square h). }[/math]

and thus the EFE is reduced to a linear, second order partial differential equation in terms of [math]\displaystyle{ h_{\mu\nu} }[/math].

Gauge invariance

The process of decomposing the general spacetime [math]\displaystyle{ g_{\mu\nu} }[/math] into the Minkowski metric plus a perturbation term is not unique. This is due to the fact that different choices for coordinates may give different forms for [math]\displaystyle{ h_{\mu\nu} }[/math]. In order to capture this phenomenon, the application of gauge symmetry is introduced.

Gauge symmetries are a mathematical device for describing a system that does not change when the underlying coordinate system is "shifted" by an infinitesimal amount. So although the perturbation metric [math]\displaystyle{ h_{\mu\nu} }[/math] is not consistently defined between different coordinate systems, the overall system which it describes is.

To capture this formally, the non-uniqueness of the perturbation [math]\displaystyle{ h_{\mu\nu} }[/math] is represented as being a consequence of the diverse collection of diffeomorphisms on spacetime that leave [math]\displaystyle{ h_{\mu\nu} }[/math] sufficiently small. Therefore to continue, it is required that [math]\displaystyle{ h_{\mu\nu} }[/math] be defined in terms of a general set of diffeomorphisms then select the subset of these that preserve the small scale that is required by the weak-field approximation. One may thus define [math]\displaystyle{ \phi }[/math] to denote an arbitrary diffeomorphism that maps the flat Minkowski spacetime to the more general spacetime represented by the metric [math]\displaystyle{ g_{\mu\nu} }[/math]. With this, the perturbation metric may be defined as the difference between the pullback of [math]\displaystyle{ g_{\mu\nu} }[/math] and the Minkowski metric:

[math]\displaystyle{ h_{\mu\nu} = (\phi^*g)_{\mu\nu} - \eta_{\mu\nu}. }[/math]

The diffeomorphisms [math]\displaystyle{ \phi }[/math] may thus be chosen such that [math]\displaystyle{ |h_{\mu\nu}| \ll 1 }[/math].

Given then a vector field [math]\displaystyle{ \xi^\mu }[/math] defined on the flat, background spacetime, an additional family of diffeomorphisms [math]\displaystyle{ \psi_\epsilon }[/math] may be defined as those generated by [math]\displaystyle{ \xi^\mu }[/math] and parameterized by [math]\displaystyle{ \epsilon \gt 0 }[/math]. These new diffeomorphisms will be used to represent the coordinate transformations for "infinitesimal shifts" as discussed above. Together with [math]\displaystyle{ \phi }[/math], a family of perturbations is given by

[math]\displaystyle{ \begin{align} h^{(\epsilon)}_{\mu\nu} &= [(\phi\circ\psi_\epsilon)^*g]_{\mu\nu} - \eta_{\mu\nu} \\ &= [\psi^*_\epsilon(\phi^*g)]_{\mu\nu} - \eta_{\mu\nu} \\ &= \psi^*_\epsilon(h + \eta)_{\mu\nu} - \eta_{\mu\nu} \\ &= (\psi^*_\epsilon h)_{\mu\nu} + \epsilon\left[\frac{(\psi^*_\epsilon\eta)_{\mu\nu} - \eta_{\mu\nu}}{\epsilon}\right]. \end{align} }[/math]

Therefore, in the limit [math]\displaystyle{ \epsilon\rightarrow 0 }[/math],

[math]\displaystyle{ h^{(\epsilon)}_{\mu\nu} = h_{\mu\nu} + \epsilon\mathcal{L}_\xi\eta_{\mu\nu} }[/math]

where [math]\displaystyle{ \mathcal{L}_\xi }[/math] is the Lie derivative along the vector field [math]\displaystyle{ \xi_\mu }[/math].

The Lie derivative works out to yield the final gauge transformation of the perturbation metric [math]\displaystyle{ h_{\mu\nu} }[/math]:

[math]\displaystyle{ h^{(\epsilon)}_{\mu\nu} = h_{\mu\nu} + \epsilon(\partial_\mu\xi_\nu + \partial_\nu\xi_\mu), }[/math]

which precisely define the set of perturbation metrics that describe the same physical system. In other words, it characterizes the gauge symmetry of the linearized field equations.

Choice of gauge

By exploiting gauge invariance, certain properties of the perturbation metric can be guaranteed by choosing a suitable vector field [math]\displaystyle{ \xi^\mu }[/math].

Transverse gauge

To study how the perturbation [math]\displaystyle{ h_{\mu\nu} }[/math] distorts measurements of length, it is useful to define the following spatial tensor:

[math]\displaystyle{ s_{ij} = h_{ij} - \frac{1}{3}\delta^{kl}h_{kl}\delta_{ij} }[/math]

(Note that the indices span only spatial components: [math]\displaystyle{ i,j\in\{1,2,3\} }[/math]). Thus, by using [math]\displaystyle{ s_{ij} }[/math], the spatial components of the perturbation can be decomposed as

[math]\displaystyle{ h_{ij} = s_{ij} - \Psi\delta_{ij} }[/math]

where [math]\displaystyle{ \Psi = \frac{1}{3}\delta^{kl}h_{kl} }[/math].

The tensor [math]\displaystyle{ s_{ij} }[/math] is, by construction, traceless and is referred to as the strain since it represents the amount by which the perturbation stretches and contracts measurements of space. In the context of studying gravitational radiation, the strain is particularly useful when utilized with the transverse gauge. This gauge is defined by choosing the spatial components of [math]\displaystyle{ \xi^\mu }[/math] to satisfy the relation

[math]\displaystyle{ \nabla^2\xi^j + \frac{1}{3}\partial_j\partial_i\xi^i = -\partial_i s^{ij}, }[/math]

then choosing the time component [math]\displaystyle{ \xi^0 }[/math] to satisfy

[math]\displaystyle{ \nabla^2\xi^0 = \partial_i h_{0i} + \partial_0\partial_i\xi^i. }[/math]

After performing the gauge transformation using the formula in the previous section, the strain becomes spatially transverse:

[math]\displaystyle{ \partial_i s^{ij}_{(\epsilon)} = 0, }[/math]

with the additional property:

[math]\displaystyle{ \partial_i h^{0i}_{(\epsilon)} = 0. }[/math]

Synchronous gauge

The synchronous gauge simplifies the perturbation metric by requiring that the metric not distort measurements of time. More precisely, the synchronous gauge is chosen such that the non-spatial components of [math]\displaystyle{ h^{(\epsilon)}_{\mu\nu} }[/math] are zero, namely

[math]\displaystyle{ h^{(\epsilon)}_{0\nu} = 0. }[/math]

This can be achieved by requiring the time component of [math]\displaystyle{ \xi^\mu }[/math] to satisfy

[math]\displaystyle{ \partial_0\xi^0 = -h_{00} }[/math]

and requiring the spatial components to satisfy

[math]\displaystyle{ \partial_0\xi^i = \partial_i\xi^0 - h_{0i}. }[/math]

Harmonic gauge

The harmonic gauge (also referred to as the Lorenz gauge[note 2]) is selected whenever it is necessary to reduce the linearized field equations as much as possible. This can be done if the condition

[math]\displaystyle{ \partial_\mu h^\mu_\nu = \frac{1}{2}\partial_\nu h }[/math]

is true. To achieve this, [math]\displaystyle{ \xi_\mu }[/math] is required to satisfy the relation

[math]\displaystyle{ \square\xi_\mu = -\partial_\nu h^\nu_\mu + \frac{1}{2}\partial_\mu h. }[/math]

Consequently, by using the harmonic gauge, the Einstein tensor [math]\displaystyle{ G_{\mu\nu} = R_{\mu\nu} - \frac{1}{2}Rg_{\mu\nu} }[/math] reduces to

[math]\displaystyle{ G_{\mu\nu} = -\frac{1}{2}\square\left(h^{(\epsilon)}_{\mu\nu} - \frac{1}{2}h^{(\epsilon)}\eta_{\mu\nu}\right). }[/math]

Therefore, by writing it in terms of a "trace-reversed" metric, [math]\displaystyle{ \bar{h}^{(\epsilon)}_{\mu\nu} = h^{(\epsilon)}_{\mu\nu} - \frac{1}{2}h^{(\epsilon)}\eta_{\mu\nu} }[/math], the linearized field equations reduce to

[math]\displaystyle{ \square \bar{h}^{(\epsilon)}_{\mu\nu} = -16\pi GT_{\mu\nu}. }[/math]

Which can be solved exactly using the wave solutions that define gravitational radiation.

See also


Notes

  1. This is assuming that the background spacetime is flat. Perturbation theory applied in spacetime that is already curved can work just as well by replacing this term with the metric representing the curved background.
  2. Not to be confused with Lorentz.

Further reading

  • Sean M. Carroll (2003). Spacetime and Geometry, an Introduction to General Relativity. Pearson. ISBN 978-0805387322.