Astronomy:Aberration of light
The aberration of light (also referred to as astronomical aberration, stellar aberration, or velocity aberration) is an astronomical phenomenon which produces an apparent motion of celestial objects about their true positions, dependent on the velocity of the observer. Aberration causes objects to appear to be displaced towards the direction of motion of the observer compared to when the observer is stationary. The change in angle is typically very small — of the order of v/c where c is the speed of light and v the velocity of the observer. In the case of "stellar" or "annual" aberration, the apparent position of a star to an observer on Earth varies periodically over the course of a year as the Earth's velocity changes as it revolves around the Sun, by a maximum angle of approximately 20 arcseconds in right ascension or declination.
The term aberration has historically been used to refer to a number of related phenomena concerning the propagation of light in moving bodies.[1] Aberration should not be confused with parallax. The latter is a change in the apparent position of a relatively nearby object, as measured by a moving observer, relative to more distant objects that define a reference frame. The amount of parallax depends on the distance of the object from the observer, whereas aberration does not. Aberration is also related to light-time correction and relativistic beaming, although it is often considered separately from these effects.
Aberration is historically significant because of its role in the development of the theories of light, electromagnetism and, ultimately, the theory of special relativity. It was first observed in the late 1600s by astronomers searching for stellar parallax in order to confirm the heliocentric model of the Solar System. However, it was not understood at the time to be a different phenomenon.[2] In 1727, James Bradley provided a classical explanation for it in terms of the finite speed of light relative to the motion of the Earth in its orbit around the Sun,[3][4] which he used to make one of the earliest measurements of the speed of light. However, Bradley's theory was incompatible with 19th century theories of light, and aberration became a major motivation for the aether drag theories of Augustin Fresnel (in 1818) and G. G. Stokes (in 1845), and for Hendrik Lorentz's aether theory of electromagnetism in 1892. The aberration of light, together with Lorentz's elaboration of Maxwell's electrodynamics, the moving magnet and conductor problem, the negative aether drift experiments, as well as the Fizeau experiment, led Albert Einstein to develop the theory of special relativity in 1905, which presents a general form of the equation for aberration in terms of such theory.[5]
Explanation
Aberration may be explained as the difference in angle of a beam of light in different inertial frames of reference. A common analogy is to consider the apparent direction of falling rain. If rain is falling vertically in the frame of reference of a person standing still, then to a person moving forwards the rain will appear to arrive at an angle, requiring the moving observer to tilt their umbrella forwards. The faster the observer moves, the more tilt is needed.
The net effect is that light rays striking the moving observer from the sides in a stationary frame will come angled from ahead in the moving observer's frame. This effect is sometimes called the "searchlight" or "headlight" effect.
In the case of annual aberration of starlight, the direction of incoming starlight as seen in the Earth's moving frame is tilted relative to the angle observed in the Sun's frame. Since the direction of motion of the Earth changes during its orbit, the direction of this tilting changes during the course of the year, and causes the apparent position of the star to differ from its true position as measured in the inertial frame of the Sun.
While classical reasoning gives intuition for aberration, it leads to a number of physical paradoxes observable even at the classical level (see history). The theory of special relativity is required to correctly account for aberration. The relativistic explanation is very similar to the classical one however, and in both theories aberration may be understood as a case of addition of velocities.
Classical explanation
In the Sun's frame, consider a beam of light with velocity equal to the speed of light c, with x and y velocity components [math]\displaystyle{ u_x }[/math] and [math]\displaystyle{ u_y }[/math], at an angle [math]\displaystyle{ \tan(\theta) = u_y/u_x }[/math]. If the Earth is moving at velocity [math]\displaystyle{ v }[/math] in the x direction relative to the Sun, then by velocity addition the x component of the beam's velocity in the Earth's frame of reference is [math]\displaystyle{ u_x' = u_x - v }[/math], and the y velocity is unchanged, [math]\displaystyle{ u_y' = u_y }[/math]. (Note that you need the velocity of the Sun with respect to the Earth which is the negative of the velocity of the Earth with respect to the Sun. Also note that we are only using vectors here without indication of direction.) Thus the angle of the light in the Earth's frame in terms of the angle in the Sun's frame is
- [math]\displaystyle{ \tan(\phi) = \frac{u_y'}{u_x'} = \frac{u_y}{u_x-v} = \frac{\sin(\theta)}{\cos(\theta)-v/c} }[/math]
In the case of [math]\displaystyle{ \theta = 90^\circ }[/math], this result reduces to [math]\displaystyle{ \tan(\theta - \phi) = -v/c }[/math].
Relativistic explanation
The reasoning in the relativistic case is the same except that the relativistic velocity addition formulas must be used, which can be derived from Lorentz transformations between different frames of reference. These formulae are
- [math]\displaystyle{ u_x' = (u_x - v)/(1-u_x v/c^2) }[/math]
- [math]\displaystyle{ u_y' = u_y / \gamma (1-u_x v/c^2) }[/math]
where [math]\displaystyle{ \gamma = 1/\sqrt{1-v^2/c^2} }[/math], giving the components of the light beam in the Earth's frame in terms of the components in the Sun's frame. The angle of the beam in the Earth's frame is thus [6]
- [math]\displaystyle{ \tan(\phi) = \frac{u_y'}{u_x'} = \frac{u_y}{\gamma(u_x-v)} = \frac{\sin(\theta)}{\gamma(\cos(\theta)-v/c)} }[/math]
In the case of [math]\displaystyle{ \theta = 90^\circ }[/math], this result reduces to [math]\displaystyle{ \sin(\theta - \phi) = -v/c }[/math], and in the limit [math]\displaystyle{ v/c \ll 1 }[/math] this may be approximated by [math]\displaystyle{ \theta - \phi = -v/c }[/math]. This relativistic derivation keeps the speed of light [math]\displaystyle{ \sqrt{u_x^2 + u_y^2} = c }[/math] constant in all frames of reference, unlike the classical derivation above.
Relationship to light-time correction and relativistic beaming
Aberration is related to two other phenomena, light-time correction, which is due to the motion of an observed object during the time taken by its light to reach an observer, and relativistic beaming, which is an angling of the light emitted by a moving light source. It can be considered equivalent to them but in a different inertial frame of reference. In aberration, the observer is considered to be moving relative to a (for the sake of simplicity[7]) stationary light source, while in light-time correction and relativistic beaming the light source is considered to be moving relative to a stationary observer.
Consider the case of an observer and a light source moving relative to each other at constant velocity, with a light beam moving from the source to the observer. At the moment of emission, the beam in the observer's rest frame is tilted compared to the one in the source's rest frame, as understood through relativistic beaming. During the time it takes the light beam to reach the observer the light source moves in the observer's frame, and the 'true position' of the light source is displaced relative to the apparent position the observer sees, as explained by light-time correction. Finally, the beam in the observer's frame at the moment of observation is tilted compared to the beam in source's frame, which can be understood as an aberrational effect. Thus, a person in the light source's frame would describe the apparent tilting of the beam in terms of aberration, while a person in the observer's frame would describe it as a light-time effect.
The relationship between these phenomena is only valid if the observer and source's frames are inertial frames. In practice, because the Earth is not an inertial rest frame but experiences centripetal acceleration towards the Sun, many aberrational effects such as annual aberration on Earth cannot be considered light-time corrections. However, if the time between emission and detection of the light is short compared to the orbital period of the Earth, the Earth may be approximated as an inertial frame and aberrational effects are equivalent to light-time corrections.
Types of aberration
There are a number of types of aberration, caused by the differing components of the Earth's and observed object's motion:
- Annual aberration is due to the orbital revolution of the Earth around the Sun.
- Planetary aberration is the combination of aberration and light-time correction.
- Diurnal aberration is due to the rotation of the Earth about its own axis.
- Secular aberration is due to the motion of the Sun and Solar System relative to other stars in our Galaxy.
Annual aberration
Annual aberration is caused by the motion of an observer on Earth as the planet revolves around the Sun. Due to orbital eccentricity, the orbital velocity [math]\displaystyle{ v }[/math] of Earth (in the Sun's rest frame) varies periodically during the year as the planet traverses its elliptic orbit and consequently the aberration also varies periodically, typically causing stars to appear to move in small ellipses.
Approximating Earth's orbit as circular, the maximum displacement of a star due to annual aberration is known as the constant of aberration, conventionally represented by [math]\displaystyle{ \kappa }[/math]. It may be calculated using the relation [math]\displaystyle{ \kappa = \theta-\phi \approx v/c }[/math] substituting the Earth's average speed in the Sun's frame for [math]\displaystyle{ v }[/math] and the speed of light [math]\displaystyle{ c }[/math]. Its accepted value is 20.49552 arcseconds (at J2000).[8]
Assuming a circular orbit, annual aberration causes stars exactly on the ecliptic (the plane of Earth's orbit) to appear to move back and forth along a straight line, varying by [math]\displaystyle{ \kappa }[/math] on either side of their position in the Sun's frame. A star that is precisely at one of the ecliptic poles (at 90° from the ecliptic plane) will appear to move in a circle of radius [math]\displaystyle{ \kappa }[/math] about its true position, and stars at intermediate ecliptic latitudes will appear to move along a small ellipse.
For illustration, consider a star at the northern ecliptic pole viewed by an observer at a point on the Arctic Circle. Such an observer will see the star transit at the zenith, once every day (strictly speaking sidereal day). At the time of the March equinox, Earth's orbit carries the observer in a southwards direction, and the star's apparent declination is therefore displaced to the south by an angle of [math]\displaystyle{ \kappa }[/math]. On the September equinox, the star's position is displaced to the north by an equal and opposite amount. On either solstice, the displacement in declination is 0. Conversely, the amount of displacement in right ascension is 0 on either equinox and at maximum on either solstice.
In actuality, Earth's orbit is slightly elliptic rather than circular, and its speed varies somewhat over the course of its orbit, which means the description above is only approximate. Aberration is more accurately calculated using Earth's instantaneous velocity relative to the barycenter of the Solar System.[8]
Note that the displacement due to aberration is orthogonal to any displacement due to parallax. If parallax were detectable, the maximum displacement to the south would occur in December, and the maximum displacement to the north in June. It is this apparently anomalous motion that so mystified early astronomers.
Solar annual aberration
A special case of annual aberration is the nearly constant deflection of the Sun from its position in the Sun's rest frame by [math]\displaystyle{ \kappa }[/math] towards the west (as viewed from Earth), opposite to the apparent motion of the Sun along the ecliptic (which is from west to east, as seen from Earth). The deflection thus makes the Sun appear to be behind (or retarded) from its rest-frame position on the ecliptic by a position or angle [math]\displaystyle{ \kappa }[/math].
This deflection may equivalently be described as a light-time effect due to motion of the Earth during the 8.3 minutes that it takes light to travel from the Sun to Earth. This is possible since the transit time of sunlight is short relative to the orbital period of the Earth, so the Earth's frame may be approximated as inertial. In the Earth's frame, the Sun moves by a distance [math]\displaystyle{ \Delta x = vt }[/math] in the time it takes light to reach Earth, [math]\displaystyle{ t=R/c }[/math] for the orbit of radius [math]\displaystyle{ R }[/math]. This gives an angular correction [math]\displaystyle{ \tan(\theta) \approx \theta = \Delta x/R }[/math] which can be solved to give [math]\displaystyle{ \theta = v/c = \kappa }[/math], the same as the aberrational correction.
Planetary aberration
Planetary aberration is the combination of the aberration of light (due to Earth's velocity) and light-time correction (due to the object's motion and distance), as calculated in the rest frame of the Solar System. Both are determined at the instant when the moving object's light reaches the moving observer on Earth. It is so called because it is usually applied to planets and other objects in the Solar System whose motion and distance are accurately known.
Diurnal aberration
Diurnal aberration is caused by the velocity of the observer on the surface of the rotating Earth. It is therefore dependent not only on the time of the observation, but also the latitude and longitude of the observer. Its effect is much smaller than that of annual aberration, and is only 0.32 arcseconds in the case of an observer at the Equator, where the rotational velocity is greatest.[9]
Secular aberration
The Sun and Solar System are revolving around the center of the Galaxy. Aberration due to this motion is known as secular aberration and affects the apparent positions of distant stars and extragalactic objects. However, since the galactic year is about 230 million years, the aberration varies very slowly and this change is extremely difficult to observe. Therefore, secular aberration is usually ignored when considering the positions of stars. In other words, star maps show the observed apparent positions of the stars, not their calculated true positions after accounting for secular aberration.
For stars significantly less than 230 million light years away, the Solar System may be approximated as an inertial frame and so the effect of secular aberration is equivalent to a light-time correction. This includes stars in the Milky Way, since the Milky Way is about 100,000 light years in diameter. For these stars the true position of the star is then easily computed from its proper motion and its distance.
Secular aberration is typically a small number of arcminutes, for example the stationary star Groombridge 1830 is displaced by approximately 3 arcminutes,[9] due to secular aberration. This is roughly 8 times the effect of annual aberration, as one would expect since the velocity of the Solar System relative to the center of the Galaxy is about 8 times the velocity of the Earth relative to the Sun.
Discovery and first observations
The discovery of the aberration of light was totally unexpected, and it was only by considerable perseverance and perspicacity that Bradley was able to explain it in 1727. It originated from attempts to discover whether stars possessed appreciable parallaxes.
Search for stellar parallax
The Copernican heliocentric theory of the Solar System had received confirmation by the observations of Galileo and Tycho Brahe and the mathematical investigations of Kepler and Newton.[10] As early as 1573, Thomas Digges had suggested that parallactic shifting of the stars should occur according to the heliocentric model, and consequently if stellar parallax could be observed it would help confirm this theory. Many observers claimed to have determined such parallaxes, but Tycho Brahe and Giovanni Battista Riccioli concluded that they existed only in the minds of the observers, and were due to instrumental and personal errors. However, in 1680 Jean Picard, in his Voyage d’Uranibourg, stated, as a result of ten years' observations, that Polaris, the Pole Star, exhibited variations in its position amounting to 40″ annually. Some astronomers endeavoured to explain this by parallax, but these attempts failed because the motion differed from that which parallax would produce. John Flamsteed, from measurements made in 1689 and succeeding years with his mural quadrant, similarly concluded that the declination of Polaris was 40″ less in July than in September. Robert Hooke, in 1674, published his observations of γ Draconis, a star of magnitude 2m which passes practically overhead at the latitude of London (hence its observations are largely free from the complex corrections due to atmospheric refraction), and concluded that this star was 23″ more northerly in July than in October.[10]
James Bradley's observations
Consequently, when Bradley and Samuel Molyneux entered this sphere of research in 1725, there was still considerable uncertainty as to whether stellar parallaxes had been observed or not, and it was with the intention of definitely answering this question that they erected a large telescope at Molyneux's house at Kew.[4] They decided to reinvestigate the motion of γ Draconis with a telescope constructed by George Graham (1675–1751), a celebrated instrument-maker. This was fixed to a vertical chimney stack in such manner as to permit a small oscillation of the eyepiece, the amount of which (i.e. the deviation from the vertical) was regulated and measured by the introduction of a screw and a plumb line.[10]
The instrument was set up in November 1725, and observations on γ Draconis were made starting in December. The star was observed to move 40″ southwards between September and March, and then reversed its course from March to September. [10] At the same time, 35 Camelopardalis, a star with a right ascension nearly exactly opposite to that of γ Draconis, was 19" more northerly at the beginning of March than in September.[11] These results were completely unexpected and inexplicable by existing theories.
Early hypotheses
Bradley and Molyneux discussed several hypotheses in the hope of finding the solution. Since the apparent motion was evidently caused neither by parallax nor observational errors, Bradley first hypothesized that it could be due to oscillations in the orientation of the Earth's axis relative to the celestial sphere – a phenomenon known as nutation. 35 Camelopardalis was seen to possess an apparent motion which could be consistent with nutation, but since its declination varied only one half as much as that of γ Draconis, it was obvious that nutation did not supply the answer[12] (however, Bradley later went on to discover that the Earth does indeed nutate).[13] He also investigated the possibility that the motion was due to an irregular distribution of the Earth's atmosphere, thus involving abnormal variations in the refractive index, but again obtained negative results.[12]
On August 19, 1727, Bradley embarked upon a further series of observations using a telescope of his own erected at the Rectory, Wanstead. This instrument had the advantage of a larger field of view and he was able to obtain precise positions of a large number of stars over the course of about twenty years. During his first two years at Wanstead, he established the existence of the phenomenon of aberration beyond all doubt, and this also enabled him to formulate a set of rules that would allow the calculation of the effect on any given star at a specified date.
Development of the theory of aberration
Bradley eventually developed his explanation of aberration in about September 1728 and this theory was presented to the Royal Society in mid January the following year. One well-known story was that he saw the change of direction of a wind vane on a boat on the Thames, caused not by an alteration of the wind itself, but by a change of course of the boat relative to the wind direction.[13] However, there is no record of this incident in Bradley's own account of the discovery, and it may therefore be apocryphal.
The following table shows the magnitude of deviation from true declination for γ Draconis and the direction, on the planes of the solstitial colure and ecliptic prime meridian, of the tangent of the velocity of the Earth in its orbit for each of the four months where the extremes are found, as well as expected deviation from true ecliptic longitude if Bradley had measured its deviation from right ascension:
Month | Direction of tangential velocity of Earth on the plane of the solstitial colure | Deviation from true declination of γ Draconis | Direction of tangential velocity of Earth on the plane of the ecliptic prime meridian | Expected deviation from true ecliptic longitude of γ Draconis |
---|---|---|---|---|
December | zero | none | ← (moving toward perihelion at fast velocity) | decrease of more than 20.2" |
March | ← (moving toward aphelion) | 19.5" southward | zero | none |
June | zero | none | → (moving toward aphelion at slow velocity) | increase of less than 20.2" |
September | → (moving toward perihelion) | 19.5" northward | zero | none |
Bradley proposed that the aberration of light not only affected declination, but right ascension as well, so that a star in the pole of the ecliptic would describe a little ellipse with a diameter of about 40", but for simplicity, he assumed it to be a circle. Since he only observed the deviation in declination, and not in right ascension, his calculations for the maximum deviation of a star in the pole of the ecliptic are for its declination only, which will coincide with the diameter of the little circle described by such star. For eight different stars, his calculations are as follows:
Star | Annual Variation (") | Maximum deviation in declination of a star in the pole of the ecliptic (") |
---|---|---|
γ Draconis | 39 | 40.4 |
β Draconis | 39 | 40.2 |
η Ursa Maj. | 36 | 40.4 |
α Cass. | 34 | 40.8 |
τ Persei | 25 | 41.0 |
α Persei | 23 | 40.2 |
35 Camel. | 19 | 40.2 |
Capella | 16 | 40.0 |
MEAN | 40.4 |
Based on these calculations, Bradley was able to estimate the constant of aberration at 20.2", which is equal to 0.00009793 radians, and with this was able to estimate the speed of light at 183,300 miles (295,000 km) per second.[14] By projecting the little circle for a star in the pole of the ecliptic, he could simplify the calculation of the relationship between the speed of light and the speed of the Earth's annual motion in its orbit as follows:
- [math]\displaystyle{ \cos(\frac{1}{2}\pi-0.00009793) = \sin(0.00009793) = \frac{v}{c} }[/math]
Thus, the speed of light to the speed of the Earth's annual motion in its orbit is "10210 to one, from whence it would follow, that light moves, or is propagated as far as from the Sun to the Earth in 8' 12"."[15]
The discovery and elucidation of aberration is now regarded as a classic case of the application of scientific method, in which observations are made to test a theory, but unexpected results are sometimes obtained that in turn lead to new discoveries. It is also worth noting that part of the original motivation of the search for stellar parallax was to test the Copernican theory that the Earth revolves around the Sun, but of course the existence of aberration also establishes the truth of that theory.
Historical theories of aberration
The phenomenon of aberration became a driving force for many physical theories during the 200 years between its observation and the conclusive explanation by Albert Einstein.
The first classical explanation was provided in 1729, by James Bradley as described above, who attributed it to the finite speed of light and the motion of Earth in its orbit around the Sun.[3][4] However, this explanation proved inaccurate once the wave nature of light was better understood, and correcting it became a major goal of the 19th century theories of luminiferous aether. Augustin-Jean Fresnel proposed a correction due to the motion of a medium (the aether) through which light propagated, known as "partial aether drag". He proposed that objects partially drag the aether along with them as they move, and this became the accepted explanation for aberration for some time. George Stokes proposed a similar theory, explaining that aberration occurs due to the flow of aether induced by the motion of the Earth. Accumulated evidence against these explanations, combined with new understanding of the electromagnetic nature of light, led Hendrik Lorentz to develop an electron theory which featured an immobile aether, and he explained that objects contract in length as they move through the aether. Motivated by these previous theories, Albert Einstein then developed the theory of special relativity in 1905, which provides the modern account of aberration.
Bradley's classical explanation
Bradley conceived of an explanation in terms of a corpuscular theory of light in which light is made of particles unaffected by gravity.[1] His classical explanation appeals to the motion of the earth relative to a beam of light-particles moving at a finite velocity, and is developed in the Sun's frame of reference, unlike the classical derivation given above.
Consider the case where a distant star is motionless relative to the Sun, and the star is extremely far away, so that parallax may be ignored. In the rest frame of the Sun, this means light from the star travels in parallel paths to the Earth observer, and arrives at the same angle regardless of where the Earth is in its orbit. Suppose the star is observed on Earth with a telescope, idealized as a narrow tube. The light enters the tube from the star at angle [math]\displaystyle{ \theta }[/math] and travels at speed [math]\displaystyle{ c }[/math] taking a time [math]\displaystyle{ h/c }[/math] to reach the bottom of the tube, where it is detected. Suppose observations are made from Earth, which is moving with a speed [math]\displaystyle{ v }[/math]. During the transit of the light, the tube moves a distance [math]\displaystyle{ vh/c }[/math]. Consequently, for the particles of light to reach the bottom of the tube, the tube must be inclined at an angle [math]\displaystyle{ \phi }[/math] different from [math]\displaystyle{ \theta }[/math], resulting in an apparent position of the star at angle [math]\displaystyle{ \phi }[/math]. As the Earth proceeds in its orbit it changes direction, so [math]\displaystyle{ \phi }[/math] changes with the time of year the observation is made. The apparent angle and true angle are related using trigonometry as:
- [math]\displaystyle{ \tan(\phi) = \frac { h\sin(\theta)}{hv/c + h \cos (\theta)}=\frac { \sin(\theta)}{v/c + \cos (\theta)} }[/math].
In the case of [math]\displaystyle{ \theta = 90^\circ }[/math], this gives [math]\displaystyle{ \tan(\theta - \phi) = v/c }[/math]. While this is different from the more accurate relativistic result described above, in the limit of small angle and low velocity they are approximately the same, within the error of the measurements of Bradley's day. These results allowed Bradley to make one of the earliest measurements of the speed of light.[15][16]
Luminiferous aether
In the early nineteenth century the wave theory of light was being rediscovered, and in 1804 Thomas Young adapted Bradley's explanation for corpuscular light to wavelike light traveling through a medium known as the luminiferous aether. His reasoning was the same as Bradley's, but it required that this medium be immobile in the Sun's reference frame and must pass through the earth unaffected, otherwise the medium (and therefore the light) would move along with the earth and no aberration would be observed. [17] He wrote:
Upon consideration of the phenomena of the aberration of the stars I am disposed to believe that the luminiferous aether pervades the substance of all material bodies with little or no resistance, as freely perhaps as the wind passes through a grove of trees.—Thomas Young, 1804[1]
However, it soon became clear Young's theory could not account for aberration when materials with a non-vacuum index of refraction were present. An important example is of a telescope filled with water. The velocity of the light in such a telescope will be slower than in vacuum, and is given by [math]\displaystyle{ c/n }[/math] rather than [math]\displaystyle{ c }[/math] where [math]\displaystyle{ n }[/math] is the index of refraction of the water. Thus, by Bradley and Young's reasoning the aberration angle is given by
- [math]\displaystyle{ \tan(\phi) = \frac { \sin(\theta)}{v/(c/n) + \cos (\theta)} }[/math].
which predicts a medium-dependent angle of aberration. When refraction at the telescope's objective is taken into account this result deviates even more from the vacuum result. In 1810 François Arago performed a similar experiment and found that the aberration was unaffected by the medium in the telescope, providing solid evidence against Young's theory. This experiment was subsequently verified by many others in the following decades, most accurately by Airy in 1871, with the same result.[17]
Aether drag models
Fresnel's aether drag
In 1818, Augustin Fresnel developed a modified explanation to account for the water telescope and for other aberration phenomena. He explained that the aether is generally at rest in the Sun's frame of reference, but objects partially drag the aether along with them as they move. That is, the aether in an object of index of refraction [math]\displaystyle{ n }[/math] moving at velocity [math]\displaystyle{ v }[/math] is partially dragged with a velocity [math]\displaystyle{ (1-1/n^2)v }[/math] bringing the light along with it. This factor is known as "Fresnel's dragging coefficient". This dragging effect, along with refraction at the telescope's objective, compensates for the slower speed of light in the water telescope in Bradley's explanation.[lower-alpha 1] With this modification Fresnel obtained Bradley's vacuum result even for non-vacuum telescopes, and was also able to predict many other phenomena related to the propagation of light in moving bodies. Fresnel's dragging coefficient became the dominant explanation of aberration for the next decades.
Stokes' aether drag
However, the fact that light is polarized (discovered by Fresnel himself) led scientists such as Cauchy and Green to believe that the aether was a totally immobile elastic solid as opposed to Fresnel's fluid aether. There was thus renewed need for an explanation of aberration consistent both with Fresnel's predictions (and Arago's observations) as well as polarization.
In 1845, Stokes proposed a 'putty-like' aether which acts as a liquid on large scales but as a solid on small scales, thus supporting both the transverse vibrations required for polarized light and the aether flow required to explain aberration. Making only the assumptions that the fluid is irrotational and that the boundary conditions of the flow are such that the aether has zero velocity far from the Earth, but moves at the Earth's velocity at its surface and within it, he was able to completely account for aberration.[lower-alpha 2] The velocity of the aether outside of the Earth would decrease as a function of distance from the Earth so light rays from stars would be progressively dragged as they approached the surface of the Earth. The Earth's motion would be unaffected by the aether due to D'Alembert's paradox.
Both Fresnel and Stokes' theories were popular. However, the question of aberration was put aside during much of the second half of the 19th century as focus of inquiry turned to the electromagnetic properties of aether.
Lorentz' length contraction
In the 1880s once electromagnetism was better understood, interest turned again to the problem of aberration. By this time flaws were known to both Fresnel's and Stokes' theories. Fresnel's theory required that the relative velocity of aether and matter to be different for light of different colors, and it was shown that the boundary conditions Stokes had assumed in his theory were inconsistent with his assumption of irrotational flow.[1][17][18] At the same time, the modern theories of electromagnetic aether could not account for aberration at all. Many scientists such as Maxwell, Heaviside and Hertz unsuccessfully attempted to solve these problems by incorporating either Fresnel or Stokes' theories into Maxwell's new electromagnetic laws.
Hendrik Lorentz spent considerable effort along these lines. After working on this problem for a decade, the issues with Stokes' theory caused him to abandon it and to follow Fresnel's suggestion of a (mostly) stationary aether (1892, 1895). However, in Lorentz's model the aether was completely immobile, like the electromagnetic aethers of Cauchy, Green and Maxwell and unlike Fresnel's aether. He obtained Fresnel's dragging coefficient from modifications of Maxwell's electromagnetic theory, including a modification of the time coordinates in moving frames ("local time"). In order to explain the Michelson–Morley experiment (1887), which apparently contradicted both Fresnel's and Lorentz's immobile aether theories, and apparently confirmed Stokes' complete aether drag, Lorentz theorized (1892) that objects undergo "length contraction" by a factor of [math]\displaystyle{ \sqrt{1-v^2/c^2} }[/math] in the direction of their motion through the aether. In this way, aberration (and all related optical phenomena) can be accounted for in the context of an immobile aether. Lorentz' theory became the basis for much research in the next decade, and beyond. Its predictions for aberration are identical to those of the relativistic theory.[17][19]
Special relativity
Lorentz' theory matched experiment well, but it was complicated and made many unsubstantiated physical assumptions about the microscopic nature of electromagnetic media. In his 1905 theory of special relativity, Albert Einstein reinterpreted the results of Lorentz' theory in a much simpler and more natural conceptual framework which disposed of the idea of an aether. His derivation is given above, and is now the accepted explanation. Robert S. Shankland reported some conversations with Einstein, in which Einstein emphasized the importance of aberration:[20]
He continued to say the experimental results which had influenced him most were the observations of stellar aberration and Fizeau’s measurements on the speed of light in moving water. “They were enough,” he said.
Other important motivations for Einstein's development of relativity were the moving magnet and conductor problem and (indirectly) the negative aether drift experiments, already mentioned by him in the introduction of his first relativity paper. Einstein wrote in a note in 1952:[5]
My own thought was more indirectly influenced by the famous Michelson-Morley experiment. I learned of it through Lorentz’ path breaking investigation on the electrodynamics of moving bodies (1895), of which I knew before the establishment of the special theory of relativity. Lorentz’ basic assumption of a resting ether did not seem directly convincing to me, since it led to an [struck out: to me artificial appearing] interpretation of the Michelson-Morley experiment, which [struck out: did not convince me] seemed unnatural to me. My direct path to the sp. th. rel. was mainly determined by the conviction that the electromotive force induced in a conductor moving in a magnetic field is nothing other than an electric field. But the result of Fizeau’s experiment and the phenomenon of aberration also guided me.
While Einstein's result is the same as Bradley's original equation except for an extra factor of [math]\displaystyle{ \gamma }[/math], it should be emphasized that Bradley's result does not merely give the classical limit of the relativistic case, in the sense that it gives incorrect predictions even at low relative velocities. Bradley's explanation cannot account for situations such as the water telescope, nor for many other optical effects (such as interference) that might occur within the telescope. This is because in the Earth's frame it predicts that the direction of propagation of the light beam in the telescope is not normal to the wavefronts of the beam, in contradiction with Maxwell's theory of electromagnetism. It also does not preserve the speed of light c between frames. However, Bradley did correctly infer that the effect was due to relative velocities.
See also
Wikimedia Commons has media related to Aberration of light. |
- Apparent place
- Stellar parallax
- Astronomical nutation
- Proper motion
- Timeline of electromagnetism and classical optics
- Stellar aberration (derivation from Lorentz transformation)
Notes
- ↑ More in detail, Fresnel explains that the incoming light of angle [math]\displaystyle{ \theta }[/math] is first refracted at the end of the telescope, to a new angle [math]\displaystyle{ \psi }[/math] within the telescope. This may be accounted for by Snell's law, giving [math]\displaystyle{ \sin(\theta - \phi) = n \sin(\psi - \phi) }[/math]. Then drag must be accounted for. Without drag, the x and y components of the light in the telescope are [math]\displaystyle{ (c/n) \sin(\psi) }[/math] and [math]\displaystyle{ (c/n) \cos(\psi) }[/math], but drag modifies the x component to [math]\displaystyle{ (c/n) \cos(\psi) - (1-1/n^2)v }[/math] if the Earth moves with velocity [math]\displaystyle{ v }[/math]. If [math]\displaystyle{ \alpha }[/math] is angle and [math]\displaystyle{ v_l }[/math] is the velocity of the light with these velocity components, then by Bradley's reasoning [math]\displaystyle{ \tan(\phi) = \frac { h \sin(\alpha)}{v t + h \cos (\alpha)} }[/math] where [math]\displaystyle{ h }[/math] is the modified path length through the water and t is the time it takes the light to travel the distance h, [math]\displaystyle{ t = h/v_l }[/math]. Upon solving these equations for [math]\displaystyle{ \phi }[/math] in terms of [math]\displaystyle{ \theta }[/math] one obtains Bradley's vacuum result.
- ↑
Stokes' derivation may be summarized as follows: Consider a wavefront moving in the downwards z direction. Say the aether has velocity field [math]\displaystyle{ u,v,w }[/math] as a function of [math]\displaystyle{ x,y,z }[/math]. Now, motion of the aether in the x and y directions does not affect the wavefront, but the motion in the z direction advances it (in addition to the amount it advances at speed c). If the z velocity of the aether varies over space, for example if it is slower for higher x as shown in the figure, then the wavefront becomes angled, by an angle [math]\displaystyle{ \tan(\alpha) = tdw/dx }[/math]. Now, say in time t the wavefront has moved by a span [math]\displaystyle{ dz \approx c t }[/math] (assuming the speed of the aether is negligible compared to the speed of light). Then for each distance [math]\displaystyle{ dz }[/math] the ray descends, it is bent by an angle [math]\displaystyle{ \alpha \approx (dw/dx) (dz/c) }[/math], and so the total angle by which it has changed after travelling through the entire fluid is
- [math]\displaystyle{ \alpha \approx \frac{1}{c} \int \frac{\partial w}{\partial x} dz }[/math]
If the fluid is irrotational it will satisfy the Cauchy–Riemann equations, one of which is
- [math]\displaystyle{ \frac{\partial w}{\partial x} = \frac{\partial u}{\partial z} }[/math].
Inserting this into the previous result gives an aberration angle [math]\displaystyle{ \alpha = (u_2 - u_1)/c }[/math] where the [math]\displaystyle{ u }[/math]s represent the x component of the aether's velocity at the start and end of the ray. Far from the earth the aether has zero velocity, so [math]\displaystyle{ u_2 = 0 }[/math] and at the surface of the earth it has the earth's velocity [math]\displaystyle{ v }[/math]. Thus we finally get
- [math]\displaystyle{ \alpha \approx \frac{v}{c} }[/math]
which is the known aberration result.
References
- ↑ 1.0 1.1 1.2 1.3 Schaffner, Kenneth F. (1972), Nineteenth-century aether theories, Oxford: Pergamon Press, pp. 99–117 und 255–273, ISBN 0-08-015674-6
- ↑ Williams, M. E. W.. "Flamsteed's Alleged Measurement of Annual Parallax for the Pole Star". Journal for the History of Astronomy 10 (2): 102–116. doi:10.1177/002182867901000203. Bibcode: 1979JHA....10..102W.
- ↑ 3.0 3.1 Bradley, James (1727–1728). "A Letter from the Reverend Mr. James Bradley Savilian Professor of Astronomy at Oxford, and F.R.S. to Dr.Edmond Halley Astronom. Reg. &c. Giving an Account of a New Discovered Motion of the Fix'd Stars.". Phil. Trans. R. Soc. 35: 637–661. doi:10.1098/rstl.1727.0064.
- ↑ 4.0 4.1 4.2 Hirschfeld, Alan (2001). Parallax:The Race to Measure the Cosmos. New York, New York: Henry Holt. ISBN 0-8050-7133-4.
- ↑ 5.0 5.1 Norton, John D., John D. (2004), "Einstein's Investigations of Galilean Covariant Electrodynamics prior to 1905", Archive for History of Exact Sciences 59: 45–105, doi:10.1007/s00407-004-0085-6, Bibcode: 2004AHES...59...45N, archived from the original on 2009-01-11, https://web.archive.org/web/20090111103932/http://philsci-archive.pitt.edu/archive/00001743/
- ↑ Richard A. Mould (2001). Basic Relativity (2nd ed.). Springer. p. 8. ISBN 0-387-95210-1. https://books.google.com/?id=lfGE-wyJYIUC&pg=PA8.
- ↑ In fact, the light source doesn't need to be stationary, consider for example eclipsing binary stars: they are rotating with high speed —and ever changing and different velocity vectors— around each other, but they appear as one spot all the time.
- ↑ 8.0 8.1 Kovalevsky, Jean; Seidelmann, P. Kenneth (2004). Fundamentals of Astrometry. Cambridge: Cambridge University Press. ISBN 0-521-64216-7.
- ↑ 9.0 9.1 Newcomb, Simon (1960). A Compendium of Spherical Astronomy. Macmillan, 1906 – republished by Dover.
- ↑ 10.0 10.1 10.2 10.3 Eppenstein 1911, p. 54.
- ↑ Bradley, James; Rigaud, Stephen Peter (1832). Miscellaneous works and correspondence of the Rev. James Bradley, D.D., F.R.S. Oxford: University Press. p. 11.
- ↑ 12.0 12.1 Eppenstein 1911, p. 55.
- ↑ 13.0 13.1 Berry, Arthur (1961). A Short History of Astronomy. Dover.
- ↑ Hoiberg, Dale H., ed (2010). "aberration, constant of". Encyclopædia Britannica. I: A-ak Bayes (15th ed.). Chicago, IL: Encyclopædia Britannica Inc.. pp. 30. ISBN 978-1-59339-837-8.
- ↑ 15.0 15.1 James Bradley (1729). "An account of a new discovered motion of the fixed stars". Philosophical Transactions of the Royal Society 35: 637–661. doi:10.1098/rstl.1727.0064.
- ↑ Encyclopædia Britannica
- ↑ 17.0 17.1 17.2 17.3 Whittaker, Edmund Taylor (1910), A History of the theories of aether and electricity (1. ed.), Dublin: Longman, Green and Co., archived from the original on 2016-02-15, https://web.archive.org/web/20160215173309/https://archive.org/details/historyoftheorie00whitrich
Whittaker, Edmund Taylor (1953). A History of the Theories of Aether and Electricity (2. ed.). T. Nelson. - ↑ Janssen, Michel; Stachel, John (2010), "The Optics and Electrodynamics of Moving Bodies", in John Stachel, Going Critical, Springer, ISBN 1-4020-1308-6, http://www.mpiwg-berlin.mpg.de/Preprints/P265.PDF
- ↑ Darrigol, Olivier (2000), Electrodynamics from Ampére to Einstein, Oxford: Clarendon Press, ISBN 0-19-850594-9
- ↑ Shankland, R. S. (1963). "Conversations with Albert Einstein". American Journal of Physics 31 (1): 47–57. doi:10.1119/1.1969236. Bibcode: 1963AmJPh..31...47S.
Further reading
- P. Kenneth Seidelmann (Ed.), Explanatory Supplement to the Astronomical Almanac (University Science Books, 1992), 127-135, 700.
- Stephen Peter Rigaud, Miscellaneous Works and Correspondence of the Rev. James Bradley, D.D. F.R.S. (1832).
- Charles Hutton, Mathematical and Philosophical Dictionary (1795).
- H. H. Turner, Astronomical Discovery (1904).
- Thomas Simpson, Essays on Several Curious and Useful Subjects in Speculative and Mix'd Mathematicks (1740).
- :de:August Ludwig Busch, Reduction of the Observations Made by Bradley at Kew and Wansted to Determine the Quantities of Aberration and Nutation (1838).
External links
- Courtney Seligman on Bradley's observations