CHAPTER 5
SPECIAL RELATIVITY
Is
there any point to which you would wish to draw my attention?
To
the curious incident of the dog in the night-time.
The
dog did nothing in the night-time.
That
was the curious incident, remarked Sherlock Holmes.
The Memoirs of Sherlock Holmes (1893)
5.1 What Relativity Means
The discovery of Maxwell’s
equations elevated the description of electromagnetic phenomena to the same
level that the discovery of
We will use the term frame of reference a lot in the ensuing discussion. It typically refers to a set of space and time coordinates that a particular observer uses to depict the occurrence of events. You can think of an observer’s frame of reference as his or her laboratory, equipped with rulers, clocks and other measuring devices moving along with the observer and providing him or her with all the tools necessary to make a measurement and a framework within which the results of those measurements may be described. An inertial frame of reference is one that is not accelerated. At most, it moves at constant velocity relative to another, non-accelerated frame of reference.
Confused?
Do not be confused. Obviously, the behavior of mechanical phenomena
looks different when observed from different frames of reference, but the
underlying laws that describe the behavior do not. Aristotle was wrong when he argued that a
spear, thrown vertically upward, would get left behind if the Earth were
moving. He concluded that, since it
doesn’t, the Earth must be at rest.
Galileo destroyed that conclusion with the following argument that gets
to the heart of the matter … “Shut
yourself up with some friend in the main cabin below decks on some large ship,
and have you there some flies, butterflies, and other small flying
animals. Have a large bowl of water with
some fish in it; hang up a bottle that empties drop by drop into a wide vessel
beneath it. With the ship standing
still, observe carefully how the little animals fly with equal speed to all
sides of the cabin. The fish swim
indifferently in all directions; the drops fall into the vessel beneath; and,
in throwing something to your friend, you need throw it no more strongly in one
direction than another, the distances being equal; jumping with your feet
together, you pass equal spaces in every direction. When you have observed all these things
carefully . . . have the ship proceed with any speed you like, so long as the
motion is uniform and not fluctuating this way and that. You will discover not the least change in all
the effects named, nor could you tell from any of them whether the ship was
moving or standing still.”
Galileo Galilei
What is Galileo saying here? Imagine being inside a train that moving smoothly, free of all vibration, along a track at 60 mph. You flip a coin up in the air. You observe that it leaves your hand, goes straight up and then falls back down and you catch it. Note that, from your perspective, the coin has moved vertically but not horizontally. The behavior is exactly what you would observe if you performed the experiment outside the train on the Earth. In each case, you would use the same laws of physics to calculate the trajectory of the coin. In each case, the calculation would predict the same outcome in agreement with your observation.
Galileo’s words represent the first statement of a relativity principle. What he is saying is that you cannot perform an experiment with moving objects “inside” a particular frame of reference that can tell you whether or not the frame is in a state of constant motion. You can only detect that one frame of reference is in constant motion relative to another frame of reference. You cannot tell which frame is moving in some absolute sense. As we will discuss shortly in more detail, Maxwell’s equations describing electromagnetic phenomena seemed to imply otherwise. Einstein, struggling with this implication, was convinced that something was amiss. He firmly believed that Galileo’s notion was unequivocally true for all physical phenomena including electromagnetic and mechanical ones. With suitable modification, he turned Galileo’s observation into one of the fundamental premises of his theory of special relativity. If one could say that Galileo’s idea was an overture then Einstein’s work was the completion of the entire symphony. The modern basis of special relativity is not just the idea that the appearance of phenomena in different inertial frames of reference is identical, but that all laws of physics that describe the behavior of any natural phenomena are invariant, that is, the laws of physics take on the same mathematical form in different inertial frames of reference.
Let’s extend Galileo’s idea to
see what the phrase, invariance of the laws of physics, means. For example, what would the “Earth” observer
in the above example see when looking at the experiment of flipping the coin
performed by the “train” observer? The
Earth observer would see the coin travel not only up and down but horizontally
as well. Since the coin would leave the train observer’s hand and return to it
after the train had traveled some horizontal distance, the coin would have to
travel this same horizontal distance.
The coin, seen to travel only up and down by the observer in the train,
would travel along a parabolic trajectory as seen from by the observer attached
to the Earth. The behavior of the coin would look different to the two
observers. The appearance of the outcome of the experiment depends on the
relative motion of the two observers. But would they agree that its motion is
still describable in each frame of reference in terms of
5.2 Galilean
Transformations
… Absolute, true and mathematical time, of itself,
and from its own nature, flows equably without relation to anything
external."
We need to clarify what we mean by
the process of transforming
· Insert figure of coordinate system at rest and one moving at constant speed
The accompanying figure shows two frames of reference, one at “rest,” like the Earth in the above example, and one moving at constant speed u along the +x-direction, like the train. We denote position and time in the rest frame by the variables x and t and in the moving frame by x’ and t’. For the sake of simplicity, we will assume that all motion is restricted to just a single dimension --- along the x-axis. Suppose an object is moving in the +x direction. Its position and time at that position would be given by the values of x and t in the rest frame. Its position and time in the moving frame would be given by the corresponding values of x’ and t’. How would these observers “connect” their observations, that is, how would they tell each other where the object would be at any particular time?
Let’s construct a simple
example. Suppose two observers, each
with some kind of ruler and clock that allows each of them to measure positions
and the time of occurrence of events in their own frame of reference, are
located on “
The equations that relate these two different positions and times are called Galilean transformations. Expressed mathematically, the Galilean transformations that relate position and time in the two frames of reference are
The transformation that relates the speeds is given by
![]()
(Obviously the above – sign would be + if the relative direction of motion were changed.)
For example, pick a position x of the skateboarder at any time t as measured in the rest frame, say x = 10 blocks at t = 5 minutes. The transformations tell us that u’ = 1 block/minute, t’ = 5 minutes and x’ = 5 blocks. Note that the form of the Galilean transformations hinge on the assumption that both spatial and time intervals appear to be identical to observers in different inertial frames of reference. In other words, the length of rulers can be compared and clocks can be synchronized when two observers are together and then when one of those observers moves relative to the other, the two different sets of measuring instruments will continue to register correctly any ensuing length and time interval. It all seems reasonable.
The above simple example deals with the limited situation in which the velocity of the moving object, the skateboarder, is a constant. We now address situations in which the object might be accelerating, or changing its velocity. Suppose for example that the skateboarder has some way of gaining speed. Acceleration along a straight path is the rate of change of speed. In the rest frame, the acceleration of the skateboarder is a = Δu/Δt.[1] In the moving frame it is a’ = Δu’/Δt’. Since a time interval Δt’ in the moving frame is equal to a time interval Δt in the rest frame and u’ = u – v (remember, v is constant --- the moving frame of reference travels at a fixed velocity) a small change in the velocity u’ (Δu’) is the same as a small change in the velocity u (Δu) during any interval of time. Thus, accelerations in the two frames of reference are equal
For example, our fearless skateboarder could be picking up speed at the rate of 1 block/minute --- each minute. His acceleration would be a = a’ = 1 block/min2 --- the same in both frames of reference.
In order to complete the
exercise of transforming
![]()
Now, we transform the positions in the moving frame to those of the rest frame, obtaining
![]()
The distance between the
two objects is the same in each frame of reference.
![]()
… assuming that the masses of the mini black hole M and skateboarder m and the universal constant of gravitation G do not depend on any frame of reference.[2]
Any law of force more complicated than Newton’s law of gravitation would also be invariant as long as it depended only upon the relative positions (and/or velocities) of two interacting bodies. If, however, the law of force depended on absolute positions, i.e., if in the above example, the force between the mini black hole and the skateboarder contained terms like xb2 – x2, the force law would no longer be invariant! Nothing like this has ever been discovered. We know of no forces that would make the laws of physics different for an observer in the laboratory and one, say, in a moving train.
We conclude that the laws of motion are indeed invariant under the Galilean transformation. They take on the same mathematical form, F = ma (F’ = ma’) in all inertial frames of reference. Each observer uses the same laws of physics in order to calculate the resultant motion of objects subject to some force.
However, Maxwell’s
equations did not exhibit this feature of invariance. When using the Galilean transformations to
transform them from one inertial frame of reference to another, they changed
form! Maxwell’s theory led to the
unequivocal conclusion that light was a wave that traveled at a very specific
velocity whose value could be calculated from two constants that related directly
to the phenomena of electricity and magnetism.[3] This result implied that the equations of
electrodynamics as written down by Maxwell, unlike
Thus Maxwell’s theory of electromagnetic phenomena implied that different inertial frames of reference were not equivalent --- that all motion was not merely “relative” but that absolute motion did exist and that one ought to be able to find the preferred frame of reference in which the laws of physics were valid by detecting absolute motion through it. For example, an observer traveling in the same direction as a light wave ought to see it slow down, rather like an observer in a car moving at 55 mph looking at another traveling at 65 mph and seeing it recede at only 10 mph. Similarly, the measured speed of the receding light wave would be slower if an observer was trying to catch up with it and the reduction of its speed would determine one’s own absolute speed through the “luminiferous ether.” So it ought to be possible to devise an experiment that could detect changes in the velocity of light, in other words, build an “absolute speedometer,” but one would have to travel fast enough to see an effect and given the enormity of the speed of light, that would not be so easy to do. Nonetheless, in 1886, A. A. Michelson and Edward Morley set out to do just that.
5.3 There Is No Luminiferous Ether!
"… Absolute space, in its own nature, without
relation to anything external, remains always similar and immovable …"
Sir
Isaac Newton, Principia
By the late 19th century it had become evident that the ether would have to have very bizarre properties. First, it would have to be extremely rigid in order to propagate a disturbance as fast as the speed of light. On the other hand, it would have to permeate all of space and the material in that space as well, if the astronomer, James Bradley’s observation of stellar aberration, carried out in 1725, could be explained in terms of a wave theory of light. Bradley had measured a periodic shift in the apparent position of stars as the Earth orbited the Sun. This shift is rather like a runner observing a shift in the angle of rainfall as he or she runs though it. If the Earth dragged the ether along with it, or shoved it aside, in its motion around the Sun, this angle would not continually change as in fact it did. If the ether existed, it would have to pass right through the moving Earth, rather like a ghost passing through the walls of a house --- yet this ghost was an extremely rigid one.
On the other hand, scientists knew that light traveled through transparent media like water --- albeit at a speed slower than its speed in a vacuum. In 1818 the brilliant scientist J.A. Fresnel had suggested that flowing water might partially drag the ether along with it so that the speed of light through the water might depend on its speed of flow. H.L. Fizeau demonstrated the truth of this conjecture in an experiment he performed in 1851.
However, the idea of ether drag was difficult to reconcile with the measurement of the aberration of starlight. On the one hand, the ether is partially dragged along inside a moving transparent fluid if we are to understand Fizeau’s experiment, but that drag cannot be communicated to the ether that lies outside an opaque moving body like the Earth if we are to understand Bradley’s stellar aberration. The only way around this dilemma was to assume that the ether remains at rest in absolute space and that it passes completely through moving bodies. In the case of flowing transparent media, it must be the light wave that is partially dragged by flowing material and not the ether within it.
Thus, no matter how a
body moves, the ether itself always remains at rest in the “absolute space”
that
The device they built is one of the most famous pieces of experimental apparatus ever made. Its purpose was to enable Michelson and Morley to see whether a beam of light moving parallel to the Earth’s motion through the ether traveled at a speed different than that of a beam of light moving perpendicular to it. In essence, the device was designed to serve as a speedometer for the motion of the Earth through the ether. The device is called an interferometer and it was capable of making measurement of the highest precision ever achieved.
· Insert figure of interferometer
In the device, a single beam of light is split in two and each half is directed along one of two mutually perpendicular arms. One arm is oriented parallel to the Earth’s motion and the other perpendicular to it. At the end of each arm is a fixed mirror that reflects the two beams back to a common point where they can be observed with an eyepiece. The time of travel for a round trip is different for each beam if the speed of light depends on the Earth’s motion through the ether. Thus, the light waves will arrive somewhat out of phase and will interfere with each other. If the apparatus is rotated by 90 degrees, the interference pattern should change since each beam switches transit times with each other. The result of the Michelson-Morley experiment can be summed up with a quote from their first publication of the results of their experiment
“The
interpretation of these results is that there is no displacement of the
interference of the interference bands …
The
result of the hypothesis of the stationary ether is thus shown to be incorrect.”
It was, and perhaps still is, the most famous “null result” experiment ever performed. The Earth’s motion through the luminiferous ether could not be detected by looking for observed changes in the speed of light. There was none! Classical physics was in crisis!
5.4 The Postulates of Special Relativity
Even though the Michelson-Morley experiment was a classic and had been performed 18 years prior to Einstein’s publication of the special theory of relativity in 1905, there is great controversy surrounding the role it played in shaping Einstein’s resolution of the problems it created for the classical theory of light. He did not mention the Michelson-Morley experiment at all in his publication and when asked many years later about the basis upon which he built his theory, he claimed that he had no knowledge of Michelson-Morley prior to his publication. But there can be no doubt that he knew about it.[4] Indeed, in 1931 he said … the Michelson-Morley experiment …
“…
uncovered a serious defect in the ether theory of light, as it then existed,
and stimulated the ideas of H.A. Lorentz and Fitzgerald out of which the
special theory of relativity developed.” [5]