Chapter 2: Relativistic Mechanics

Introduction

We know that Newton’s laws of motion hold and are invariant in inertial frames of reference that are related by the Galilean transformation. But it is the Lorentz transformation rather than the Galilean transformation that is correct. That means that the laws of classical mechanics cannot be correct, and we must find new laws of relativistic mechanics.

In doing so, we will be guided by 3 principles:

A correct relativistic law must hold in all inertial frames, i.e., it must be invariant under the Lorentz transformation.
Relativistic definitions and laws must reduce to their nonrelativistic counterparts when applied to systems moving much slower than the speed of light.
Our relativistic laws must agree with experiment.

Mass

How do we define mass m? We know that at slow speeds, mass can be defined classically by m = F∕a. In relativity we will measure the mass m of an object in its rest frame. This is called the rest mass or the proper mass. Observers in different rest frames could measure the mass of an object by bringing the object to rest in their frame of reference, and then measuring its mass. The mass they would measure would be the same in all inertial reference frames.

Relativisitic Momentum

Classically, momentum is defined by

⃗p = m ⃗u

(1)

where m and the mass and velocity of an object. What is the correct relativistic definition of momentum? We have the freedom to define momentum any way we like, but to be useful, we would like to define it such that total momentum is conserved in all inertial frames.

Recall the law of momentum conservation. If there are n bodies with momenta ₁, ..., _n, then, in the absence of external forces, the total momentum is given by

∑ ⃗p = ⃗p1 + ...+ ⃗pn

(2)

cannot change. We would like to define momentum relativistically to preserve conservation of momentum.

As your book describes in section 2.3, we can’t use the classical definition Eq. (1) because of the way velocity transforms between reference frames. For example, the way u_y transforms in going from S to S^′ depends on u_x. The classical definition of momentum is

d⃗r- ⃗p = m⃗u = m dt

(3)

The problem is that both and t are subject to the Lorentz transformation and that makes things messy.

The relativistic way to define momentum is

⃗p = m -d⃗r dt0

(4)

where t₀ is the proper time, i.e., the time measured in the rest frame of the particle or object. Notice that with this definition,

dy- py = m dt0

(5)

which is invariant when one goes between reference frames that are moving relative to one another along the x-axis. That is because t₀ is invariant and y = y^′. Similarly for p_z.

Since dt = γdt₀ where γ = 1∕ ∘ -----2--2-
1 - u ∕c , we can write

∘---m⃗u----- ⃗p = 1 - u2∕c2 = γm ⃗u

(6)

A constant force changes the momentum and hence the velocity. Notice that as u → c, γ and hence p increase without limit, but the speed u never reaches the speed of light (see plot of u versus p in Fig 2.2 of your book). Thus no object can go faster than the speed of light.

u˙vs˙p.eps

Figure 1: Velocity versus momentum.

We can define a relativistic mass M(u) by

-----M----- M (u) = ∘ -----2--2-= M γ 1 - u ∕c

(7)

Your book calls this a variable mass m_var, but it doesn’t want to use this. However, it does make it easy to write the momentum in the traditional form

⃗p = M (u)⃗u

(8)

Notice from Eq. (7) that if the speed u of an object with nonzero mass is equal to the speed of light, the relativistic mass is infinity. If the object’s speed exceeds the speed of light, then the relativistic mass M(u) is imaginary. Clearly, this is absurd and so, once again, the velocity of an object cannot exceed the speed of light. If the object has mass, it cannot travel at the speed of light; its speed must be less than the speed of light.

Relativistic Energy

Now we need to define a relativistic energy. We will use 2 criteria:

When applied to slowly moving bodies, the new definition reduces to the classical definition.
The total energy ∑ E of an isolated system should be conserved in all reference frames.

The relativistic energy that satisfies these requirements turns out to be

mc2 E = ∘-----------= γmc2 1 - u2∕c2

(9)

This applies to any single body, no matter how big or how small. Notice that the units are right. (It’s always important to check units.) γ is dimensionless, and mc² has units of energy. (Recall that kinetic energy is (1∕2)mv².)

Now let’s take the nonrelativistic (slow) limit u ≪ c. Then

( 2)- 1∕2 2 γ = 1 - u-- ≈ 1 + 1-u-- c2 2 c2

(10)

Therefore, when u ≪ c,

( ) 2 1u2 2 2 1 2 E = γmc ≈ 1 + 2c2- mc = mc + 2-mu

(11)

The first term is a constant independent of the speed u, and we can always add a constant to the energy since we can set the zero of the energy (E = 0) anywhere. The second term is just the usual classical expression for the kinetic energy. So our definition of the relativistic energy satisfies the criterion of reducing to the classical kinetic energy in the nonrelativistic (slow) limit.

Now let’s look at that constant mc². If the object is at rest, then u = 0, γ = 1, and Eq. (9) reduces to the famous equation

2 E = mc

(12)

This is called the rest energy of the mass m. If we could convert a mass m completely into energy, then this is the amount of energy we would get. It’s a lot of energy because the speed of light c is so large, and c² is even larger. For example, the rest energy of a 1 kg lump of metal is

E = mc2 = (1 kg) × (3 × 108 m∕s )2 = 9 × 1016 joules

(13)

which is about the energy generated by a large power plant in one year. E = mc² is the basic equation behind nuclear power, and atomic bombs. These are powered by nuclear fission in which, typically, uranium 235 nuclei are split apart, converting about 1/1000 of the rest energy into heat. Thus 1 kg of ²³⁵U can yield a fantastic amount of heat (9 × 10¹³ J).

When an object is not at rest, its energy E = γmc² is the sum of its rest energy mc² plus its kinetic energy K = (E - mc²):

E = mc2 + K

(14)

where

K = E - mc2 = (γ - 1)mc2

(15)

In the nonrelativistic limit K ≈ 1
2 mu², the classical kinetic energy. However, in the relativistic limit of u → c, we get something quite different. Because γ can approach infinity, K can approach infinity even though the speed u can never reach the speed of light c. Notice that K, like its classical counterpart, is always positive (K ≥ 0).

Two Useful Relations

Let’s derive 2 useful relations starting from

⃗p = γm ⃗u

(16)

and

E = γmc2

(17)

If we divide Eq. (16) by Eq. (17), we obtain

-⃗p = ⃗u- E c2

(18)

⃗u ⃗pc ⃗β = c-= -E-

(19)

which give the dimensionless velocity = ∕c in terms of and E. Notice that c has dimensions of energy, so the ratio c∕E is dimensionless, just like ⃗
β . Eq. (19) is one useful relation.

The other is

( )2 E2 = (pc)2 + mc2

(20)

This has the same form as the Pythagorean relation for a right triangle with sides pc and mc² and hypotenuse E, though there isn’t any deep meaning to this. To show that this is correct, we can plug in E = γmc² and p = γmv, solve for γ² and show that this reduces to the definition γ² = (1 - v²∕c²)^-1.

Units

Since we are dealing with small things, we should use appropriate units. For example, if I ask how tall you are, you would tell me so many feet and so many inches. You could give me your height in miles or light years, but those are not appropriate units. Similarly, joules is not an appropriate unit for atomic and subatomic particles like electrons. The commonly used unit is the electron volt or eV. This is defined as the amount of work needed to move an electron (q = -e = -1.60 × 10^-19 Coulomb) through a voltage drop of 1 volt (ΔV = -1 volt); thus

( -19 ) -19 1 eV = qΔV = - 1.60 × 10 C × (- 1 V olt) = 1.60 × 10 J

(21)

Typical atomic energies are on the order of 1 eV or so, while those in nuclear physics are on the order of 10⁶ eV = 1 MeV.

Masses of particles are often given in eV with the understanding that they are referring to the rest energy mc². Techically, the mass is properly in units of eV/c². So, for example, the “mass” of the electron is given as 0.511 MeV, meaning mc² = 0.511 MeV. Similarly, when momentum p is given in units of MeV, they really mean the quantity pc, or the units of [p] is eV/c.

If you look up a table for the mass of atoms, the mass is often given in atomic mass units (denoted u). The conversion is

- 27 1 u = 1.66 × 10 kg

(22)

Conversion of Mass to Energy

The famous equation E = mc² implies that if you can convert mass into energy, then you would get a lot of energy because the speed of light squared is so big. So does matter get converted to energy? The answer is yes. An example where this happens for atomic nuclei is when atomic nuclei break apart, either because they are unstable (radioactive) or because they are forced apart by an incoming particle (e.g., a neutron) in a process called fission. This is the principle behind atomic power plants like San Onofre.

Let’s suppose that a nucleus at rest has a mass M, so that the initial energy is Mc². Then suppose the nucleus breaks apart due to radioactive decay in which pieces of the nucleus go flying apart. The energy then is a combination of the kinetic energy K of the pieces and their rest energy. Suppose that the nucleus breaks into 2 pieces. Since total energy (including the rest mass energy) is conserved, we have

M c2 = E1 + E2 = (K + m c2) + (K + m c2) 1 1 22 22 = (K1 + K2 ) + (m1c + m2c ) (23)

The sum of the masses of the pieces (m₁ + m₂) is less than the initial mass M, so there is a difference between the initial mass and the total final mass:

2 2 2 ΔM c = M c - (m1 + m2 )c

(24)

The missing mass is converted into the kinetic energy of the fragments flying apart:

ΔM c2 = K + K 1 2

(25)

For example, your book considers the radioactive decay of ²³²Th via the reaction

232Th →228 Ra +4 He

(26)

The sum of the masses of ²²⁸Ra and ⁴He nuclei is 0.004 u smaller than the original ²³²Th nucleus. Since 1 u = 1.66 × 10^-27 kg, this corresponds to 7 × 10^-30 kg. This missing mass is converted into kinetic energy because the products (²²⁸Ra and ⁴He) fly apart. The amount of kinetic energy is

( )( ) ΔM c2 = 7 × 10-30 kg 3 × 108 m ∕s 2 = 4 M eV (27)

If we reversed the process and forced the 2 pieces together to form the original nucleus, it would take work. The amount of energy we would have to expend would be ΔMc². In other words, energy we put in would be converted into the additional mass so that the rest mass M of the final nucleus would be greater than the sum m₁ + m₂.

So sometimes we need to add energy to merge or fuse 2 pieces together. However, sometimes, the opposite can happen. In particular, energy can be released when 2 things are brought together to form a more stable (lower energy) thing. For example, if we bring together an electron and a proton to form a hydrogen atom, then 13.6 eV is released. This is called the binding energy since is the amount of energy that would be required to pull apart the proton and electron in a hydrogen atom. If we add the mass m_e of a free electron (free means it is floating around, and is not bound inside an atom), and the mass m_p a free proton, the sum is 13.6 eV/c² greater than the mass M_H of a hydrogen atom. This is another example of mass being converted into energy. In symbols:

2 2 2 mec + mpc = MH c + B

(28)

where B is the binding energy. So the binding energy corresponds to the rest energy of the missing mass:

( 2 2) 2 B = mec + mpc - M c

(29)

Similarly, when atoms are combined to form stable molecules, binding energy is released. Energy is also released when nuclei are forced together to form bigger stable nuclei in a process called fusion. For example, 4 hydrogen atoms can be fused to make 1 helium atom. This releases binding energy. This is what powers the sun and hydrogen bombs.

Example: Λ Decay

The Λ particle is a subatomic particle that can spontaneously decay into a proton and a negatively charged pion.

- Λ → p + π

(30)

The sum of the mass of the proton and the pion will be less than that of the Λ. In a certain experiment, the outgoing particles were traveling in the +x direction with momenta p_p = 581 MeV/c and p_π=256 MeV/c. The rest masses are m_p = 938 Mev/c² and m_π = 140 MeV/c². Find the rest mass m_Λ of the Λ.

To solve this problem, let us start with

E2 = (pc)2 + (mc2 )2

(31)

Solving for the mass of the Λ gives

∘ ------------ E2 - (p c)2 m Λ = ----Λ-----Λ--- c2

(32)

So we need the energy and the momentum of the Λ to find the mass. First we find the energy using Eq. (31)

E2 = (pc)2 + (mc2 )2

(33)

This gives E_p = 1103 MeV and E_π = 292 MeV. So

E Λ = Ep + E π = 1395 M eV

(34)

Conservation of momentum gives

p = p + p = 837 M eV ∕c Λ p π

(35)

where the momenta are all pointing along the positive x axis. Now that we know E_Λ and p_Λ, we can Eq. (32) to find the mass:

∘ ------------ E2 - (p c)2 m Λ = ----Λ--2--Λ---= 1116 M eV ∕c2 c

(36)

This is how many masses of unstable subatomic particles are measured.

Force in Relativity

There are 2 definitions of force in classical mechanics:

F = ma

(37)

and

F = dp- dt

(38)

These are equivalent because p = mu, so dp∕dt = ma. But in relativity, p = γmu, so dp∕dt≠ma. In relativity, the most convenient definition of force turns out to be

dp- F = dt

(39)

Example 2.9 shows that this definition of force allows the work-energy theorem to remain valid. Namely,

F ⋅ dr = dE

(40)

This says that if a mass m, acted on by a force F, moves a small distance dr, the change in its energy, dE, equals the work done by F.

This definition of force allows us to keep the Lorentz force law intact:

F = q(E + u × B )

(41)

One easy application of this equation is when a particle with charge q moves with velocity u perpendicular to a uniform field B. The particle’s energy is constant because no work is done on it. To see this, look at the work-energy theorem. It turns out that the force is perpendicular to the direction of motion of the particle because the particle moves in a circle and the force points radially in towards the center of the circle. Therefore, since the energy (E = γmc²) is constant, the velocity is constant in magnitude and only changes direction. To see that it moves in a circle, start with

dp-= qu × B dt

(42)

which we can write as

du- γm dt = qu × B.

(43)

Use acceleration a = du∕dt and the fact that u is perpendicular to B to write

γma = quB

(44)

For motion in a circle of radius R, the centripetal acceleration a = u²∕R. The reason is the same in relativity as in classical mechanics. Using γmu = p, we can solve for R and write

R = -p- qB

(45)

p = qBR

(46)

This provides a convenient way to measure the momentum of a particle of known charge q. We know the applied field B and we can measure the radius R of the circle.

For example, suppose we want to find the momentum of a proton moving perpendicular to a uniform magnetic field B = 1.0 tesla (T) in a circle of radius 1.4 m. We plug this into Eq. (46) using q = e = 1.6 × 10^-19 C. This gives

- 19 p = qBR = (1.6 × 10 C ) × (1.0 T ) × (1.4 m) = 2.24 × 10-19 kg - m ∕s (47)

Using the conversion 1 MeV/c=5.34 × 10^-22 kg-m/s, we can change units:

p = (2.24 × 10-19 kg - m ∕s) × ------1-M-eV-∕c------- = 420 M eV ∕c. 5.34 × 10-22 kg - m ∕s

(48)

Since the proton’s rest mass is known to be m = 938 MeV/c², we can find its energy from the “Pythagorean relation” to be

∘ -------------- 2 2 2 E = (pc ) + (mc ) = 1030 M eV

(49)

Massless Particles

Is it possible to have massless particles? Classically, the answer is “no.” Momentum (p = mu) and kinetic energy (KE = mu²∕2) are proportional to the mass m. If m = 0, then p = E = 0. One might think the same would be true relativistically since p = γmu and E = γmc². Again, p and E are proportional to m and m = 0. But what if the massless particle travels at the speed of light? Then γ = ∞, and γm = 0 ⋅∞ which is ill-defined.

It’s better to go back to

2 2 2 2 E = (pc) + (mc )

(50)

and

β = u-= pc- c E

(51)

If m = 0, then these equations reduce to

E = pc

(52)

and

pc u β = ---= 1 = -- or u = c E c

(53)

So the massless particle travels at the speed of light. Conversely, if we discovered a particle that traveled at speed c, then β = u∕c would equal 1=pc∕E. That would mean E = pc and hence, that m = 0, i.e., it would be a massless particle. So massless particles travel at the speed of light, and particles that travel at c must be massless. Particles with nonzero mass must travel at less than the speed of light.

Do massless particles exist? Yes. The best known massless particle is the photon which is a particle of light. As we will discuss more later, light can be described in 2 ways: (1) as an electromagnetic wave, and (2) as particles or packets of energy called photons.

To find the energy and momentum of a massless particle, we look at a reaction that it is involved in, e.g., photoionization of hydrogen in which a photon hits the electron in a hydrogen atom, causing the electron to be ejected:

γ + H → e + p

(54)

where γ is the symbol for a photon. We measure the momentum and energy of the particles with mass, then use conservation of energy to deduce the energy and momentum of the massless particle.

Another example of a massless particle is the graviton but there is no experimental evidence for the graviton. It used be accepted that the neutrino had no mass, but now there is experimental evidence that it does have a little bit of mass.

Example: Electron-Positron Annihilation

You’ve probably heard that when matter and antimatter collide, they annihilate and release a lot of energy. The positron is an anti-electron, i.e., it’s the antimatter version of an electron. It is exactly like an electron except that it has a positive charge (+e). The positron has the same mass as an electron. Suppose a positron and an electron are both at rest with no momentum. So they only have their rest mass energies: m_ec² and m_pc² = m_ec². Then when they annihilate, 2 photons are produced. Conservation of momentum requires that their total momentum is zero, and the sum of their total energy is 2m_ec²:

p + p = 0 1 2

(55)

and

E1 + E2 = 2mec2

(56)

So the photons that are created must have equal and opposite momenta:

p1 = - p2

(57)

Since E = pc for photons, the photons must have equal energies: E₁ = E₂, and

E1 = E2 = mc2 = 0.511 M eV

(58)

So all the rest mass energy is converted into electromagnetic energy.

Positrons are used in PET scans. PET stands for Positron Emission Tomography. This is a medical diagnostic probe in which a patient is injected with a radioactive positron emitter (like carbon 11) in a suitably chosen solution like glucose (sugar) that will tend to collect in the brain, for example. Once in the brain, the emitted positrons annihilate with nearby electrons, and photons are emitted that are detected by a ring of detectors. In this way, it is possible to create an accurate map of the areas of interest.

When is nonrelativistic mechanics good enough?

The relativistic equations all reduce to their classical counterparts in the limit of low speeds (v ≪ c). When should one use the relativistic formulae and when is it good enough to use the classical equations? There is no clear-cut answer. It depends on what you need it for. For example, GPS (Global Positioning System) needs an accuracy of 1 ns, so relativistic effects must be taken into account. Table 2.4 of your book compares the values of the kinetic energy calculated with relativistic and classical formulas for various values of the velocity. As a general rule of thumb, you can ignore relativisitic effect if the speeds are 0.1c or less, or if a particle’s kinetic energy K is much less than 1% of its rest energy. (Recall that K goes as u², so this is why a speed of 0.1c corresponds to 0.01% of the energy.) For speeds greater than 0.1c or kinetic energies greater than 0.01mc², relativistic effects probably need to be taken into account, and relativistic equations should probably be used.