MOTION MOUNTAIN
the adventure of physics
www.motionmountain.net
Christoph Schiller
Motion Mountain
Twentieth revision.
τῷ ἐµοὶ δαὶµονι
Die Menschen stärken, die Sachen klären.
Contents
Preface 20
An appetizer 23
Third Part : Motion Withou t Motion – What Are Space, Time and
Particles?
Should one use force? 149 • Complete states – initial conditions 155 • Do surprises
exist? Is the future determined? 156 • A strange summary about motion 160
161 Bibliography
175 3. Global descriptions of motion – the simplicity of complexity
178 Measuring change with action
The principle of least action 181 • Why is motion so often bounded? 185 • Curios-
ities and fun challenges about Lagrangians 188
191 Motion and symmetry
Why can we think and talk? 191 • Viewpoints 192 • Symmetries and groups 194
• Representations 194 • Symmetries, motion and Galilean physics 197 • Reprodu-
cibility, conservation and Noether’s theorem 200 • Curiosities and fun challenges
about motion symmetry 205
205 Simple motions of extended bodies – oscillations and waves
10 contents
Waves and their motion 208 • Why can we talk to each other? – Huygens’ prin-
ciple 212 • Signals 214 • Solitary waves and solitons 215 • Curiosities and fun
challenges about waves and extended bodies 217
222 Do extended bodies exist?
Mountains and fractals 223 • Can a chocolate bar last forever? 224 • How high
can animals jump? 225 • Felling trees 226 • The sound of silence 227 • Little
hard balls 227 • The motion of fluids 230 • Curiosities and fun challenges about
fluids 231 • Curiosities and fun challenges about solids 237
240 What can move in nature?
241 How do objects get warm?
242 Temperature
Entropy 244 • Flow of entropy 246 • Do isolated systems exist? 247 • Why do
balloons take up space? – The end of continuity 247 • Brownian motion 249 •
Entropy and particles 251 • The minimum entropy of nature – the quantum of in-
formation 252 • Why can’t we remember the future? 254 • Is everything made of
• Relativistic films – aberration and Doppler effect 310 • Which is the best seat in a
bus? 313 • How fast can one walk? 313 • Is the speed of shadow greater than the
speed of light? 314 • Parallel to parallel is not parallel – Thomas rotation 317 • A
never-ending story – temperature and relativity 318
318 Relativistic mechanics
Mass in relativity 318 • Why relativistic snooker is more difficult 320 • Mass is
concentrated energy 321 • Collisions, virtual objects and tachyons 323 • Systems
of particles – no centre of mass 325 • Why is most motion so slow? 325 • The
history of the mass–energy equivalence formula of De Pretto and Einstein 326 • 4-
vectors 327 • 4-momentum 330 • 4-force 331 • Rotation in relativity 332 • Wave
motion 333 • The action of a free particle – how do things move? 334 • Conformal
transformations – why is the speed of light constant? 335
337 Accelerating observers
contents 11
ducing universal gravity 438 • Deducing linearized general relativity 439 • How to
calculate the shape of geodesics 439 • Mass in general relativity 441 • Is gravity an
interaction? 441 • The essence of general relativity 442 • Riemann gymnastics 443
• Curiosities and fun challenges about general relativity 445
445 9. Why can we see the stars? – Motion in the universe
Which stars do we see? 446 • What do we see at night? 447 • What is the uni-
verse? 453 • The colour and the motion of the stars 455 • Do stars shine every
night? 457 • A short history of the universe 457 • The history of space-time 461
• Why is the sky dark at night? 466 • Is the universe open, closed or marginal? 468
• Why is the universe transparent? 469 • The big bang and its consequences 470
• Was the big bang a big bang? 471 • Was the big bang an event? 471 • Was the
big bang a beginning? 471 • Does the big bang imply creation? 472 • Why can we
see the Sun? 473 • Why are the colours of the stars different? 474 • Are there dark
12 contents
stars? 475 • Are all stars different? – Gravitational lenses 475 • What is the shape
of the universe? 477 • What is behind the horizon? 478 • Why are there stars all
over the place? – Inflation 479 • Why are there so few stars? – The energy and en-
tropy content of the universe 479 • Why is matter lumped? 480 • Why are stars so
small compared with the universe? 481 • Are stars and galaxies moving apart or is
the universe expanding? 481 • Is there more than one universe? 481 • Why are the
stars fixed? – Arms, stars and Mach’s principle 481 • At rest in the universe 483 •
Does light attract light? 483 • Does light decay? 484
484 10. Black holes – falling forever
Why study black holes? 484 • Horizons 485 • Orbits 487 • Hair and entropy 490
• Black holes as energy sources 492 • Curiosities and fun challenges about black
holes 493 • Formation of and search for black holes 496 • Singularities 497 • A
quiz – is the universe a black hole? 498
499 11. Does space differ from time?
Can space and time be measured? 500 • Are space and time necessary? 501 •
magnetic fields? 565 • Could electrodynamics be different? 566 • The toughest chal-
lenge for electrodynamics 567 • Challenges and fun curiosities about electrodyna-
mics 568
568 14. What is light?
Polarization and electromagnetism 572 • The slowness of progress in physics 577
• Another look at electromagnetic radiation 578 • How does the world look when
riding on a light beam? 579 • Does light always travel in a straight line? 580 • The
concentration of light 584 • Can one touch light? 585 • War, light and lies 587 •
What is colour? 588 • What is the speed of light? – Again 591 • 200 years too late
– negative refraction indices 593 • Signals and predictions 594 • Does the aether
exist? 595 • Challenges and fun curiosities about light 596
597 Images and the eye
How to prove you’re holy 597 • Do we see what exists? 597 • How does one make
contents 13
pictures of the inside of the eye? 600 • How does one make holograms and other
three-dimensional images? 601 • Imaging 602 • Why can we see each other? 603 •
Light as weapon? 605 • Challenges and fun curiosities about the eye and images 606
607 15. Charges are discrete – the limits of classical electrodynamics
How fast do charges move? 608 • Challenges and curiosities about charge discrete-
ness 608
610 16. Electromagnetic effects
Is lightning a discharge? – Electricity in the atmosphere 610 • Does ball lightning
exist? 612 • Does gravity make charges radiate? 613 • Research questions 614 •
Levitation 615 • Matter, levitation and electromagnetic effects 618 • Challenges
and fun curiosities about electromagnetic effects 625
628 A summary of classical electrodynamics and of its limits
628 17. Classical physics in a nutshell – one and a half steps out of three
The future of planet Earth 630 • The essence of classical physics – the infinitely small
implies the lack of surprises 632 • Why have we not yet reached the top of the moun-
so light? 881
882 Curiosities and fun challenges of quantum electrodynamics
How can one move on perfect ice? – The ultimate physics test 886
887 Summary of quantum electrodynamics
Open questions in QED 889
890 27. Quantum mechanics with gravitation – the first approach
Corrections to the Schrödinger equation 891 • A rephrased large number hypo-
thesis 891 • Is quantum gravity necessary? 892 • Limits to disorder 892 • Measur-
ing acceleration with a thermometer – Fulling–Davies–Unruh radiation 893
894 Black holes aren’t black
Gamma ray bursts 897 • Material properties of black holes 899 • How do black
holes evaporate? 900 • The information paradox of black holes 900 • More para-
16 contents
doxes 901
902 Quantum mechanics of gravitation
The gravitational Bohr atom 902 • Decoherence of space-time 903 • Do gravitons
exist? 903 • Space-time foam 904 • No particles 905 • No science fiction 905 •
Not cheating any longer 905
906 Bibliography
1088 36. Maximum force and minimum distance – physics in limit statements
1088 Fundamental limits to all observables
Special relativity in one statement 1088 • Quantum theory in one statement 1089 •
General relativity in one statement 1091 • Deducing general relativity 1092 • De-
ducing universal gravitation 1095 • The size of physical systems in general relativ-
ity 1095 • A mechanical analogy for the maximum force 1095
1096 Units and limit values for all physical observables
Limits to space and time 1098 • Mass and energy limits 1098 • Virtual particles –
a new definition 1099 • Limits in thermodynamics 1099 • Electromagnetic limits
and units 1100 • Vacuum and mass–energy – two sides of the same coin 1101 •
Curiosities and fun challenges about Planck limits 1102
1104 Upper and lower limits to observables
Size and energy dependence 1105 • Angular momentum, action and speed 1105 •
18 contents
Force, power and luminosity 1106 • Acceleration 1107 • Momentum 1107 • Life-
time, distance and curvature 1107 • Mass change 1108 • Mass and density 1108
• The strange charm of the entropy bound 1108 • Temperature 1110 • Electro-
magnetic observables 1111 • Curiosities and fun challenges about limits to observ-
ables 1111
1112 Limits to measurement precision and their challenge to thought
Measurement precision and the existence of sets 1113 • Why are observers
needed? 1114 • A solution to Hilbert’s sixth problem 1114 • Outlook 1115
1116 Bibliography
1119 37. The shape of points – extension in nature
Introduction – vacuum and particles 1120 • How else can we show that matter and
vacuum cannot be distinguished? 1121
1122 Argument 1: The size and shape of elementary particles
Do boxes exist? 1123 • Can the Greeks help? – The limits of knives 1123 • Are cross
sections finite? 1123 • Can one take a photograph of a point? 1124 • What is the
1155 Ch a pter XIII The Top of the Mountain (not yet available)
“ Antiquity
The intensity with which small children explore their environment suggests that there is ”
a drive to grasp the way the world works, a ‘physics instinct’, built into each of us. What
would happen if this drive, instead of being stifled during school education, as it usually
is, were allowed to thrive in an environment without bounds, reaching from the atoms
to the stars? Probably most adolescents would then know more about nature than most
senior physics teachers today. This text tries to provide this possibility to the reader. It
acts as a guide in an exploration, free of all limitations, of physics, the science of motion.
The project is the result of a threefold aim I have pursued since 1990: to present the basics
of motion in a way that is simple, up to date and vivid.
In order to be vivid, a text must be challenging, questioning and daring. This text tries
to startle the reader as much as possible. Reading a book on general physics should be like
going to a magic show. We watch, we are astonished, we do not believe our eyes, we think,
and finally – maybe – we understand the trick. When we look at nature, we often have
the same experience. The text tries to intensify this by following a simple rule: on each
page, there should be at least one surprise or provocation for the reader to think about.
Numerous interesting challenges are proposed. Hints or answers to these are given in an
appendix.
The strongest surprises are those that seem to contradict everyday experience. Most of
the surprises in this text are taken from daily life: in particular, from the things one exper-
iences when climbing a mountain. Observations about trees, stones, the Moon, the sky
and people are used wherever possible; complex laboratory experiments are mentioned
only where necessary. These surprises are organized so as to lead in a natural way to the
most extreme conclusion of all, namely that continuous space and time do not exist. The
concepts of space and time, useful as they may be in everyday life, are only approxima-
tions. Indeed, they turn out to be mental crutches that hinder the complete exploration
of the world.
Giving full rein to one’s curiosity and thought leads to the development of a strong
and dependable character. The motto of the text, a famous statement by Hartmut von
Hentig on pedagogy, translates as: ‘To clarify things, to fortify people.’ Exploring any limit
requires courage; and courage is also needed to abandon space and time as tools for the
description of the world. Changing habits of thought produces fear, often hidden by anger;
but we grow by overcoming our fears. Achieving a description of the world without the
use of space and time may be the most beautiful of all adventures of the mind.
In my experience as a teacher, there was one learning method that never failed to trans-
form unsuccessful pupils into successful ones: if you read a book for study, summarize
every section you read, in your own words, aloud. If you are unable to do so, read the sec-
tion again. Repeat this until you can clearly summarize what you read in your own words,
aloud. You can do this alone in a room, or with friends, or while walking. If you do this
with everything you read, you will reduce your learning and reading time significantly.
In addition, you will enjoy learning from good texts much more and hate bad texts much
less. Masters of the method can use it even while listening to a lecture, in a low voice, thus
avoiding to ever take notes.
A request
The text is and remains free for everybody. In exchange for getting the file for free, please
send me a short email on the following issues:
Copyright © Christoph Schiller November 1997–January 2007
Many people who have kept their gift of curiosity alive have helped to make this project
come true. Most of all, Saverio Pascazio has been – present or not – a constant reference
for this project. Fernand Mayné, Anna Koolen, Ata Masafumi, Roberto Crespi, Serge
Pahaut, Luca Bombelli, Herman Elswijk, Marcel Krijn, Marc de Jong, Martin van der
Mark, Kim Jalink, my parents Peter and Isabella Schiller, Mike van Wijk, Renate Georgi,
Paul Tegelaar, Barbara and Edgar Augel, M. Jamil, Ron Murdock, Carol Pritchard and,
most of all, my wife Britta have all provided valuable advice and encouragement.
Many people have helped with the project and the collection of material. Most use-
ful was the help of Mikael Johansson, Bruno Barberi Gnecco, Lothar Beyer, the numer-
ous improvements by Bert Sierra, the detailed suggestions by Claudio Farinati, the many
improvements by Eric Sheldon, the advice of Andrew Young, the continuous help and
advice of Jonatan Kelu, the corrections of Elmar Bartel, and in particular the extensive,
”
W hat is the most daring and amazing journey we can make in a lifetime?
e can travel to remote places, like adventurers, explorers or cosmonauts;
e can look at even more distant places, like astronomers; we can visit the past, like
historians, archaeologists, evolutionary biologists or geologists; or we can delve deeply
into the human soul, like artists or psychologists. All these voyages lead either to other
places or to other times. However, we can do better.
The most daring trip of all is not the one leading to the most inaccessible place, but
the one leading to where there is no place at all. Such a journey implies leaving the prison
of space and time and venturing beyond it, into a domain where there is no position, no
— Can you explain on the telephone what ‘right’ and ‘left’ mean, or what a mirror is?
— Can you make a telephone appointment with a friend without using any terms of time
or position, such as ‘clock’, ‘hour’, ‘place’, ‘where’, ‘when’, ‘at’, ‘near’, ‘before’, ‘after’, ‘near’,
‘upon’, ‘under’, ‘above’, ‘below’?
— Can you describe the fall of a stone without using the language of space or time?
— Do you know of any observation at all that you can describe without concepts from
the domains of ‘space’, ‘time’ or ‘object’?
* ‘The solution of the riddle of life in space and time lies outside space and time.’
** Solutions to, and comments on, challenges are either given on page 1250 or later on in the text. Challenges
are classified as research level (r), difficult (d), normal student level (n) and easy (e). Challenges for which
no solution has yet been included in the book are marked (ny).
24 an appetizer
— Can you explain what time is? And what clocks are?
— Can you imagine a finite history of the universe, but without a ‘first instant of time’?
— Can you imagine a domain of nature where matter and vacuum are indistinguishable?
— Have you ever tried to understand why motion exists?
This book explains how to achieve these and other feats, bringing to completion an an-
cient dream of the human spirit, namely the quest to describe every possible aspect of
motion.
Why do your shoelaces remain tied? They do so because space has three dimensions.
Why not another number? The question has taxed researchers for thousands of years. The
answer was only found by studying motion down to its smallest details, and by exploring
its limits.
Why do the colours of objects differ? Why does the Sun shine? Why does the Moon not
fall out of the sky? Why is the sky dark at night? Why is water liquid but fire not? Why
is the universe so big? Why is it that birds can fly but men can’t? Why is lightning not
the most amazing descriptions of nature ever imagined. We will find that space is warped,
that light does not usually travel in a straight line, and that time is not the same for every-
body. We will discover that there is a maximum force of gravity, and that gravity is not an
interaction, but rather the change of time with position. We will see how the blackness
of the sky at night proves that the universe has a finite age. We will also discover that
there is a smallest entropy in nature, which prevents us from knowing everything about
a physical system. In addition, we will discover the smallest electrical charge. These and
other strange properties and phenomena of motion are summarized in the first part of
this text, whose topic is classical physics. It leads directly to the next question.
What are things? Things are composites of particles. Not only tangible things, but all
interactions and forces – those of the muscles, those that make the Sun burn, those that
an appetizer 25
make the Earth turn, those that determine the differences between attraction, repulsion,
friction, creation and annihilation – are made of particles as well. The growth of trees,
the colours of the sky, the burning of fire, the warmth of a human body, the waves of the
sea and the mood changes of people are all composed of particles in motion. This story is
told in more detail in the second part of the text, which deals with quantum mechanics.
Here we will learn that there is a smallest change in nature. This minimum value forces
everything to keep changing. In particular, we will learn that it is impossible to completely
fill a glass of wine, that eternal life is impossible, and that light can be transformed into
matter. If you find this boring, you can read about the substantial dangers involved in
Page 979 buying a can of beans.
The first two parts of this text can be summarized with the help of a few limit principles:
In other words, each of the constants of nature k~2, c, c 4 ~4G, ħ~2 and e that appear above
is a limit value. We will discover in each case that the equations of the corresponding
domain of physics follow from this limit property. After these results, the path is prepared
for the final part of our mountain ascent.
What are particles, position and time? The recent results of an age-long search are
making it possible to start answering this question. One just needs to find a description
that explains all limit principles at the same time. This third part is not yet complete,
because the necessary research results are not yet available. Nevertheless, some of the
intermediate results are striking:
— It is known already that space and time are not continuous; that – to be precise –
neither points nor particles exist; and that there is no way to distinguish space from
Page 1020 time, nor vacuum from matter, nor matter from radiation.
— It is known already that nature is not simply made of particles and vacuum.
— It seems that position, time and particles are aspects of a complex, extended entity that
is incessantly varying in shape.
Copyright © Christoph Schiller November 1997–January 2007
— Among the mysteries that should be cleared up in the coming years are the origin of
the three dimensions of space, the origin of time and the details of the big bang.
— Current research indicates that motion is an intrinsic property of matter and radiation
and that, as soon as we introduce these two concepts in our description of nature,
motion appears automatically. Indeed, it is impossible not to introduce these concepts,
because they necessarily appear when we divide nature into parts, an act we cannot
avoid because of the mechanisms of our senses and therefore of our thinking.
— Current research also indicates that the final, completely precise, description of nature
does not use any form of infinity. We find, step by step, that all infinities appearing in
the human description of nature – both the infinitely large and the infinitely small –
result from approximations. ‘Infinity’ turns out to be merely a conceptual convenience
26 an appetizer
that has no place in nature. However, we find that the precise description does not in-
clude any finite quantities either! These and many other astonishing results of modern
physics appear in the third part of this text.
This third and final part of the text thus describes the present state of the search for a
unified theory encompassing general relativity and quantum mechanics. To achieve such
a description, the secrets of space, time, matter and forces have to be unravelled. It is a
fascinating story, assembled piece by piece by thousands of researchers. At the end of the
ascent, at the top of the mountain, the idea of motion will have undergone a complete
transformation. Without space and time, the world will look magical, incredibly simple
and fascinating: pure beauty.
Classical Physics:
How Do Things and Images Move?
W
ham! The lightning striking the tree nearby violently disrupts our quiet forest
alk and causes our hearts to suddenly beat faster. In the top of the tree
e see the fire start and fade again. The gentle wind moving the leaves around
us helps to restore the calmness of the place. Nearby, the water in a small river follows
“
Motion is everywhere: friendly and threatening, terrible and
beautiful. It is fundamental to our human existence. We
Zeno of Elea*
”
need motion for growing, for learning, for thinking and
for enjoying life. We use motion for walking through a
forest, for listening to its noises and for talking about all this.
Like all animals, we rely on motion to get food and to sur-
Could change to: vive dangers. Plants by contrast cannot move (much); for
their self-defence, they developed poisons. Examples of such
Species unable to move plants are the stinging nettle, the tobacco plant, digitalis, bel-
out of the way of ladonna and poppy; poisons include caffeine, nicotine, cur-
danger, such as plants
and sedentary creatures are and many others. Poisons such as these are at the basis of
most medicines. Therefore, most medicines exist essentially
Copyright © Christoph Schiller November 1997–January 2007
* Zeno of Elea (c. 450 bce), one of the main exponents of the Eleatic school of philosophy.
why should we care abou t motion? 29
ASTRONOMY
tom
MATERIAL SCIENCES
CHEMISTRY
part III: mt GEOSCIENCES
MEDICINE
part II: qt Motion
Mountain
Ref. 3 Motion is mysterious. Though found everywhere – in the stars, in the tides, in our
eyelids – neither the ancient thinkers nor myriads of others in the 25 centuries since then
have been able to shed light on the central mystery: what is motion? We shall discover
that the standard reply, ‘motion is the change of place in time’, is inadequate. Just recently
an answer has finally been found. This is the story of the way to find it.
Motion is a part of human experience. If we imagine human experience as an island,
then destiny, symbolized by the waves of the sea, carried us to its shore. Near the centre of
the island an especially high mountain stands out. From its top we can see over the whole
landscape and get an impression of the relationships between all human experiences, in
particular between the various examples of motion. This is a guide to the top of what I
have called Motion Mountain. The hike is one of the most beautiful adventures of the
Terms like
human mind. Clearly, the first question to ask is:
"clearly" and
"obviously" that
imply you know
the readers'
mind and
readiness
should be
avoided. There
will be readers
who do not
perceive why
this question
must come first.
30 i galilean motion • 1. why should we care abou t motion?
good pictures
F I G U R E 3 Illusions of motion: look at the figure on the left and slightly move the page, or look at the
white dot at the centre of the figure on the right and move your head back and forward
Ref. 2
acceptable
To sharpen the mind for the issue of motion’s existence, have a look at Figure 3 and follow
the instructions. In both cases the figures seem to rotate. One can experience similar
effects if one walks over Italian cobblestone in wave patterns or if one looks at the illusions
”
on the web page www.ritsumei.ac.jp/~akitaoka. How can one make sure that real motion
Challenge 3 n is different from these or other similar illusions?**
Many scholars simply argued that motion does not exist at all. Their arguments deeply
Ref. 4 influenced the investigation of motion. For example, the Greek philosopher Parmenides
(born c. 515 bce in Elea, a small town near Naples, in southern Italy) argued that since
nothing comes from nothing, change cannot exist. He underscored the permanence of
Ref. 5 nature and thus consistently maintained that all change and thus all motion is an illusion.
Heraclitus (c. 540 to c. 480 bce ) held the opposite view. He expressed it in his famous
statement πάντα ῥεῖ ‘panta rhei’ or ‘everything flows’.*** He saw change as the essence
of nature, in contrast to Parmenides. These two equally famous opinions induced many
scholars to investigate in more detail whether in nature there are conserved quantities or
whether creation is possible. We will uncover the answer later on; until then, you might
Challenge 4 n ponder which option you prefer.
Copyright © Christoph Schiller November 1997–January 2007
Parmenides’ collaborator Zeno of Elea (born c. 500 bce) argued so intensely against
motion that some people still worry about it today. In one of his arguments he claims –
in simple language – that it is impossible to slap somebody, since the hand first has to
travel halfway to the face, then travel through half the distance that remains, then again
so, and so on; the hand therefore should never reach the face. Zeno’s argument focuses
on the relation between infinity and its opposite, finitude, in the description of motion.
Ref. 6 In modern quantum theory, a similar issue troubles many scientists up to this day.
Zeno also maintained that by looking at a moving object at a single instant of time,
* The riddle does not exist. If a question can be put at all, it can be answered.
** Solutions to challenges are given either on page 1250 or later on in the text. Challenges are classified as
research level (r), difficult (d), normal student level (n) and easy (e). Challenges with no solution as yet are
why should we care abou t motion? 31
F I G U R E 4 How much water is required to make a bucket hang vertically? At what angle does the
pulled reel change direction of motion? (© Luca Gastaldi)
our brain to approximate space and time as continuous entities, and forced by our brain
to describe nature as made of different parts. If any one of these conditions were not ful-
filled, we would not observe motion; motion, then, would not exist. Each of these results
can be uncovered most efficiently if we start with the following question:
marked (ny).
*** Appendix A explains how to read Greek text.
32 i galilean motion • 1. why should we care abou t motion?
F I G U R E 5 A time line of scientific and political personalities in antiquity (the last letter of the name is
aligned with the year of death)
Like any science, the approach of physics is twofold: we advance with precision and with
Charles Baudelaire, La Beauté.*
”
curiosity. Precision makes meaningful communication possible, and curiosity makes it
worthwhile.** Whenever one talks about motion and aims for increased precision or for
more detailed knowledge, one is engaged, whether knowingly or not, in the ascent of
Motion Mountain. With every increase in the precision of the description, one gains some
height. The examples of Figure 4 make the point. When you fill a bucket with a small
amount of water, it does not hang vertically. (Why?) If you continue adding water, it starts
Copyright © Christoph Schiller November 1997–January 2007
Challenge 5 ny to hang vertically at a certain moment. How much water is necessary? When you pull a
thread from a reel in the way shown, the reel will move either forwards or backwards,
depending on the angle at which you pull. What is the limiting angle between the two
possibilities?
High precision means going into fine details. This method actually increases the pleas-
ure of the adventure.*** The higher we get on Motion Mountain, the further we can see
* Charles Baudelaire (b. 1821 Paris, d. 1867 Paris) Beauty: ‘I hate movement, which changes shapes, and
Ref. 7 never do I weep and never do I laugh.’ Beauty.
Ref. 8 ** For a collection of interesting examples of motion in everyday life, see the excellent book by Walker.
Challenge 6 n *** Distrust anybody who wants to talk you out of investigating details. He is trying to deceive you. Details
are important. Also, be vigilant also during this walk.
why should we care abou t motion? 33
motion pictures motion as therapy for cancer, diabetes, acne and de-
pression
motion perception Ref. 22 motion sickness
motion for fitness and wellness motion for meditation
motion control in sport motion ability as health check
perpetual motion motion in dance, music and other arts
motion as proof of various gods Ref. 10 motion of stars and angels Ref. 11
economic efficiency of motion the connection between motional and emotional
habits
motion as help to overcome trauma motion in psychotherapy Ref. 12
and the more our curiosity is rewarded. The views offered are breathtaking, especially
from the very top. The path we will follow – one of the many possible routes – starts from
Ref. 9 the side of biology and directly enters the forest that lies at the foot of the mountain.
Intense curiosity drives us to go straight to the limits: understanding motion requires
exploration of the largest distances, the highest velocities, the smallest particles, the
strongest forces and the strangest concepts. Let us begin.
“
Copyright © Christoph Schiller November 1997–January 2007
Antiquity
The best place to obtain a general overview on the types of motion is a large library; this
is shown in Table 1. The domains in which motion, movements and moves play a role
”
are indeed varied. Already in ancient Greece people had the suspicion that all types of
motion, as well as many other types of change, are related. It is usual to distinguish at
least three categories.
The first category of change is that of material transport, such as a person walking or
a leaf falling from a tree. Transport is the change of position or orientation of objects. To
a large extent, the behaviour of people also falls into this category.
A second category of change groups observations such as the dissolution of salt in wa-
ter, the formation of ice by freezing, the putrefaction of wood, the cooking of food, the
34 i galilean motion • 1. why should we care abou t motion?
coagulation of blood, and the melting and alloying of metals. These changes of colour,
brightness, hardness, temperature and other material properties are all transformations.
Transformations are changes not visibly connected with transport. To this category, a few
ancient thinkers added the emission and absorption of light. In the twentieth century,
these two effects were proven to be special cases of transformations, as were the newly
discovered appearance and disappearance of matter, as observed in the Sun and in radio-
activity. Mind change, such as change of mood, of health, of education and of character,
Ref. 16 is also (mostly) a type of transformation.
Ref. 17 The third and especially important category of change is growth; it is observed for
animals, plants, bacteria, crystals, mountains, stars and even galaxies. In the nineteenth
century, changes in the population of systems, biological evolution, and in the twentieth
century, changes in the size of the universe, cosmic evolution, were added to this category.
Traditionally, these phenomena were studied by separate sciences. Independently they all
arrived at the conclusion that growth is a combination of transport and transformation.
The difference is one of complexity and of time scale.
Copyright © Christoph Schiller November 1997–January 2007
At the beginnings of modern science during the Renaissance, only the study of trans-
port was seen as the topic of physics. Motion was equated to transport. The other two do-
mains were neglected by physicists. Despite this restriction, the field of enquiry remains
large, covering a large part of Experience Island. The obvious temptation is to structure
the field by distinguishing types of transport by their origin. Movements such as those
of the legs when walking are volitional, because they are controlled by one’s will, whereas
movements of external objects, such as the fall of a snowflake, which one cannot influ-
ence by will-power, are called passive. Children are able to make this distinction by about
the age of six, and this marks a central step in the development of every human towards
a precise description of the environment.* From this distinction stems the historical but
* Failure to pass this stage completely can result in a person having various strange beliefs, such as believing
why should we care abou t motion? 35
now outdated definition of physics as the science of the motion of non-living things.
Then, one day, machines appeared. From that moment, the distinction between vo-
litional and passive motion was put into question. Like living beings, machines are self-
moving and thus mimic volitional motion. However, careful observation shows that every
part in a machine is moved by another, so that their motion is in fact passive. Are living
beings also machines? Are human actions examples of passive motion as well? The ac-
cumulation of observations in the last 100 years made it clear that volitional movement*
indeed has the same physical properties as passive motion in non-living systems. (Of
course, from the emotional viewpoint, the differences are important; for example, grace
Ref. 18 can only be ascribed to volitional movements.) The distinction between the two types is
thus not necessary and is dropped in the following. Since passive and volitional motion
have the same properties, through the study of motion of non-living objects we can learn
something about the human condition. This is most evident when touching the topics of
determinism, causality, probability, infinity, time and sex, to name but a few of the themes
we will encounter on the way. Copyright © Christoph Schiller November 1997–January 2007
With the accumulation of observations in the nineteenth and twentieth centuries, even
more of the historical restrictions on the study of motion were put into question. Extens-
ive observations showed that all transformations and all growth phenomena, including
behaviour change and evolution, are also examples of transport. In other words, over
2 000 years of studies have shown that the ancient classification of observations was use-
in the ability to influence roulette balls, as found in compulsive players, or in the ability to move other bod-
ies by thought, as found in numerous otherwise healthy-looking people. An entertaining and informative
account of all the deception and self-deception involved in creating and maintaining these beliefs is given
by James R andi, The Faith Healers, Prometheus Books, 1989. A professional magician, he presents many
similar topics in several of his other books. See also his http://www.randi.org website for more details.
* The word ‘movement’ is rather modern; it was imported into English from the old French and became
popular only at the end of the eighteenth century. It is never used by Shakespeare.
36 i galilean motion • 1. why should we care abou t motion?
Good use of italic to highlight a key idea.
less: all change is transport. In the middle of the twentieth century this culminated in the
confirmation of an even more specific idea already formulated in ancient Greece: every
type of change is due to the motion of particles. It takes time and work to reach this con-
Contradiction. If
clusion, which appears only when one relentlessly pursues higher and higher precision
an idea in the description of nature. The first two parts of this adventure retrace the path to this
Challenge
is"confirmed" 7n
then result. (Do you agree with it?)
it is fact. Leave The last decade of the twentieth century changed this view completely. The particle
out the word idea turns out to be wrong. This new result, already suggested by advanced quantum
"confirmation."
theory, is reached in the third part of our adventure through a combination of careful
observation and deduction. But we still have some way to go before we reach there.
At present, at the beginning of our walk, we simply note that history has shown that
classifying the various types of motion is not productive. Only by trying to achieve max-
imum precision can we hope to arrive at the fundamental properties of motion. Precision,
not classification is the path to follow. As Ernest Rutherford said: ‘All science is either
”
Only wimps study only the general case; real
Another great
quote!
“ scientists pursue examples.
Beresford Parlett
Human beings enjoy perceiving. Perception starts before birth, and we continue enjoying
”
it for as long as we can. That is why television, even when devoid of content, is so success-
Copyright © Christoph Schiller November 1997–January 2007
ful. During our walk through the forest at the foot of Motion Mountain we cannot avoid
perceiving. Perception is first of all the ability to distinguish. We use the basic mental act
of distinguishing in almost every instant of life; for example, during childhood we first
learned to distinguish familiar from unfamiliar observations. This is possible in combin-
ation with another basic ability, namely the capacity to memorize experiences. Memory
gives us the ability to experience, to talk and thus to explore nature. Perceiving, classify-
* The importance of throwing is also seen from the terms derived from it: in Latin, words like subject or
‘thrown below’, object or ‘thrown in front’, and interjection or ‘thrown in between’; in Greek, it led to terms
like symbol or ‘thrown together’, problem or ‘thrown forward’, emblem or ‘thrown into’, and – last but not
least – devil or ‘thrown through’.
** The world is independent of my will.
why should we care abou t motion? 37
ing and memorizing together form learning. Without any one of these three abilities, we
could not study motion.
Children rapidly learn to distinguish permanence from variability. They learn to recog-
nize human faces, even though a face never looks exactly the same each time it is seen.
From recognition of faces, children extend recognition to all other observations. Recog-
nition works pretty well in everyday life; it is nice to recognize friends, even at night, and
even after many beers (not a challenge). The act of recognition thus always uses a form of
generalization. When we observe, we always have a general idea in our mind. We specify
the main ones.
All forests can remind us of the essence of perception. Sitting on the grass in a clear-
ing of the forest at the foot of Motion Mountain, surrounded by the trees and the silence
typical of such places, a feeling of calmness and tranquillity envelops us. Suddenly, some-
thing moves in the bushes; immediately our eyes turn and our attention focuses. The
nerve cells that detect motion are part of the most ancient part of our brain, shared with
* The human eye is rather good at detecting motion. For example, the eye can detect motion of a point of
light even if the change of angle is smaller than that which can be distinguished in a fixed image. Details of
Ref. 22 this and similar topics for the other senses are the domain of perception research.
** The topic of motion perception is full of interesting aspects. An excellent introduction is chapter 6 of the
beautiful text by Donald D. Hoffman, Visual Intelligence – How We Create What We See, W.W. Norton
& Co., 1998. His collection of basic motion illusions can be experienced and explored on the associated
http://aris.ss.uci.edu/cogsci/personnel/hoffman/hoffman.html website.
38 i galilean motion • 1. why should we care abou t motion?
motion
the basic type of change
in relation and in opposition to the environment. The concept of an object is therefore also
a relative concept. But the basic conceptual distinction between localized, isolable objects
and the extended environment is not trivial or unimportant. First, it has the appearance
Challenge 8 n of a circular definition. (Do you agree?) This issue will keep us very busy later on. Second,
we are so used to our ability of isolating local systems from the environment that we take
it for granted. However, as we will see in the third part of our walk, this distinction turns
Page 1039 out to be logically and experimentally impossible!* Our walk will lead us to discover the
Copyright © Christoph Schiller November 1997–January 2007
reason for this impossibility and its important consequences. Finally, apart from moving
entities and the permanent background, we need a third concept, as shown in Table 2.
Wisdom is one thing: to understand the thought
Ref. 23
“ which steers all things through all things.
Heraclitus of Ephesus
”
* Contrary to what is often read in popular literature, the distinction is possible in quantum theory. It be-
comes impossible only when quantum theory is unified with general relativity.
why should we care abou t motion? 39
What distinguishes the various patterns in the lower left corners of this text? In everyday
life we would say: the situation or configuration of the involved entities. The situation
somehow describes all those aspects that can differ from case to case. It is customary to
”
call the list of all variable aspects of a set of objects their (physical) state of motion, or
simply their state.
The situations in the lower left corners differ first of all in time. Time is what makes
opposites possible: a child is in a house and the same child is outside the house. Time
describes and resolves this type of contradiction. But the state not only distinguishes situ-
* Objects, the unalterable, and the subsistent are one and the same. Objects are what is unalterable and
subsistent; their configuration is what is changing and unstable.
** A physical system is a localized entity of investigation. In the classification of Table 2, the term ‘physical
system’ is (almost) the same as ‘object’ or ‘physical body’. Images are usually not counted as physical systems
Challenge 9 n (though radiation is one). Are holes physical systems?
*** The exact separation between those aspects belonging to the object and those belonging to the state
depends on the precision of observation. For example, the length of a piece of wood is not permanent; wood
shrinks and bends with time, due to processes at the molecular level. To be precise, the length of a piece of
wood is not an aspect of the object, but an aspect of its state. Precise observations thus shift the distinction
between the object and its state; the distinction itself does not disappear – at least not for quite while.
40 i galilean motion • 1. why should we care abou t motion?
**
Challenge 11 n Is the motion of a ghost an example of motion?
**
A man climbs a mountain from 9 a.m. to 1 p.m. He sleeps on the top and comes down
the next day, taking again from 9 am to 1 pm for the descent. Is there a place on the path
Challenge 12 n that he passes at the same time on the two days?
**
Challenge 13 n Can something stop moving? How would you show it?
**
Challenge 14 n Does a body moving in straight line for ever show that nature is infinite?
**
Copyright © Christoph Schiller November 1997–January 2007
* Sections entitled ‘curiosities’ are collections of topics and problems that allow one to check and to expand
the usage of concepts already introduced.
why should we care abou t motion? 41
Challenge 18 n What is the lowest speed you have observed? Is there a lowest speed in nature?
**
According to legend, Sessa ben Zahir, the Indian inventor of the game of chess, demanded
from King Shirham the following reward for his invention: he wanted one grain of rice
for the first square, two for the second, four for the third, eight for the fourth, and so on.
Challenge 19 n How much time would all the rice fields of the world take to produce the necessary rice?
**
When a burning candle is moved, the flame lags behind the candle. How does the flame
Challenge 20 n behave if the candle is inside a glass, still burning, and the glass is accelerated?
**
A good way to make money is to build motion detectors. A motion detector is a small box
**
When a block is rolled over the floor over a set of cylinders, how are the speed of the
Challenge 25 n block and that of the cylinders related?
Copyright © Christoph Schiller November 1997–January 2007
**
Ref. 16 Do you dislike formulae? If you do, use the following three-minute method to change
Challenge 26 n the situation. It is worth trying it, as it will make you enjoy this book much more. Life is
short; as much of it as possible, like reading this text, should be a pleasure.
1 - Close your eyes and recall an experience that was absolutely marvellous, a situation
when you felt excited, curious and positive.
2 - Open your eyes for a second or two and look at page 329 – or any other page that
contains many formulae.
3 - Then close your eyes again and return to your marvellous experience.
4 - Repeat the observation of the formulae and the visualization of your memory –
steps 2 and 3 – three more times.
Then leave the memory, look around yourself to get back into the here and now, and test
42 i galilean motion • 2. galilean physics – motion in everyday life
yourself. Look again at page 329. How do you feel about formulae now?
**
In the sixteenth century, Niccolò Tartaglia* proposed the following problem. Three young
couples want to cross a river. Only a small boat that can carry two people is available. The
men are extremely jealous, and would never leave their brides with another man. How
Challenge 27 n many journeys across the river are necessary?
**
Cylinders can be used to roll a flat object over the floor; they keep the object plane always
at the same distance from the floor. What cross-sections other than circular, so-called
curves of constant width, allow you to realize the same feat? How many examples can you
Challenge 28 n find?
“ Menschen.**
Georg Christoph Lichtenberg
The simplest description of motion is the one we all, like cats or monkeys, use uncon- ”
sciously in everyday life: only one thing can be at a given spot at a given time. This general
description can be separated into three assumptions: matter is impenetrable and moves,
time is made of instants, and space is made of points. Without these three assumptions (do
Challenge 29 n you agree with them?) it is not possible to define velocity in everyday life. This description
of nature is called Galilean or Newtonian physics.
Galileo Galilei (1564–1642), Tuscan professor of mathematics,
was a founder of modern physics and is famous for advocating the
importance of observations as checks of statements about nature. By
requiring and performing these checks throughout his life, he was
led to continuously increase the accuracy in the description of mo-
tion. For example, Galileo studied motion by measuring change of
position with a self-constructed stopwatch. His approach changed
the speculative description of ancient Greece into the experimental
Copyright © Christoph Schiller November 1997–January 2007
Ve l o c i t i e s Physical M at h e m at i c a l Definition
can propert y name
Be distinguished distinguishability element of set Page 662
Change gradually continuum real vector space Page 71, Page 1230
Point somewhere direction vector space, dimensionality Page 71
Be compared measurability metricity Page 1221
Be added additivity vector space Page 71
Have defined angles direction Euclidean vector space Page 71
Exceed any limit infinity unboundedness Page 663
“ Jochen Rindt**
Velocity fascinates. To physicists, not only car races are interesting, but any moving entity ”
is. Therefore they first measure as many examples as possible. A selection is given in
Table 4. The units and prefixes used are explained in detail in Appendix B.
Everyday life teaches us a lot about motion: objects can overtake each other, and they
can move in different directions. We also observe that velocities can be added or changed
smoothly. The precise list of these properties, as given in Table 3, is summarized by math-
ematicians in a special term; they say that velocities form a Euclidean vector space.***
Page 71 More details about this strange term will be given shortly. For now we just note that in
describing nature, mathematical concepts offer the most accurate vehicle.
When velocity is assumed to be an Euclidean vector, it is called Galilean velocity. Ve-
locity is a profound concept. For example, velocity does not need space and time meas-
urements to be defined. Are you able to find a means of measuring velocities without
Challenge 30 d measuring space and time? If so, you probably want to skip to page 283, jumping 2000
years of enquiries. If you cannot do so, consider this: whenever we measure a quantity we Copyright © Christoph Schiller November 1997–January 2007
* Newton was born a year after Galileo died. Newton’s other hobby, as master of the Mint, was to supervise
Ref. 24 personally the hanging of counterfeiters. About Newton’s infatuation with alchemy, see the books by Dobbs.
Among others, Newton believed himself to be chosen by god; he took his Latin name, Isaacus Neuutonus,
and formed the anagram Jeova sanctus unus. About Newton and his importance for classical mechanics, see
Ref. 25 the text by Clifford Truesdell.
** Jochen Rindt (1942–1970), famous Austrian Formula One racing car driver, speaking about speed.
*** It is named after Euclid, or Eukleides, the great Greek mathematician who lived in Alexandria around
300 bce. Euclid wrote a monumental treatise of geometry, the Στοιχεῖα or Elements, which is one of the
milestones of human thought. The text presents the whole knowledge on geometry of that time. For the first
time, Euclid introduces two approaches that are now in common use: all statements are deduced from a
small number of basic ‘axioms’ and for every statement a ‘proof ’ is given. The book, still in print today, has
been the reference geometry text for over 2000 years. On the web, it can be found at http://aleph0.clarku.
edu/~djoyce/java/elements/elements.html.
44 i galilean motion • 2. galilean physics – motion in everyday life
assume that everybody is able to do so, and that everybody will get the same result. In
other words, we define measurement as a comparison with a standard. We thus implicitly
assume that such a standard exists, i.e. that an example of a ‘perfect’ velocity can be found.
Historically, the study of motion did not investigate this question first, because for many
centuries nobody could find such a standard velocity. You are thus in good company.
Some researchers have specialized in the study of the lowest velocities found in nature:
Ref. 29 they are called geologists. Do not miss the opportunity to walk across a landscape while
listening to one of them.
Velocity is a profound subject for a second reason: we will discover that all seven prop-
erties of Table 3 are only approximate; none is actually correct. Improved experiments will
uncover limits in every property of Galilean velocity. The failure of the last three proper-
ties of Table 3 will lead us to special and general relativity, the failure of the middle two
to quantum theory and the failure of the first two properties to the unified description of
nature. But for now, we’ll stick with Galilean velocity, and continue with another Galilean
What is time?
Time does not exist in itself, but only through
”
“ the perceived objects, from which the concepts
of past, of present and of future ensue.
Lucretius,** De rerum natura, lib. 1, v. 460 ss.
Ref. 19
In their first years of life, children spend a lot of time throwing objects around. The term
‘object’ is a Latin word meaning ‘that which has been thrown in front.’ Developmental
psychology has shown experimentally that from this very experience children extract
”
the concepts of time and space. Adult physicists do the same when studying motion at
university.
When we throw a stone through the air, we can define a sequence of observations. Our
memory and our senses give us this ability. The sense of hearing registers the various
sounds during the rise, the fall and the landing of the stone. Our eyes track the location
Copyright © Christoph Schiller November 1997–January 2007
of the stone from one point to the next. All observations have their place in a sequence,
with some observations preceding them, some observations simultaneous to them, and
still others succeeding them. We say that observations are perceived to happen at various
instants and we call the sequence of all instants time.
An observation that is considered the smallest part of a sequence, i.e. not itself a se-
quence, is called an event. Events are central to the definition of time; in particular, start-
Challenge 32 n ing or stopping a stopwatch are events. (But do events really exist? Keep this question in
the back of your head as we move on.)
O b s e r va t i o n Ve l o c i t y
O b s e r va t i o n Ti m e
ephemeris time (or one of its successors). An observer who uses his personal watch calls
the reading his proper time; it is often used in the theory of relativity.
Not every movement is a good standard for time. In the year 2000 an Earth rotation
Page 1177 did not take 86 400 seconds any more, as it did in the year 1900, but 86 400.002 seconds.
Can you deduce in which year your birthday will have shifted by a whole day from the
Challenge 34 n time predicted with 86 400 seconds?
All methods for the definition of time are thus based on comparisons of motions. In
order to make the concept as precise and as useful as possible, a standard reference motion
is chosen, and with it a standard sequence and a standard duration is defined. The device
that performs this task is called a clock. We can thus answer the question of the section
title: time is what we read from a clock. Note that all definitions of time used in the various
branches of physics are equivalent to this one; no ‘deeper’ or more fundamental definition
is possible.* Note that the word ‘moment’ is indeed derived from the word ‘movement’.
Language follows physics in this case. Astonishingly, the definition of time just given is
observe that events succeed each other smoothly, apparently without end. In this context,
‘smoothly’ means that observations that are not too distant tend to be not too different.
Yet between two instants, as close as we can observe them, there is always room for other
events. Durations, or time intervals, measured by different people with different clocks
ships, aeroplanes and lorries. For more information, see the http://www.gpsworld.com website. The time-
Challenge 33 n keeping infrastructure is also important for other parts of the modern economy. Can you spot the most
important ones?
* The oldest clocks are sundials. The science of making them is called gnomonics. An excellent and complete
introduction into this somewhat strange world can be found at the http://www.sundials.co.uk website.
Page 848 ** The brain contains numerous clocks. The most precise clock for short time intervals, the internal interval
timer, is more accurate than often imagined, especially when trained. For time periods between a few tenths
Ref. 30 of a second, as necessary for music, and a few minutes, humans can achieve accuracies of a few per cent.
48 i galilean motion • 2. galilean physics – motion in everyday life
I n s ta n t s o f t i m e Physical M at h e m at i c a l Definition
propert y name
“ ”
Challenge 37 n
Copyright © Christoph Schiller November 1997–January 2007
All rotational motions in our society, such as athletic races, horse, bicycle or ice skat-
ing races, turn anticlockwise. Likewise, every supermarket leads its guests anticlockwise
through the hall. Mathematicians call this the positive rotation sense. Why? Most people
are right-handed, and the right hand has more freedom at the outside of a circle. There-
fore thousands of years ago chariot races in stadia went anticlockwise. As a result, all
races still do so to this day. That is why runners move anticlockwise. For the same reason,
helical stairs in castles are built in such a way that defending right-handers, usually from
above, have that hand on the outside.
On the other hand, the clock imitates the shadow of sundials; obviously, this is true
on the northern hemisphere only, and only for sundials on the ground, which were the
galilean physics – motion in everyday life 49
most common ones. (The old trick to determine south by pointing the hour hand of an
horizontal watch to the Sun and halving the angle between it and the direction of 12 o’clock
does not work on the southern hemisphere.) So every clock implicitly continues to state
on which hemisphere it was invented. In addition, it also tells us that sundials on walls
came in use much later than those on the floor.
The expression ‘the flow of time’ is often used to convey that in nature change follows
after change, in a steady and continuous manner. But though the hands of a clock ‘flow’,
”
What is space?
The introduction of numbers as coordinates [...]
“
Copyright © Christoph Schiller November 1997–January 2007
Whenever we distinguish two objects from each other, such as two stars, we first of all dis-
tinguish their positions. Distinguishing positions is the main ability of our sense of sight.
Position is therefore an important aspect of the physical state of an object. A position is
”
* We cannot compare a process with ‘the passage of time’ – there is no such thing – but only with another
process (such as the working of a chronometer).
** Hermann Weyl (1885–1955) was one of the most important mathematicians of his time, as well as an
important theoretical physicist. He was one of the last universalists in both fields, a contributor to quantum
theory and relativity, father of the term ‘gauge’ theory, and author of many popular texts.
50 i galilean motion • 2. galilean physics – motion in everyday life
taken by only one object at a time. Positions are limited. The set of all available positions,
called (physical) space, acts as both a container and a background.
Closely related to space and position is size, the set of positions an objects occupies.
Small objects occupy only subsets of the positions occupied by large ones. We will discuss
size shortly.
How do we deduce space from observations? During childhood, humans (and most
higher animals) learn to bring together the various perceptions of space, namely the
visual, the tactile, the auditory, the kinesthetic, the vestibular etc., into one coherent set
of experiences and description. The result of this learning process is a certain ‘image’ of
space in the brain. Indeed, the question ‘where?’ can be asked and answered in all lan-
guages of the world. Being more precise, adults derive space from distance measurements.
The concepts of length, area, volume, angle and solid angle are all deduced with their help.
Geometers, surveyors, architects, astronomers, carpet salesmen and producers of metre
sticks base their trade on distance measurements. Space is a concept formed to summar-
three dimensions? This is probably the most difficult question of physics; it will be
answered only in the very last part of our walk.
Challenge 41 n It is often said that thinking in four dimensions is impossible. That is wrong. Just try.
For example, can you confirm that in four dimensions knots are impossible?
Like time intervals, length intervals can be described most precisely with the help of
real numbers. In order to simplify communication, standard units are used, so that every-
body uses the same numbers for the same length. Units allow us to explore the general
properties of Galilean space experimentally: space, the container of objects, is continuous,
three-dimensional, isotropic, homogeneous, infinite, Euclidean and unique or ‘absolute’.
In mathematics, a structure or mathematical concept with all the properties just men-
tioned is called a three-dimensional Euclidean space. Its elements, (mathematical) points,
are described by three real parameters. They are usually written as
and are called coordinates. They specify and order the location of a point in space. (For
the precise definition of Euclidean spaces, see page 71.)
What is described here in just half a page actually took 2000 years to be worked out,
mainly because the concepts of ‘real number’ and ‘coordinate’ had to be discovered first.
The first person to describe points of space in this way was the famous mathematician and
philosopher René Descartes*, after whom the coordinates of expression (1) are named
Cartesian.
* René Descartes or Cartesius (1596–1650), French mathematician and philosopher, author of the famous
statement ‘je pense, donc je suis’, which he translated into ‘cogito ergo sum’ – I think therefore I am. In his
view this is the only statement one can be sure of.
52 i galilean motion • 2. galilean physics – motion in everyday life
O b s e r va t i o n D i s ta n c e
“
Μέτρον ἄριστον.*
Cleobulus
(Richardson found other numbers for other coasts.) The number l 0 is the length at scale
Copyright © Christoph Schiller November 1997–January 2007
1 : 1. The main result is that the larger the map, the longer the coastline. What would hap-
pen if the scale of the map were increased even beyond the size of the original? The length
would increase beyond all bounds. Can a coastline really have infinite length? Yes, it can.
In fact, mathematicians have described many such curves; they are called fractals. An
infinite number of them exist, and Figure 12 shows one example.*** Can you construct
* ‘Measure is the best (thing).’ Cleobulus (Κλεοβουλος) of Lindos, (c. 620–550 BCE ) was another of the
proverbial seven sages.
** Lewis Fray Richardson (1881–1953), English physicist and psychologist.
*** Most of these curves are self-similar, i.e. they follow scaling ‘laws’ similar to the above-mentioned. The
term ‘fractal’ is due to the Polish mathematician Benoît Mandelbrot and refers to a strange property: in a
certain sense, they have a non-integral number D of dimensions, despite being one-dimensional by con-
54 i galilean motion • 2. galilean physics – motion in everyday life
F I G U R E 12 A fractal: a self-similar curve of infinite length (far right), and its construction
Challenge 44 e another?
Length has other strange properties. The Italian mathematician Giuseppe Vitali was
the first to discover that it is possible to cut a line segment of length 1 into pieces that
can be reassembled – merely by shifting them in the direction of the segment – into a
line segment of length 2. Are you able to find such a division using the hint that it is only
Challenge 45 d possible using infinitely many pieces?
To sum up, length is well defined for lines that are straight or nicely curved, but not
for intricate lines, or for lines made of infinitely many pieces. We therefore avoid fractals
and other strangely shaped curves in the following, and we take special care when we
needed. They were discovered in 1923 by the famous mathematician Stefan Banach.* He
proved that one can indeed define an area for any set of points whatsoever, even if the bor-
der is not nicely curved but extremely complicated, such as the fractal curve previously
mentioned. Today this generalized concept of area, technically a ‘finitely additive isomet-
rically invariant measure,’ is called a Banach measure in his honour. Mathematicians sum
struction. Mandelbrot saw that the non-integer dimension was related to the exponent e of Richardson by
D = 1 + e, thus giving D = 1.25 in the example above.
Coastlines and other fractals are beautifully presented in Heinz-Otto Peitgen, Hartmu t Jür-
gens & Dietmar Saupe, Fractals for the Classroom, Springer Verlag, 1992, pp. 232–245. It is also available
in several other languages.
* Stefan Banach (Krakow, 1892–Lvov, 1945), important Polish mathematician.
galilean physics – motion in everyday life 55
up this discussion by saying that since in two dimensions there is a Banach measure, there
is a way to define the concept of area – an additive and rigid measure – for any set of points
whatsoever.*
What is the situation in three dimensions, i.e. for
volume? We can start in the same way as for area,
by defining the volume V of a rectangular polyhed-
ron with sides a, b, c as V = abc. But then we en-
counter a first problem: a general polyhedron cannot
be cut into a cube by straight cuts! The limitation
was discovered in 1900 and 1902 by Max Dehn.** He dihedral
angle
found that the possibility depends on the values of
the edge angles, or dihedral angles, as the mathem-
aticians call them. If one ascribes to every edge of a
general polyhedron a number given by its length l
Size does not count!***** Volume is thus not a useful concept at all.
* Actually, this is true only for sets on the plane. For curved surfaces, such as the surface of a sphere, there
are complications that will not be discussed here. In addition, the problems mentioned in the definition of
length of fractals also reappear for area if the surface to be measured is not flat but full of hills and valleys.
A typical example is the area of the human lung: depending on the level of details examined, one finds area
values from a few up to over a hundred square metres.
** Max Dehn (1878–1952), German mathematician, student of David Hilbert.
*** This is also told in the beautiful book by M. Aigler & G.M. Z iegler, Proofs from the Book, Springer
Verlag, 1999. The title is due to the famous habit of the great mathematician Paul Erdös to imagine that all
beautiful mathematical proofs can be assembled in the ‘book of proofs’.
**** Alfred Tarski (b. 1902 Warsaw, d. 1983 Berkeley), Polish mathematician.
***** The proof of the result does not need much mathematics; it is explained beautifully by Ian Stewart in
56 i galilean motion • 2. galilean physics – motion in everyday life
F I G U R E 14 Straight lines found in nature: cerussite (picture width approx. 3 mm, © Stephan Wolfsried)
and selenite (picture width approx. 10 m © Arch. Speleoresearch&Films/La Venta at http://www.laventa.
it and http://www.naica.com.mx)
The Banach–Tarski theorem raises two questions: first, can the result be applied to gold
What is straight?
When you see a solid object with a straight edge, it is a 99 %-safe bet that it is man-made.
Of course, there are exceptions, as shown in Figure 14.* The largest crystals ever found are
Ref. 38 18 m in length. But in general, the contrast between the objects seen in a city – buildings,
furniture, cars, electricity poles, boxes, books – and the objects seen in a forest – trees,
plants, stones, clouds – is evident: in the forest nothing is straight or flat, in the city most
objects are. How is it possible for humans to produce straight objects while there are
almost none to be found in nature?
Copyright © Christoph Schiller November 1997–January 2007
Page 1252 Any forest teaches us the origin of straightness; it presents tall tree trunks and rays
of daylight entering from above through the leaves. For this reason we call a line straight
if it touches either a plumb-line or a light ray along its whole length. In fact, the two
Challenge 50 n definitions are equivalent. Can you confirm this? Can you find another definition? Obvi-
ously, we call a surface flat if for any chosen orientation and position the surface touches
Paradox of the spheres, New Scientist, 14 January 1995, pp. 28–31. The Banach–Tarski paradox also exists in
four dimensions, as it does in any higher dimension. More mathematical detail can be found in the beautiful
Ref. 36 book by Stan Wagon.
* Another famous exception, unrelated to atomic structures, is the well-known Irish geological formation
called the Giant’s Causeway. Other candidates that might come to mind, such as certain bacteria which have
Ref. 37 (almost) square or (almost) triangular shapes are not counter-examples, as the shapes are only approximate.
galilean physics – motion in everyday life 57
A hollow Earth?
Space and straightness pose subtle challenges. Some strange people maintain that all hu-
mans live on the inside of a sphere; they (usually) call this the hollow Earth theory. They
claim that the Moon, the Sun and the stars are all near the centre of the hollow sphere.
They also explain that light follows curved paths in the sky and that when conventional
Copyright © Christoph Schiller November 1997–January 2007
physicists talk about a distance r from the centre of the Earth, the real hollow Earth dis-
Challenge 51 n tance is r he = R Earth
2
~r. Can you show that this model is wrong? Roman Sexl* used to ask
this question to his students and fellow physicists. The answer is simple: if you think you
have an argument to show that this view is wrong, you are mistaken! There is no way of
showing that such a view is wrong. It is possible to explain the horizon, the appearance
Challenge 52 e of day and night, as well as the satellite photographs of the round Earth, such as Figure 15.
To explain what happened during a flight to the Moon is also fun. A coherent hollow
Earth view is fully equivalent to the usual picture of an infinitely extended space. We will
Page 503 come back to this problem in the section on general relativity.
* Roman Sexl, (1939–1986), important Austrian physicist, author of several influential textbooks on gravit-
ation and relativity.
58 i galilean motion • 2. galilean physics – motion in everyday life
**
Imagine a black spot on a white surface. What is the colour of the line separating the spot
Challenge 54 n from the background? This question is often called Peirce’s puzzle.
Copyright © Christoph Schiller November 1997–January 2007
**
Also bread is an (approximate) irregular fractal. The fractal dimension of bread is around
Challenge 55 n 2.7. Try to measure it.
**
Challenge 56 n How often in 24 hours do the hour and minute hands of a clock lie on top of each other?
For clocks that also have a second hand, how often do all three hands lie on top of each
other?
**
How many times in twelve hours can the two hands of a clock be exchanged with the
Challenge 57 n result that the new situation shows a valid time? What happens for clocks that also have
galilean physics – motion in everyday life 59
F I G U R E 17 A
vernier/nonius/clavius
d b
**
How does a vernier work? It is called nonius in other languages. The first name is derived
from a French military engineer* who did not invent it, the second is a play of words
on the Latinized name of the Portuguese inventor of a more elaborate device** and the
Latin word for ‘nine’. In fact, the device as we know it today – shown in Figure 17 – was
**
Fractals in three dimensions bear many surprises. Take a regular tetrahedron; then glue
on every one of its triangular faces a smaller regular tetrahedron, so that the surface of
the body is again made up of many equal regular triangles. Repeat the process, gluing
still smaller tetrahedrons to these new (more numerous) triangular surfaces. What is the
Challenge 69 n shape of the final fractal, after an infinite number of steps?
**
Motoring poses many mathematical problems. A central one is the following parking
issue: what is the shortest distance d from the car in front necessary to leave a parking
Challenge 70 n spot without using reverse gear? (Assume that you know the geometry of your car, as
shown in Figure 18, and its smallest outer turning radius R, which is known for every
car.) Next question: what is the smallest gap required when you are allowed to manoeuvre
Challenge 71 n back and forward as often as you like? Now a problem to which no solution seems to be
available in the literature: How does the gap depend on the number, n, of times you use
Copyright © Christoph Schiller November 1997–January 2007
Challenge 72 n reverse gear? (The author offers 50 euro for the first well-explained solution sent to him.)
**
Scientists use a special way to write large and small numbers, explained in Table 9.
Ref. 39 In 1996 the smallest experimentally probed distance was 10−19 m, achieved between
quarks at Fermilab. (To savour the distance value, write it down without the exponent.)
Challenge 73 n What does this measurement mean for the continuity of space?
1 100
0.1 10−1 10 101
0.2 2 ë 10−1 20 2 ë 101
0.324 3.24 ë 10−1 32.4 3.24 ë 101
0.01 10−2 100 102
0.001 10−3 1000 103
0.000 1 10−4 10 000 104
0.000 01 10−5 etc. 100 000 105 etc.
**
Zeno reflected on what happens to a moving object at a given instant of time. To discuss
with him, you decide to build the fastest possible shutter for a photographic camera that
you can imagine. You have all the money you want. What is the shortest shutter time you
Challenge 74 n would achieve?
**
Can you prove Pythagoras’ theorem by geometrical means alone, without using Copyright © Christoph Schiller November 1997–January 2007
sky yks
horizon noziroh
earth htrae
**
The definition of angle helps to determine the size of a firework display. Measure the time
T, in seconds, between the moment that you see the rocket explode in the sky and the
moment you hear the explosion, measure the (plane) angle α of the ball with your hand.
The diameter D is
D 6 s~° T α . (3)
Challenge 79 e Why? For more about fireworks, see the http://cc.oulu.fi/~kempmp website. By the way,
the angular distance between the knuckles of an extended fist are about 3°, 2° and 3°, the
Challenge 80 n size of an extended hand 20°. Can you determine the other angles related to your hand?
Copyright © Christoph Schiller November 1997–January 2007
**
Measuring angular size with the eye only is tricky. For example, can you say whether the
Challenge 81 e Moon is larger or smaller than the nail of your thumb at the end of your extended arm?
Angular size is not an intuitive quantity; it requires measurement instruments.
A famous example, shown in Figure 20, illustrates the difficulty of estimating angles.
Both the Sun and the Moon seem larger when they are on the horizon. In ancient times,
Ptolemy explained this illusion by an unconscious apparent distance change induced by
the human brain. In fact, the Moon is even further away from the observer when it is just
above the horizon, and thus its image is smaller than it was a few hours earlier, when it
Challenge 82 n was high in the sky. Can you confirm this?
In fact, the Moon’s size changes much more due to another effect: the orbit of the Moon
is elliptical. An example of this is shown in Figure 21.
galilean physics – motion in everyday life 63
wall
blue ladder
of length b
ladder of red ladder
length l of length r
wall wall
height
h?
square
box of
side b height h
distance d ?
F I G U R E 22 Two ladder puzzles: a moderately difficult (left) and a difficult one (right)
**
Galileo also made mistakes. In his famous book, the Dialogues, he says that the curve
formed by a thin chain hanging between two nails is a parabola, i.e. the curve defined
Challenge 83 d by y = x 2 . That is not correct. What is the correct curve? You can observe the shape
(approximately) in the shape of suspension bridges.
**
Draw three circles, of different sizes, that touch each other. Now draw a fourth circle in
the space between, touching the outer three. What simple relation do the inverse radii of
Challenge 84 n the four circles obey?
**
64 i galilean motion • 2. galilean physics – motion in everyday life
**
Challenge 93 n Could time be two-dimensional?
Experiments show that the properties of Galilean time and space are extracted from the
environment by most higher animals and by young children. Later, when children learn
to speak, they put these experiences into concepts, as was just done above. With the help
”
of these concepts, grown-up children then say that motion is change of position with time.
This description is illustrated by rapidly flipping the lower left corners of this book, start-
where t 0 is the time the fall starts, z 0 is the initial height, v 0 is the initial velocity in the
vertical direction and д = 9.8 m~s2 is a constant that is found to be the same, within about
one part in 300, for all falling bodies on all points of the surface of the Earth. Where do the
Ref. 42 value 9.8 m~s2 and its slight variations come from? A preliminary answer will be given
shortly, but the complete elucidation will occupy us during the larger part of this hike.
* Science is written in this huge book that is continuously open before our eyes (I mean the universe) ... It is
written in mathematical language.
** On the world of fireworks, see the frequently asked questions list of the usenet group rec.pyrotechnics, or
search the web. A simple introduction is the article by J.A. Conkling, Pyrotechnics, Scientific American
pp. 66–73, July 1990.
66 i galilean motion • 2. galilean physics – motion in everyday life
collision
F I G U R E 24 Two ways to test that the time of free fall does not depend on horizontal velocity
Equation (4) allows us to determine the depth of a well, given the time a stone takes
Challenge 94 n to reach its bottom. The equation also»gives the speed v with which one hits the ground
after jumping from a tree, namely v = 2дh . A height of 3 m yields a velocity of 27 km~h.
x(t) = x 0 + v x0 (t − t 0 )
y(t) = y 0 + v y0 (t − t 0 ) , (5)
Copyright © Christoph Schiller November 1997–January 2007
a complete description for the path followed by thrown stones results. A path of this shape
is called a parabola; it is shown in Figures 9, 24 and 25. A parabolic shape is also used for
Challenge 97 n light reflectors inside pocket lamps or car headlights. Can you show why?
Ref. 44 Physicists enjoy generalizing the idea of a path. As Figure 25 shows, a path is a trace left
in a diagram by a moving object. Depending on what diagram is used, these paths have
different names. Hodographs are used in weather forecasting. Space-time diagrams are
useful to make the theory of relativity accessible. The configuration space is spanned by
the coordinates of all particles of a system. For many particles, it is has a high number of
dimensions. It plays an important role in self-organization. The difference between chaos
and order can be described as a difference in the properties of paths in configuration
space. The phase space diagram is also called state space diagram. It plays an essential
role in thermodynamics.
how to describe motion – kinematics 67
z z vz mv z
x t vx z
x mv x
t x
F I G U R E 26 Three
superimposed images of a frass
pellet shot away by a caterpillar
(© Stanley Caveney)
Numerous species of moth and butterfly caterpillars shoot away their frass – to put it
Ref. 45 more crudely: their shit – so that its smell does not help predators to locate them. Stanley
Caveney and his team took photographs of this process. Figure 26 shows a caterpillar
(yellow) of the skipper Calpodes ethlius inside a rolled up green leaf caught in the act.
Given that the record distance observed is 1.5 m (though by another species, Epargyreus
Challenge 98 n clarus), what is the ejection speed? How do caterpillars achieve it?
**
What is the horizontal distance one can reach with a stone, given the speed and the angle
Challenge 99 n from the horizontal at which it is thrown?
**
68 i galilean motion • 2. galilean physics – motion in everyday life
y
derivative:
dy/dt
slope: Δ y
Δ y/Δ t
Δ t
t
F I G U R E 27 Derivatives
Challenge 100 n How can the speed of falling rain be measured using an umbrella? This method can also
Page 284 be used to measure the speed of light.
The last two issues arise because equation (4) does not hold in all cases. For example,
leaves or potato crisps do not follow it. As Galileo already knew, this is a consequence of
air resistance; we will discuss it shortly. In fact, even without air resistance, the path of a
Challenge 105 n stone is not always a parabola; can you find such a situation?
What is rest?
In the Galilean description of nature, motion and rest are opposites. In other words, a
body is at rest when its position, i.e. its coordinates, do not change with time. In other
words, (Galilean) rest is defined as
Later we will see that this definition, contrary to first impressions, is not much use and
will have to be modified. The definition of rest implies that non-resting objects can be
distinguished by comparing the rapidity of their displacement. One thus can define the
velocity v of an object as the change of its position x with time t. This is usually written as
v=
dx
. (7)
dt
In this expression, valid for each coordinate separately, d~dt means ‘change with time’;
one can thus say that velocity is the derivative of position with respect to time. The speed
v is the name given to the magnitude of the velocity v. Derivatives are written as fractions
in order to remind the reader that they are derived from the idea of slope. The expression
dy ∆y
is meant as an abbreviation of lim , (8)
a shorthand for saying that the derivative at a point is the limit of the slopes in the neigh-
Challenge 106 e bourhood of the point, as shown in Figure 27. This definition implies the working rules
d(y + z) dy dz d dy d2 y
= + =c = =
d(c y) dy d(yz) dy dz
, , , z+y ,(9)
dt dt dt dt dt dt dt dt 2 dt dt dt
c being any number. This is all one ever needs to know about derivatives. The quantities
dt and dy, sometimes useful by themselves, are called differentials. These concepts are due
to Gottfried Wilhelm Leibniz.* Derivatives lie at the basis of all calculations based on the
continuity of space and time. Leibniz was the person who made it possible to describe
and use velocity in physical formulae and, in particular, to use the idea of velocity at a
given point in time or space for calculations.
The definition of velocity assumes that it makes sense to take
the limit ∆t 0. In other words, it is assumed that infinitely small
time intervals do exist in nature. The definition of velocity with
derivatives is possible only because both space and time are de-
scribed by sets which are continuous, or in mathematical language,
connected and complete. In the rest of our walk we shall not forget
that from the beginning of classical physics, infinities are present
Copyright © Christoph Schiller November 1997–January 2007
* Gottfried Wilhelm Leibniz (b. 1646 Leipzig, d. 1716 Hannover), Saxon lawyer, physicist, mathematician,
philosopher, diplomat and historian. He was one of the great minds of mankind; he invented the differ-
ential calculus (before Newton) and published many successful books in the various fields he explored,
among them De arte combinatoria, Hypothesis physica nova, Discours de métaphysique, Nouveaux essais sur
l’entendement humain, the Théodicée and the Monadologia.
70 i galilean motion • 2. galilean physics – motion in everyday life
dv d2 x
a= = . (10)
dt dt 2
Acceleration is what we feel when the Earth trembles, an aeroplane takes off, or a bicycle
goes round a corner. More examples are given in Table 10. Like velocity, acceleration has
both a magnitude and a direction, properties indicated by the use of bold letters for their
abbreviations. *
Higher derivatives than acceleration can also be defined in the same manner. They
Challenge 110 n add little to the description of nature, because as we will show shortly neither these nor
even acceleration itself are useful for the description of the state of motion of a system.
One aim of the study of motion is to find a complete and precise description of both states
and objects. With the help of the concept of space, the description of objects can be refined
considerably. In particular, one knows from experience that all objects seen in daily life
”
have an important property: they can be divided into parts. Often this observation is
* Such physical quantities are called vectors. In more precise, mathematical language, a vector is an element
of a set, called vector space, in which the following properties hold for all vectors a and b and for all numbers
c and d:
c(a + b) = ca + cb , (c + d)a = ca + da , (cd)a = c(da) and 1a = a . (11)
Another example of vector space is the set of all positions of an object. Does the set of all rotations form a
Challenge 108 n vector space? All vector spaces allow the definition of a unique null vector and of a single negative vector
for each vector in it.
Note that vectors do not have specified points at which they start: two arrows with same direction and
length are the same vector, even if they start at different points in space.
Copyright © Christoph Schiller November 1997–January 2007
In many vector spaces the concept of length (specifying the ‘magnitude’) can be introduced, usually via
an intermediate step. A vector space is called Euclidean if one can define for it a scalar product between two
vectors, a number ab satisfying
O b s e r va t i o n Accelera-
tion
of coordinates, is called a point particle or a point mass. In equation (4), the stone was
assumed to be such a point particle.
Do point-like objects, i.e. objects smaller than anything one can measure, exist in daily
Copyright © Christoph Schiller November 1997–January 2007
life? Yes and no. The most notable examples are the stars. At present, angular sizes as small
as 2 µrad can be measured, a limit given by the fluctuations of the air in the atmosphere.
In space, such as for the Hubble telescope orbiting the Earth, the angular limit is due to
the diameter of the telescope and is of the order of 10 nrad. Practically all stars seen from
Earth are smaller than that, and are thus effectively ‘point-like’, even when seen with the
most powerful telescopes.
As an exception to the general rule, the size of a few large and nearby stars, of red
giant type, can be measured with special instruments.* Betelgeuse, the higher of the two
shoulders of Orion shown in Figure 28, Mira in Cetus, Antares in Scorpio, Aldebaran in
* The website http://www.astro.uiuc.edu/~kaler/sow/sowlist.html gives an introduction to the different
types of stars. The http://www.astro.wisc.edu/~dolan/constellations/constellations.html website provides de-
how to describe motion – kinematics 73
α γ
Betelgeuse
Bellatrix
ε δ Mintaka
ζ Alnilam
Alnitag
β
κ
Rigel
Saiph
Taurus and Sirius in Canis Major are examples of stars whose size has been measured;
F I G U R E 29 How an object can rotate continuously without tangling up the connection to a second
object
they are made of parts. This is the same approach as describing the motion of an animal as
a whole by combining the motion of its various body parts. The simplest description, the
continuum approximation, describes extended bodies as an infinite collection of point
particles. It allows us to understand and to predict the motion of milk and honey, the
motion of the air in hurricanes and of perfume in rooms. The motion of fire and all other
Challenge 118 n tion in your own body? Are you able to see how many cables may be attached to the
Challenge 119 n rotating body of the figure without hindering the rotation?
Despite the possibility of animals having rotating parts, the method of Figure 29 still
Challenge 120 n cannot be used to make a practical wheel or propeller. Can you see why? Evolution had
no choice: it had to avoid animals with parts rotating around axles. That is the reason
that propellers and wheels do not exist in nature. Of course, this limitation does not rule
Ref. 53 out that living bodies move by rotation as a whole: tumbleweed, seeds from various trees,
some insects, certain other animals, children and dancers occasionally move by rolling
or rotating as a whole.
Single bodies, and thus all living beings, can only move through deformation of their
shape: therefore they are limited to walking, running, crawling or flapping wings or fins.
Page 117 Examples of legs are shown in Figure 30 and in Figure 65, examples of wings can be seen
objects and images 75
in Figure 388 on page 993. In contrast, systems of several bodies, such as bicycles, pedal
boats or other machines, can move without any change of shape of their components,
thus enabling the use of axles with wheels, propellers or other rotating devices.*
In summary, whenever we observe a construction in which some part is turning con-
falling snowflakes, and material ejected by volcanoes are examples of motion, since they
all change position over time. For the same reason, the shadow following our body, the
beam of light circling the tower of a lighthouse on a misty night, and the rainbow that
constantly keeps the same apparent distance from the hiker are examples of motion.
* Despite the disadvantage of not being able to use rotating parts and of being restricted to one piece only,
nature’s moving constructions, usually called animals, often outperform human built machines. As an ex-
ample, compare the size of the smallest flying systems built by evolution with those built by humans. (See,
e.g. http://pixelito.reference.be.) There are two reasons for this discrepancy. First, nature’s systems have in-
tegrated repair and maintenance systems. Second, nature can build large structures inside containers with
small openings. In fact, nature is very good at what people do when they build sailing ships inside glass
Challenge 121 n bottles. The human body is full of such examples; can you name a few?
76 i galilean motion • 2. galilean physics – motion in everyday life
push
F I G U R E 31 In which direction
does the bicycle turn?
Everybody who has ever seen an animated cartoon knows that images can move in
more surprising ways than objects. Images can change their size, shape and even colour,
a feat only few objects are able to perform.* Images can appear and disappear without
trace, multiply, interpenetrate, go backwards in time and defy gravity or any other force.
Images, even ordinary shadows, can move faster than light. Images can float in space
and keep the same distance from approaching objects. Objects can do almost none of
Ref. 57 this. In general, the ‘laws of cartoon physics’ are rather different from those in nature. In
* Excluding very slow changes such as the change of colour of leaves in the Fall, in nature only certain
crystals, the octopus, the chameleon and a few other animals achieve this. Of man-made objects, television,
Challenge 122 n computer displays, heated objects and certain lasers can do it. Do you know more examples? An excellent
source of information on the topic of colour is the book by K. Nassau, The Physics and Chemistry of Colour
– the fifteen causes of colour, J. Wiley & Sons, 1983. In the popular science domain, the most beautiful book is
the classic work by the Flemish astronomer Marcel G.J. Minnaert, Light and Colour in the Outdoors,
Springer, 1993, an updated version based on his wonderful book series, De natuurkunde van ‘t vrije veld,
Ref. 56 Thieme & Cie, 1937. Reading it is a must for all natural scientists. On the web, there is also the – simpler, but
excellent – http://webexhibits.org/causesofcolour website.
** One could propose including the requirement that objects may be rotated; however, this requirement gives
difficulties in the case of atoms, as explained on page 770, and with elementary particles, so that rotation is
not made a separate requirement.
objects and images 77
In the same way that objects are made of matter, images are made of radiation. Images
Ref. 58 are the domain of shadow theatre, cinema, television, computer graphics, belief systems
and drug experts. Photographs, motion pictures, ghosts, angels, dreams and many hallu-
cinations are images (sometimes coupled with brain malfunction). To understand images,
we need to study radiation (plus the eye and the brain). However, due to the importance
of objects – after all we are objects ourselves – we study the latter first.
Ref. 59
“ of movement: That resulting from collision.
When a child rides a monocycle, she or he makes use of a general rule in our world:
Aetius, Opinions.
”
one body acting on another puts it in motion. Indeed, in about six hours, anybody can
What is mass?
“
∆ός µοι ποῦ στω καὶ κινῶ τὴν γῆν.
Da ubi consistam, et terram movebo.**
Archimedes
When we push something we are unfamiliar with, such as when we kick an object on
the street, we automatically pay attention to the same aspect that children explore when
they stand in front of a mirror for the first time, or when they see a red laser spot for
” Copyright © Christoph Schiller November 1997–January 2007
the first time. We check whether the unknown entity can be pushed and pay attention to
how the unknown object moves under our influence. The high precision version of the
experiment is shown in Figure 32. Repeating the experiment with various pairs of objects,
we find – as in everyday life – that a fixed quantity m i can be ascribed to every object i.
The more difficult it is to move an object, the higher the quantity; it is determined by the
* This surprising effect obviously works only above a certain minimal speed. Can you determine what this
Challenge 125 n speed is? Be careful! Too strong a push will make you fall.
** ‘Give me a place to stand, and I’ll move the Earth.’ Archimedes (c. 283–212), Greek scientist and engineer.
Ref. 60 This phrase was attributed to him by Pappus. Already Archimedes knew that the distinction used by lawyers
between movable and immovable property made no sense.
78 i galilean motion • 2. galilean physics – motion in everyday life
v1 v2
v'2
v'1
relation
=−
m2 ∆v 1
(13)
m1 ∆v 2
where ∆v is the velocity change produced by the collision. The number m i is called the
mass of the object i.
In order to have mass values that are common to everybody, the mass value for one par-
ticular, selected object has to be fixed in advance. This special object, shown in Figure 33
is called the standard kilogram and is kept with great care under vacuum in a glass con-
tainer in Sèvres near Paris. It is touched only once every few years because otherwise dust,
humidity, or scratches would change its mass. Through the standard kilogram the value
of the mass of every other object in the world is determined.
The mass thus measures the difficulty of getting something moving. High masses are
harder to move than low masses. Obviously, only objects have mass; images don’t. (By the
Ref. 49 way, the word ‘mass’ is derived, via Latin, from the Greek µαζα – bread – or the Hebrew Copyright © Christoph Schiller November 1997–January 2007
The principle of conservation of mass was first stated by Antoine-Laurent Lavoisier.* Con-
* Antoine-Laurent Lavoisier (1743–1794), French chemist and a genius. Lavoisier was the first to under-
stand that combustion is a reaction with oxygen; he discovered the components of water and introduced
mass measurements into chemistry. When he was (unjustly) sentenced to the guillotine during the French
revolution, he decided to use the experience for a scientific experiment; he decided to blink his eyes as fre-
quently as possible after his head was cut off, in order to show others how long it takes to lose consciousness.
objects and images 79
servation of mass implies that the mass of a composite system is the sum of the mass of
the components. In short, Galilean mass is a measure for the quantity of matter.
The product of the velocity vi and the mass m i is called the mo-
mentum of the body. The sum, or total momentum of the system, is
the same before and after the collision; it is a conserved quantity. Mo-
mentum conservation defines mass. The two conservation principles Christiaan Huygens
(14) and (15) were first stated in this way by the important Dutch physicist Christiaan
Huygens.* Some typical momentum values are given in Table 11.
Momentum conservation implies that when a moving sphere hits a resting one of the
same mass and without loss of energy, a simple rule determines the angle between the
Challenge 126 n directions the two spheres take after the collision. Can you find this rule? It is particularly
useful when playing billiards. We will find out later that it is not valid in special relativity.
Copyright © Christoph Schiller November 1997–January 2007
Another consequence is shown in Figure 34: a man lying on a bed of nails with two
large blocks of concrete on his stomach. Another man is hitting the concrete with a heavy
sledgehammer. As the impact is mostly absorbed by the concrete, there is no pain and no
Challenge 127 n danger – unless the concrete is missed. Why?
F I G U R E 34 Is this dangerous?
O b s e r va t i o n Momentum
The above definition of mass has been generalized by the physicist and philosopher
Ernst Mach* in such a way that it is valid even if the two objects interact without contact,
as long as they do so along the line connecting their positions. The mass ratio between
two bodies is defined as a negative inverse acceleration ratio, thus as
=− ,
m2 a1
Copyright © Christoph Schiller November 1997–January 2007
(16)
m1 a2
where a is the acceleration of each body during the interaction. This definition has been
studied in much detail in the physics community, mainly in the nineteenth century. A
few points sum up the results:
* Ernst Mach (1838 Chrlice–1916 Vaterstetten), Austrian physicist and philosopher. The mach unit for aero-
plane speed as a multiple of the speed of sound in air (about 0.3 km~s) is named after him. He developed the
so-called Mach–Zehnder interferometer; he also studied the basis of mechanics. His thoughts about mass
and inertia influenced the development of general relativity, and led to Mach’s principle, which will appear
later on. He was also proud to be the last scientist denying – humorously, and against all evidence – the
existence of atoms.
objects and images 81
— The definition of mass implies the conservation of momentum P mv. Momentum con-
servation is not a separate principle. Conservation of momentum cannot be checked
experimentally, because mass is defined in such a way that the principle holds.
— The definition of mass implies the equality of the products m 1 a 1 and −m 2 a 2 . Such
products are called forces. The equality of acting and reacting forces is not a separate
principle; mass is defined in such a way that the principle holds.
— The definition of mass is independent of whether contact is involved or not, and
whether the origin of the accelerations is due to electricity, gravitation, or other in-
teractions.* Since the interaction does not enter the definition of mass, mass values
defined with the help of the electric, nuclear or gravitational interaction all agree, as
long as momentum is conserved. All known interactions conserve momentum. For
some unfortunate historical reasons, the mass value measured with the electric or
nuclear interactions is called the ‘inertial’ mass and the mass measured using grav-
ity is called the ‘gravitational’ mass. As it turns out, this artificial distinction has no
Challenge 129 n
The definition of mass through momentum conservation implies that when an object
falls, the Earth is accelerated upwards by a tiny amount. If one could measure this tiny
amount, one could determine the mass of the Earth. Unfortunately, this measurement is
Challenge 130 n impossible. Can you find a better way to determine the mass of the Earth?
Summarizing Table 13, the mass of a body is thus most precisely described by a positive
* As mentioned above, only central forces obey the relation (16) used to define mass. Central forces act
Page 91 between the centre of mass of bodies. We give a precise definition later. However, since all fundamental forces
are central, this is not a restriction. There seems to be one notable exception: magnetism. Is the definition
Challenge 128 n of mass valid in this case?
** In particular, in order to define mass we must be able to distinguish bodies. This seems a trivial require-
ment, but we discover that this is not always possible in nature.
82 i galilean motion • 2. galilean physics – motion in everyday life
O b s e r va t i o n Mass
Challenge 131 e than an equivalent positive mass). Are you able to confirm this? However, a small positive
mass object would float away from a large negative-mass body, as you can easily deduce
by comparing the various accelerations involved. A positive and a negative mass of the
same value would stay at constant distance and spontaneously accelerate away along the
Challenge 132 e line connecting the two masses. Note that both energy and momentum are conserved in
all these situations.* Negative-mass bodies have never been observed. Antimatter, which
Page 323, page 778 will be discussed later, also has positive mass.
Is motion eternal?
Every body continues in the state of rest or of
The product p = mv of mass and velocity is called the momentum of a particle; it describes ”
ible fluid. In an interaction, the invisible fluid is transferred from one object to another.
However, the sum is always constant.
* For more curiosities, see R.H. Price, Negative mass can be positively amusing, American Journal of Phys-
ics 61, pp. 216–217, 1993. Negative mass particles in a box would heat up a box made of positive mass while
Page 84 traversing its walls, and accelerating, i.e. losing energy, at the same time. They would allow one to build a
Challenge 133 e perpetuum mobile of the second kind, i.e. a device circumventing the second principle of thermodynamics.
Moreover, such a system would have no thermodynamic equilibrium, because its energy could decrease
forever. The more one thinks about negative mass, the more one finds strange properties contradicting ob-
Challenge 134 n servations. By the way, what is the range of possible mass values for tachyons?
** Arthur Eddington (1882–1944), British astrophysicist.
84 i galilean motion • 2. galilean physics – motion in everyday life
Ref. 61 * Usually adenosine triphosphate (ATP), the fuel of most processes in animals.
Copyright © Christoph Schiller November 1997–January 2007
** Some funny examples of past attempts to built a perpetual motion machine are described in Stanislav
Michel, Perpetuum mobile, VDI Verlag, 1976. Interestingly, the idea of eternal motion came to Europe
from India, via the Islamic world, around the year 1200, and became popular as it opposed the then standard
view that all motion on Earth disappears over time. See also the http://www.geocities.com/mercutio78_99/
pmm.html and the http://www.lhup.edu/~dsimanek/museum/unwork.htm websites. The conceptual mis-
take made by eccentrics and used by crooks is always the same: the hope of overcoming friction. (In fact,
this applied only to the perpetual motion machines of the second kind; those of the first kind – which are
even more in contrast with observation – even try to generate energy from nothing.)
If the machine is well constructed, i.e. with little friction, it can take the little energy it needs for the
sustenance of its motion from very subtle environmental effects. For example, in the Victoria and Albert
Ref. 62 Museum in London one can admire a beautiful clock powered by the variations of air pressure over time.
Low friction means that motion takes a long time to stop. One immediately thinks of the motion of the
Challenge 137 n planets. In fact, there is friction between the Earth and the Sun. (Can you guess one of the mechanisms?)
But the value is so small that the Earth has already circled around the Sun for thousands of millions of years,
objects and images 85
scale. In other words, the disappearance and also the spontaneous appearance of motion
in everyday life is an illusion due to the limitations of our senses. For example, the mo-
tion proper of every living being exists before its birth, and stays after its death. The same
happens with its energy. This result is probably the closest one can get to the idea of ever-
lasting life from evidence collected by observation. It is perhaps less than a coincidence
that energy used to be called vis viva, or ‘living force’, by Leibniz and many others.
Since motion is conserved, it has no origin. Therefore, at this stage of our walk we
cannot answer the fundamental questions: Why does motion exist? What is its origin?
The end of our adventure is nowhere near.
Kinetic energy thus depends on the mass and on the square of the speed v of a body. Kin-
etic energy is the ability that a body has to induce change in bodies it hits. The full name
‘kinetic energy’ was introduced by Gustave-Gaspard Coriolis.* Coriolis also introduced
Challenge 138 n the factor 1/2, in order that the relation dT~dv = p would be obeyed. (Why?) Energy is a
word taken from ancient Greek; originally it was used to describe character, and meant
‘intellectual or moral vigour’. It was taken into physics by Thomas Young (1773–1829) in
1807 because its literal meaning is ‘force within’. (The letters E, W, A and several others are
also used to denote energy.) Another, equivalent definition of energy will become clear
later: energy is what can be transformed into heat.
(Physical) energy is the measure of the ability to generate motion. A body has a lot
of energy if it has the ability to move many other bodies. Energy is a number; it has no
direction. The total momentum of two equal masses moving with opposite velocities is
zero; their total energy increases with velocity. Energy thus also measures motion, but
in a different way than momentum. Energy measures motion in a more global way. An
equivalent definition is the following. Energy is the ability to perform work. Here, the
physical concept of work is just the precise version of what is meant by work in everyday
Copyright © Christoph Schiller November 1997–January 2007
life.**
Do not be surprised if you do not grasp the difference between momentum and energy
straight away: physicists took about two centuries to figure it out. For some time they even
insisted on using the same word for both, and often they didn’t know which situation
required which concept. So you are allowed to take a few minutes to get used to it.
Both energy and momentum measure how systems change. Momentum tells how sys-
tems change over distance, energy measures how systems change over time. Momentum
O b s e r va t i o n Energy
is needed to compare motion here and there. Energy is needed to compare motion now
and later. Some measured energy values are given in Table 14. Copyright © Christoph Schiller November 1997–January 2007
Challenge 143 e became fashionable to use travellers in rockets. (What will come next?) Galileo explained
that only relative velocities between bodies produce effects, not the absolute values of the
velocities. For the senses, there is no difference between constant, undisturbed motion,
O b s e r va t i o n Power
however rapid it may be, and rest. This is now called Galileo’s principle of relativity. In
everyday life we feel motion only if the means of transportation trembles (thus if it ac-
celerates), or if we move against the air. Therefore Galileo concludes that two observers
in straight and undisturbed motion against each other cannot say who is ‘really’ moving.
Whatever their relative speed, neither of them ‘feels’ in motion.*
Copyright © Christoph Schiller November 1997–January 2007
* In 1632, in his Dialogo, Galileo writes: ‘Shut yourself up with some friend in the main cabin below decks
on some large ship, and have with you there some flies, butterflies, and other small flying animals. Have
a large bowl of water with some fish in it; hang up a bottle that empties drop by drop into a wide vessel
beneath it. With the ship standing still, observe carefully how the little animals fly with equal speed to all
sides of the cabin. The fish swim indifferently in all directions; the drops fall into the vessel beneath; and,
in throwing something to your friend, you need throw it no more strongly in one direction than another,
the distances being equal: jumping with your feet together, you pass equal spaces in every direction. When
you have observed all these things carefully (though there is no doubt that when the ship is standing still
everything must happen in this way), have the ship proceed with any speed you like, so long as the motion is
uniform and not fluctuating this way and that, you will discover not the least change in all the effects named,
nor could you tell from any of them whether the ship was moving or standing still. In jumping, you will pass
on the floor the same spaces as before, nor will you make larger jumps toward the stern than toward the prow
even though the ship is moving quite rapidly, despite the fact that during the time you are in the air the floor
under you will be going in a direction opposite to your jump. In throwing something to your companion,
objects and images 89
Daß die Sonne morgen aufgehen wird, ist eine Hypothese; und das heißt:
wir wissen nicht, ob sie aufgehen wird.*
The statement is wrong. Can you explain why Wittgenstein erred here, despite his strong
Rotation
Rotation keeps us alive. Without the change of day and night, we would be either fried or
frozen to death, depending on our location on our planet. A short summary of rotation
is thus appropriate. We saw before that a body is described by its reluctance to move;
similarly, a body also has a reluctance to turn. This quantity is called its moment of inertia
and is often abbreviated Θ. The speed or rate of rotation is described by angular velocity,
usually abbreviated ω. A few values found in nature are given in Table 16.
The observables that describe rotation are similar to those describing linear motion,
as shown in Table 17. Like mass, the moment of inertia is defined in such a way that the
sum of angular momenta L – the product of moment of inertia and angular velocity – is
conserved in systems that do not interact with the outside world:
Q Θ i ω i = Q L i = const . (18)
i i
In the same way that linear momentum conservation defines mass, angular momentum
you will need no more force to get it to him whether he is in the direction of the bow or the stern, with
Copyright © Christoph Schiller November 1997–January 2007
yourself situated opposite. The droplets will fall as before into the vessel beneath without dropping toward
the stern, although while the drops are in the air the ship runs many spans. The fish in their water will swim
toward the front of their bowl with no more effort than toward the back, and will go with equal ease to bait
placed anywhere around the edges of the bowl. Finally the butterflies and flies will continue their flights
indifferently toward every side, nor will it ever happen that they are concentrated toward the stern, as if
tired out from keeping up with the course of the ship, from which they will have been separated during long
intervals by keeping themselves in the air. And if smoke is made by burning some incense, it will be seen
going up in the form of a little cloud, remaining still and moving no more toward one side than the other.
The cause of all these correspondences of effects is the fact that the ship’s motion is common to all the things
contained in it, and to the air also. That is why I said you should be below decks; for if this took place above
in the open air, which would not follow the course of the ship, more or less noticeable differences would be
seen in some of the effects noted.’
* ‘It is a hypothesis that the Sun will rise tomorrow; and this means that we do not know whether it will rise.’
This well-known statement is found in Ludwig Wittgenstein, Tractatus, 6.36311.
90 i galilean motion • 2. galilean physics – motion in everyday life
O b s e r va t i o n Angular velocity
ω = 2π~T
Θ = Q m n r n2 , (19)
n
where r n is the distance from the mass element m n to the axis of rotation. Can you con-
Challenge 145 e firm the expression? Therefore, the moment of inertia of a body depends on the chosen
Challenge 146 n axis of rotation. Can you confirm that this is so for a brick?
Obviously, the value of the moment of inertia also depends on the location of the axis
objects and images 91
L fingers in
rotation
sense;
thumb
r shows
index: "p" angular
A momentum
thumb: "r"
p
used for its definition. For each axis direction, one distinguishes an intrinsic moment of
inertia, when the axis passes through the centre of mass of the body, from an extrinsic
L=rp= L=rp=
2A(T)m 2A(T)m
so that (21)
T T
where p is the momentum of the particle, A(T) is the surface swept by the position vector
r of the particle during time T.** The angular momentum thus points along the rotation
where d is the distance between the centre of mass and the axis of extrinsic rotation. This relation is called
Challenge 147 n Steiner’s parallel axis theorem. Are you able to deduce it? Copyright © Christoph Schiller November 1997–January 2007
** For the curious, the result of the cross product or vector product ab between two vectors a and b is defined
as that vector that is orthogonal to both, whose orientation is given by the right-hand rule, and whose length
is given by ab sin ∢(a, b), i.e. by the surface area of the parallelogram spanned by the two vectors. From the
Challenge 150 e definition you can show that the vector product has the properties
a b = −b a , a (b + c) = a b + a c , λa b = λ(a b) = a λb , a a = 0 ,
a(b c) = b(c a) = c(a b) , a (b c) = b(ac) − c(ab) ,
(a b)(c d) = a(b (c d)) = (ac)(bd) − (bc)(ad) ,
(a b) (c d) = c((a b)d) − d((a b)c) , a (b c) + b (c a) + c (a b) = 0 . (22)
Page 1219 The vector product exists (almost) only in three-dimensional vector spaces. (See Appendix D.) The cross
product vanishes if and only if the vectors are parallel. The parallelepiped spanned by three vectors a, b and
Challenge 151 e c has the volume V = c(a b). The pyramid or tetrahedron formed by the three vectors has one sixth of
that volume.
92 i galilean motion • 2. galilean physics – motion in everyday life
frictionless
axis
F I G U R E 37 How a snake turns itself around its F I G U R E 38 Can the ape reach
axis the banana?
L2
E rot = 21 Θ ω 2 = . (23)
2Θ
The expression is similar to the expression for the kinetic energy of a particle. Can you
guess how much larger the rotational energy of the Earth is compared with the yearly
Challenge 152 n electricity usage of humanity? In fact, if you can find a way to harness this energy, you
will become famous.
As in the case of linear motion, rotational energy and angular momentum are not
always conserved in the macroscopic world: rotational energy can change due to friction,
and angular momentum can change due to external forces (torques). However, for closed
systems both quantities always conserved on the microscopic scale.
On a frictionless surface, as approximated by smooth ice or by a marble floor covered
by a layer of oil, it is impossible to move forward. In order to move, we need to push
against something. Is this also the case for rotation?
Surprisingly, it is possible to turn even without pushing against something. You can
check this on a well-oiled rotating office chair: simply rotate an arm above the head. After Copyright © Christoph Schiller November 1997–January 2007
each turn of the hand, the orientation of the chair has changed by a small amount. Indeed,
conservation of angular momentum and of rotational energy do not prevent bodies from
changing their orientation. Cats learn this in their youth. After they have learned the
trick, if they are dropped legs up, they can turn themselves in such a way that they always
Ref. 65 land feet first. Snakes also know how to rotate themselves, as Figure 37 shows. During
the Olympic Games one can watch board divers and gymnasts perform similar tricks.
Challenge 153 d Rotation is thus different from translation in this aspect. (Why?)
Angular momentum is conserved. This statement is valid for any axis, provided that
friction plays no role. external forces (torques) play no role. To make the point, Jean-Marc
Ref. 3 Lévy-Leblond poses the problem of Figure 38. Can the ape reach the banana without
leaving the plate, assuming that the plate on which the ape rests can turn around the axis
Challenge 154 n without friction?
objects and images 93
ωr ω d = vp
P ωR
r ω R = vaxis
d
R
which shows that a rolling wheel does indeed rotate about its point of contact with the
ground.
Surprisingly, when a wheel rolls, some points on it move towards the wheel’s axis, some
stay at a fixed distance and others move away from it. Can you determine where these
various points are located? Together, they lead to an interesting pattern when a rolling
Copyright © Christoph Schiller November 1997–January 2007
Challenge 156 n
Ref. 66 wheel with spokes, such as a bicycle wheel, is photographed.
Ref. 67 With these results you can tackle the following beautiful challenge. When a turning
bicycle wheel is put on a slippery surface, it will slip for a while and then end up rolling.
Challenge 157 d How does the final speed depend on the initial speed and on the friction?
How do we walk?
Golf is a good walk spoiled.
“
Why do we move our arms when walking or running? To conserve energy. In fact, when
a body movement is performed with as little energy as possible, it is natural and graceful.
Mark Twain
”
94 i galilean motion • 2. galilean physics – motion in everyday life
(This can indeed be taken as the actual definition of grace. The connection is common
Ref. 18 knowledge in the world of dance; it is also a central aspect of the methods used by actors
to learn how to move their bodies as beautifully as possible.)
To convince yourself about the energy savings, try walking or running with your arms
fixed or moving in the opposite direction to usual: the effort required is considerably
higher. In fact, when a leg is moved, it produces a torque around the body axis which
has to be counterbalanced. The method using the least energy is the swinging of arms.
Since the arms are lighter than the legs, they must move further from the axis of the body,
to compensate for the momentum; evolution has therefore moved the attachment of the
arms, the shoulders, farther apart than those of the legs, the hips. Animals on two legs but
no arms, such as penguins or pigeons, have more difficulty walking; they have to move
their whole torso with every step.
Which muscles do most of the work when walking, the motion that experts call gait? In
Ref. 68 1980, Serge Gracovetsky found that in human gait most power comes from the muscles
along the spine, not from the legs. (Indeed, people without legs are also able to walk.) Copyright © Christoph Schiller November 1997–January 2007
When you take a step, the lumbar muscles straighten the spine; this automatically makes it
turn a bit to one side, so that the knee of the leg on that side automatically comes forward.
When the foot is moved, the lumbar muscles can relax, and then straighten again for the
next step. In fact, one can experience the increase in tension in the back muscles when
Challenge 158 e walking without moving the arms, thus confirming where the human engine is located.
Human legs differ from those of apes in a fundamental aspect: humans are able to run.
In fact the whole human body has been optimized for running, an ability that no other
primate has. The human body has shed most of its hair to achieve better cooling, has
evolved the ability to run while keeping the head stable, has evolved the right length of
arms for proper balance when running, and even has a special ligament in the back that
works as a shock absorber while running. In other words, running is the most human of
all forms of motion.
objects and images 95
sphere 5 km
5 km 5 km
Equator
5 km
“
The search for answers to this question gives a beautiful cross section of the history of
Anonymous*
”
classical physics. Around the year 265 bce, the Greek thinker Aristarchos of Samos main-
Ref. 69 tained that the Earth rotates. He had measured the parallax of the Moon (today known to
be up to 0.95°) and of the Sun (today known to be 8.8 ′ ).** The parallax is an interesting
effect; it is the angle describing the difference between the directions of a body in the sky
Copyright © Christoph Schiller November 1997–January 2007
when seen by an observer on the surface of the Earth and when seen by a hypothetical
observer at the Earth’s centre. (See Figure 42.) Aristarchos noticed that the Moon and the
Sun wobble across the sky, and this wobble has a period of 24 hours. He concluded that
the Earth rotates.
Page 284 Measurements of the aberration of light also show the rotation of the Earth; it can
be detected with a telescope while looking at the stars. The aberration is a change of the
expected light direction, which we will discuss shortly. At the Equator, Earth rotation
* ‘And yet she moves’ is the sentence falsely attributed to Galileo about the Earth. It is true, however, that at
his trial he was forced to publicly retract the idea of a moving Earth to save his life (see the footnote on page
225).
** For the definition of angles see page 61 and for the definition of angle units see Appendix B.
96 i galilean motion • 2. galilean physics – motion in everyday life
v = ω (R+h)
h N h
v=ωR
ϕ
N
Equator
E
centre of the rotating Earth. The elliptical shape shows that the path of a thrown stone
does not lie on a plane for an observer standing on Earth; for such an observer, the exact
path thus cannot be drawn on a piece of paper.
In 1835, Gustave-Gaspard Coriolis found a closely related effect that nobody had as yet
noticed in everyday life. Imagine an ball that rolls over a table. For a person on the floor,
* Pierre Louis Moreau de Maupertuis (1698–1759), French physicist and mathematician. He was one of the
key figures in the quest for the principle of least action, which he named in this way. He was also found-
ing president of the Berlin Academy of Sciences. Maupertuis thought that the principle reflected the max-
imization of goodness in the universe. This idea was thoroughly ridiculed by Voltaire in this Histoire du
Docteur Akakia et du natif de Saint-Malo, 1753. Maupertuis (http://www.voltaire-integral.com/Html/23/
08DIAL.htm) performed his measurement of the Earth to distinguish between the theory of gravitation of
Newton and that of Descartes, who had predicted that the Earth is elongated at the poles, instead of flattened.
objects and images 97
the ball rolls in a straight line. Now imagine that the table rotates. For the person on the
floor, the ball still rolls in a straight line. But for a person on the rotating table, the ball
traces a curved path. In short, any object that travels in a rotating background is subject
to a transversal acceleration. This is the so-called Coriolis acceleration (or Coriolis effect).
Now, the Earth is a rotating background. On the northern hemisphere, the rotation
is anticlockwise. As the result, any moving object is slightly deviated to the right (while
the magnitude of its velocity stays constant). On Earth, like on all rotating backgrounds,
the Coriolis acceleration results from the change of distance to the rotation axis. Can you
Challenge 160 n deduce the analytical expression for it, namely aC = −2ω v?
On Earth, the Coriolis acceleration generally has a small value. Therefore it is best ob-
served either in large-scale or high-speed phenomena. Indeed, the Coriolis acceleration
determines the handedness of many large-scale phenomena with a spiral shape, such as
the directions of cyclones and anticyclones in meteorology, the general wind patterns on
Earth and the deflection of ocean currents and tides. These phenomena have opposite
the Earth, but results from the way the water starts to flow out. But let us go on with the
story about the Earth’s rotation.
Finally, in 1851, the French physician-turned-physicist Jean Bernard Léon Foucault
(b. 1819 Paris, d. 1868 Paris) performed an experiment that removed all doubts and
rendered him world-famous practically overnight. He suspended a 67 m long pendulum*
in the Panthéon in Paris and showed the astonished public that the direction of its swing
changed over time, rotating slowly. To anybody with a few minutes of patience to watch
the change of direction, the experiment proved that the Earth rotates. If the Earth did
Challenge 161 d * Why was such a long pendulum necessary? Understanding the reasons allows one to repeat the experiment
Ref. 73 at home, using a pendulum as short as 70 cm, with the help of a few tricks. To observe the effect with a simple
set-up, attach a pendulum to your office chair and rotate it slowly.
98 i galilean motion • 2. galilean physics – motion in everyday life
ψ1
N
Earth's
centre φ
Eq
u ato
r
ψ0
ψ1
not rotate, the swing of the pendulum would always continue in the same direction. On a
TFoucault =
24 h
(26)
sin φ
where φ is the latitude of the location of the pendulum, e.g. 0° at the Equator and 90° at
the North Pole. This formula is one of the most beautiful results of Galilean kinematics.**
Foucault was also the inventor and namer of the gyroscope. He
built the device, shown in Figure 46, in 1852, one year after his pen-
dulum. With it, he again demonstrated the rotation of the Earth.
Once a gyroscope rotates, the axis stays fixed in space – but only
when seen from distant stars or galaxies. (This is not the same as
Challenge 163 n talking about absolute space. Why?) For an observer on Earth, the
axis direction changes regularly with a period of 24 hours. Gyro-
scopes are now routinely used in ships and in aeroplanes to give
the direction of north, because they are more precise and more re-
liable than magnetic compasses. In the most modern versions, one
F I G U R E 46 The
Copyright © Christoph Schiller November 1997–January 2007
* The discovery also shows how precision and genius go together. In fact, the first person to observe the
effect was Vincenzo Viviani, a student of Galilei, as early as 1661! Indeed, Foucault had read about Viviani’s
work in the publications of the Academia dei Lincei. But it took Foucault’s genius to connect the effect to
the rotation of the Earth; nobody had done so before him.
** The calculation of the period of Foucault’s pendulum assumed that the precession rate is constant during
a rotation. This is only an approximation (though usually a good one).
Challenge 164 n *** Can you guess how rotation is detected in this case?
objects and images 99
r
E
m m W
Challenge 168 n
∆φ =
8π Ω A
(27)
cλ
where A is the area (vector) enclosed by the two interfering light rays, λ their wavelength
and c the speed of light. The effect is now called the Sagnac effect, even though it had
Ref. 75 already been predicted 20 years earlier by Oliver Lodge.** Michelson and his team found
* Albert Abraham Michelson (b. 1852 Strelno, d. 1931 Pasadena) Prussian–Polish–US-American physicist,
obsessed by the precise measurement of the speed of light, received the Nobel Prize for Physics in 1907.
** Oliver Lodge (1851–1940) was a British physicist who studied electromagnetic waves and tried to com-
municate with the dead. A strange but influential figure, his ideas are often cited when fun needs to be made
100 i galilean motion • 2. galilean physics – motion in everyday life
a fringe shift with a period of 24 hours and of exactly the magnitude predicted by the
rotation of the Earth. Modern high precision versions use ring lasers with areas of only
a few square metres, but which are able to measure variations of the rotation rates of the
Earth of less than one part per million. Indeed, over the course of a year the length of a
day varies irregularly by a few milliseconds, mostly due to influences from the Sun or the
Ref. 76 Moon, due to weather changes and due to hot magma flows deep inside the Earth.* But
also earthquakes, the el Ninño effect in the climate and the filling of large water dams have
effects on the rotation of the Earth. All these effects can be studied with such precision
interferometers; these can also be used for research into the motion of the soil due to
lunar tides or earthquakes, and for checks on the theory of relativity.
In summary, observations show that the Earth surface rotates at 463 m~s at the Equator,
a larger value than that of the speed of sound in air – about 340 m~s in usual conditions
– and that we are in fact whirling through the universe.
except one, turn around themselves in the same direction, and that they also all turn
Ref. 79 around the Sun in that same direction. But the complete story is outside the scope of this
text.
The rotation around its axis is not the only motion of the Earth; it performs other
motions as well. This was already known long ago. In 128 bce, the Greek astronomer
Hipparchos discovered what is today called the (equinoctial) precession. He compared a
measurement he made himself with another made 169 years before. Hipparchos found
of physicists; for example, he was one of those (rare) physicists who believed that at the end of the nineteenth
century physics was complete.
* The growth of leaves on trees and the consequent change in the Earth’s moment of inertia, already studied
in 1916 by Harold Jeffreys, is too small to be seen, so far.
objects and images 101
nutation period
year 15000: is 18.6 years year 2000:
North pole is North pole is
Vega in Polaris in the
Lyra Ursa minor
precession
N
Moon’s
path
Moon
th
Earth’s pa
that the Earth’s axis points to different stars at different times. He concluded that the sky
was moving. Today we prefer to say that the axis of the Earth is moving. During a period
of 25 800 years the axis draws a cone with an opening angle of 23.5°. This motion, shown
in Figure 49, is generated by the tidal forces of the Moon and the Sun on the equatorial
bulge of the Earth that results form its flattening. The Sun and the Moon try to align the
axis of the Earth at right angles to the Earth’s path; this torque leads to the precession of
the Earth’s axis. (The same effect appears for any spinning top or in the experiment with
Copyright © Christoph Schiller November 1997–January 2007
* The circular motion, a wobble, was predicted by the great Swiss mathematician Leonhard Euler (1707–
1783). Using this prediction and Küstner’s data, in 1891 Seth Chandler claimed to be the discoverer of the
circular component.
102 i galilean motion • 2. galilean physics – motion in everyday life
a period of about 1.2 years due to fluctuations in the pressure at the bottom of the oceans.
In practice, the North Pole moves with an amplitude of 15 m around an average central
Ref. 80 position.
In 1912, the German meteorologist and geophysicist Alfred Wegener (1880–1930) dis-
covered an even larger effect. After studying the shapes of the continental shelves and the
geological layers on both sides of the Atlantic, he conjectured that the continents move,
and that they are all fragments of a single continent that broke up 200 million years ago.*
Even though at first derided across the world, Wegener’s discoveries were correct. Mod-
ern satellite measurements, shown in Figure 50, confirm this model. For example, the
American continent moves away from the European continent by about 10 mm every
year. There are also speculations that this velocity may have been much higher at certain
periods in the past. The way to check this is to look at the magnetization of sedimental
rocks. At present, this is still a hot topic of research. Following the modern version of the
model, called plate tectonics, the continents (with a density of 2.7 ë 103 kg~m3 ) float on the
Copyright © Christoph Schiller November 1997–January 2007
Page 107, page 614 fluid mantle of the Earth (with a density of 3.1 ë 103 kg~m3 ) like pieces of cork on water,
Ref. 81 and the convection inside the mantle provides the driving mechanism for the motion.
* In this old continent, called Gondwanaland, there was a huge river that flowed westwards from the Chad
to Guayaquil in Ecuador. After the continent split up, this river still flowed to the west. When the Andes
appeared, the water was blocked, and many millions of years later, it flowed back. Today, the river still flows
eastwards and is called the Amazon River.
objects and images 103
Earth
Sun
Sun
be, if the Earth travels around the Sun? The distance between the Earth and the Sun has
been known since the seventeenth century, but it was only in 1837 that Friedrich Wilhelm
Bessel* became the first to observe the parallax of a star. This was a result of extremely
careful measurements and complex calculations: he discovered the Bessel functions in
order to realize it. He was able to find a star, 61 Cygni, whose apparent position changed
with the month of the year. Seen over the whole year, the star describes a small ellipse in
the sky, with an opening of 0.588 ′′ (this is the modern value). After carefully eliminating
all other possible explanations, he deduced that the change of position was due to the
motion of the Earth around the Sun, and from the size of the ellipse he determined the
Challenge 169 n distance to the star to be 105 Pm, or 11.1 light years. Copyright © Christoph Schiller November 1997–January 2007
Bessel had thus managed for the first time to measure the distance of a star. By doing
so he also proved that the Earth is not fixed with respect to the stars in the sky and that the
Earth indeed revolves around the Sun. The motion itself was not a surprise. It confirmed
the result of the mentioned aberration of light, discovered in 1728 by James Bradley** and
Page 284 to be discussed shortly; the Earth moves around the Sun.
* Friedrich Wilhelm Bessel (1784–1846), Westphalian astronomer who left a successful business career to
dedicate his life to the stars, and became the foremost astronomer of his time.
** James Bradley, (1693–1762), English astronomer. He was one of the first astronomers to understand the
value of precise measurement, and thoroughly modernized Greenwich. He discovered the aberration of light,
a discovery that showed that the Earth moves and also allowed him to measure the speed of light; he also
discovered the nutation of the Earth.
104 i galilean motion • 2. galilean physics – motion in everyday life
With the improvement of telescopes, other motions of the Earth were discovered. In
1748, James Bradley announced that there is a small regular change of the precession,
which he called nutation, with a period of 18.6 years and an angular amplitude of 19.2 ′′ .
Nutation occurs because the plane of the Moon’s orbit around the Earth is not exactly the
same as the plane of the Earth’s orbit around the Sun. Are you able to confirm that this
Challenge 170 ny situation produces nutation?
Astronomers also discovered that the 23.5° tilt – or obliquity – of the Earth’s axis, the
angle between its intrinsic and its orbital angular momentum, actually changes from 22.1°
to 24.5° with a period of 41 000 years. This motion is due to the attraction of the Sun and
the deviations of the Earth from a spherical shape. During the Second World War, in
1941, the Serbian astronomer Milutin Milankovitch (1879–1958) retreated into solitude
and studied the consequences. In his studies he realized that this 41 000 year period of
the tilt, together with an average period of 22 000 years due to precession,* gives rise to
the more than 20 ice ages in the last 2 million years. This happens through stronger or
rotation.
The next step is to ask whether the Sun itself moves. Indeed it does. Locally, it moves
with a speed of 19.4 km~s towards the constellation of Hercules. This was shown by Wil-
liam Herschel in 1783. But globally, the motion is even more interesting. The diameter of
the galaxy is at least 100 000 light years, and we are located 26 000 light years from the
centre. (This has been known since 1918; the centre of the galaxy is located in the direc-
tion of Sagittarius.) At our position, the galaxy is 1 300 light years thick; presently, we are
Ref. 83 68 light years ‘above’ the centre plane. The Sun, and with it the solar system, takes about
225 million years to turn once around the galactic centre, its orbital velocity being around
* In fact, the 25 800 year precession leads to three insolation periods, of 23 700, 22 400 and 19 000 years, due
to the interaction between precession and perihelion shift.
objects and images 105
our galaxy
220 km~s. It seems that the Sun will continue moving away from the galaxy plane until it
is about 250 light years above the plane, and then move back, as shown in Figure 52. The
oscillation period is estimated to be around 60 million years, and has been suggested as
the mechanism for the mass extinctions of animal life on Earth, possibly because some
gas cloud may be encountered on the way. The issue is still a hot topic of research.
We turn around the galaxy centre because the formation of galaxies, like that of solar
systems, always happens in a whirl. By the way, can you confirm from your own observa-
Challenge 171 n tion that our galaxy itself rotates?
Finally, we can ask whether the galaxy itself moves. Its motion can indeed be observed
because it is possible to give a value for the motion of the Sun through the universe, de-
fining it as the motion against the background radiation. This value has been measured
Ref. 84 to be 370 km~s. (The velocity of the Earth through the background radiation of course
depends on the season.) This value is a combination of the motion of the Sun around
the galaxy centre and of the motion of the galaxy itself. This latter motion is due to the
gravitational attraction of the other, nearby galaxies in our local group of galaxies.*
In summary, the Earth really moves, and it does so in rather complex ways. As Henri
Poincaré would say, if we are in a given spot today, say the Panthéon in Paris, and come
Copyright © Christoph Schiller November 1997–January 2007
back to the same spot tomorrow at the same time, we are in fact 31 million kilometres away.
This state of affairs would make time travel extremely difficult even if it were possible
(which it is not); whenever you went back to the past, you would have to get to the old
spot exactly!
Is rotation relative?
When we turn rapidly, our arms lift. Why does this happen? How can our body detect
whether we are rotating or not? There are two possible answers. The first approach, pro-
* This is roughly the end of the ladder. Note that the expansion of the universe, to be studied later, produces
no motion.
106 i galilean motion • 2. galilean physics – motion in everyday life
moted by Newton, is to say that there is an absolute space; whenever we rotate against this
space, the system reacts. The other answer is to note that whenever the arms lift, the stars
also rotate, and in exactly the same manner. In other words, our body detects rotation
because we move against the average mass distribution in space.
The most cited discussion of this question is due to Newton. Instead of arms, he ex-
plored the water in a rotating bucket. As usual for philosophical issues, Newton’s answer
was guided by the mysticism triggered by his father’s early death. Newton saw absolute
space as a religious concept and was not even able to conceive an alternative. Newton
thus sees rotation as an absolute concept. Most modern scientist have fewer problems
and more common sense than Newton; as a result, today’s consensus is that rotation ef-
fects are due to the mass distribution in the universe: rotation is relative. However, we
have to be honest; the question cannot be settled by Galilean physics. We will need gen-
eral relativity.
Challenge 173 n
A car at a certain speed uses 7 litres of gasoline per 100 km. What is the combined air and
rolling resistance? (Assume that the engine has an efficiency of 25%.)
”
**
When travelling in the train, you can test Galileo’s statement about everyday relativity of
motion. Close your eyes and ask somebody to turn you around many times: are you able
Challenge 174 e to say in which direction the train is running?
**
A train starts to travel at a constant speed of 10 m~s between two cities A and B, 36 km
apart. The train will take one hour for the journey. At the same time as the train, a fast
dove starts to fly from A to B, at 20 m~s. Being faster than the train, the dove arrives at
B first. The dove then flies back towards A; when it meets the train, it backturns again to
city B. It goes on flying back and forward until the train reaches B. What distance did the
dove cover?
Copyright © Christoph Schiller November 1997–January 2007
Challenge 175 e
**
A good bathroom scales, used to determine the weight of objects, does not show a con-
Challenge 176 n stant weight when you step on it and stay motionless. Why not?
**
A cork is attached to a thin string a metre long. The string is passed over a long rod
held horizontally, and a wine glass is attached at the other end. If you let go the cork in
Challenge 177 n Figure 53, nothing breaks. Why not? And what happens?
Challenge 172 n * Thomas Carlyle (1797–1881). Do you agree with the quotation?
objects and images 107
mountain
h
plain ocean
continental crust
**
In 1901, Duncan MacDougalls, a medical doctor, measured the weight of dying people, in
the hope to see whether death leads to a mass change. He found a sudden change of about
20 g at the moment of death, with large variations from person to person. He attributed
Challenge 178 n it to the soul. Is this explanation satisfactory? (If you know a better one, publish it!)
**
The Earth’s crust is less dense (2.7 kg~l) than the Earth’s mantle (3.1 kg~l) and floats on it.
As a result, the lighter crust below a mountain ridge must be much deeper than below a
plain. If a mountain rises 1 km above the plain, how much deeper must the crust be below
Challenge 179 n it? The simple block model shown in Figure 54 works fairly well; first, it explains why, near Copyright © Christoph Schiller November 1997–January 2007
mountains, measurements of the deviation of free fall from the vertical line lead to so
much lower values than those expected without a deep crust. Later, sound measurements
have confirmed directly that the continental crust is indeed thicker beneath mountains.
**
Balance a pencil vertically (tip upwards!) on a piece of paper near the edge of a table. How
Challenge 180 e can you pull out the paper without letting the pencil fall?
**
Take a pile of coins. One can push out the coins, starting with the one at the bottom,
by shooting another coin over the table surface. The method also helps to visualize two-
Challenge 181 e dimensional momentum conservation.
108 i galilean motion • 2. galilean physics – motion in everyday life
F I G U R E 55 A well-known toy
2L,2M L, M
**
In early 2004, two men and a woman earned £ 1.2 million in a single evening in a Lon-
don casino. They did so by applying the formulae of Galilean mechanics. They used the
method pioneered by various physicists in the 1950s who built various small computers
that could predict the outcome of a roulette ball from the initial velocity imparted by the
Ref. 85 croupier. In the case in Britain, the group added a laser scanner to a smart phone that
measured the path of a roulette ball and predicted the numbers where it would arrive.
In this way, they increased the odds from 1 in 37 to about 1 in 6. After six months of
investigations, Scotland Yard ruled that they could keep the money they won.
**
Is a return flight by plane – from a point A to B and back to A – faster if the wind blows
Challenge 182 e or not?
**
The toy of Figure 55 shows interesting behaviour: when a number of spheres are lifted
and dropped to hit the resting ones, the same number of spheres detach on the other side, Copyright © Christoph Schiller November 1997–January 2007
whereas the previously dropped spheres remain motionless. At first sight, all this seems
to follow from energy and momentum conservation. However, energy and momentum
conservation provide only two equations, which are insufficient to explain or determine
the behaviour of five spheres. Why then do the spheres behave in this way? And why do
Challenge 183 d they all swing in phase when a longer time has passed?
**
A surprising effect is used in home tools such as hammer drills. We remember that when a
small ball elastically hits a large one at rest, both balls move after the hit, and the small one
Ref. 87 obviously moves faster than the large one. Despite this result, when a short cylinder hits a
long one of the same diameter and material, but with a length that is some integer multiple
of that of the short one, something strange happens. After the hit, the small cylinder
objects and images 109
wall
ladder
F I G U R E 57 The centre of F I G U R E 58
mass defines stability How does the
ladder fall?
**
Housewives know how to extract a cork of a wine bottle using a cloth. Can you imagine
Challenge 186 n how? They also know how to extract the cork with the cloth if the cork has fallen inside
the bottle. How?
**
The sliding ladder problem, shown schematically in Figure 58, asks for the detailed mo-
tion of the ladder over time. The problem is more difficult than it looks, even if friction
Challenge 187 ny is not taken into account. Can you say whether the lower end always touches the floor?
** Copyright © Christoph Schiller November 1997–January 2007
A homogeneous ladder of length 5 m and mass 30 kg leans on a wall. The angle is 30°; the
static friction coefficient on the wall is negligible, and on the floor it is 0.3. A person of
mass 60 kg climbs the ladder. What is the maximum height the person can climb before
the ladder starts sliding? This and many puzzles about ladders can be found on http://
www.mathematische-basteleien.de/leiter.htm.
**
Ref. 88 A common fly on the stern of a 30 000 ton ship of 100 m length tilts it by less than the
diameter of an atom. Today, distances that small are easily measured. Can you think of at
Challenge 188 n least two methods, one of which should not cost more than 2000 euro?
**
110 i galilean motion • 2. galilean physics – motion in everyday life
The level of acceleration a human can survive depends on the duration over which one
is subjected to it. For a tenth of a second, 30 д = 300 m~s2 , as generated by an ejector
seat in an aeroplane, is acceptable. (It seems that the record acceleration a human has
survived is about 80 д = 800 m~s2 .) But as a rule of thumb it is said that accelerations of
15 д = 150 m~s2 or more are fatal.
**
The highest microscopic accelerations are observed in particle collisions, where one gets
values up to 1035 m~s2 . The highest macroscopic accelerations are probably found in the
collapsing interiors of supernovae, exploding stars which can be so bright as to be visible
in the sky even during the daytime. A candidate on Earth is the interior of collapsing
bubbles in liquids, a process called cavitation. Cavitation often produces light, an effect
Ref. 89 discovered by Frenzel and Schulte in 1934 and called sonoluminescence. (See Figure 59.)
It appears most prominently when air bubbles in water are expanded and contracted by
**
Would travelling through interplanetary space be healthy? People often fantasize about
long trips through the cosmos. Experiments have shown that on trips of long dura-
tion, cosmic radiation, bone weakening and muscle degeneration are the biggest dangers.
Many medical experts question the viability of space travel lasting longer than a couple of
years. Other dangers are rapid sunburn, at least near the Sun, and exposure to the vacuum.
Ref. 91 So far only one man has experienced vacuum without protection. He lost consciousness
after 14 seconds, but survived unharmed.
** Copyright © Christoph Schiller November 1997–January 2007
Challenge 191 n How does the kinetic energy of a rifle bullet compare to that of a running man?
**
Challenge 192 n In which direction does a flame lean if it burns inside a jar on a rotating turntable?
**
A ping-pong ball is attached by a string to a stone, and the whole is put under water in
a jar. The set-up is shown in Figure 60. Now the jar is accelerated horizontally. In which
Challenge 193 n direction does the ball move? What do you deduce for a jar at rest?
**
Challenge 194 n What happens to the size of an egg when one places it in a jar of vinegar for a few days?
objects and images 111
F I G U R E 59 Observation of
sonoluminescence (© Detlev
Lohse)
**
Rotation holds a surprise for anybody who studies it carefully. Angular momentum is
a quantity with a magnitude and a direction. However, it is not a vector, as any mirror
shows. The angular momentum of a body circling in a plane parallel to a mirror behaves Copyright © Christoph Schiller November 1997–January 2007
in a different way from a usual arrow: its mirror image is not reflected if it points towards
Challenge 197 e the mirror! You can easily check this for yourself. For this reason, angular momentum
is called a pseudovector. The fact has no important consequences in classical physics; but
we have to keep it in mind for later occasions.
**
What is the best way to transport a number of full coffee or tea cups while at the same
Challenge 198 n time avoiding spilling any precious liquid?
**
The Moon recedes from the Earth by 3.8 cm a year, due to friction. Can you find the
Challenge 199 n mechanism responsible?
112 i galilean motion • 2. galilean physics – motion in everyday life
ping-pong ball
string
stone
**
Ms =
log(E~1 J) − 4.8
. (28)
1.5
The strange numbers in the expression have been chosen to put the earthquake values as
near as possible to the older, qualitative Mercalli scale (now called EMS98) that classifies Copyright © Christoph Schiller November 1997–January 2007
the intensity of earthquakes. However, this is not fully possible; the most sensitive instru-
ments today detect earthquakes with magnitudes of −3. The highest value every meas-
ured was a Richter magnitude of 10, in Chile in 1960. Magnitudes above 12 are probably
Challenge 202 n impossible. (Can you show why?)
**
Figure 61 shows the so-called Celtic wiggle stone, a stone that starts rotating on a plane
Ref. 87 surface when it is put into oscillation. The size can vary between a few centimetres and a
few metres. By simply bending a spoon one can realize a primitive form of this strange
device, if the bend is not completely symmetrical. The rotation is always in the same
direction. If the stone is put into rotation in the wrong direction, after a while it stops and
Challenge 203 d starts rotating in the other sense! Can you explain the effect?
objects and images 113
F I G U R E 62 How
**
What is the motion of the point on the surface of the Earth that has Sun in its zenith (i.e.,
Challenge 204 ny vertically above it), when seen on a map of the Earth during one day, and day after day?
**
The moment of inertia of a body depends on the shape of the body; usually, angular mo-
mentum and the angular velocity do not point in the same direction. Can you confirm
Challenge 205 n this with an example?
**
Can it happen that a satellite dish for geostationary TV satellites focuses the sunshine
Challenge 206 n onto the receiver?
**
Why is it difficult to fire a rocket from an aeroplane in the direction opposite to the motion
Challenge 207 n of the plane? Copyright © Christoph Schiller November 1997–January 2007
**
You have two hollow spheres: they have the same weight, the same size and are painted in
the same colour. One is made of copper, the other of aluminium. Obviously, they fall with
Challenge 208 ny the same speed and acceleration. What happens if they both roll down a tilted plane?
**
An ape hangs on a rope. The rope hangs over a wheel and is attached to a mass of equal
weight hanging down on the other side, as shown in Figure 62. The rope and the wheel
Challenge 209 n are massless and frictionless. What happens when the ape climbs the rope?
**
114 i galilean motion • 2. galilean physics – motion in everyday life
**
Challenge 211 n How can you determine the speed of a rifle bullet with only a scale and a metre stick?
**
Why does a gun make a hole in a door but cannot push it open, in exact contrast to what
Challenge 212 e a finger can do?
**
Challenge 213 n Can a water skier move with a higher speed than the boat pulling him?
**
Take two cans of the same size and weight, one full of ravioli and one full of peas. Which
**
Challenge 215 n What is the moment of inertia of a homogeneous sphere?
**
The moment of inertia is determined by the values of its three principal axes. These are
all equal for a sphere and for a cube. Does it mean that it is impossible to distinguish a
Challenge 216 n sphere from a cube by their inertial behaviour?
**
You might know the ‘Dynabee’, a hand-held gyroscopic device that can be accelerated to
Challenge 217 d high speed by proper movements of the hand. How does it work?
**
It is possible to make a spinning top with a metal paper clip. It is even possible to make
Challenge 218 n one of those tops with turn on their head when spinning. Can you find out how?
**
Is it true that the Moon in the first quarter in the northern hemisphere looks like the Copyright © Christoph Schiller November 1997–January 2007
**
Challenge 220 n How would Figure 63 look if taken at the Equator?
objects and images 115
**
Running backwards is an interesting sport. The 2006 world records for running back-
wards can be found on http://www.recordholders.org/en/list/backwards_running.html.
You will be astonished how many of these records are faster than your best personal for-
Challenge 221 e ward-running time.
**
Challenge 222 n What is the curve described by the midpoint of a ladder sliding down a wall?
Copyright © Christoph Schiller November 1997–January 2007
**
Since the Earth is round, there are many ways to drive from one point on the Earth to
another along a circle segment. This has interesting consequences for volley balls and for
girl-watching. Take a volleyball and look at its air inlet. If you want to move the inlet to a
different position with a simple rotation, you can choose the rotation axis in may different
Challenge 223 e ways. Can you confirm this? In other words, when we look in a given direction and then
want to look in another, the eye can accomplish this change in different ways. The option
chosen by the human eye had already been studied by medical scientists in the eighteenth
century. It is called Listing’s ‘law’.* It states that all axes that nature chooses lie in one plane.
* If you are interested in learning in more detail how nature and the eye cope with the complexities of three di-
mensions, see the http://schorlab.berkeley.edu/vilis/whatisLL.htm and http://www.med.uwo.ca/physiology/
116 i galilean motion • 2. galilean physics – motion in everyday life
Challenge 224 n Can you imagine its position in space? Men have a real interest that this mechanism is
strictly followed; if not, looking at girls on the beach could cause the muscles moving the
eyes to get knotted up.
cameras and by measurements using aluminium models of the animal’s feet. The experi-
ments show that the feet slapping on the water provides only 25 % of the force necessary
Ref. 93 to run above water; the other 75 % is provided by a pocket of compressed air that the
basilisks create between their feet and the water once the feet are inside the water. In
fact, basilisks mainly walk on air. (Both effects used by basilisks are also found in fast
Ref. 94 canoeing.) It was calculated that humans are also able to walk on water, provided their
feet hit the water with a speed of 100 km~h using the simultaneous physical power of 15
sprinters. Quite a feat for all those who ever did so.
courses/LLConsequencesWeb/ListingsLaw/perceptual2.htm websites.
* In the Middle Ages, the term ‘basilisk’ referred to a mythical monster supposed to appear shortly before
the end of the world. Today, it is a small reptile in the Americas.
objects and images 117
There is a second method of walking and running on water; this second method even
allows its users to remain immobile on top of the water surface. This is what water striders,
insects of the family Gerridae with a overall length of up to 15 mm, are able to do (together
with several species of spiders). Like all insects, the water strider has six legs (spiders have
eight). The water strider uses the back and front legs to hover over the surface, helped by
thousands of tiny hairs attached to its body. The hairs, together with the surface tension
of water, prevent the strider from getting wet. If you put shampoo into the water, the Copyright © Christoph Schiller November 1997–January 2007
water strider sinks and can no longer move. The water strider uses its large middle legs as
oars to advance over the surface, reaching speeds of up to 1 m~s doing so. In short, water
striders actually row over water. The same mechanism is used by the small robots that
Ref. 95 can move over water and were developed by Metin Sitti and his group.
Legs pose many interesting problems. Engineers know that a staircase is comfortable
to walk only if for each step the depth l plus twice the height h is a constant: l + 2h =
Challenge 226 n 0.63 0.02 m. This is the so-called staircase formula. Why does it hold?
Challenge 227 n All animals have an even number of legs. Do you know an exception? Why not? In
fact, one can argue that no animal has less than four legs. Why is this the case?
On the other hand, all animals with two legs have the legs side by side, whereas systems
Challenge 228 e with wheels have them one behind the other. Why is this not the other way round?
Legs also provide simple distance rulers: just count your steps. In 2006, it was dis-
118 i galilean motion • 2. galilean physics – motion in everyday life
covered that this method is used by certain ant species, such as Cataglyphis fortis. They
Ref. 96 can count to at least 25 000, as shown by Matthias Wittlinger and his team. These ants use
the ability to find the shortest way back to their home even in structureless desert terrain.
But let us continue with the study of motion transmitted over distance, without the
use of any contact at all.
The first and main contact-free method to generate motion we discover in our environ- ”
ment is height. Waterfalls, snow, rain and falling apples all rely on it. It was one of the
fundamental discoveries of physics that height has this property because there is an in-
teraction between every body and the Earth. Gravitation produces an acceleration along
* ‘I fell like dead bodies fall.’ Dante Alighieri (1265, Firenze–1321, Ravenna), the powerful Italian poet.
** In several myths about the creation or the organization of the world, such as the biblical one or the Indian
one, the Earth is not an object, but an imprecisely defined entity, such as an island floating or surrounded by
Challenge 229 n water with unclear boundaries and unclear method of suspension. Are you able to convince a friend that the
Earth is round and not flat? Can you find another argument apart from the roundness of the Earth’s shadow
when it is visible on the Moon?
Ref. 97 A famous crook, Robert Peary, claimed to have reached the North Pole in 1909. (In fact, Roald Amundsen
Copyright © Christoph Schiller November 1997–January 2007
reached the both the South and the North Pole first.) Peary claimed to have taken a picture there, but that
Challenge 230 n picture, which went round the world, turned out to be the proof that he had not been there. Can you imagine
how?
By the way, if the Earth is round, the top of two buildings is further apart than their base. Can this effect
Challenge 231 n be measured?
*** Tycho Brahe (1546–1601), famous Danish astronomer, builder of Uraniaborg, the astronomical castle.
He consumed almost 10 % of the Danish gross national product for his research, which produced the first
star catalogue and the first precise position measurements of planets.
**** Johannes Kepler (1571 Weil der Stadt–1630 Regensburg); after helping his mother successfully defend
herself in a trial where she was accused of witchcraft, he studied Protestant theology and became a teacher
of mathematics, astronomy and rhetoric. His first book on astronomy made him famous, and he became
assistant to Tycho Brahe and then, at his teacher’s death, the Imperial Mathematician. He was the first to
use mathematics in the description of astronomical observations, and introduced the concept and field of
‘celestial physics’.
dynamics due to gravitation 119
ingly simple result by the English physicist Robert Hooke:* every body of mass M attracts
any other body towards its centre with an acceleration whose magnitude a is given by
a=G
M
(29)
r2
where r is the centre-to-centre distance of the two bodies. This is called the universal
‘law’ of gravitation, or universal gravity, because it is valid in general. The proportionality
constant G is called the gravitational constant; it is one of the fundamental constants of
nature, like the speed of light or the quantum of action. More about it will be said shortly.
The effect of gravity thus decreases with increasing distance; gravity depends on the in-
verse distance squared of the bodies under consideration. If bodies are small compared
with the distance r, or if they are spherical, expression (29) is correct as it stands; for
non-spherical shapes the acceleration has to be calculated separately for each part of the
bodies and then added together.
* Robert Hooke, (1635–1703), important English physicist and secretary of the Royal Society. Apart from
discovering the inverse square relation and many others, such as Hooke’s ‘law’, he also wrote the Micro-
graphia, a beautifully illustrated exploration of the world of the very small.
** The first precise – but not the first – measurement was achieved in 1752 by the French astronomers
Lalande and La Caille, who simultaneously measured the position of the Moon seen from Berlin and from
Le Cap.
*** This is deduced easily by noting that for an object in circular motion, the magnitude v of the velocity
Challenge 232 n v = dx~dt is given as v = 2πr~T. The drawing of the vector v over time, the so-called hodograph, shows that
it behaves exactly like the position of the object. Therefore the magnitude a of the acceleration a = dv~dt is
given by the corresponding expression, namely a = 2πv~T.
**** This is the hardest quantity to measure oneself. The most surprising way to determine the Earth’s size
Ref. 99 is the following: watch a sunset in the garden of a house, with a stopwatch in hand. When the last ray of the
Sun disappears, start the stopwatch and run upstairs. There, the Sun is still visible; stop the stopwatch when
120 i galilean motion • 2. galilean physics – motion in everyday life
Moon
Earth
figure to be inserted
F I G U R E 67 A physicist’s and an artist’s view of the fall of the Moon: a diagram by Christiaan Huygens
(not to scale) and a marble statue by Auguste Rodin
the Sun disappears again and note the time t. Measure the height distance h of the two eye positions where
Challenge 233 ny the Sun was observed. The Earth’s radius R is then given by R = k h~t 2 , with k = 378 ë 106 s2 .
Ref. 100 There is also a simple way to measure the distance to the Moon, once the size of the Earth is known. Take
Copyright © Christoph Schiller November 1997–January 2007
a photograph of the Moon when it is high in the sky, and call θ its zenith angle, i.e. its angle from the vertical.
Make another photograph of the Moon a few hours later, when it is just above the horizon. On this picture,
Page 62 unlike a common optical illusion, the Moon is smaller, because it is further away. With a sketch the reason
for this becomes immediately clear. If q is the ratio of the two angular diameters, the Earth–Moon distance
Challenge 234 ny 2
d m is given by the relation d m = R 2 + [2Rq cos θ~(1 − q 2 )]2 . Enjoy finding its derivation from the sketch.
Another possibility is to determine the size of the Moon by comparing it with the size of the shadow of
the Earth during an eclipse. The distance to the Moon is then computed from its angular size, about 0.5°.
* Jean Buridan (c. 1295 to c. 1366) was also one of the first modern thinkers to speculate on a rotation of
the Earth about an axis.
** Another way to put it is to use the answer of the Dutch physicist Christiaan Huygens (1629–1695): the
Moon does not fall from the sky because of the centrifugal acceleration. As explained on page 111, this
explanation is nowadays out of favour at most universities.
Ref. 102 There is a beautiful problem connected to the left part of the figure: Which points on the surface of the
Challenge 236 d Earth can be hit by shooting from a mountain? And which points can be hit by shooting horizontally?
dynamics due to gravitation 121
Universal gravity also explains why the Earth and most planets are (almost) spherical.
Since gravity increases with decreasing distance, a liquid body in space will always try to
form a spherical shape. Seen on a large scale, the Earth is indeed liquid. We also know
that the Earth is cooling down – that is how the crust and the continents formed. The
sphericity of smaller solid objects encountered in space, such as the Moon, thus means
that they used to be liquid in older times.
Properties of gravitation
Gravitation implies that the path of a stone is not a parabola, as stated earlier, but actually
an ellipse around the centre of the Earth. This happens for exactly the same reason that
Challenge 237 ny the planets move in ellipses around the Sun. Are you able to confirm this statement?
Universal gravitation allows us to solve a mystery. The puzzling acceleration value
д = 9.8 m~s2 we encountered in equation (4) is thus due to the relation
The equation can be deduced from equation (29) by taking the Earth to be spherical. The
everyday acceleration of gravity д thus results from the size of the Earth, its mass, and the
universal constant of gravitation G. Obviously, the value for д is almost constant on the
surface of the Earth because the Earth is almost a sphere. Expression (30) also explains
why д gets smaller as one rises above the Earth, and the deviations of the shape of the
Earth from sphericity explain why д is different at the poles and higher on a plateau.
Challenge 238 n (What would it be on the Moon? On Mars? On Jupiter?)
By the way, it is possible to devise a simple machine, other than a yo-yo, that slows
down the effective acceleration of gravity by a known amount, so that one can measure
Challenge 239 n its value more easily. Can you imagine it?
Note that 9.8 is roughly π 2 . This is not a coincidence: the metre has been chosen in
such a way to make this correct. The period T of a swinging pendulum, i.e. a back and
Challenge 240 n forward swing, is given by* ¿
Ál
T = 2πÁ À , (31)
д
where l is the length of the pendulum, and д is the gravitational acceleration. (The pen-
Copyright © Christoph Schiller November 1997–January 2007
* Formula (31) is noteworthy mainly for all that is missing. The period of a pendulum does not depend on
the mass of the swinging body. In addition, the period of a pendulum does not depend on the amplitude.
(This is true as long as the oscillation angle is smaller than about 15°.) Galileo discovered this as a student,
when observing a chandelier hanging on a long rope in the dome of Pisa. Using his heartbeat as a clock he
found that even though the amplitude of the swing got smaller and smaller, the time for the swing stayed
the same.
A leg also moves like a pendulum, when one walks normally. Why then do taller people tend to walk
Challenge 241 n faster?
122 i galilean motion • 2. galilean physics – motion in everyday life
ition of the metre; it was made in 1673 by Huygens and repeated in 1790 by Talleyrand,
but was rejected by the conference that defined the metre because variations in the value
of д with geographical position and temperature-induced variations of the length of a
pendulum induce errors that are too large to yield a definition of useful precision.
Finally, the proposal was made to define the metre as 1~40 000 000 of the circum-
ference of the Earth through the poles, a so-called meridian. This proposal was almost
identical to – but much more precise than – the pendulum proposal. The meridian defin-
ition of the metre was then adopted by the French national assembly on 26 March 1791,
with the statement that ‘a meridian passes under the feet of every human being, and all
meridians are equal’. (Nevertheless, the distance from Equator to the poles is not exactly
Ref. 103 10 Mm; that is a strange story. One of the two geographers who determined the size of
the first metre stick was dishonest. The data he gave for his measurements – the general
method of which is shown in Figure 68 – was fabricated. Thus the first official metre stick
in Paris was shorter than it should be.)
F I G U R E 68 The measurements
G = 6.7 ë 10−11 Nm2 ~kg2 = 6.7 ë 10−11 m3 ~kg s2 . (32) that lead to the definition of the
metre (© Ken Alder)
* Henry Cavendish (1731–1810) was one of the great geniuses of physics; rich and solitary, he found many
rules of nature, but never published them. Had he done so, his name would be much more well known.
John Michell (1724–1793) was church minister, geologist and amateur astronomer.
dynamics due to gravitation 123
than that. This simple calculation thus proves that gravitation cannot be the true cause
of people falling in love, and that sexual attraction is not of gravitational origin, but of
a different source. The basis for this other interaction, love, will be studied later in our
walk: it is called electromagnetism.
But gravity has more interesting properties to of-
fer. The effects of gravitation can also be described f (x,y)
by another observable, namely the (gravitational)
potential φ. We then have the simple relation that
the acceleration is given by the gradient of the po-
tential
y
a = −∇φ a = −grad φ .
x
or (33)
grad f
The gradient is just a learned term for ‘slope along F I G U R E 69 The potential and the
φ = −G
M
. (34)
r
A potential considerably simplifies the description of motion, since a potential is additive:
given the potential of a point particle, one can calculate the potential and then the motion
around any other irregularly shaped object.**
The potential φ is an interesting quantity; with a single number at every position in
space we can describe the vector aspects of gravitational acceleration. It automatically
* In two or more dimensions slopes are written ∂φ~∂z – where ∂ is still pronounced ‘d’ – because in those
cases the expression dφ~dz has a slightly different meaning. The details lie outside the scope of this walk.
** Alternatively, for a general, extended body, the potential is found by requiring that the divergence of its
gradient is given by the mass (or charge) density times some proportionality constant. More precisely, one Copyright © Christoph Schiller November 1997–January 2007
has
∆φ = 4πGρ (35)
where ρ = ρ(x, t) is the mass volume density of the body and the operator ∆, pronounced ‘delta’, is defined
as ∆ f = ∇∇ f = ∂ 2 f ~∂x 2 + ∂ 2 f ~∂y 2 + ∂ 2 f ~∂z 2 . Equation (35) is called the Poisson equation for the potential
φ. It is named after Siméon-Denis Poisson (1781–1840), eminent French mathematician and physicist. The
positions at which ρ is not zero are called the sources of the potential. The so-called source term ∆φ of
a function is a measure for how much the function φ(x) at a point x differs from the average value in a
Challenge 245 ny region around that point. (Can you show this, by showing that ∆φ φ̄ − φ(x)?) In other words, the Poisson
equation (35) implies that the actual value of the potential at a point is the same as the average value around
that point minus the mass density multiplied by 4πG. In particular, in the case of empty space the potential
at a point is equal to the average of the potential around that point.
Often the concept of gravitational field is introduced, defined as g = −∇φ. We avoid this in our walk,
because we will discover that, following the theory of relativity, gravity is not due to a field at all; in fact even
the concept of gravitational potential turns out to be only an approximation.
124 i galilean motion • 2. galilean physics – motion in everyday life
gives that gravity in New Zealand acts in the opposite direction to gravity in Paris. In
addition, the potential suggests the introduction of the so-called potential energy U by
setting
U = mφ (36)
and thus allowing us to determine the change of kinetic energy T of a body falling from
a point 1 to a point 2 via
T1 − T2 = U 2 − U 1 m 1 v1 2 − m 2 v2 2 = mφ 2 − mφ 1 .
1 1
or (37)
2 2
In other words, the total energy, defined as the sum of kinetic and potential energy, is
conserved in motion due to gravity. This is a characteristic property of gravitation. Not
all accelerations can be derived from a potential; systems with this property are called
conservative. The accelerations due to friction are not conservative, but those due to elec-
its height is only 6267 m above sea level, is about 20 km farther away from the centre of
the Earth than the top of Mount Sagarmatha* in Nepal, whose height above sea level is
8850 m. The top of Mount Chimborazo is in fact the point on the surface most distant
from the centre of the Earth.
As a consequence, if the Earth stopped rotating (but kept its shape), the water of the
oceans would flow north; all of Europe would be under water, except for the few moun-
tains of the Alps that are higher than about 4 km. The northern parts of Europe would
be covered by between 6 km and 10 km of water. Mount Sagarmatha would be over 11 km
above sea level. If one takes into account the resulting change of shape of the Earth, the
numbers come out smaller. In addition, the change in shape would produce extremely
strong earthquakes and storms. As long as there are none of these effects, we can be sure
Page 89 that the Sun will indeed rise tomorrow, despite what some philosophers might pretend.
painstaking research about the movements of the planets in the zodiac. The three main
ones are as follows:
1. Planets move on ellipses with the Sun located at one focus (1609).
2. Planets sweep out equal areas in equal times (1609).
3. All planets have the same ratio T 2 ~d 3 between the orbit duration T and the semima-
jor axis d (1619).
The main results are given in Figure 72. The sheer
work required to deduce the three ‘laws’ was enorm-
ous. Kepler had no calculating machine available, not
even a slide rule. The calculation technology he used d
was the recently discovered logarithms. Anyone who Sun
d
has used tables of logarithms to perform calculations Copyright © Christoph Schiller November 1997–January 2007
can get a feeling for the amount of work behind these planet
three discoveries.
The second ‘law’ about equal swept areas implies
that planets move faster when they are near the Sun. F I G U R E 72 The motion of a planet
around the Sun, showing its
It is a way to state the conservation of angular mo- semimajor axis d, which is also the
mentum. But now comes the central point. The huge spatial average of its distance from
volume of work by Brahe and Kepler can be summar- the Sun
ized in the expression a = GM~r 2 . Can you confirm
Challenge 249 ny that all three ‘laws’ follow from Hooke’s expression of universal gravity? Publishing this
result was the main achievement of Newton. Try to repeat his achievement; it will show
you not only the difficulties, but also the possibilities of physics, and the joy that puzzles
give.
dynamics due to gravitation 127
Newton solved the puzzle with geometric drawing. Newton was not able to write down,
Ref. 25 let alone handle, differential equations at the time he published his results on gravitation.
In fact, it is well known that Newton’s notation and calculation methods were poor. (Much
poorer than yours!) The English mathematician Godfrey Hardy* used to say that the
insistence on using Newton’s integral and differential notation, rather than the earlier
and better method, still common today, due to his rival Leibniz – threw back English
mathematics by 100 years.
Kepler, Hooke and Newton became famous because they brought order to the descrip-
tion of planetary motion. This achievement, though of small practical significance, was
widely publicized because of the age-old prejudices linked with astrology.
However, there is more to gravitation. Universal gravity explains the motion and shape
of the Milky Way and of the other galaxies, the motion of many weather phenomena and
Challenge 250 n explains why the Earth has an atmosphere but the Moon does not. (Can you explain this?)
In fact, universal gravity explains much more about the Moon.
How long is a day on the Moon? The answer is roughly 14 Earth-days. That is the time
that it takes for the Moon to see the Sun again in the same position.
One often hears that the Moon always shows the same side to the Earth. But this is
wrong. As one can check with the naked eye, a given feature in the centre of the face of
the Moon at full Moon is not at the centre one week later. The various motions leading
to this change are called librations; they are shown in the film in Figure 73.** The mo-
tions appear mainly because the Moon does not describe a circular, but an elliptical orbit
* Godfrey Harold Hardy (1877–1947) was an important English number theorist, and the author of the
well-known A Mathematician’s Apology. He also ‘discovered’ the famous Indian mathematician Srinivasa
Ramanujan, bringing him to Britain.
** The film is in DivX 5 AVI format and requires a software plug-in in Acrobat Reader that can play it.
128 i galilean motion • 2. galilean physics – motion in everyday life
F I G U R E 74 Maps (not photographs) of the near side (left) and far side (right) of
the moon, showing how often the latter saved the Earth from meteorite impacts
(courtesy USGS)
shape due to this effect. (Remember that the shape of the Earth depends on its speed of
rotation.) These changes in shape influence the tectonic activity of the Earth, and maybe
also the drift of the continents.
The Moon has many effects on animal life. A famous example is the midge Clunio,
Ref. 111 which lives on coasts with pronounced tides. Clunio spends between six and twelve weeks
as a larva, then hatches and lives for only one or two hours as an adult flying insect, during
which time it reproduces. The midges will only reproduce if they hatch during the low
tide phase of a spring tide. Spring tides are the especially strong tides during the full and
new moons, when the solar and lunar effects combine, and occur only every 14.8 days. In
1995, Dietrich Neumann showed that the larvae have two built-in clocks, a circadian and
a circalunar one, which together control the hatching to precisely those few hours when
the insect can reproduce. He also showed that the circalunar clock is synchronized by the
brightness of the Moon at night. In other words, the larvae monitor the Moon at night
and then decide when to hatch: they are the smallest known astronomers.
If insects can have circalunar cycles, it should come as no surprise that women also
Ref. 112 have such a cycle; however, in this case the origin of the cycle length is still unknown.
The Moon also helps to stabilize the tilt of the Earth’s axis, keeping it more or less
fixed relative to the plane of motion around the Sun. Without the Moon, the axis would
change its direction irregularly, we would not have a regular day and night rhythm, we
would have extremely large climate changes, and the evolution of life would have been
Ref. 113 impossible. Without the Moon, the Earth would also rotate much faster and we would
Ref. 114 have much less clement weather. The Moon’s main remaining effect on the Earth, the
Orbits
The path of a body orbiting another under the influence of gravity is an ellipse with the
central body at one focus. A circular orbit is also possible, a circle being a special case of
an ellipse. Single encounters of two objects can also be parabolas or hyperbolas, as shown
in Figure 75. Circles, ellipses, parabolas and hyperbolas are collectively known as conic
sections. Indeed each of these curves can be produced by cutting a cone with a knife. Are
Challenge 253 e you able to confirm this?
If orbits are mostly ellipses, it follows that comets
return. The English astronomer Edmund Halley
hyperbola
(1656–1742) was the first to draw this conclusion and
to predict the return of a comet. It arrived at the pre-
dicted date in 1756, and is now named after him. The
period of Halley’s comet is between 74 and 80 years; parabola
the first recorded sighting was 22 centuries ago, and
it has been seen at every one of its 30 passages since,
the last time in 1986. mass
Depending on the initial energy and the initial an- ellipse
circle
gular momentum of the body with respect to the cent-
Copyright © Christoph Schiller November 1997–January 2007
Why does the inverse square ‘law’ lead to conic sections? First, for two bodies, the total
angular momentum L is a constant:
L = mr 2 φ̇ (38)
and therefore the motion lies in a plane. Also the energy E is a constant
E = m( )2 + m(r )2 − G
1 dr 1 dφ mM
. (39)
2 dt 2 dt r
Challenge 255 ny Together, the two equations imply that
L2
r=
1
2
¼ . (40)
Gm M 1 + 1 + 2E L 2 cos φ
r= r=
C C
or (41)
1 + e cos φ 1 − e cos φ
is an ellipse for 0 < e < 1, a parabola for e = 1 and a hyperbola for e A 1, one focus being
at the origin. The quantity e, called the eccentricity, describes how squeezed the curve is.
In other words, a body in orbit around a central mass follows a conic section.
In all orbits, also the heavy mass moves. In fact, both bodies orbit around the common
centre of mass. Both bodies follow the same type of curve (ellipsis, parabola or hyperbola),
Challenge 256 e but the dimensions of the two curves differ.
If more than two objects move under mutual gravitation, many additional possibilities
for motions appear. The classification and the motions are quite complex. In fact, this so-
called many-body problem is still a topic of research, and the results are mathematically
fascinating. Let us look at a few examples.
When several planets circle a star, they also attract each other. Planets thus do not
move in perfect ellipses. The largest deviation is a perihelion shift, as shown in Figure 51.
It is observed for Mercury and a few other planets, including the Earth. Other deviations
from elliptical paths appear during a single orbit. In 1846, the observed deviations of the
Copyright © Christoph Schiller November 1997–January 2007
motion of the planet Uranus from the path predicted by universal gravity were used to pre-
dict the existence of another planet, Neptune, which was discovered shortly afterwards.
Page 81 We have seen that mass is always positive and that gravitation is thus always attractive;
there is no antigravity. Can gravity be used for levitation nevertheless, maybe using more
than two bodies? Yes; there are two examples.* The first are the geostationary satellites,
which are used for easy transmission of television and other signals from and towards
Earth.
The Lagrangian libration points are the second example. Named after their discoverer,
these are points in space near a two-body system, such as Moon–Earth or Earth–Sun,
in which small objects have a stable equilibrium position. A general overview is given in
Challenge 257 ny Figure 76. Can you find their precise position, remembering to take rotation into account?
There are three additional Lagrangian points on the Earth–Moon line. How many of them
Challenge 258 ny are stable?
There are thousands of asteroids, called Trojan aster-
oids, at and around the Lagrangian points of the Sun–
Jupiter system. In 1990, a Trojan asteroid for the Mars–
Sun system was discovered. Finally, in 1997, an ‘almost planet (or Sun)
Trojan’ asteroid was found that follows the Earth on its
way around the Sun (it is only transitionary and follows L5
π/3
a somewhat more complex orbit). This ‘second compan-
Ref. 115 ion’ of the Earth has a diameter of 5 km. Similarly, on
π/3
the main Lagrangian points of the Earth–Moon system π/3 π/3
a high concentration of dust has been observed.
To sum up, the single equation a = −GMr~r 3 cor-
L4
moon (or planet)
Tides
Ref. 117 Why do physics texts always talk about tides? Because, as general relativity will show, tides
prove that space is curved! It is thus useful to study them in a bit more detail. Gravitation
explains the sea tides as results of the attraction of the ocean water by the Moon and the
Sun. Tides are interesting; even though the amplitude of the tides is only about 0.5 m
on the open sea, it can be up to 20 m at special places near the coast. Can you imagine
* Pierre Simon Laplace (b. 1749 Beaumont-en-Auge, d. 1827 Paris), important French mathematician. His
treatise appeared in five volumes between 1798 and 1825. He was the first to propose that the solar system
was formed from a rotating gas cloud, and one of the first people to imagine and explore black holes.
132 i galilean motion • 2. galilean physics – motion in everyday life
t = t1 :
deformed
after
t=0:
spherical
Challenge 259 n why? The soil is also lifted and lowered by the Sun and the Moon, by about 0.3 m, as satel- Copyright © Christoph Schiller November 1997–January 2007
Ref. 42 lite measurements show. Even the atmosphere is subject to tides, and the corresponding
Ref. 118 pressure variations can be filtered out from the weather pressure measurements.
Tides appear for any extended body moving in the gravitational field of another. To
understand the origin of tides, picture a body in orbit, like the Earth, and imagine its
components, such as the segments of Figure 78, as being held together by springs. Uni-
versal gravity implies that orbits are slower the more distant they are from a central body.
As a result, the segment on the outside of the orbit would like to be slower than the central
one; but it is pulled by the rest of the body through the springs. In contrast, the inside seg-
ment would like to orbit more rapidly but is retained by the others. Being slowed down,
the inside segments want to fall towards the Sun. In sum, both segments feel a pull away
from the centre of the body, limited by the springs that stop the deformation. Therefore,
extended bodies are deformed in the direction of the field inhomogeneity.
dynamics due to gravitation 133
For example, as a result of tidal forces, the Moon always has (roughly) the same face
to the Earth. In addition, its radius in direction of the Earth is larger by about 5 m than
the radius perpendicular to it. If the inner springs are too weak, the body is torn into
pieces; in this way a ring of fragments can form, such as the asteroid ring between Mars
and Jupiter or the rings around Saturn.
Let us return to the Earth. If a body is surrounded by water, it will form bulges in the
direction of the applied gravitational field. In order to measure and compare the strength
of the tides from the Sun and the Moon, we reduce tidal effects to their bare minimum.
As shown in Figure 79, we can study the deformation of a body due to gravity by study-
ing the deformation of four pieces. We can study it in free fall, because orbital motion
and free fall are equivalent. Now, gravity makes some of the pieces approach and others
diverge, depending on their relative positions. The figure makes clear that the strength of
the deformation – water has no built-in springs – depends on the change of gravitational
acceleration with distance; in other words, the relative acceleration that leads to the tides
= = 0.033 mm~s2
GM Moon
a Moon 2
(42)
d Moon
and thus the attraction from the Moon is about 178 times weaker than that from the Sun.
When two nearby bodies fall near a large mass, the relative acceleration is proportional
to their distance, and follows da = da~dr dr. The proportionality factor da~dr = ∇a,
called the tidal acceleration (gradient), is the true measure of tidal effects. Near a large
Challenge 260 e spherical mass M, it is given by
=− 3
da 2GM
(43)
dr r
which yields the values
dr d Sun
In other words, despite the much weaker pull of the Moon, its tides are predicted to be
over twice as strong as the tides from the Sun; this is indeed observed. When Sun, Moon
and Earth are aligned, the two tides add up; these so-called spring tides are especially
strong and happen every 14.8 days, at full and new moon.
Tides lead to a pretty puzzle. The fact that the moon tides are much stronger can be
Challenge 261 n used to deduce that the Moon is much denser than the Sun. Why?
Tides also produce friction. The friction leads to a slowing of the Earth’s rotation.
134 i galilean motion • 2. galilean physics – motion in everyday life
Nowadays, the slowdown can be measured by precise clocks (even though short time
Ref. 76 variations due to other effects, such as the weather, are often larger). The results fit well
Page 458 with fossil results showing that 400 million years ago, in the Devonian period, a year had
400 days, and a day about 22 hours. It is also estimated that 900 million years ago, each
of the 481 days of a year were 18.2 hours long. The friction at the basis of this slowdown
also results in an increase in the distance of the Moon from the Earth by about 3.8 cm
Challenge 262 n per year. Are you able to explain why?
As mentioned above, the tidal motion of the soil is also responsible for the triggering
of earthquakes. Thus the Moon can have also dangerous effects on Earth. (Unfortunately,
knowing the mechanism does not allow the prediction of earthquakes.) The most fascin-
ating example of tidal effects is seen on Jupiter’s satellite Io. Its tides are so strong that they
induce intense volcanic activity, as shown in Figure 80, with eruption plumes as high as
500 km. If tides are even stronger, they can destroy the body altogether, as happened to Copyright © Christoph Schiller November 1997–January 2007
the body between Mars and Jupiter that formed the planetoids, or (possibly) to the moons
that led to Saturn’s rings.
In summary, tides are due to relative accelerations of nearby mass points. This has an
Page 385 important consequence. In the chapter on general relativity we will find that time multi-
plied by the speed of light plays the same role as length. Time then becomes an additional
dimension, as shown in Figure 81. Using this similarity, two free particles moving in the
same direction correspond to parallel lines in space-time. Two particles falling side-by-
side also correspond to parallel lines. Tides show that such particles approach each other.
Page 427 In other words, tides imply that parallel lines approach each other. But parallel lines can
approach each other only if space-time is curved. In short, tides imply curved space-time
and space. This simple reasoning could have been performed in the eighteenth century;
however, it took another 200 years and Albert Einstein’s genius to uncover it.
dynamics due to gravitation 135
x
t1
α
b
t2 M
t
“ Aufklärung.
Immanuel Kant*
Towards the end of the seventeenth century people discovered that light has a finite velo-
”
Page 283 city – a story which we will tell in detail later. An entity that moves with infinite velocity
cannot be affected by gravity, as there is no time to produce an effect. An entity with a
finite speed, however, should feel gravity and thus fall.
Does its speed increase when light reaches the surface of the Earth? For almost three
centuries people had no means of detecting any such effect; so the question was not in-
vestigated. Then, in 1801, the Prussian astronomer Johann Soldner (1776–1833) was the
Ref. 119 first to put the question in a different way. Being an astronomer, he was used to measur-
ing stars and their observation angles. He realized that light passing near a massive body
would be deflected due to gravity.
Soldner studied a body on a hyperbolic path, moving with velocity c past a spherical
mass M at distance b (measured from the centre), as shown in Figure 82. Soldner deduced
Challenge 263 ny the deflection angle
α univ. grav. =
2 GM
. (45)
b c2 Copyright © Christoph Schiller November 1997–January 2007
One sees that the angle is largest when the motion is just grazing the mass M. For light
deflected by the mass of the Sun, the angle turns out to be at most a tiny 0.88 ′′ = 4.3 µrad.
In Soldner’s time, this angle was too small to be measured. Thus the issue was forgotten.
Had it been pursued, general relativity would have begun as an experimental science, and
Page 417 not as the theoretical effort of Albert Einstein! Why? The value just calculated is different
from the measured value. The first measurement took place in 1919;** it found the correct
dependence on the distance, but found a deflection of up to 1.75 ′′ , exactly double that of
expression (45). The reason is not easy to find; in fact, it is due to the curvature of space,
as we will see. In summary, light can fall, but the issue hides some surprises.
means in particular that inertial mass, based on electromagnetic interaction, and gravit-
ational mass are identical by definition.
We also note that we have never defined a separate concept of ‘passive gravitational
mass’. The mass being accelerated by gravitation is the inertial mass. Worse, there is no
Challenge 266 ny way to define a ‘passive gravitational mass’. Try it! All methods, such as weighing an ob-
ject, cannot be distinguished from those that determine inertial mass from its reaction to
acceleration. Indeed, all methods of measuring mass use non-gravitational mechanisms.
Scales are a good example.
If the ‘passive gravitational mass’ were different from the inertial mass, we would have
strange consequences. For those bodies for which it were different we would get into
Challenge 265 ny * What are the values shown by a balance for a person of 85 kg juggling three balls of 0.3 kg each?
dynamics due to gravitation 137
Gravity on the Moon is only one sixth of that on the Earth. Why does this imply that
”
it is difficult to walk quickly and to run on the Moon (as can be seen in the TV images
recorded there)?
Copyright © Christoph Schiller November 1997–January 2007
**
Imagine that you have twelve coins of identical appearance, of which one is a forgery. The
forged one has a different mass from the eleven genuine ones. How can you decide which
is the forged one and whether it is lighter or heavier, using a simple balance only three
Challenge 267 e times?
**
For a physicist, antigravity is repulsive gravity; it does not exist in nature. Nevertheless,
* ‘Falling is neither dangerous nor a shame; to keep lying is both.’ Konrad Adenauer (b. 1876 Köln, d.
1967 Rhöndorf), German chancellor.
138 i galilean motion • 2. galilean physics – motion in everyday life
1000 km
the term ‘antigravity’ is used incorrectly by many people, as a short search on the inter-
net shows. Some people call any effect that overcomes gravity, ‘antigravity’. However, this
definition implies that tables and chairs are antigravity devices. Following the definition,
most of the wood, steel and concrete producers are in the antigravity business. The inter-
net definition makes absolutely no sense.
**
Do all objects on Earth fall with the same acceleration of 9.8 m~s2 , assuming that air
resistance can be neglected? No; every housekeeper knows that. You can check this by
yourself. A broom angled at around 35° hits the floor before a stone, as the sounds of
Challenge 268 n impact confirm. Are you able to explain why?
**
Also Bungee jumpers are accelerated more strongly than д. For a rubber of mass m and
Copyright © Christoph Schiller November 1997–January 2007
a = д 1 + (4 + ) .
1m m
(46)
8M M
Challenge 269 n Can you deduce the relation from Figure 84?
**
Challenge 270 n Guess: What is the weight of a ball of cork with a radius of 1 m?
**
Challenge 271 n Guess: A heap of 1000 1 mm diameter iron balls is collected. What is its mass?
dynamics due to gravitation 139
**
How can you use your observations made during your travels to show that the Earth is
Challenge 272 n not flat?
**
Is the acceleration due to gravity constant? Not really. Every day, it is estimated that 108 kg
of material fall onto the Earth in the form of meteorites.
**
Both the Earth and the Moon attract bodies. The centre of mass of the Earth–Moon sys-
tem is 4800 km away from the centre of the Earth, quite near its surface. Why do bodies
Challenge 273 n on Earth still fall towards the centre of the Earth?
**
**
The ratio of the strengths of the tides of Moon and Sun is roughly 7 3. Is it true that this
Challenge 276 ny is also the ratio between the mass densities of the two bodies?
**
The friction between the Earth and the Moon slows down the rotation of both. The Moon
stopped rotating millions of years ago, and the Earth is on its way to doing so as well.
When the Earth stops rotating, the Moon will stop moving away from Earth. How far will
Challenge 277 ny the Moon be from the Earth at that time? Afterwards however, even further in the future,
the Moon will move back towards the Earth, due to the friction between the Earth–Moon
system and the Sun. Even though this effect would only take place if the Sun burned for Copyright © Christoph Schiller November 1997–January 2007
Challenge 278 n ever, which is known to be false, can you explain it?
**
When you run towards the east, you lose weight. There are two different reasons for this:
the ‘centrifugal’ acceleration increases so that the force with which you are pulled down
diminishes, and the Coriolis force appears, with a similar result. Can you estimate the
Challenge 279 ny size of the two effects?
**
What is the time ratio between a stone falling through a distance l and a pendulum
Challenge 280 n swinging though half a circle of radius l? (This problem is due to Galileo.) How many
digits of the number π can one expect to determine in this way?
140 i galilean motion • 2. galilean physics – motion in everyday life
**
Why can a spacecraft accelerate through the slingshot effect when going round a planet,
Challenge 281 n despite momentum conservation? It is speculated that the same effect is also the reason
for the few exceptionally fast stars that are observed in the galaxy. For example, the star
Ref. 104 HE0457-5439 moves with 720 km~s, which is much higher than the 100 to 200 km~s of
most stars in the Milky way. It seems that the role of the accelerating centre was taken by
a black hole.
**
Ref. 105 The orbit of a planet around the Sun has many interesting properties. What is the hodo-
Challenge 282 n graph of the orbit? What is the hodograph for parabolic and hyperbolic orbits?
**
The Galilean satellites of Jupiter, shown in Figure 71 on page 126, can be seen with
**
A simple, but difficult question: if all bodies attract each other, why don’t or didn’t all
Challenge 283 n stars fall towards each other? Indeed, the inverse square expression of universal gravity
has a limitation: it does not allow one to make sensible statements about the matter in
the universe. Universal gravity predicts that a homogeneous mass distribution is unstable;
indeed, an inhomogeneous distribution is observed. However, universal gravity does not
predict the average mass density, the darkness at night, the observed speeds of the distant
galaxies, etc. In fact, ‘universal’ gravity does not explain or predict a single property of
Page 445 the universe. To do this, we need general relativity.
Copyright © Christoph Schiller November 1997–January 2007
**
The acceleration д due to gravity at a depth of 3000 km is 10.05 m~s2 , over 2 % more than
Ref. 121 at the surface of the Earth. How is this possible? Also, on the Tibetan plateau, д is higher
than the sea level value of 9.81 m~s2 , even though the plateau is more distant from the
Challenge 284 n centre of the Earth than sea level is. How is this possible?
**
When the Moon circles the Sun, does its path have sections concave towards the Sun, as
Challenge 285 n shown at the right of Figure 86, or not, as shown on the left? (Independent of this issue,
both paths in the diagram disguise that fact that the Moon path does not lie in the same
plane as the path of the Earth around the Sun.)
dynamics due to gravitation 141
Moon
Sun
**
You can prove that objects attract each other (and that they are not only attracted by the
Earth) with a simple experiment that anybody can perform at home, as described on the
http://www.fourmilab.ch/gravitation/foobar website.
**
It is instructive to calculate the escape velocity of the Earth, i.e. that velocity with which
a body must be thrown so that it never falls back. It turns out to be 11 km~s. What is
the escape velocity for the solar system? By the way, the escape velocity of our galaxy is
129 km~s. What would happen if a planet or a system were so heavy that its escape velocity
Challenge 286 n would be larger than the speed of light?
**
For bodies of irregular shape, the centre of gravity of a body is not the same as the centre Copyright © Christoph Schiller November 1997–January 2007
Challenge 287 n of mass. Are you able to confirm this? (Hint: Find and use the simplest example possible.)
**
Can gravity produce repulsion? What happens to a small test body on the inside of a large
Challenge 288 ny C-shaped mass? Is it pushed towards the centre of mass?
**
Ref. 122 The shape of the Earth is not a sphere. As a consequence, a plumb line usually does not
Challenge 289 ny point to the centre of the Earth. What is the largest deviation in degrees?
**
Challenge 290 n What is the largest asteroid one can escape from by jumping?
142 i galilean motion • 2. galilean physics – motion in everyday life
**
If you look at the sky every day at 6 a.m., the Sun’s position varies during the year. The
result of photographing the Sun on the same film is shown in Figure 87. The curve, called
the analemma, is due to the inclination of the Earth’s axis, as well as the elliptical shape
of the path around the Sun. The shape of the analemma is also built into high quality
sundials. The top and the (hidden) bottom points correspond to the solstices.
**
The constellation in which the Sun stands at noon (at the centre of the time zone) is sup- Copyright © Christoph Schiller November 1997–January 2007
posedly called the ‘zodiacal sign’ of that day. Astrologers say there are twelve of them,
namely Aries, Taurus, Gemini, Cancer, Leo, Virgo, Libra, Scorpius, Sagittarius, Capri-
cornus, Aquarius and Pisces and that each takes (quite precisely) a twelfth of a year or
a twelfth of the ecliptic. Any check with a calendar shows that at present, the midday
Sun is never in the zodiacal sign during the days usually connected to it. The relation has
Page 103 shifted by about a month since it was defined, due to the precession of the Earth’s axis. A
check with a map of the star sky shows that the twelve constellations do not have the same
length and that on the ecliptic there are fourteen of them, not twelve. There is Ophiuchus,
the snake constellation, between Scorpius and Sagittarius, and Cetus, the whale, between
Aquarius and Pisces. In fact, not a single astronomical statement about zodiacal signs is
Ref. 123 correct. To put it clearly, astrology, in contrast to its name, is not about stars. (In some
languages, the term for ‘crook’ is derived from the word ‘astrologer’.)
dynamics due to gravitation 143
dm
r
m
R
dM
F I G U R E 88 The vanishing of
gravitational force inside a
spherical shell of matter
**
tury. Sometimes there are also false alarms. One example was the alleged fall of mini
comets on the Earth. They were supposedly made of a few dozen kilograms of ice and hit-
Ref. 127 ting the Earth every few seconds. It is now known not to happen. On the other hand, it is
known that many tons of asteroids fall on the Earth every day, in the form of tiny particles.
Incidentally, discovering objects hitting the Earth is not at all easy. Astronomers like to
point out that an asteroid as large as the one that led to the extinction of the dinosaurs
could hit the Earth without any astronomer noticing in advance, if the direction is slightly
unusual, such as from the south, where few telescopes are located.
**
Universal gravity allows only elliptical, parabolic or hyperbolic orbits. It is impossible
for a small object approaching a large one to be captured. At least, that is what we have
144 i galilean motion • 2. galilean physics – motion in everyday life
learned so far. Nevertheless, all astronomy books tell stories of capture in our solar system;
Challenge 293 n for example, several outer satellites of Saturn have been captured. How is this possible?
**
How would a tunnel have to be shaped in order that a stone would fall through it without
touching the walls? (Assume constant density.) If the Earth did not rotate, the tunnel
would be a straight line through its centre, and the stone would fall down and up again,
in a oscillating motion. For a rotating Earth, the problem is much more difficult. What is
Challenge 294 n the shape when the tunnel starts at the Equator?
**
The International Space Station circles the Earth every 90 minutes at an altitude of about
380 km. You can see where it is from the website http://www.heavens-above.com. By the
way, whenever it is just above the horizon, the station is the third brightest object in the
**
Is it true that the centre of mass of the solar system, its barycentre, is always inside the
Challenge 296 n Sun? Even though a star or the Sun move very little when planets move around them,
this motion can be detected with precision measurements making use of the Doppler
Page 292 effect for light or radio waves. Jupiter, for example, produces a speed change of 13 m~s
in the Sun, the Earth 1 m~s. The first planets outside the solar system, around the pulsar
PSR1257+12 and the star Pegasi 51, was discovered in this way, in 1992 and 1995. In the
meantime, over 150 planets have been discovered with this method. So far, the smallest
planet discovered has 7 times the mass of the Earth.
**
Not all points on the Earth receive the same number of daylight hours during a year. The
Challenge 297 d effects are difficult to spot, though. Can you find one?
**
Can the phase of the Moon have a measurable effect on the human body? What about
Challenge 298 n the tidal effects of the Moon?
Copyright © Christoph Schiller November 1997–January 2007
**
There is an important difference between the heliocentric system and the old idea that
all planets turn around the Earth. The heliocentric system states that certain planets,
such as Mars and Venus, can be between the Earth and the Sun at certain times, and
behind the Sun at other times. In contrast, the geocentric system states that they are al-
ways in between. Why did such an important difference not immediately invalidate the
Challenge 299 n geocentric system?
**
The strangest reformulation of the description of motion given by ma = ∇U is the almost
Ref. 128 absurd looking equation
∇v = dv~ds (47)
dynamics due to gravitation 145
where s is the motion path length. It is called the ray form of Newton’s equation of motion.
Challenge 300 n Can you find an example of its application?
**
Seen from Neptune, the size of the Sun is the same as that of Jupiter seen from the Earth
Challenge 301 n at the time of its closest approach. True?
**
What is gravity? This is not a simple question. In 1690, Nicolas Fatio de Duillier and in
Ref. 129 1747, Georges-Louis Lesage proposed an explanation for the 1~r 2 dependence. Lesage ar-
gued that the world is full of small particles – he called them ‘corpuscules ultra-mondains’
– flying around randomly and hitting all objects. Single objects do not feel the hits, since
they are hit continuously and randomly from all directions. But when two objects are
near to each other, they produce shadows for part of the flux to the other body, resulting
**
Could one put a satellite into orbit using a cannon? Does the answer depend on the dir-
Challenge 304 ny ection in which one shoots?
Copyright © Christoph Schiller November 1997–January 2007
**
Two computer users share experiences. ‘I threw my Pentium III and Pentium IV out of
the window.’ ‘And?’ ‘The Pentium III was faster.’
**
Challenge 305 n How often does the Earth rise and fall when seen from the Moon? Does the Earth show
phases?
**
Challenge 306 ny What is the weight of the Moon? How does it compare with the weight of the Alps?
**
146 i galilean motion • 2. galilean physics – motion in everyday life
Owing to the slightly flattened shape of the Earth, the source of the Mississippi is about
20 km nearer to the centre of the Earth than its mouth; the water effectively runs uphill.
Challenge 307 n How can this be?
**
If a star is made of high density material, the speed of a planet orbiting near to it could
Challenge 308 n be greater than the speed of light. How does nature avoid this strange possibility?
**
What will happen to the solar system in the future? This question is surprisingly hard
to answer. The main expert of this topic, French planetary scientist Jacques Laskar, simu-
Ref. 130 lated a few hundred million years of evolution using computer-aided calculus. He found
Page 267 that the planetary orbits are stable, but that there is clear evidence of chaos in the evolu-
tion of the solar system, at a small level. The various planets influence each other in subtle
and still poorly understood ways. Effects in the past are also being studied, such as the Copyright © Christoph Schiller November 1997–January 2007
energy change of Jupiter due to its ejection of smaller asteroids from the solar system, or
energy gains of Neptune. There is still a lot of research to be done in this field.
**
One of the open problems of the solar system is the description of planet distances dis-
covered in 1766 by Johann Daniel Titius (1729–1796) and publicized by Johann Elert
Bode (1747–1826). Titius discovered that planetary distances d from the Sun can be ap-
proximated by
d = a + 2n b with a = 0.4 AU , b = 0.3 AU (48)
where distances are measured in astronomical units and n is the number of the planet.
The resulting approximation is compared with observations in Table 18.
dynamics due to gravitation 147
TA B L E 19 The orbital
periods known to the
Babylonians
Body Period
Saturn 29 a
Jupiter 12 a
Mars 687 d
Sun 365 d
Venus 224 d
Mercury 88 d
Moon 29 d
Ref. 133 story was told by Cassius Dio (c. 160 to c. 230). Towards the end of Antiquity, the order-
ing was taken up by the Roman empire. In Germanic languages, including English, the
Latin names of the celestial bodies were replaced by the corresponding Germanic gods.
The order Saturday, Sunday, Monday, Tuesday, Wednesday, Thursday and Friday is thus
a consequence of both the astronomical measurements and the astrological superstitions
of the ancients.
**
In 1722, the great mathematician Leonhard Euler made a mistake in his calculation that
led him to conclude that if a tunnel were built from one pole of the Earth to the other,
a stone falling into it would arrive at the Earth’s centre and then immediately turn and
go back up. Voltaire made fun of this conclusion for many years. Can you correct Euler
148 i galilean motion • 2. galilean physics – motion in everyday life
and show that the real motion is an oscillation from one pole to the other, and can you
Challenge 310 n calculate the time a pole-to-pole fall would take (assuming homogeneous density)?
What would be the oscillation time for an arbitrary straight surface-to-surface tunnel
Challenge 311 n of length l, thus not going from pole to pole?
**
Figure 89 shows a photograph of the 1999 solar eclipse taken by the Russian space station
Mir. It clearly shows that a global view of a phenomenon can be quite different from a local
Challenge 312 n one. What is the speed of the shadow?
**
In 2005, satellite measurements have shown that the water in the Amazon river presses
down the land up to 75 mm more in the season when it is full of water than in the season
Ref. 134 when it is almost empty.
Copyright © Christoph Schiller November 1997–January 2007
**
Assume that wires existed that do not break. How long would such a wire have to be so
Challenge 313 n that, attached to the equator, it would stand upright in the air?
**
Everybody know that there are roughly two tides per day. But there are places, such as on
the coast of , where there is only one tide per day. See http://www.jason.oceanobs.com/
html/applications/marees/marees_m2k1_fr.html.
**
It is sufficient to use the concept of centrifugal force to show that the rings of Saturn
cannot be made of massive material, but must be made of separate pieces. Can you find
what is cl assical mechanics? 149
**
A painting is hanging on the wall of Dr. Dolittle’s waiting room. He hung up the paint-
ing using two nails, and wound the picture wire around the nails in such a way that the
Challenge 315 e painting would fall if either nail were pulled out. How did Dr. Dolittle do it?
“
Copyright © Christoph Schiller November 1997–January 2007
Everybody has to take a stand on this question, even students of physics. Indeed, many
types of forces are used and observed in daily life. One speaks of muscular, gravitational,
”
* This is in contrast to the actual origin of the term ‘mechanics’, which means ‘machine science’. It derives
from the Greek µηκανή, which means ‘machine’ and even lies at the origin of the English word ‘machine’
itself. Sometimes the term ‘mechanics’ is used for the study of motion of solid bodies only, excluding, e.g.,
hydrodynamics. This use fell out of favour in physics in the last century.
** This is not completely correct: in the 1980s, the first case of gravitational friction was discovered: the
Page 411 emission of gravity waves. We discuss it in detail later.
150 i galilean motion • 2. galilean physics – motion in everyday life
O b s e r va t i o n Force
psychic, sexual, satanic, supernatural, social, political, economic and many others. Physi-
cists see things in a simpler way. They call the different types of forces observed between
F=
dp
. (49)
dt
Force is the change or flow of momentum. If a force acts on a body, momentum flows
into it. Indeed, momentum can be imagined to be some invisible and intangible liquid.
Force measures how much of this liquid flows from one body to another per unit time.
Using the Galilean definition of linear momentum p = mv, we can rewrite the defini-
tion of force (for constant mass) as
F = ma , (50)
where F = F(t, x) is the force acting on an object of mass m and where a = a(t, x) =
dv~dt = d2 x~dt 2 is the acceleration of the same object, that is to say its change of velocity.*
The expression states in precise terms that force is what changes the velocity of masses. The
quantity is called ‘force’ because it corresponds in many, but not all aspects to muscular
force. For example, the more force is used, the further a stone can be thrown.
Copyright © Christoph Schiller November 1997–January 2007
However, whenever the concept of force is used, it should be remembered that physical
force is different from everyday force or everyday effort. Effort is probably best approxim-
ated by the concept of (physical) power, usually abbreviated P, and defined (for constant
force) as
P= =Fëv
dW
(51)
dt
* This equation was first written down by the Swiss mathematician and physicist Leonhard Euler (1707–
1783) in 1747, 20 years after the death of Newton, to whom it is usually and falsely ascribed; it was Euler,
not Newton, who first understood that this definition of force is useful in every case of motion, whatever
Ref. 25 the appearance, be it for point particles or extended objects, and be it rigid, deformable or fluid bodies.
Surprisingly and in contrast to frequently-made statements, equation (50) is even correct in relativity, as
shown on page 331.
what is cl assical mechanics? 151
will hardly ever use the term ‘weight’ in the microscopic part of our adventure.
Through its definition, the concept of force is distinguished clearly from ‘mass’, ‘mo-
mentum’, ‘energy’ and ‘power’. But where do forces originate? In other words, which ef-
fects in nature have the capacity to accelerate bodies by pumping momentum into ob-
jects? Table 21 gives an overview.
Every example of motion, from the one that lets us choose the direction of our gaze
to the one that carries a butterfly through the landscape, can be put into one of the two
left-most columns of Table 21. Physically, the two columns are separated by the following
* This stepping stone is so high that many professional physicists do not really take it themselves; this is
confirmed by the innumerable comments in papers that state that physical force is defined using mass, and, at
the same time, that mass is defined using force (the latter part of the sentence being a fundamental mistake).
152 i galilean motion • 2. galilean physics – motion in everyday life
criterion: in the first class, the acceleration of a body can be in a different direction from
its velocity. The second class of examples produces only accelerations that are exactly
opposed to the velocity of the moving body, as seen from the frame of reference of the
braking medium. Such a resisting force is called friction, drag or a damping. All examples
Challenge 321 e in the second class are types of friction. Just check.
Friction can be so strong that all motion of a body against its environment is made
impossible. This type of friction, called static friction or sticking friction, is common and
important: without it, turning the wheels of bicycles, trains or cars would have no effect.
Without static friction, wheels driven by a motor would have no grip. Similarly, not a
single screw would stay tightened. We could neither run nor walk in a forest, as the soil
would be more slippery than polished ice. In fact not only our own motion, but all volun-
tary motion of living beings is based on friction. The same is the case for self-moving ma-
chines. Without static friction, the propellers in ships, aeroplanes and helicopters would
not have any effect and the wings of aeroplanes would produce no lift to keep them in the
F = c w ρAv 2
1
(52)
2
where A is the area of its cross-section and v its velocity relative to the air, ρ is the density
of air; the drag coefficient c w is a pure number that depends on the shape of the moving
object. (A few examples are given in Figure 90. The formula is valid for all fluids, not only
Challenge 323 ny for air.) You may check that aerodynamic resistance cannot be derived from a potential.**
* Recent research suggest that maybe in certain crystalline systems, such as tungsten bodies on silicon, under
ideal conditions gliding friction can be extremely small and possibly even vanish in certain directions of
Ref. 137 motion. This so-called superlubrication is presently a topic of research.
** Such a statement about friction is correct only in three dimensions, as is the case in nature; in the case of
154 i galilean motion • 2. galilean physics – motion in everyday life
friction between disc and the reading head is a central quantity in determining the life-
time.
All forms of friction are accompanied by an increase in the temperature of the mov-
ing body. The reason became clear after the discovery of atoms. Friction is not observed
in few – e.g. 2, 3, or 4 – particle systems. Friction only appears in systems with many
particles, usually millions or more. Such systems are called dissipative. Both the tem-
perature changes and friction itself are due to the motion of large numbers of micro-
scopic particles against each other. This motion is not included in the Galilean descrip-
tion. When it is included, friction and energy loss disappear, and potentials can then be
used throughout. Positive accelerations – of microscopic magnitude – then also appear,
and motion is found to be conserved. As a result, all motion is conservative on a micro-
scopic scale. Therefore, on a microscopic scale it is possible to describe all motion without
the concept of force.* The moral of the story is that one should use force only in one situ-
ation: in the case of friction, and only when one does not want to go into the microscopic
details.**
Et qu’avons-nous besoin de ce moteur, quand
We often describe the motion of a body by specifying the time dependence of its position, ”
for example as
The quantities with an index 0, such as the starting position x0 , the starting velocity v0 ,
etc., are called initial conditions. Initial conditions are necessary for any description of mo-
tion. Different physical systems have different initial conditions. Initial conditions thus
specify the individuality of a given system. Initial conditions also allow us to distinguish
the present situation of a system from that at any previous time: initial conditions specify
the changing aspects of a system. In other words, they summarize the past of a system. Copyright © Christoph Schiller November 1997–January 2007
* The first scientist who eliminated force from the description of nature was Heinrich Rudolf Hertz (b. 1857
Hamburg, d. 1894 Bonn), the famous discoverer of electromagnetic waves, in his textbook on mechanics,
Die Prinzipien der Mechanik, Barth, 1894, republished by Wissenschaftliche Buchgesellschaft, Darmstadt,
1963. His idea was strongly criticized at that time; only a generation later, when quantum mechanics quietly
got rid of the concept for good, did the idea become commonly accepted. (Many have speculated about
the role Hertz would have played in the development of quantum mechanics and general relativity, had he
not died so young.) In his book, Hertz also formulated the principle of the straightest path: particles follow
geodesics. This same description is one of the pillars of general relativity, as we will see later on.
** In the case of human relations the evaluation should be somewhat more discerning, as the research by
Ref. 141 James Gilligan shows.
*** ‘And whatfor do we need this motor, when the reasoned study of nature proves to us that perpetual
motion is the first of its laws?’
Ref. 60 **** ‘What future will be tomorrow, never ask ...’ Horace is Quintus Horatius Flaccus (65–8 bce ), the great
Roman poet.
156 i galilean motion • 2. galilean physics – motion in everyday life
Page 39 Initial conditions are thus precisely the properties we have been seeking for a descrip-
tion of the state of a system. To find a complete description of states we thus need only a
complete description of initial conditions. It turns out that for gravitation, as for all other
microscopic interactions, there is no need for initial acceleration a0 , initial jerk j0 , or
higher-order initial quantities. In nature, acceleration and jerk depend only on the prop-
erties of objects and their environment; they do not depend on the past. For example,
the expression a = GM~r 2 , giving the acceleration of a small body near a large one, does
not depend on the past, but only on the environment. The same happens for the other
fundamental interactions, as we will find out shortly.
Page 71 The complete state of a moving mass point is thus described by specifying its position
and its momentum at all instants of time. Thus we have achieved a complete description
of the intrinsic properties of point objects, namely by their mass, and of their states of mo-
tion, namely by their momentum, energy, position and time. For extended rigid objects
we also need orientation, angular velocity and angular momentum. Can you specify the
“ future is uncertain.
Anonymous
”
* We cannot infer the events of the future from those of the present. Superstition is nothing but belief in the
causal nexus.
what is cl assical mechanics? 157
If, after climbing a tree, we jump down, we cannot halt the jump in the middle of the ”
trajectory; once the jump has begun, it is unavoidable and determined, like all passive
motion. However, when we begin to move an arm, we can stop or change its motion
from a hit to a caress. Voluntary motion does not seem unavoidable or predetermined.
Challenge 333 e Which of these two cases is the general one?
Let us start with the example that we can describe most precisely so far: the fall of a
body. Once the potential φ acting on a particle is given and taken into account, using
we then can determine the motion in advance. The complete trajectory x(t) can be cal-
culated with these two pieces of information. Owing to this possibility, an equation such
Make key as (55) is called an evolution equation for the motion of the object. (Note that the term
terms bold. An ‘evolution’ has different meanings in physics and in biology.) An evolution equation al-
edit of this
paragraph
ways expresses the observation that not all types of change are observed in nature, but
appears in the only certain specific cases. Not all imaginable sequences of events are observed, but only
cover a limited number of them. In particular, equation (55) expresses that from one instant to
memorandum. the next, objects change their motion based on the potential acting on them. Thus, given
an evolution equation and initial state, the whole motion of a system is uniquely fixed;
this property of motion is often called determinism. Since this term is often used with
different meanings, let us distinguish it carefully from several similar concepts, to avoid
misunderstandings.
Motion can be deterministic and at the same time still be unpredictable. The latter
property can have four origins: an impracticably large number of particles involved, the
complexity of the evolution equations, insufficient information about initial conditions,
and strange shapes of space-time. The weather is an example where the first three condi-
tions are fulfilled at the same time.* Nevertheless, its motion is still deterministic. Near
black holes all four cases apply together. We will discuss black holes in the section on
Copyright © Christoph Schiller November 1997–January 2007
the description of states and initial conditions in the microscopic domain are included.
In short, determinism does not contradict (macroscopic) irreversibility. However, on the
microscopic scale, deterministic motion is always reversible.
A final concept to be distinguished from determinism is acausality. Causality is the
requirement that a cause must precede the effect. This is trivial in Galilean physics, but
becomes of importance in special relativity, where causality implies that the speed of light
is a limit for the spreading of effects. Indeed, it seems impossible to have deterministic
motion (of matter and energy) which is acausal, i.e. faster than light. Can you confirm
Challenge 334 n this? This topic will be looked at more deeply in the section on special relativity.
Saying that motion is ‘deterministic’ means that it is fixed in the future and also in
the past. It is sometimes stated that predictions of future observations are the crucial test
for a successful description of nature. Owing to our often impressive ability to influence
the future, this is not necessarily a good test. Any theory must, first of all, describe past
observations correctly. It is our lack of freedom to change the past that results in our lack
everyday life, that of classical physics and that of quantum mechanics, is intrinsically and
inescapably deterministic, since it connects observations of the past and the future, elim-
inating alternatives. In short, the use of time implies determinism, and vice versa. When
drawing metaphysical conclusions, as is so popular nowadays when discussing quantum
Page 851 theory, one should never forget this connection. Whoever uses clocks but denies determ-
inism is nurturing a split personality!*
The idea that motion is determined often produces fear, because we are taught to asso-
ciate determinism with lack of freedom. On the other hand, we do experience freedom
in our actions and call it free will. We know that it is necessary for our creativity and for
our happiness. Therefore it seems that determinism is opposed to happiness.
But what precisely is free will? Much ink has been consumed trying to find a precise
definition. One can try to define free will as the arbitrariness of the choice of initial con-
ditions. However, initial conditions must themselves result from the evolution equations,
so that there is in fact no freedom in their choice. One can try to define free will from the
idea of unpredictability, or from similar properties, such as uncomputability. But these
definitions face the same simple problem: whatever the definition, there is no way to prove
experimentally that an action was performed freely. The possible definitions are useless.
In short, free will cannot be observed. (Psychologists also have a lot of their own data to
support this, but that is another topic.)
No process that is gradual – in contrast to sudden – can be due to free will; gradual
processes are described by time and are deterministic. In this sense, the question about
free will becomes one about the existence of sudden changes in nature. This will be a re-
curring topic in the rest of this walk. Does nature have the ability to surprise? In everyday
life, nature does not. Sudden changes are not observed. Of course, we still have to invest-
ing free, you should change your life! Fear of determinism usually stems from refusal to
take the world the way it is. Paradoxically, it is precisely he who insists on the existence
of free will who is running away from responsibility.
* That free will is a feeling can also be confirmed by careful introspection. The idea of free will always appears
after an action has been started. It is a beautiful experiment to sit down in a quiet environment, with the
intention to make, within an unspecified number of minutes, a small gesture, such as closing a hand. If you
Challenge 337 e carefully observe, in all detail, what happens inside yourself around the very moment of decision, you find
either a mechanism that led to the decision, or a diffuse, unclear mist. You never find free will. Such an
experiment is a beautiful way to experience deeply the wonders of the self. Experiences of this kind might
also be one of the origins of human spirituality, as they show the connection everybody has with the rest of
nature.
Challenge 339 n ** If nature’s ‘laws’ are deterministic, are they in contrast with moral or ethical ‘laws’? Can people still be
held responsible for their actions?
160 i galilean motion • 2. galilean physics – motion in everyday life
”
Darum kann es in der Logik auch nie
“ Überraschungen geben.*
Ludwig Wittgenstein, Tractatus, 6.1251
Classical mechanics describes nature in a rather simple way. Objects are permanent and ”
massive entities localized in space-time. States are changing properties of objects, de-
scribed by position in space and instant in time, by energy and momentum, and by their
rotational equivalents. Time is the relation between events measured by a clock. Clocks
are devices in undisturbed motion whose position can be observed. Space and position
is the relation between objects measured by a metre stick. Metre sticks are devices whose
shape is subdivided by some marks, fixed in an invariant and observable manner. Motion
Biblio graphy
Aiunt enim multum legendum esse, non multa.
“ Plinius, Epistulae.*
1 For a history of science in antiquity, see Lucio Russo, La rivoluzione dimenticata, Fel-
”
trinelli 1996, also available in several other languages. Cited on page 29.
2 An overview of motion illusions can be found on the excellent website http://www.
michaelbach.de/ot. See for example, http://www.michaelbach.de/ot/mot_rotsnake/index.
html. Cited on page 30.
3 A beautiful book explaining physics and its many applications in nature and technology
vividly and thoroughly is Paul G. Hewitt, John Suchocki & Leslie A. Hewitt,
Conceptual Physical Science, Bejamin/Cummings, 1999.
A book famous for its passion for curiosity is R ichard P. Feynman, R.B. Leighton
5 All the known texts by Parmenides and Heraclitus can be found in Jean-Paul Dumont,
Les écoles présocratiques, Folio-Gallimard, 1988. Views about the non-existence of motion
have also been put forward by much more modern and much more contemptible authors,
Page 694 such as in 1710 by Berkeley. Cited on page 30.
6 An example of people worried by Zeno is given by William McL aughlin, Resolving
Zeno’s paradoxes, Scientific American pp. 66–71, November 1994. The actual argument was
not about a hand slapping a face, but about an arrow hitting the target. See also Ref. 48.
Cited on page 30.
* ‘Read much, but not anything.’ Ep. 7, 9, 15. Gaius Plinius Secundus (b. 23/4 Novum Comum, d.
79 Vesuvius eruption), Roman writer, especially famous for his large, mainly scientific work Historia nat-
uralis, which has been translated and read for almost 2000 years.
162 i galilean motion
7 The full text of La Beauté and the other poems from Les fleurs du mal, one of the finest books
of poetry ever written, can be found at the http://hypermedia.univ-paris8.fr/bibliotheque/
Baudelaire/Spleen.html website. Cited on page 32.
8 The most famous text is Jearl Walker, The Flying Circus of Physics, Wiley, 1975. For
more interesting physical effects in everyday life, see Erwein Fl achsel, Hundertfünfzig
Physikrätsel, Ernst Klett Verlag, 1985. The book also covers several clock puzzles, in puzzle
numbers 126 to 128. Cited on page 32.
9 A concise and informative introduction into the history of classical physics is given in the
first chapter of the book by F.K. R ichtmeyer, E.H. Kennard & J.N. Cooper, Intro-
duction to Modern Physics, McGraw–Hill, 1969. Cited on page 33.
10 A good overview over the arguments used to prove the existence of god from motion is given
by Michael Buckley, Motion and Motion’s God, Princeton University Press, 1971. The
intensity of the battles waged around these failed attempts is one of the tragicomic chapters
of history. Cited on page 33.
18 On the topic of grace and poise, see e.g. the numerous books on the Alexander technique,
such as M. Gelb, Body Learning – An Introduction to the Alexander Technique, Aurum
Press, 1981, and R ichard Brennan, Introduction to the Alexander Technique, Little
Brown and Company, 1996. Among others, the idea of the Alexander technique is to re-
turn to the situation that the muscle groups for sustention and those for motion are used
only for their respective function, and not vice versa. Any unnecessary muscle tension, such
as neck stiffness, is a waste of energy due to the use of sustention muscles for movement and
of motion muscles for sustention. The technique teaches the way to return to the natural use
of muscles.
Motion of animals was discussed extensively already in the seventeenth century by
G. B orelli, De motu animalium, 1680. An example of a more modern approach is J.J.
Collins & I. Stewart, Hexapodal gaits and coupled nonlinear oscillator models, Biolo-
gical Cybernetics 68, pp. 287–298, 1993. See also I. Stewart & M. Golubitsky, Fearful
bibliography 163
23 All fragments from Heraclitus are from John Mansley Robinson, An Introduction to
Early Greek Philosophy, Houghton Muffin 1968, chapter 5. Cited on page 38.
24 An introduction to Newton the alchemist are the two books by Betty Jo Teeter Dobbs,
The Foundations of Newton’s Alchemy, Cambridge University Press, 1983, and The Janus Face
of Genius, Cambridge University Press, 1992. Newton is found to be a sort of highly intellec-
tual magician, desperately looking for examples of processes where gods interact with the
material world. An intense but tragic tale. A good overview is provided by R.G. Keesing,
Essay Review: Newton’s Alchemy, Contemporary Physics 36, pp. 117–119, 1995.
Newton’s infantile theology, typical for god seekers who grew up without a father, can be
found in the many books summarizing the letter exchanges between Clarke, his secretary,
and Leibniz, Newton’s rival for fame. Cited on page 43.
25 An introduction to the story of classical mechanics, which also destroys a few of the myths
surrounding it – such as the idea that Newton could solve differential equations or that he
introduced the expression F = ma – is given by Clifford A. Truesdell, Essays in the
History of Mechanics, Springer, 1968. Cited on pages 43, 127, and 150.
pp. 225–232, 2004. The chemical clocks in our body are described in John D. Palmer, The
Living Clock, Oxford University Press, 2002, or in A. Ahlgren & F. Halberg, Cycles of
Nature: An Introduction to Biological Rhythms, National Science Teachers Association, 1990.
See also the http://www.msi.umn.edu/~halberg/introd website. Cited on page 47.
31 This has been shown among others by the work of Anna Wierzbicka mentioned in more de-
tail in the Intermezzo following this chapter, on page 659. The passionate best seller by the
Chomskian author Steven Pinker, The Language Instinct – How the Mind Creates Lan-
guage, Harper Perennial, 1994, also discusses issues related to this matter, refuting amongst
others on page 63 the often repeated false statement that the Hopi language is an exception.
Cited on page 47.
32 Aristotle rejects the idea of the flow of time in chapter IV of his Physics. See the full text on
the http://classics.mit.edu/Aristotle/physics.4.iv.html website. Cited on page 49.
bibliography 165
33 Perhaps the most informative of the books about the ‘arrow of time’ is Hans Dieter Zeh,
The Physical Basis of the Direction of Time, Springer Verlag, 4th edition, 2001. It is still the
best book on the topic. Most other texts – have a look on the internet – lack clarity of ideas.
A typical conference proceeding is J.J. Halliwell, J. Pérez–Mercader & Woj-
ciech H. Zurek, Physical Origins of Time Asymmetry, Cambridge University Press, 1994.
Cited on page 49.
34 On the issue of absolute and relative motion there are many books about few issues. Ex-
amples are John Barbour, Absolute or Relative Motion? Vol. 1: A Study from the Machian
Point of View of the Discovery and the Structure of Spacetime Theories, Cambridge Univer-
sity Press, 1989, John Barbour, Absolute or Relative Motion? Vol. 2: The Deep Structure
of General Relativity, Oxford University Press, 2005, or John Earman, World Enough and
Spacetime: Absolute vs Relational Theories of Spacetime, MIT Press, 1989. Cited on page 53.
35 R. Dougherty & M. Foreman, Banach–Tarski decompositions using sets with the
property of Baire, Journal of the American Mathematical Society 7, pp. 75–124, 1994. See
also Al an L.T. Paterson, Amenability, American Mathematical Society, 1998, and
43 Andrea Frova, La fisica sotto il naso – 44 pezzi facili, Biblioteca Universale Rizzoli, Mil-
ano, 2001. Cited on page 66.
44 On the other hands, other sciences enjoy studying usual paths in all detail. See, for example,
Heini Hediger, editor, Die Straßen der Tiere, Vieweg & Sohn, 1967. Cited on page 66.
45 The study of shooting faeces (i.e., shit) and its mechanisms is a part of modern biology. The
reason that caterpillars do this was determined by M. Weiss, Good housekeeping: why do
shelter-dwelling caterpillars fling their frass?, Ecology Letters 6, pp. 361–370, 2003, who also
gives the present record of 1.5 m for the 24 mg pellets of Epargyreus clarus. The picture of
the flying frass is from S. Caveney, H. McLean & D. Surry, Faecal firing in a skipper
caterpillar is pressure-driven, The Journal of Experimental Biology 201, pp. 121–133, 1998.
Cited on page 67.
46 This was discussed in the Frankfurter Allgemeine Zeitung, 2nd of August, 1997, at the time
of the world athletics championship. The values are for the fastest part of the race of a 100 m
sprinter; the exact values cited were called the running speed world records in 1997, and were
given as 12.048 m~s = 43.372 km~h by Ben Johnson for men, and 10.99 m~s = 39.56 km~h
50 An overview of the latest developments is given by J.T. Armstrong, D.J. Hunter, K.J.
Johnston & D. Mozurkewich, Stellar optical interferometry in the 1990s, Physics
Today pp. 42–49, May 1995. More than 100 stellar diameters were known already in 1995.
Several dedicated powerful instruments are being planned. Cited on page 73.
51 A good biology textbook on growth is Arthur F. Hopper & Nathan H. Hart, Found-
ations of Animal Deveopment, Oxford University Press, 2006. Cited on page 74.
52 This is discussed for example in C.L. Stong, The amateur scientist – how to supply electric
power to something which is turning, Scientific American pp. 120–125, December 1975. It
also discusses how to make a still picture of something rotating simply by using a few prisms,
the so-called Dove prisms. Other examples of attaching something to a rotating body are
given by E. R ieflin, Some mechanisms related to Dirac’s strings, American Journal of
Physics 47, pp. 379–381, 1979. Cited on page 74.
bibliography 167
53 James A. Young, Tumbleweed, Scientific American 264, pp. 82–87, March 1991. The
tumbleweed is in fact quite rare, except in in Hollywood westerns, where all directors feel
obliged to give it a special appearance. Cited on page 74.
54 The first experiments to prove the rotation of the flagella were by M. Silverman & M.I.
Simon, Flagellar rotation and the mechanism of bacterial motility, Nature 249, pp. 73–74,
1974. For some pretty pictures of the molecules involved, see K. Namba, A biological mo-
lecular machine: bacterial flagellar motor and filament, Wear 168, pp. 189–193, 1993, or
the website http://www.nanonet.go.jp/english/mailmag/2004/011a.html. The present record
speed of rotation, 1700 rotations per second, is reported by Y. Magariyama, S. Sug-
iyama, K. Muramoto, Y. Maekawa, I. Kawagishi, Y. Imae & S. Kud o, Very fast
flagellar rotation, Nature 371, p. 752, 1994.
More on bacteria can be learned from David Dusenbery, Life at a Small Scale, Sci-
entific American Library, 1996. Cited on page 75.
55 On shadows, see the agreeable popular text by Roberto Casati, Alla scoperta dell’ombra
– Da Platone a Galileo la storia di un enigma che ha affascinato le grandi menti dell’umanità,
A discussion and a breakdown into electricity usage (14 EJ) and other energy forms, with
variations per country, can be found in S. Benka, The energy challenge, Physics Today
55, pp. 38–39, April 2002, and in E.J. Monitz & M.A. Kenderdine, Meeting energy
challenges: technology and policy, Physics Today 55, pp. 40–46, April 2002. Cited on page
87.
64 For an overview, see the paper by J.F. Mulligan & H.G. Hertz, An unpublished lecture
by Heinrich Hertz: ‘On the energy balance of the Earth’, American Journal of Physics 65,
pp. 36–45, 1997. Cited on page 88.
65 For a beautiful photograph of this feline feat, see the cover of the journal and the article of
J. Darius, A tale of a falling cat, Nature 308, p. 109, 1984. Cited on page 92.
66 Natthi L. Sharma, A new observation about rolling motion, European Journal of Physics
17, pp. 353–356, 1996. Cited on page 93.
168 i galilean motion
67 C. Singh, When physical intuition fails, American Journal of Physics 70, pp. 1103–1109, 2002.
Cited on page 93.
68 Serge Gracovetsky, The Spinal Engine, Springer Verlag, 1990. It is now also knon that
human gait is chaotic. This is explained by M. Perc, The dynamics of human gait, European
Journal of Physics 26, pp. 525–534, 2005. Cited on page 94.
69 Thomas Heath, Aristarchus of Samos – the Ancient Copernicus, Dover, 1981, reprinted
from the original 1913 edition. Aristarchos’ treaty is given in Greek and English. Aristarchos
was the first proposer of the heliocentric system. Aristarchos had measured the length of the
day (in fact, by determining the number of days per year) to the astonishing precision of
less than one second. This excellent book also gives an overview of Greek astronomy before
Aristarchos, explained in detail for each Greek thinker. Aristarchos’ text is also reprinted
in Aristarchos, On the sizes and the distances of the Sun and the Moon, c. 280 bce in
Michael J. Crowe, Theories of the World From Antiquity to the Copernican Revolution,
Dover, 1990, especially on pp. 27–29. Cited on page 95.
70 The influence of the Coriolis effect on icebergs was studied most thoroughly by the Swedish
74 The reference is J.G. Hagen, La rotation de la terre : ses preuves mécaniques anciennes et
nouvelles, Sp. Astr. Vaticana Second. App. Rome, 1910. His other experiment is published as
J.G. Hagen, How Atwood’s machine shows the rotation of the Earth even quantitatively,
International Congress of Mathematics, Aug. 1912. Cited on page 99.
75 R. Anderson, H.R. Bilger & G.E. Stedman, The Sagnac-effect: a century of Earth-
rotated interferometers, American Journal of Physics 62, pp. 975–985, 1994.
See also the clear and extensive paper by G.E. Stedman, Ring laser tests of funda-
mental physics and geophysics, Reports on Progress in Physics 60, pp. 615–688, 1997. Cited
on page 99.
76 About the length of the day, see the http://maia.usno.navy.mil website, or the books by
K. L ambeck, The Earth’s Variable Rotation: Geophysical Causes and Consequences, Cam-
bridge University Press, 1980, and by W.H. Munk & G.J.F. MacDonald, The Rotation
bibliography 169
of the Earth, Cambridge University Press, 1960. About a modern ring laser set-up, see http://
www.wttzell.ifag.de. Cited on pages 100 and 134.
77 One example of data is by C.P. Sonett, E.P. Kvale, A. Z akharian, M.A. Chan &
T.M. Demko, Late proterozoic and paleozoic tides, retreat of the moon, and rotation of
the Earth, Science 273, pp. 100–104, 5 July 1996. They deduce from tidal sediment analysis
that days were only 18 to 19 hours long in the Proterozoic, i.e. 900 million years ago; they
assume that the year was 31 million seconds long from then to today. See also C.P. Sonett
& M.A. Chan, Neoproterozoic Earth-Moon dynamics – rework of the 900 MA Big Cot-
tonwood canyon tidal laminae, Geophysical Research Letters 25, pp. 539–542, 1998. Another
determination was by G.E. Williams, Precambrian tidal and glacial clastic deposits: im-
plications for precambrian Earth–Moon dynamics and palaeoclimate, Sedimentary Geology
120, pp. 55–74, 1998. Using a geological formation called tidal rhythmites, he deduced that
about 600 million years ago there were 13 months per year and a day had 22 hours. Cited
on page 100.
78 The story of this combination of history and astronomy is told in R ichard Stephenson,
the ice ages, Science 194, pp. 1121–1132, 1976. They found the result by literally digging in the
mud that covers the ocean floor in certain places. Note that the web is full of information
on the ice ages. Just look up ‘Milankovitch’ in a search engine. Cited on page 104.
83 R. Humphreys & J. L arsen, The sun’s distance above the galactic plane, Astronomical
Journal 110, pp. 2183–2188, November 1995. Cited on page 104.
84 C.L. Bennet, M.S. Turner & M. White, The cosmic rosetta stone, Physics Today 50,
pp. 32–38, November 1997. Cited on page 105.
85 A good roulette prediction story from the 1970s is told by Thomas A. Bass, The Eu-
daemonic Pie also published under the title The Newtonian Casino, Backinprint, 2000. An
overview up to 1998 is given in the paper Edward O. Thorp, The invention of the first
wearable computer, Proceedings of the Second International Symposium on Wearable Com-
puters (ISWC 1998), 19-20 October 1998, Pittsburgh, Pennsylvania, USA (IEEE Computer Soci-
170 i galilean motion
metic water strider robot, Proceedings of the IEEE International Conference on Robotics
and Automation, Orlando, USA, 2006. S.H. Suhr, Y.S. Song, S.J. Lee & M. Sitti, Bio-
logically inspired miniature water strider robot, Proceedings of the Robotics: Science and
Systems I, Boston, USA, 2005. See also the website http://www.me.cmu.edu/faculty1/sitti/
nano/projects/waterstrider. Cited on page 117.
96 M. Wittlinger, R. Wehner & H. Wolf, The ant odometer: stepping on stilts and
stumps, Science 312, pp. 1965–1967, 2006. Cited on page 118.
97 The material on the shadow discussion is from the book by Robert M. Pryce, Cook and
Peary, Stackpole Books, 1997. See also Wally Herbert, The Noose of Laurels, Doubleday
1989. The sad story of Robert Peary is also told in the the National Geographic Society had
financed Peary in his attempt and had supported him until the US Congress had declared
him the first man at the Pole, the (partial) retraction is noteworthy. (The magazine then
bibliography 171
changed its mind again later on, to sell more copies.) By the way, the photographs of Cook,
who claimed to have been at the North Pole even before Peary, have the same problem with
the shadow length. Both men have a history of cheating about their ‘exploits’. As a result, the
first man at the North Pole was probably Roald Amundsen, who arrived there a few years
later, and who was also the first man at the South Pole. Cited on page 118.
98 The story is told in M. Nauenberg, Hooke, orbital motion, and Newton’s Principia, Amer-
ican Journal of Physics 62, 1994, pp. 331–350. Cited on page 119.
99 More details are given by D. R awlins, in Doubling your sunsets or how anyone can meas-
ure the Earth’s size with wristwatch and meter stick, American Journal of Physics 47, 1979,
pp. 126–128. Another simple measurement of the Earth radius, using only a sextant, is given
by R. O’Keefe & B. Ghavimi–Al agha, in The World Trade Centre and the distance
to the world’s centre, American Journal of Physics 60, pp. 183–185, 1992. Cited on page 119.
100 More details on astronomical distance measurements can be found in the beautiful little
book by A. van Helden, Measuring the Universe, University of Chicago Press, 1985, and in
Nigel Henbest & Heather Cooper, The Guide to the Galaxy, Cambridge University
106 About the measurement of spatial dimensions via gravity – and the failure to find any hint
for a number different from three – see the review by E.G. Adelberger, B.R. Heckel
& A.E. Nelson, Tests of the gravitational inverse-square law, Annual Review of Nuclear
and Particle Science 53, pp. 77–121, 2003, also http://arxiv.org/abs/hep-ph/0307284, or the
review by J.A. Hewett & M. Spiropulu, Particle physics probes of extra spacetime
dimensions, Annual Review of Nuclear and Particle Science 52, pp. 397–424, 2002, http://
arxiv.org/abs/hep-ph/0205106. Cited on page 124.
107 There are many books explaining the origin of the precise shape of the Earth, such as the
pocket book S. Anders, Weil die Erde rotiert, Verlag Harri Deutsch, 1985. Cited on page
124.
108 The shape of the Earth is described most precisely with the World Geodetic System. For
an extensive presentation, see the http://www.eurocontrol.be/projects/eatchip/wgs84/start.
172 i galilean motion
html website. See also the website of the International Earth Rotation Service at http://hpiers.
obspm.fr. Cited on page 124.
109 W.K. Hartman, R.J. Phillips & G.J. Taylor, editors, Origin of the Moon, Lunar and
Planetary Institute, 1986. Cited on page 128.
110 If you want to read about the motion of the Moon in all its fascinating details, have a look
at Martin C. Gu tzwiller, Moon–Earth–Sun: the oldest three body problem, Reviews
of Modern Physics 70, pp. 589–639, 1998. Cited on page 128.
111 Dietrich Neumann, Physiologische Uhren von Insekten – Zur Ökophysiologie lun-
arperiodisch kontrollierter Fortpflanzungszeiten, Naturwissenschaften 82, pp. 310–320, 1995.
Cited on page 128.
112 The origin of the duration of the menstrual cycle is not yet settled; however, there are ex-
planations on how it becomes synchronized with other cycles. For a general explanation see
Arkady Pikovsky, Michael Rosenblum & Jürgen Kurths, Synchronization: A
Universal Concept in Nonlinear Science, Cambridge University Press, 2002. Cited on page
129.
120
& Chemie 59, p. 354, 1896, and by R. von Eőtvős, V. Pekár & E. Fekete, Beiträge
zum Gesetz der Proportionalität von Trägheit und Gravität, Annalen der Physik 4, Leipzig
68, pp. 11–66, 1922. He found agreement to 5 parts in 109 . More experiments were performed
by P.G. Roll, R. Krotkow & R.H. Dicke, The equivalence of inertial and passive grav-
itational mass, Annals of Physics (NY) 26, pp. 442–517, 1964, one of the most interesting and
entertaining research articles in experimental physics, and by V.B. Braginsky & V.I.
Panov, Soviet Physics – JETP 34, pp. 463–466, 1971. Modern results, with errors less than
one part in 1012 , are by Y. Su & al., New tests of the universality of free fall, Physical Review
D50, pp. 3614–3636, 1994. Several experiments have been proposed to test the equality in
space to less than one part in 1016 . Cited on page 136.
121 See L. Hod ges, Gravitational field strength inside the Earth, American Journal of Physics
59, pp. 954–956, 1991. Cited on page 140.
bibliography 173
122 P. Mohazzabi & M.C. James, Plumb line and the shape of the Earth, American Journal
of Physics 68, pp. 1038–1041, 2000. Cited on page 141.
123 From Neil de Gasse Tyson, The Universe Down to Earth, Columbia University Press,
1994. Cited on page 142.
124 This is a small example from the beautiful text by Mark P. Silverman, And Yet It Moves:
Strange Systems and Subtle Questions in Physics, Cambridge University Press, 1993. It is a
treasure chest for anybody interested in the details of physics. Cited on page 143.
125 G.D. Quinl an, Planet X: a myth exposed, Nature 363, pp. 18–19, 1993. Cited on page 143.
126 See http://en.wikipedia.org/wiki/90377_Sedna. Cited on page 143.
127 See R. Matthews, Not a snowball’s chance ..., New Scientist 12 July 1997, pp. 24–27. The
original claim is by Louis A. Frank, J.B. Sigwarth & J.D. Craven, On the influx of
small comets into the Earth’s upper atmosphere, parts I and II, Geophysical Research Letters
13, pp. 303–306, pp. 307–310, 1986. The latest observations have disproved the claim. Cited
on page 143.
Nice use of ”
A
typography! ll over the Earth – even in Australia – people observe that stones fall ‘down’. This
ncient observation led to the discovery of the universal ‘law’ of gravity. To find it,
ll that was necessary was to look for a description of gravity that was valid globally.
The only additional observation that needs to be recognized in order to deduce the result
a = GM~r 2 is the variation of gravity with height.
In short, thinking globally helps us to make our description of motion more precise.
How can we describe motion as globally as possible? It turns out that there are six ap-
proaches to this question, each of which will be helpful on our way to the top of Motion
Mountain. We will start with an overview, and then explore the details of each approach.
— The first global approach to motion arises from a limitation of what we have learned
so far. When we predict the motion of a particle from its current acceleration, we are
using the most local description of motion possible. For example, whenever we use
an evolution equation we use the acceleration of a particle at a certain place and time
to determine its position and motion just after that moment and in the immediate
neighbourhood of that place.
Evolution equations thus have a mental ’horizon’ of radius zero.
Copyright © Christoph Schiller November 1997–January 2007
The opposite approach is illustrated in the famous problem of Figure 91. The chal-
lenge is to find the path that allows the fastest possible gliding motion from a high
Challenge 343 d point to a distant low point. To solve this we need to consider the motion as a whole,
for all times and positions. The global approach required by questions such as this one
will lead us to a description of motion which is simple, precise and fascinating: the
so-called principle of cosmic laziness, also known as the principle of least action.
— The second global approach to motion emerges when we compare the various descrip-
tions of the same system produced by different observers. For example, the observa-
tions by somebody falling from a cliff, a passenger in a roller coaster, and an observer
* Navigare necesse, vivere non necesse. ‘To navigate is necessary, to live is not.’ Gnaeus Pompeius Magnus
Ref. 145 (106–48 bce ), as cited by Plutarchus (c. 45 to c. 125).
176 i galilean motion • 3. the global simplicity of complexity
bicycle rope
wheel
a b b
F C P
a b b
on the ground will usually differ. The relationships between these observations lead
us to a global description, valid for everybody. This approach leads us to the theory of
relativity.
— The third global approach to motion is to exploring the motion of extended and ri-
gid bodies, rather than mass points. The counter-intuitive result of the experiment in
Figure 93 shows why this is worthwhile.
In order to design machines, it is essential to understand how a group of rigid bod-
ies interact with one another. As an example, the mechanism in Figure 94 connects
the motion of points C and P. It implicitly defines a circle such that one always has the
relation r C = 1~r P between the distances of C and P from its centre. Can you find that
Challenge 344 ny circle?
Ref. 146 Another famous challenge is to devise a wooden carriage, with gearwheels that
connect the wheels to an arrow in such a way that whatever path the carriage takes,
Copyright © Christoph Schiller November 1997–January 2007
Challenge 345 d the arrow always points south (see Figure 95). The solution to this is useful in helping
us to understand general relativity, as we will see.
Another interesting example of rigid motion is the way that human movements,
such as the general motions of an arm, are composed from a small number of basic
Ref. 147 motions. All these examples are from the fascinating field of engineering; unfortu-
nately, we will have little time to explore this topic in our hike.
— The fourth global approach to motion is the description of non-rigid extended bodies.
For example, fluid mechanics studies the flow of fluids (like honey, water or air) around
solid bodies (like spoons, ships, sails or wings). Fluid mechanics thus seeks to explain
how insects, birds and aeroplanes fly,* why sailing-boats can sail against the wind, what
* The mechanisms of insect flight are still a subject of active research. Traditionally, fluid dynamics has
the global simplicity of complexity 177
? or ?
Ref. 148 happens when a hard-boiled egg is made to spin on a thin layer of water, or how a bottle
Challenge 346 n full of wine can be emptied in the fastest way possible.
As well as fluids, we can study the behaviour of deformable solids. This area of re-
search is called continuum mechanics. It deals with deformations and oscillations of
extended structures. It seeks to explain, for example, why bells are made in particular
Challenge 347 n shapes; how large bodies – such as falling chimneys – break when under stress; and
how cats can turn themselves the right way up as they fall. During the course of our
journey we will repeatedly encounter issues from this field, which impinges even upon
general relativity and the world of elementary particles.
— The fifth global approach to motion is the study of the motion of huge numbers of
particles. This is called statistical mechanics. The concepts needed to describe gases,
such as temperature and pressure (see Figure 97), will be our first steps towards the
understanding of black holes.
Copyright © Christoph Schiller November 1997–January 2007
— The sixth global approach to motion involves all of the above-mentioned view-
points at the same time. Such an approach is needed to understand everyday ex-
perience, and life itself. Why does a flower form a specific number of petals? How
does an embryo differentiate in the womb? What makes our hearts beat? How do
Another nice mountains ridges and cloud patterns emerge? How do stars and galaxies evolve?
use of How are sea waves formed by the wind?
typography
concentrated on large systems, like boats, ships and aeroplanes. Indeed, the smallest human-made object
that can fly in a controlled way – say, a radio-controlled plane or helicopter – is much larger and heavier
than many flying objects that evolution has engineered. It turns out that controlling the flight of small things
requires more knowledge and more tricks than controlling the flight of large things. There is more about this
topic on page 991.
178 i galilean motion • 3. the global simplicity of complexity
All these are examples of self-organization; life scientists simply speak of growth.
Whatever we call these processes, they are characterized by the spontaneous appear-
ance of patterns, shapes and cycles. Such processes are a common research theme
across many disciplines, including biology, chemistry, medicine, geology and engin-
eering.
We will now give a short introduction to these six global approaches to motion. We will
begin with the first approach, namely, the global description of moving point-like ob-
jects. The beautiful method described below was the result of several centuries of collect-
ive effort, and is the highlight of mechanics. It also provides the basis for all the further
descriptions of motion that we will meet later on.
* Note that this ‘action’ is not the same as the ‘action’ appearing in statements such as ‘every action has an
equal and opposite reaction’. This last usage, coined by Newton, has not stuck; therefore the term has been
recycled. After Newton, the term ‘action’ was first used with an intermediate meaning, before it was finally
given the modern meaning used here. This last meaning is the only meaning used in this text.
Another term that has been recycled is the ‘principle of least action’. In old books it used to have a different
meaning from the one in this chapter. Nowadays, it refers to what used to be called Hamilton’s principle in
the Anglo-Saxon world, even though it is (mostly) due to others, especially Leibniz. The old names and
meanings are falling into disuse and are not continued here.
Behind these shifts in terminology is the story of an intense two-centuries-long attempt to describe mo-
tion with so-called extremal or variational principles: the objective was to complete and improve the work
initiated by Leibniz. These principles are only of historical interest today, because all are special cases of the
Ref. 149 principle of least action described here.
measuring change with action 179
Change A p p r ox i m at e a c t i o n
va l u e
average L
integral
∫ L(t)dt
Copyright © Christoph Schiller November 1997–January 2007
t
∆t t m
ti tf
elapsed time
F I G U R E 99 Defining a total effect as an
accumulation (addition, or integral) of small effects
over time
180 i galilean motion • 3. the global simplicity of complexity
f
L(t) dt = lim Q L(t m )∆t = L ë (t f − t i ) .
tf
∫ti ∆t 0 m=i
(58)
In other words, the integral is the limit, as the time slices get smaller, of the sum of the
areas of the individual rectangular strips that approximate the function.** Since the P
sign also means a sum, and since an infinitesimal ∆t is written dt, we can understand the
notation used for integration. Integration is a sum over slices. The notation was developed
by Gottfried Leibniz to make exactly this point. Physically speaking, the integral of the
Lagrangian measures the effect that L builds up over time. Indeed, action is called ‘effect’
in some languages, such as German. Copyright © Christoph Schiller November 1997–January 2007
In short, then, action is the integral of the Lagrangian over time. The unit of action,
and thus of physical change, is the unit of energy (the Joule), times the unit of time (the
second). Thus change is measured in Js. A large value means a big change. Table 22 shows
some approximate values of actions.
* It is named after Giuseppe Lodovico Lagrangia (b. 1736 Torino, d. 1813 Paris), better known as
Joseph Louis Lagrange. He was the most important mathematician of his time; he started his career in Turin,
then worked for 20 years in Berlin, and finally for 26 years in Paris. Among other things he worked on num-
ber theory and analytical mechanics, where he developed most of the mathematical tools used nowadays for
calculations in classical mechanics and classical gravitation. He applied them successfully to many motions
in the solar system.
** For more details on integration see Appendix D.
measuring change with action 181
F I G U R E 100 The
minimum of a curve has
vanishing slope
To understand the definition of action in more detail, we will start with the simplest
case: a system for which the potential energy is zero, such as a particle moving freely. Ob-
viously, a large kinetic energy means a lot of change. If we observe the particle at two in-
* In fact, in some pathological situations the action is maximal, so that the snobbish form of the principle is
that the action is ‘stationary,’ or an ‘extremum,’ meaning minimal or maximal. The condition of vanishing
variation, given below, encompasses both cases.
182 i galilean motion • 3. the global simplicity of complexity
When gravity is present, a thrown stone flies along a parabola (or more precisely, along
an ellipse) because any other path, say one in which the stone makes a loop in the air,
Challenge 351 e would imply a larger action. Again you might want to verify this for yourself.
All observations support this simple and basic statement: things always move in a way
that produces the smallest possible value for the action. This statement applies to the full
path and to any of its segments. Betrand Russell called it the ‘law of cosmic laziness’.
It is customary to express the idea of minimal change in a different way. The action
varies when the path is varied. The actual path is the one with the smallest action. You will
recall from school that at a minimum the derivative of a quantity vanishes: a minimum
has a horizontal slope. In the present case, we do not vary a quantity, but a complete path;
hence we do not speak of a derivative or slope, but of a variation. It is customary to write
the variation of action as δS. The principle of least action thus states:
The quantity being varied has no name; it represents a generalized notion of change. You might want to
Challenge 352 ny check that it leads to the correct evolution equations. Thus, although proper Lagrangian descriptions exist
only for conservative systems, for dissipative systems the principle can be generalized and remains useful.
Many physicists will prefer another approach. What a mathematician calls a generalization is a special
case for a physicist: the principle (61) hides the fact that all friction results from the usual principle of min-
Copyright © Christoph Schiller November 1997–January 2007
imal action, if we include the complete microscopic details. There is no friction in the microscopic domain.
Friction is an approximate, macroscopic concept.
Nevertheless, more mathematical viewpoints are useful. For example, they lead to interesting limitations
for the use of Lagrangians. These limitations, which apply only if the world is viewed as purely classical –
which it isn’t – were discovered about a hundred years ago. In those times computers where not available,
and the exploration of new calculation techniques was important. Here is a summary.
The coordinates used in connection with Lagrangians are not necessarily the Cartesian ones. Generalized
coordinates are especially useful when there are constraints on the motion. This is the case for a pendulum,
where the weight always has to be at the same distance from the suspension, or for an ice skater, where the
Ref. 150 skate has to move in the direction in which it is pointing. Generalized coordinates may even be mixtures of
positions and momenta. They can be divided into a few general types.
Generalized coordinates are called holonomic–scleronomic if they are related to Cartesian coordinates in
a fixed way, independently of time: physical systems described by such coordinates include the pendulum
and a particle in a potential. Coordinates are called holonomic–rheonomic if the dependence involves time.
measuring change with action 183
of variations. The condition δS = 0 implies that the action, i.e. the area under the curve
in Figure 99, is a minimum. A little bit of thinking shows that if the Lagrangian is of the
Challenge 353 ny form L(x n , v n ) = T(v n ) − U(x n ), then
=
d ∂T ∂U
(62)
dt ∂v n ∂x n
where n counts all coordinates of all particles.* For a single particle, these Lagrange’s equa-
Challenge 354 e tions of motion reduce to
ma = ∇U . (64)
This is the evolution equation: it says that the force on a particle is the gradient of the
potential energy U. The principle of least action thus implies the equation of motion.
Challenge 355 n (Can you show the converse?)
An example of a rheonomic systems would be a pendulum whose length depends on time. The two terms
Page 249 rheonomic and scleronomic are due to Ludwig Boltzmann. These two cases, which concern systems that are
only described by their geometry, are grouped together as holonomic systems. The term is due to Heinrich
Page 570 Hertz.
The more general situation is called anholonomic, or nonholonomic. Lagrangians work well only for holo-
nomic systems. Unfortunately, the meaning of the term ‘nonholonomic’ has changed. Nowadays, the term
is also used for certain rheonomic systems. The modern use calls nonholonomic any system which involves
velocities. Therefore, an ice skater or a rolling disc is often called a nonholonomic system. Care is thus ne-
cessary to decide what is meant by nonholonomic in any particular context.
Even though the use of Lagrangians, and of action, has its limitations, these need not bother us at micro-
Copyright © Christoph Schiller November 1997–January 2007
scopic level, since microscopic systems are always conservative, holonomic and scleronomic. At the funda-
mental level, evolution equations and Lagrangians are indeed equivalent.
* The most general form for a Lagrangian L(q n , q̇ n , t), using generalized holonomic coordinates q n , leads
to Lagrange equations of the form
=
d ∂L ∂L
. (63)
dt ∂ q̇ n ∂q n
In order to deduce these equations, we also need the relation δ q̇ = d~dt(δq). This relation is valid only for
holonomic coordinates introduced in the previous footnote and explains their importance.
It should also be noted that the Lagrangian for a moving system is not unique; however, the study of how
Ref. 151 the various Lagrangians for a given moving system are related is not part of this walk.
By the way, the letter q for position and p for momentum were introduced in physics by the mathem-
atician Carl Jacobi (b. 1804 Potsdam, d. 1851 Berlin).
** This idea was ridiculed by the French philosopher Voltaire (1694–1778) in his lucid writings, notably in
184 i galilean motion • 3. the global simplicity of complexity
we should still be able to feel the fascination of the issue. Leibniz was so excited about the
principle of least action because it was the first time that actual observations were distin-
guished from all other imaginable possibilities. For the first time, the search for reasons
why things are the way they are became a part of physical investigation. Could the world
be different from what it is? In the principle of least action, we have a hint of a negative
Challenge 357 n answer. (What do you think?) The final answer will emerge only in the last part of our
adventure.
As a way to describe motion, the Lagrangian has several advantages over the evolu-
tion equation. First of all, the Lagrangian is usually more compact than writing the corres-
ponding evolution equations. For example, only one Lagrangian is needed for one system,
however many particles it includes. One makes fewer mistakes, especially sign mistakes,
as one rapidly learns when performing calculations. Just try to write down the evolution
equations for a chain of masses connected by springs; then compare the effort with a deriv-
Challenge 358 ny ation using a Lagrangian. (The system behaves like a chain of atoms.) We will encounter
the brilliant book Candide, written in 1759, and still widely available.
measuring change with action 185
The principle of least action states that the action is minimal when the end point of
Ref. 152 the motion, and in particular the time between them, are fixed. It is less well known that
the reciprocal principle also holds: if the action is kept fixed, the elapsed time is maximal.
Challenge 360 ny Can you show this?
Even though the principle of least action is not an explanation of motion, it somehow
calls for one. We need some patience, though. Why nature follows the principle of least
action, and how it does so, will become clear when we explore quantum theory.
Never confuse movement with action.
Ref. 153
”
The optimist thinks this is the best of all
charges, and the strong nuclear interaction. Similarly, only three types of repulsion are
observed: rotation, pressure, and the Pauli exclusion principle (which we will encounter
Page 806 later on). Of the nine possible combinations of attraction and repulsion, not all appear in
Challenge 361 n nature. Can you find out which ones are missing from Figure 101 and Table 23, and why?
Together, attraction, friction and repulsion imply that change and action are minim-
ized when objects come and stay together. The principle of least action thus implies the
stability of aggregates. By the way, formation history also explains why so many aggreg-
Challenge 362 ny ates rotate. Can you tell why?
¼
Page 1173 * The Planck mass is given by m Pl = ħc~G = 21.767(16) µg.
** Figure 101 suggests that domains beyond physics exist; we will discover later on that this is not the case,
as mass and size are not definable in those domains.
186 i galilean motion • 3. the global simplicity of complexity
universe
mass
[kg]
galaxy
1040 black
holes star cluster
Sun
it
lim
Earth
e
neutron
ol
1020
kh
bl :
e ce
star
th ien
ac
nd sc
beyond science: beyond Planck length limit
mountain
yo nd
Interesting
in
diagram
ter l
mat
human
100
mon
com
cell
heavy DNA
10-20 nucleus
uranium
muon hydrogen
proton
electron
Aggregates
m
10-40 neutrino
icr
os
co
pi
ca
gg
Elementary
re
ga
particles lightest
te
lim
10 -60 imaginable
it
aggregate
But why does friction exist at all? And why do attractive and repulsive interactions
exist? And why is it – as it would appear from the above – that in some distant past matter
Copyright © Christoph Schiller November 1997–January 2007
was not found in lumps? In order to answer these questions, we must first study another
global property of motion: symmetry.
A g g r e g at e Size O b s. C o n st i t u e n t s
(diameter) num.
A g g r e g at e Size O b s. C o n st i t u e n t s
(diameter) num.
A g g r e g at e Size O b s. C o n st i t u e n t s
(diameter) num.
exposition of mechanics without a single figure. Obviously the book was difficult to read
and was not a sales success. Therefore his methods took another generation to come into
general use.
**
Given that action is the basic quantity describing motion, we can define energy as action
per unit time, and momentum as action per unit distance. The energy of a system thus
describes how much it changes over time, and the momentum how much it changes over
Challenge 363 n distance. What are angular momentum and rotational energy?
**
‘In nature, effects of telekinesis or prayer are impossible, as in most cases the change inside
measuring change with action 189
the brain is much smaller than the change claimed in the outside world.’ Is this argument
Challenge 364 n correct?
**
In Galilean physics, the Lagrangian is the difference between kinetic and potential energy.
Later on, this definition will be generalized in a way that sharpens our understanding of
this distinction: the Lagrangian becomes the difference between a term for free particles
and a term due to their interactions. In other words, particle motion is a continuous
compromise between what the particle would do if it were free and what other particles
want it to do. In this respect, particles behave a lot like humans beings.
**
Challenge 365 ny Explain: why is T + U constant, whereas T − U is minimal?
ism via a branched network of vessels: a few large ones, and increasingly many smaller
ones. Secondly, the vessels all have the same minimum size. And thirdly, the networks
are optimized in order to minimize the energy needed for transport. Together, these rela-
tions explain many additional scaling rules; they might also explain why animal lifespan
scales as m−1~4 , or why most mammals have roughly the same number of heart beats in
a lifetime.
A competing explanation, using a different minimization principle, states that quarter
powers arise in any network built in order that the flow arrives to the destination by the
Ref. 158 most direct path.
* The rest of the mass comes from the CO2 in the air.
190 i galilean motion • 3. the global simplicity of complexity
α air
water
F I G U R E 102 Refraction of
light is due to travel-time
optimization
**
**
In Galilean physics, the value of the action depends on the speed of the observer, but not
on his position or orientation. But the action, when properly defined, should not depend Copyright © Christoph Schiller November 1997–January 2007
on the observer. All observers should agree on the value of the observed change. Only
special relativity will fulfil the requirement that action be independent of the observer’s
Challenge 372 n speed. How will the relativistic action be defined?
**
Measuring all the change that is going on in the universe presupposes that the universe
Challenge 373 n is a physical system. Is this the case?
**
One motion for which action is particularly well minimized in nature is dear to us: walk-
Ref. 159 ing. Extensive research efforts try to design robots which copy the energy saving func-
tioning and control of human legs. For an example, see the website by Tao Geng at http://
motion and symmetry 191
www.cn.stir.ac.uk/~tgeng/research.html.
Challenge 374 n ability to talk about it! Moreover, the symmetry of nature is considerably higher than that
of a forget-me-not. We will discover that this high symmetry is at the basis of the famous
expression E 0 = mc 2 .
Ref. 160
“ apparent.
Heraclitus of Ephesos, about 500 bce
Why can we understand somebody when he is talking about the world, even though we ”
are not in his shoes? We can for two reasons: because most things look similar from differ-
ent viewpoints, and because most of us have already had similar experiences beforehand.
192 i galilean motion • 3. the global simplicity of complexity
‘Similar’ means that what we and what others observe somehow correspond. In other
words, many aspects of observations do not depend on viewpoint. For example, the num-
ber of petals of a flower has the same value for all observers. We can therefore say that this
quantity has the highest possible symmetry. We will see below that mass is another such
example. Observables with the highest possible symmetry are called scalars in physics.
Other aspects change from observer to observer. For example, the apparent size varies
with the distance of observation. However, the actual size is observer-independent. In
general terms, any type of viewpoint-independence is a form of symmetry, and the obser-
vation that two people looking at the same thing from different viewpoints can under-
stand each other proves that nature is symmetric. We start to explore the details of this
symmetry in this section and we will continue during most of the rest of our hike.
In the world around us, we note another general property: not only does the same
phenomenon look similar to different observers, but different phenomena look similar
to the same observer. For example, we know that if fire burns the finger in the kitchen, it
Viewpoints
Tolerance ... is the suspicion that the other
“ might be right.
Kurt Tucholski (1890–1935), German writer
Tolerance – a strength one mainly wishes to
”
“ political opponents.
Wolfram Weidner (b. 1925) German journalist
When a young human starts to meet other people in childhood, it quickly finds out that ” Copyright © Christoph Schiller November 1997–January 2007
certain experiences are shared, while others, such as dreams, are not. Learning to make
this distinction is one of the adventures of human life. In these pages, we concentrate
on a section of the first type of experiences: physical observations. However, even among
these, distinctions are to be made. In daily life we are used to assuming that weights,
volumes, lengths and time intervals are independent of the viewpoint of the observer.
We can talk about these observed quantities to anybody, and there are no disagreements
over their values, provided they have been measured correctly. However, other quantities
do depend on the observer. Imagine talking to a friend after he jumped from one of the
trees along our path, while he is still falling downwards. He will say that the forest floor
is approaching with high speed, whereas the observer below will maintain that the floor
is stationary. Obviously, the difference between the statements is due to their different
motion and symmetry 193
viewpoints. The velocity of an object (in this example that of the forest floor or of the
friend himself) is thus a less symmetric property than weight or size. Not all observers
agree on its value.
In the case of viewpoint-dependent observations, understanding is still possible with
the help of a little effort: each observer can imagine observing from the point of view of
the other, and check whether the imagined result agrees with the statement of the other.*
If the statement thus imagined and the actual statement of the other observer agree, the
observations are consistent, and the difference in statements is due only to the different
viewpoints; otherwise, the difference is fundamental, and they cannot agree or talk. Using
this approach, you can even argue whether human feelings, judgements, or tastes arise
Challenge 376 n from fundamental differences or not.
The distinction between viewpoint-independent (invariant) and viewpoint-
dependent quantities is an essential one. Invariant quantities, such as mass or shape,
describe intrinsic properties, and quantities depending on the observer make up the
is that of the behaviour of a system under exchange of its parts. The associated invariance
is called permutation symmetry. It is a discrete symmetry, and we will encounter it in the
second part of our adventure.
The three consistency requirements described above are called ‘principles’ because
these basic statements are so strong that they almost completely determine the ‘laws’ of
physics, as we will see shortly. Later on we will discover that looking for a complete de-
scription of the state of objects will also yield a complete description of their intrinsic
properties. But enough of introduction: let us come to the heart of the topic.
* Humans develop the ability to imagine that others can be in situations different from their own at the
Ref. 161 age of about four years. Therefore, before the age of four, humans are unable to conceive special relativity;
afterwards, they can.
194 i galilean motion • 3. the global simplicity of complexity
associativity, i.e. (a X b) X c = a X (b X c)
Any set that fulfils these three defining properties, or axioms, is called a (mathematical)
group. Historically, the notion of group was the first example of a mathematical structure
which was defined in a completely abstract manner.* Can you give an example of a group
Challenge 377 n taken from daily life? Groups appear frequently in physics and mathematics, because
Ref. 162 symmetries are almost everywhere, as we will see.** Can you list the symmetry operations
Challenge 378 n of the pattern of Figure 104?
Representations
Looking at a symmetric and composed system such as the one shown in Figure 104, we
Challenge 379 e notice that each of its parts, for example each red patch, belongs to a set of similar objects,
usually called a multiplet. Taken as a whole, the multiplet has (at least) the symmetry
properties of the whole system. For some of the coloured patches in Figure 104 we need
four objects to make up a full multiplet, whereas for others we need two, or only one, as
in the case of the central star. In fact, in any symmetric system each part can be classified
according to what type of multiplet it belongs to. Throughout our mountain ascent we will
perform the same classification with every part of nature, with ever-increasing precision.
Copyright © Christoph Schiller November 1997–January 2007
A multiplet is a set of parts that transform into each other under all symmetry trans-
formations. Mathematicians often call abstract multiplets representations. By specifying
* The term is due to Evariste Galois (1811–1832), the structure to Augustin-Louis Cauchy (1789–1857) and
the axiomatic definition to Arthur Cayley (1821–1895).
** In principle, mathematical groups need not be symmetry groups; but it can be proven that all groups can
be seen as transformation groups on some suitably defined mathematical space, so that in mathematics we
can use the terms ‘symmetry group’ and ‘group’ interchangeably.
A group is called Abelian if its concatenation operation is commutative, i.e. if a X b = b X a for all pairs of
elements a and b. In this case the concatenation is sometimes called addition. Do rotations form an abelian
group?
A subset G 1 ⊂ G of a group G can itself be a group; one then calls it a subgroup and often says sloppily
that G is larger than G 1 or that G is a higher symmetry group than G 1 .
motion and symmetry 195
F I G U R E 104 A Hispano–Arabic ornament from the Governor’s Palace in Sevilla (© Christoph Schiller)
D(refl) = ,
0 1
(66)
1 0
since every point (x, y) becomes transformed to (y, x) when multiplied by the matrix
Challenge 380 e D(refl). Therefore, for a mathematician a representation of a symmetry group G is an
assignment of a matrix D(a) to each group element a such that the representation of
the concatenation of two elements a and b is the product of the representations D of the
196 i galilean motion • 3. the global simplicity of complexity
elements:
D(a X b) = D(a)D(b) . (67)
For example, the matrix of equation (66), together with the corresponding matrices for
all the other symmetry operations, have this property.*
For every symmetry group, the construction and classification of all possible represent-
ations is an important task. It corresponds to the classification of all possible multiplets a
symmetric system can be made of. In this way, understanding the classification of all mul-
tiplets and parts which can appear in Figure 104 will teach us how to classify all possible
parts of which an object or an example of motion can be composed!
A representation D is called unitary if all matrices D(a) are unitary.** Almost all rep-
resentations appearing in physics, with only a handful of exceptions, are unitary: this
term is the most restrictive, since it specifies that the corresponding transformations are
one-to-one and invertible, which means that one observer never sees more or less than
* There are some obvious, but important, side conditions for a representation: the matrices D(a) must be
invertible, or non-singular, and the identity operation of G must be mapped to the unit matrix. In even more
compact language one says that a representation is a homomorphism from G into the group of non-singular
or invertible matrices. A matrix D is invertible if its determinant det D is not zero.
In general, if a mapping f from a group G to another G ′ satisfies
the mapping f is called an homomorphism. A homomorphism f that is one-to-one (injective) and onto
Copyright © Christoph Schiller November 1997–January 2007
(surjective) is called a isomorphism. If a representation is also injective, it is called faithful, true or proper.
In the same way as groups, more complex mathematical structures such as rings, fields and associative
algebras may also be represented by suitable classes of matrices. A representation of the field of complex
numbers is given in Appendix D.
** The transpose AT of a matrix A is defined element-by-element by (AT )ik = A ki . The complex conjugate A
of a matrix A is defined by (A )ik = (A ik ) . The adjoint A† of a matrix A is defined by A† = (AT ) . A matrix
is called symmetric if AT = A, orthogonal if AT = A−1 , Hermitean or self-adjoint (the two are synonymous
in all physical applications) if A† = A (Hermitean matrices have real eigenvalues), and unitary if A† = A−1 .
Unitary matrices have eigenvalues of norm one. Multiplication by a unitary matrix is a one-to-one mapping;
since the time evolution of physical systems is a mapping from one time to another, evolution is always
described by a unitary matrix. A real matrix obeys A = A, an antisymmetric or skew-symmetric matrix
is defined by AT = −A, an anti-Hermitean matrix by A† = −A and an anti-unitary matrix by A† = −A−1 .
All the mappings described by these special types of matrices are one-to-one. A matrix is singular, i.e. not
one-to-one, if det A = 0.
motion and symmetry 197
With these concepts we are ready to talk about motion with improved precision.
be possible for motion as well. We deduced the classification of the ribbons in the Arabic
pattern into singlets, doublets, etc. from the various possible observation viewpoints. For
a moving system, the building blocks, corresponding to the ribbons, are the observables.
Since we observe that nature is symmetric under many different changes of viewpoint,
we can classify all observables. To do so, we need to take the list of all viewpoint trans-
formations and deduce the list of all their representations.
Our everyday life shows that the world stays unchanged after changes in position, ori-
entation and instant of observation. One also speaks of space translation invariance, rota-
tion invariance and time translation invariance. These transformations are different from
those of the Arabic pattern in two respects: they are continuous and they are unboun-
ded. As a result, their representations will generally be continuously variable and without
bounds: they will be quantities or magnitudes. In other words, observables will be con-
structed with numbers. In this way we have deduced why numbers are necessary for any
description of motion.*
Earth is not a vector, since that place does not stay fixed with respect to the environment,
when the observer changes.
Mathematicians say that vectors are directed entities staying invariant under coordin-
ate transformations. Velocities of objects, accelerations and field strength are examples of
Challenge 385 e vectors. (Can you confirm this?) The magnitude of a vector is a scalar: it is the same for
any observer. By the way, a famous and baffling result of nineteenth-century experiments
is that the velocity of light is not a vector for Galilean transformations. This mystery will
be solved shortly.
* Only scalars, in contrast to vectors and higher-order tensors, may also be quantities which only take a
Challenge 383 e discrete set of values, such as +1 or −1 only. In short, only scalars may be discrete observables.
** Later on, spinors will be added to, and complete, this list.
motion and symmetry 199
Tensors are generalized vectors. As an example, take the moment of inertia of an object.
Page 90 It specifies the dependence of the angular momentum on the angular velocity. For any
object, doubling the magnitude of angular velocity doubles the magnitude of angular
momentum; however, the two vectors are not parallel to each other if the object is not a
Page 113 sphere. In general, if any two vector quantities are proportional, in the sense that doubling
the magnitude of one vector doubles the magnitude of the other, but without the two
vectors being parallel to each other, then the proportionality ‘factor’ is a (second order)
tensor. Like all proportionality factors, tensors have a magnitude. In addition, tensors
have a direction and a shape: they describe the connection between the vectors they relate.
Just as vectors are the simplest quantities with a magnitude and a direction, so tensors
are the simplest quantities with a magnitude and with a direction depending on a second,
chosen direction. Vectors can be visualized as oriented arrows; tensors can be visualized
Challenge 387 n as oriented ellipsoids.* Can you name another example of tensor?
Let us get back to the description of motion. Table 24 shows that in physical systems we
L = α a i b i + β c jk d jk + γ e l mn f l mn + ... (70)
where the indices attached to the variables a, b, c etc. always come in matching pairs to be
summed over. (Therefore summation signs are usually simply left out.) The Greek letters
represent constants. For example, the action of a free point particle in Galilean physics
was given as
S = L dt =
m
∫ 2
v 2 dt ∫ (71)
which is indeed of the form just mentioned. We will encounter many other cases during
our study of motion.**
Galileo already understood that motion is also invariant under change of viewpoints
Page 87 with different velocity. However, the action just given does not reflect this. It took some Copyright © Christoph Schiller November 1997–January 2007
* A rank-n tensor is the proportionality factor between a rank-1 tensor, i.e. between a vector, and an rank-
(n − 1) tensor. Vectors and scalars are rank 1 and rank 0 tensors. Scalars can be pictured as spheres, vectors
as arrows, and rank-2 tensors as ellipsoids. Tensors of higher rank correspond to more and more complex
shapes.
A vector has the same length and direction for every observer; a tensor (of rank 2) has the same determ-
inant, the same trace, and the same sum of diagonal subdeterminants for all observers.
A vector is described mathematically by a list of components; a tensor (of rank 2) is described by a matrix
of components. The rank or order of a tensor thus gives the number of indices the observable has. Can you
Challenge 386 e show this?
** By the way, is the usual list of possible observation viewpoints – namely different positions, different
observation instants, different orientations, and different velocities – also complete for the action (71)? Sur-
Ref. 163 prisingly, the answer is no. One of the first who noted this fact was Niederer, in 1972. Studying the quantum
theory of point particles, he found that even the action of a Galilean free point particle is invariant under
200 i galilean motion • 3. the global simplicity of complexity
years to find out the correct generalization: it is given by the theory of special relativity.
But before we study it, we need to finish the present topic.
“ science.
The reproducibility of observations, i.e. the symmetry under change of instant of time
Graffito
”
or ‘time translation invariance’, is a case of viewpoint-independence. (That is not obvi-
Challenge 389 ny ous; can you find its irreducible representations?) The connection has several important
consequences. We have seen that symmetry implies invariance. It turns out that for con-
tinuous symmetries, such as time translation symmetry, this statement can be made more
precise: for any continuous symmetry of the Lagrangian there is an associated conserved
constant of motion and vice versa. The exact formulation of this connection is the the-
x ( x+b (74)
some additional transformations. If the two observers use the coordinates (t, x) and (τ, ξ), the action (71)
Challenge 388 ny is invariant under the transformations
Rx + x0 + vt αt + β T
ξ= and τ= with R R=1 and αδ − βγ = 1 . (72)
γt + δ γt + δ
where R describes the rotation from the orientation of one observer to the other, v the velocity between the
two observers, and x0 the vector between the two origins at time zero. This group contains two important
Copyright © Christoph Schiller November 1997–January 2007
The latter, three-parameter group includes spatial inversion, dilations, time translation and a set of time-
dependent transformations such as ξ = x~t, τ = 1~t called expansions. Dilations and expansions are rarely
mentioned, as they are symmetries of point particles only, and do not apply to everyday objects and systems.
They will return to be of importance later on, however.
* Emmy Noether (b. 1882 Erlangen, d. 1935 Bryn Mayr), German mathematician. The theorem is only a
sideline in her career which she dedicated mostly to number theory. The theorem also applies to gauge
symmetries, where it states that to every gauge symmetry corresponds an identity of the equation of motion,
and vice versa.
motion and symmetry 201
= p = const ;
∂T
(76)
∂v
in short, symmetry under change of position implies conservation of momentum. The
converse is also true.
Challenge 390 ny In the case of symmetry under shift of observation instant, we find
T + U = const ; (77)
in other words, time translation invariance implies constant energy. Again, the converse
is also correct. One also says that energy and momentum are the generators of time and
space translations.
The conserved quantity for a continuous symmetry is sometimes called the Noether
overview of all the symmetries of nature we will encounter. Their main properties are also
listed. Except for those marked as ‘approximate’ or ‘speculative’, an experimental proof of
incorrectness of any of them would be a big surprise indeed.
202 i galilean motion • 3. the global simplicity of complexity
TA B L E 25 The symmetries of relativity and quantum theory with their properties; also the complete list
of logical inductions used in the two fields
Symmetry Ty pe S pa c e G r o u p Pos- Con- Va - Main
[num- of ac-topo- si b l e se r v e d cuum/ effect
ber of tion logy rep- q ua n t - m at -
pa r a - r e s e nt- ity ter is
met- ations sy m -
e r s] metric
spinors
Dynamic, interaction-dependent symmetries: gravity
1~r 2 gravity SO(4) config. as SO(4) vector pair perihelion yes/yes closed
[6 par.] space direction orbits
Diffeomorphism [ª par.] space- involved space- local yes/no perihelion
invariance time times energy– shift
momentum
Dynamic, classical and quantum-mechanical motion symmetries
motion and symmetry 203
SU(2)
[3 par.]
Colour gauge non- Hilbert as SU(3) coloured colour yes/yes massless
abelian Lie space quarks gluons
SU(3)
[8 par.]
Chiral discrete fermions discrete left, right helicity approxi- ‘massless’
symmetry mately fermions a
Permutation symmetries
204 i galilean motion • 3. the global simplicity of complexity
For details about the connection between symmetry and induction, see page 690. The explanation
of the terms in the table will be completed in the rest of the walk. The real numbers are denoted
as R.
a. Only approximate; ‘massless’ means that m m Pl , i.e. that m 22 µg.
b. N = 1 supersymmetry, but not N = 1 supergravity, is probably a good approximation for nature
Copyright © Christoph Schiller November 1997–January 2007
at everyday energies.
c. i = 1 .. N.
In summary, since we can talk about nature we can deduce several of its symmetries, in
particular its symmetry under time and space translations. From nature’s symmetries, us-
ing Noether’s theorem, we can deduce the conserved charges, such as energy or linear and
angular momentum. In other words, the definition of mass, space and time, together with
their symmetry properties, is equivalent to the conservation of energy and momentum.
Conservation and symmetry are two ways to express the same property of nature. To put it
simply, our ability to talk about nature means that energy and momentum are conserved.
In general, the most elegant way to uncover the ‘laws’ of nature is to search for nature’s
simple motions of extended bodies – oscill ations and waves 205
symmetries. In many historical cases, once this connection had been understood, phys-
ics made rapid progress. For example, Albert Einstein discovered the theory of relativity
in this way, and Paul Dirac started off quantum electrodynamics. We will use the same
method throughout our walk; in its third part we will uncover some symmetries which
are even more mind-boggling than those of relativity. Now, though, we will move on to
the next approach to a global description of motion.
L = mv 2 − kx 2 ,
1
(78)
2
where k is a quantity characterizing the spring, the so-called spring constant. The Lag-
Challenge 399 e rangian is due to Robert Hooke, in the seventeenth century. Can you confirm it?
The motion that results from this Lagrangian is periodic, as shown in Figure 105. The
Lagrangian describes the oscillation of the spring length. The motion is exactly the same
as that of a long pendulum. It is called harmonic motion, because an object vibrating
rapidly in this way produces a completely pure – or harmonic – musical sound. (The
musical instrument producing the purest harmonic waves is the transverse flute. This
instrument thus gives the best idea of how harmonic motion ‘sounds’.) The graph of a
206 i galilean motion • 3. the global simplicity of complexity
position
period T
amplitude A
phase φ
time
period T
O b s e r va t i o n Frequency
harmonic or linear oscillation, shown in Figure 105, is called a sine curve; it can be seen as
the basic building block of all oscillations. All other, non-harmonic oscillations in nature
Page 207 can be composed from sine curves, as we shall see shortly.
Every oscillating motion continuously transforms kinetic energy into potential energy
and vice versa. This is the case for the tides, the pendulum, or any radio receiver. But many
oscillations also diminish in time: they are damped. Systems with large damping, such as
the shock absorbers in cars, are used to avoid oscillations. Systems with small damping
are useful for making precise and long-running clocks. The simplest measure of damping
is the number of oscillations a system takes to reduce its amplitude to 1~e 1~2.718 times
the original value. This characteristic number is the so-called Q-factor, named after the
abbreviation of ‘quality factor’. A poor Q-factor is 1 or less, an extremely good one is
100 000 or more. (Can you write down a simple Lagrangian for a damped oscillation with
simple motions of extended bodies – oscill ations and waves 207
position
wavelength λ
crest or peak
amplitude A
Challenge 400 ny a given Q-factor?) In nature, damped oscillations do not usually keep constant frequency;
however, for the simple pendulum this remains the case to a high degree of accuracy.
The reason is that for a pendulum, the frequency does not depend significantly on the
amplitude (as long as the amplitude is smaller than about 20°). This is one reason why
pendulums are used as oscillators in mechanical clocks.
Obviously, for a good clock, the driving oscillation must not only show small damp-
ing, but must also be independent of temperature and be insensitive to other external
influences. An important development of the twentieth century was the introduction of Copyright © Christoph Schiller November 1997–January 2007
quartz crystals as oscillators. Technical quartzes are crystals of the size of a few grains of
sand; they can be made to oscillate by applying an electric signal. They have little temper-
ature dependence and a large Q-factor, and therefore low energy consumption, so that
precise clocks can now run on small batteries.
Every harmonic oscillation is described by three quantities: the amplitude, the period
(the inverse of the frequency) and the phase. The phase distinguishes oscillations of the
same amplitude and period; it defines at what time the oscillation starts. Figure 105 shows
how a harmonic oscillation is related to an imaginary rotation. As a result, the phase is
best described by an angle between 0 and 2π.
All systems that oscillate also emit waves. In fact, oscillations only appear in exten-
ded systems, and oscillations are only the simplest of motions of extended systems. The
general repetitive motion of an extended system is the wave.
208 i galilean motion • 3. the global simplicity of complexity
Wa v e Ve l o c i t y
city, a wavelength, and a phase, as can be deduced from Figure 106. Low-amplitude water
waves show this most clearly. In a harmonic wave, every position performs a harmonic
oscillation. The phase of a wave specifies the position of the wave (or a crest) at a given
time. It is an angle between 0 and 2π. How are frequency and wavelength related in a
Challenge 402 e wave?
Waves appear inside all extended bodies, be they solids, liquids, gases or plasmas. In-
side fluid bodies, waves are longitudinal, meaning that the wave motion is in the same
direction as the wave oscillation. Sound in air is an example of a longitudinal wave. Inside
solid bodies, waves can also be transverse; in that case the wave oscillation is perpendicu-
lar to the travelling direction.
Waves appear also on interfaces between bodies: water–air interfaces are a well-known
case. Even a saltwater–freshwater interface, so-called dead water, shows waves: they
simple motions of extended bodies – oscill ations and waves 209
can appear even if the upper surface of the water is immobile. Any flight in an aero-
plane provides an opportunity to study the regular cloud arrangements on the interface
between warm and cold air layers in the atmosphere. Seismic waves travelling along the
boundary between the sea floor and the sea water are also well-known. General surface
waves are usually neither longitudinal nor transverse, but of a mixed type.
On water surfaces, one classifies waves according to the force that restores the plane
where д is the acceleration due to gravity (and an amplitude much smaller than the
wavelength is assumed). The formula shows two limiting regimes. First, short or deep
waves appear when the water » depth is larger than half the wavelength; for deep waves,
the phase velocity is c дλ~2π , thus wavelength dependent, and the group velocity
» or
Copyright © Christoph Schiller November 1997–January 2007
is about half the phase velocity. Shorter deep waves are thus slower. Secondly, shallow
long waves appear when the depth is less than 5% of the wavelength; in this case, c дd ,
there is no dispersion, and the group velocity is about the same as the phase velocity.
The most impressive shallow waves are tsunamis, the large waves triggered by submarine
earthquakes. (The Japanese name is composed of tsu, meaning harbour, and nami, mean-
ing wave.) Since tsunamis are shallow waves, they show little dispersion and thus travel
over long distances; they can go round the Earth several times. Typical oscillation times
are between 6 and 60 minutes, giving wavelengths between 70 and 700 km and speeds in
Challenge 403 e the open sea of 200 to 250 m~s, similar to that of a jet plane. Their amplitude on the open
sea is often of the order of 10 cm; however, the amplitude scales with depth d as 1~d 4 and
heights up to 40 m have been measured at the shore. This was the order of magnitude of
the large and disastrous tsunami observed in the Indian Ocean on 26 December 2004.
210 i galilean motion • 3. the global simplicity of complexity
Waves can also exist in empty space. Both light and gravity waves are examples. The
exploration of electromagnetism and relativity will tell us more about their properties.
Any study of motion must include the study of wave motion. We know from experi-
ence that waves can hit or even damage targets; thus every wave carries energy and mo-
mentum, even though (on average) no matter moves along the wave propagation direc-
tion. The energy E of a wave is the sum of its kinetic and potential energy. The kinetic
energy (density) depends on the temporal change of the displacement u at a given spot:
rapidly changing waves carry a larger kinetic energy. The potential energy (density) de-
pends on the gradient of the displacement, i.e. on its spatial change: steep waves carry
a larger potential energy than shallow ones. (Can you explain why the potential energy
Challenge 404 n does not depend on the displacement itself?) For harmonic waves propagating along the
direction z, each type of energy is proportional to the square of its respective displace-
Ref. 166 ment change:
E ( )2 + v 2 ( )2 .
∂u ∂u
(80)
These effects, called superposition and interference, are strongly tied to the linearity of
most waves.
— Transverse waves in three dimensions can oscillate in different directions: they show
polarization.
— Waves, such as sound, can go around corners. This is called diffraction.
— Waves change direction when they change medium. This is called refraction.
— Waves can have a frequency-dependent propagation speed. This is called dispersion.
— Often, the wave amplitude decreases over time: waves show damping.
Material bodies in everyday life do not behave in these ways when they move. These six
wave effects appear because wave motion is the motion of extended entities. The famous
debate whether electrons or light are waves or particles thus requires us to check whether
simple motions of extended bodies – oscill ations and waves 211
Interference
Polarisation
Diffraction Refraction
these effects specific to waves can be observed or not. This is one topic of quantum theory.
Before we study it, can you give an example of an observation that implies that a motion
Challenge 408 n surely cannot be a wave?
As a result of having a frequency f and a propagation velocity v, all sine waves are
characterized by the distance λ between two neighbouring wave crests: this distance is
called the wavelength. All waves obey the basic relation
λf = v . (82)
Copyright © Christoph Schiller November 1997–January 2007
In many cases the wave velocity v depends on the wavelength of the wave. For example,
this is the case for water waves. This change of speed with wavelength is called dispersion.
In contrast, the speed of sound in air does not depend on the wavelength (to a high degree
of accuracy). Sound in air shows almost no dispersion. Indeed, if there were dispersion
for sound, we could not understand each other’s speech at larger distances.
In everyday life we do not experience light as a wave, because the wavelength is only
around one two-thousandth of a millimetre. But light shows all six effects typical of wave
Page 568 motion. A rainbow, for example, can only be understood fully when the last five wave
effects are taken into account. Diffraction and interference can even be observed with
Challenge 409 n your fingers only. Can you tell how?
Like every anharmonic oscillation, every anharmonic wave can be decomposed into
sine waves. Figure 106 gives examples. If the various sine waves contained in a disturbance
212 i galilean motion • 3. the global simplicity of complexity
propagate differently, the original wave will change in shape while it travels. That is the
reason why an echo does not sound exactly like the original sound; for the same reason,
a nearby thunder and a far-away one sound different.
All systems which oscillate also emit waves. Any radio or TV receiver contains oscillat-
ors. As a result, any such receiver is also a (weak) transmitter; indeed, in some countries
the authorities search for people who listen to radio without permission listening to the ra-
dio waves emitted by these devices. Also, inside the human ear, numerous tiny structures,
the hair cells, oscillate. As a result, the ear must also emit sound. This prediction, made
in 1948 by Tommy Gold, was confirmed only in 1979 by David Kemp. These so-called
otoacoustic emissions can be detected with sensitive microphones; they are presently be-
ing studied in order to unravel the still unknown workings of the ear and in order to
Ref. 167 diagnose various ear illnesses without the need for surgery.
Since any travelling disturbance can be decomposed into sine waves, the term ‘wave’ is
used by physicists for all travelling disturbances, whether they look like sine waves or not.
secondary
waves
primary envelope of
wave secondary waves
But the description of wave fronts as envelopes of secondary waves has an import-
ant limitation. It is not correct in two dimensions (even though Figure 109 is two-
dimensional!). In particular, it does not apply to water waves. Water wave propagation
cannot be calculated in this way in an exact manner. (It is only possible if the situation
is limited to a waves of a single frequency.) It turns out that for water waves, second-
ary waves do not only depend on the wave front of the primary waves, but depend also
on their interior. The reason is that in two (and other even) dimensions, waves of differ-
ent frequency necessarily have different speeds. And a stone falling into water generates
waves of many frequencies. In contrast, in three (and larger odd) dimensions, waves of
all frequencies have the same speed.
We can also say that Huygens’ principle holds if the wave equation is solved by a circu-
lar wave leaving no amplitude behind it. Mathematicians translate this by requiring that
the evolving delta function δ(c 2 t 2 − r 2 ) satisfies the wave equation, i.e. that ∂ 2t δ = c 2 ∆δ. Copyright © Christoph Schiller November 1997–January 2007
The delta function is that strange ‘function’ which is zero everywhere except at the origin,
where it is infinite. A few more properties describe the precise way in which this hap-
pens.* It turns out that the delta function is a solution of the wave equation only if the
space dimension is odd and at least three. In other words, while a spherical wave pulse is
possible, a circular pulse is not: there is no way to keep the centre of an expanding wave
quiet. (See Figure 110.) That is exactly what the stone experiment shows. You can try to
produce a circular pulse (a wave that has only a few crests) the next time you are in the
bathroom or near a lake: you will not succeed.
In summary, the reason a room gets dark when we switch off the light, is that we live
in a space with a number of dimensions which is odd and larger than one.
* The main property is ∫ δxdx = 1. In mathematically precise terms, the delta ‘function’ is a distribution.
214 i galilean motion • 3. the global simplicity of complexity
Signals
A signal is the transport of information. Every signal is motion of energy. Signals can
be either objects or waves. A thrown stone can be a signal, as can a whistle. Waves are a
more practical form of communication because they do not require transport of matter:
it is easier to use electricity in a telephone wire to transport a statement than to send a
messenger. Indeed, most modern technological advances can be traced to the separation
between signal and matter transport. Instead of transporting an orchestra to transmit
music, we can send radio signals. Instead of sending paper letters we write email messages.
Instead of going to the library we browse the internet.
The greatest advances in communication have resulted from the use of signals to
transport large amounts of energy. That is what electric cables do: they transport energy
without transporting any (noticeable) matter. We do not need to attach our kitchen ma-
chines to the power station: we can get the energy via a copper wire.
For all these reasons, the term ‘signal’ is often meant to imply waves only. Voice, sound,
Challenge 412 e test this relation with any wave in your environment.
Similarly, there is a relation between the position error ∆x and the wave vector error
∆k = 2π~∆λ of a signal:
∆x ∆k E .
1
(84)
2
Like the previous case, also this indeterminacy relation expresses that it is impossible
to specify both the position of a signal and its wavelength with full precision. Also this
position–wave-vector indeterminacy relation is a feature of any wave phenomenon.
Every indeterminacy relation is the consequence of a smallest entity. In the case of
waves, the smallest entity of the phenomenon is the period (or cycle, as it used to be
called). Whenever there is a smallest unit in a natural phenomenon, an indeterminacy
simple motions of extended bodies – oscill ations and waves 215
Figure to be included
relation results. We will encounter other indeterminacy relations both in relativity and in
quantum theory. As we will find out, they are due to smallest entities as well.
Whenever signals are sent, their content can be lost. Each of the six characteristics of
through the channel was suddenly stopped, a strange water wave departed from it. It
consisted of a single crest, about 10 m long and 0.5 m high, moving at about 4 m~s. He
followed that crest, shown in a reconstruction in Figure 112, with his horse for several
kilometres: the wave died out only very slowly. Russell did not observe any dispersion, as
is usual in water waves: the width of the crest remained constant. Russell then started pro-
Ref. 169 ducing such waves in his laboratory, and extensively studied their properties. He showed
that the speed depended on the amplitude, in contrast to linear, harmonic waves. He also
found that the depth d of the water canal was an important parameter. In fact, the speed
v, the amplitude A and the width L of these single-crested waves are related by
¾
¼
v= дd 1 + L=
A 4d 3
and . (85)
2d 3A
216 i galilean motion • 3. the global simplicity of complexity
As shown by these expressions, and noted by Russell, high waves are narrow and fast,
whereas shallow waves are slow and wide. The shape of the waves is fixed during their
motion. Today, these and all other stable waves with a single crest are called solitary waves.
They appear only where the dispersion and the nonlinearity of the system exactly com-
pensate for each other. Russell also noted that the solitary waves in water channels can
cross each other unchanged, even when travelling in opposite directions; solitary waves
with this property are called solitons. Solitons are stable against encounters, as shown in
Figure 113, whereas solitary waves in general are not.
simple motions of extended bodies – oscill ations and waves 217
Only sixty years later, in 1895, Korteweg and de Vries found out that solitary waves in
water channels have a shape described by
∂u d 2 ∂ 3 u
» + (1 + u) =0.
1 ∂u 3
+ (87)
дd ∂t 2d ∂x 6 ∂x 3
This equation for the elongation u is called the Korteweg–de Vries equation in their hon-
our.* The surprising stability of the solitary solutions is due to the opposite effect of the
two terms that distinguish the equation from linear wave equations: for the solitary solu-
of both concepts. For this reason, solitons are now an essential part of any description of
elementary particles, as we will find out later on.
”
* The equation can be simplified by transforming the variable u; most concisely, it can be rewritten as u t +
u x x x = 6uu x . As long as the solutions are sech functions, this and other transformed versions of the equation
are known by the same name.
218 i galilean motion • 3. the global simplicity of complexity
When the frequency of a tone is doubled, one says that the tone is higher by an octave.
Two tones that differ by an octave, when played together, sound pleasant to the ear. Two
other agreeable frequency ratios – or ‘intervals’, as musicians say – are quarts and quints.
Challenge 414 e What are the corresponding frequency ratios? (Note: the answer was one of the oldest
discoveries in physics; it is attributed to Pythagoras, around 500 bce.)
**
An orchestra is playing music in a large hall. At a distance of 30 m, somebody is listening
to the music. At a distance of 3000 km, another person is listening to the music via the
Challenge 415 n radio. Who hears the music first?
**
What is the period of a simple pendulum, i.e. a mass m attached to a massless string of
Challenge 416 ny length l? What is the period if the string is much longer than the radius of the Earth?
**
Light is a wave, as we will discover later on. As a result, light reaching the Earth from
space is refracted when it enters the atmosphere. Can you confirm that as a result, stars
Challenge 419 e appear somewhat higher in the night sky than they really are?
**
What are the highest sea waves? This question has been researched systematically only
Ref. 172 recently, using satellites. The surprising result is that sea waves with a height of 25 m and
more are common: there are a few such waves on the oceans at any given time. This result
confirms the rare stories of experienced ship captains and explains many otherwise ship
sinkings. Copyright © Christoph Schiller November 1997–January 2007
Surfers may thus get many chances to ride 30 m waves. (The record is just below this
size.) But maybe the most impressive waves to surf are those of the Pororoca, a series of
4 m waves that move from the sea into the Amazon River every spring, against the flow
of the river. These waves can be surfed for tens of kilometres.
**
All waves are damped, eventually. This effect is often frequency-dependent. Can you
Challenge 420 n provide a confirmation of this dependence in the case of sound in air?
**
When you make a hole with a needle in black paper, the hole can be used as a magnifying
Challenge 421 e lens. (Try it.) Diffraction is responsible for the lens effect. By the way, the diffraction of
simple motions of extended bodies – oscill ations and waves 219
air
air water
coin
light by holes was noted by Francesco Grimaldi in the seventeenth century; he deduced
that light is a wave. His observations were later discussed by Newton, who wrongly dis-
missed them.
**
Put an empty cup near a lamp, in such a way that the bottom of the cup remains in the
**
The group velocity of water waves (in deep water) is less than the velocity of the individual
waves. As a result, when a group of wave crests travels, within the group the crests move
from the back to the front, appearing at the back, travelling forward and then dying out
at the front.
**
One can hear the distant sea or a distant highway more clearly in the evening than in
the morning. This is an effect of refraction. Sound speed decreases with temperature. In
the evening, the ground cools more quickly than the air above. As a result, sound leaving
the ground and travelling upwards is refracted downwards, leading to the long hearing
220 i galilean motion • 3. the global simplicity of complexity
distance. In the morning, usually the air is cold above and warm below. Sound is refrac-
ted upwards, and distant sound does not reach a listener on the ground. Refraction thus
implies that mornings are quiet, and that one can hear more distant sounds in the even-
ings. Elephants use the sound situation during evenings to communicate over distances
of more than 10 km. (They also use sound waves in the ground to communicate, but that
is another story.)
**
Refraction also implies that there is a sound channel in the ocean, and in the atmosphere.
Sound speed decreases with temperature, and increases with pressure. At an ocean depth
of 1 km, or at an atmospheric height of 13 to 17 km (that is at the top of the tallest cu-
mulonimbus clouds or equivalently, at the middle of the ozone layer) sound has minimal
speed. As a result, sound that starts from that level and tries to leave is channelled back to
it. Whales use the sound channel to communicate with each other with beautiful songs;
**
On windy seas, the white wave crests have several important effects. The noise stems
from tiny exploding and imploding water bubbles. The noise of waves on the open sea Copyright © Christoph Schiller November 1997–January 2007
is thus the superposition of many small explosions. At the same time, white crests are
the events where the seas absorb carbon dioxide from the atmosphere, and thus reduce
global warming.
**
Challenge 425 n Why are there many small holes in the ceilings of many office buildings?
**
Which quantity determines the wavelength of water waves emitted when a stone is
Challenge 426 ny thrown into a pond?
**
simple motions of extended bodies – oscill ations and waves 221
Ref. 3 Yakov Perelman lists the following four problems in his delightful physics problem book.
(1) A stone falling into a lake produces circular waves. What is the shape of waves
Challenge 427 n produced by a stone falling into a river, where the water flows in one direction?
(2) It is possible to build a lens for sound, in the same way as it is possible to build
Challenge 428 n lenses for light. What would such a lens look like?
Challenge 429 ny (3) What is the sound heard inside a shell?
(4) Light takes about eight minutes to travel from the Sun to the Earth. What con-
Challenge 430 n sequence does this have for a sunrise?
**
Can you describe how a Rubik’s Cube is built? And its generalizations to higher numbers
Challenge 431 n of segments? Is there a limit to the number of segments? These puzzles are even tougher
than the search for a rearrangement of the cube. Similar puzzles can be found in the study
of many mechanisms, from robots to textile machines.
**
Infrasound, inaudible sound below 20 Hz, is a modern topic of research. In nature, infra-
sound is emitted by earthquakes, volcanic eruptions, wind, thunder, waterfalls, falling
meteorites and the surf. Glacier motion, seaquakes, avalanches and geomagnetic storms
Ref. 175 also emit infrasound. Human sources include missile launches, traffic, fuel engines and
air compressors.
It is known that high intensities of infrasound lead to vomiting or disturbances of the
sense of equilibrium (140 dB or more for 2 minutes), and even to death (170 dB for 10
minutes). The effects of lower intensities on human health are not yet known.
Infrasound can travel several times around the world before dying down, as the ex-
plosion of the Krakatoa volcano showed in 1883. With modern infrasound detectors, sea Copyright © Christoph Schiller November 1997–January 2007
surf can be detected hundreds of kilometres away. Sea surf leads to a constant ‘hum’ of
the Earth’s crust at frequencies between 3 and 7 mHz. The Global infrasound Network uses
infrasound to detect nuclear weapon tests, earthquakes and volcanic eruptions, and can
count meteorites. Only very rarely can meteorites be heard with the human ear.
**
The method used to deduce the sine waves contained in a signal, as shown in Figure 106, is
called the Fourier transformation. It is of importance throughout science and technology.
In the 1980s, an interesting generalization became popular, called the wavelet transforma-
tion. In contrast to Fourier transformations, wavelet transformations allow us to localize
signals in time. Wavelet transformations are used to compress digitally stored images in
Ref. 176 an efficient way, to diagnose aeroplane turbine problems, and in many other applications.
222 i galilean motion • 3. the global simplicity of complexity
**
If you like engineering challenges, here is one that is still open. How can one make a
Challenge 433 r robust and efficient system that transforms the energy of sea waves into electricity?
**
If you are interested in ocean waves, you might also enjoy the science of oceanography.
For an introduction, see the open source textbooks at http://oceanworld.tamu.edu/.
**
In our description of extended bodies, we assumed that each spot of a body can be fol-
lowed separately throughout its motion. Is this assumption justified? What would happen
Challenge 434 r if it were not?
**
**
Every soliton is a one-dimensional structure. Do two-dimensional analogues exist? This
Ref. 178 issue was open for many years. Finally, in 1988, Boiti, Leon, Martina and Fumagalli found
that a certain evolution equation, the so-called Davey–Stewartson equation, can have Copyright © Christoph Schiller November 1997–January 2007
solutions that are localized in two dimensions. These results were generalized by Fokas
and Santini and further generalized by Hietarinta and Hirota. Such a solution is today
called a dromion. Dromions are bumps that are localized in two dimensions and can move,
without disappearing through diffusion, in non-linear systems. An example is shown in
Figure 115. However, so far, no such solution has be observed in experiments; this is one
of the most important experimental challenges left open in non-linear science.
Over the centuries, it has reappeared again and again, at each improvement of the descrip-
tion of motion; the answer has alternated between the affirmative and the negative. Many
thinkers have been imprisoned, and many still are being persecuted, for giving answers
that are not politically correct! In fact, the issue already arises in everyday life.
in Hollywood films. If this description were correct, mountains would be extended, but
not continuous.
But mountains could also be fractals of a different sort, as shown in the left side of
Figure 116. Mountain surfaces could have an infinity of small and smaller holes. In fact,
one could also imagine that mountains are described as three-dimensional versions of the
left side of the figure. Mountains would then be some sort of mathematical Swiss cheese.
Can you devise an experiment to decide whether fractals provide the correct description
Challenge 436 n for mountains? To settle the issue, a chocolate bar can help.
n=1
n=2
n=5
n=∞
“ Atlantic or a Niagara.
Arthur Conan Doyle, A Study in Scarlet
Any child knows how to make a chocolate bar last forever: eat half the remainder every ”
day. However, this method only works if matter is scale-invariant. In other words, the
method only works if matter is either fractal, as it then would be scale-invariant for a
discrete set of zoom factors, or continuous, in which case it would be scale-invariant for
any zoom factor. Which case, if either, applies to nature?
Page 53 We have already encountered a fact making continuity a questionable assumption: con-
tinuity would allow us, as Banach and Tarski showed, to multiply food and any other
matter by clever cutting and reassembling. Continuity would allow children to eat the
same amount of chocolate every day, without ever buying a new bar. Matter is thus not
continuous. Now, fractal chocolate is not ruled out in this way; but other experiments
settle the question. Indeed, we note that melted materials do not take up much smaller Copyright © Christoph Schiller November 1997–January 2007
volumes than solid ones. We also find that even under the highest pressures, materials do
not shrink. Thus matter is not a fractal. What then is its structure?
To get an idea of the structure of matter we can take fluid chocolate, or even just some
oil – which is the main ingredient of chocolate anyway – and spread it out over a large
surface. For example, we can spread a drop of oil onto a pond on a day without rain or
wind; it is not difficult to observe which parts of the water are covered by the oil and
which are not. A small droplet of oil cannot cover a surface larger than – can you guess
Challenge 437 n the value? Trying to spread the film further inevitably rips it apart. The child’s method
of prolonging chocolate thus does not work for ever: it comes to a sudden end. The oil
experiment shows that there is a minimum thickness of oil films, with a value of about
2 nm. This simple experiment can even be conducted at home; it shows that there is a
smallest size in matter. Matter is made of tiny components. This confirms the observa-
d o extended bodies exist? 225
tions made by Joseph Loschmidt* in 1865, who was the first person to measure the size of
the components of matter.** In 1865, it was not a surprise that matter was made of small
components, as the existence of a smallest size – but not its value – had already been
deduced by Galileo, when studying some other simple questions.***
condemnation of Galileo to life imprisonment for his views on the Earth’s motion was organized by his
friend the Pope to protect him from a sure condemnation to death over a different issue.
The reasons for his arrest, as shown by the denunciation, were not his ideas on astronomy and on the
motion of the Earth, but his statements on matter. Galileo defended the view that since matter is not scale
invariant, it must be made of ‘atoms’ or, as he called them, piccolissimi quanti – smallest quanta. This was
and still is a heresy. A true Catholic is still not allowed to believe in atoms. Indeed, the theory of atoms is
not compatible with the change of bread and wine into human flesh and blood, called transsubstantiation,
which is a central tenet of the Catholic faith. In Galileo’s days, church tribunals punished heresy, i.e. deviating
personal opinions, by the death sentence. Despite being condemned to prison in his trial, Galileo published
his last book, written as an old man under house arrest, on the scaling issue. Today, the Catholic Church still
refuses to publish the proceedings and other documents of the trial. Its officials carefully avoid the subject
of atoms, as any statement on this subject would make the Catholic Church into a laughing stock. Indeed,
quantum theory, named after the term used by Galileo, has become the most precise description of nature
yet.
226 i galilean motion • 3. the global simplicity of complexity
three
mono-
atomic
steps
lamp eye
photograph to be included
Felling trees
The gentle lower slopes of Motion Mountain are covered by trees. Trees are fascinating
structures. Take their size. Why do trees have limited size? Already in the sixteenth cen-
tury, Galileo knew that it is not possible to increase tree height without limits: at some
point a tree would not have the strength to support its own weight. He estimated the max-
imum height to be around 90 m; the actual record, unknown to him at the time, seems to
be 150 m, for the Australian tree Eucalyptus regnans. But why does a limit exist at all? The
answer is the same as for bones: wood has a finite strength because it is not scale invariant;
and it is not scale invariant because it is made of small constituents, namely atoms.*
In fact, the derivation of the precise value of the height limit is more involved. Trees
must not break under strong winds. Wind resistance limits the height-to-thickness ratio
h~d to about 50 for normal-sized trees (for 0.2 m < d < 2 m). Can you say why? Thinner
Copyright © Christoph Schiller November 1997–January 2007
Challenge 439 ny
trees are limited in height to less than 10 m by the requirement that they return to the
Ref. 182 vertical after being bent by the wind.
Such studies of natural constraints also answer the question of why trees are made
from wood and not, for example, from steel. You could check for yourself that the max-
imum height of a column of a given mass is determined by the ratio E~ρ 2 between the
Challenge 440 n elastic module and the square of the mass density. Wood is actually the material for which
Ref. 183 this ratio is highest. Only recently have material scientists managed to engineer slightly
better ratios with fibre composites.
Ref. 181 * There is another important limiting factor: the water columns inside trees must not break. Both factors
seem to yield similar limiting heights.
d o extended bodies exist? 227
Why do materials break at all? All observations yield the same answer and confirm
Galileo’s reasoning: because there is a smallest size in materials. For example, bodies un-
der stress are torn apart at the position at which their strength is minimal. If a body were
completely homogeneous, it could not be torn apart; a crack could not start anywhere. If
a body had a fractal Swiss-cheese structure, cracks would have places to start, but they
would need only an infinitesimal shock to do so.
A simple experiment that shows that solids have a smallest size is shown in Figure 117.
A cylindrical rod of pure, single crystal aluminium shows a surprising behaviour when
it is illuminated from the side: its brightness depends on how the rod is oriented, even
though it is completely round. This angular dependence is due to the atomic arrangement
of the aluminium atoms in the rod.
It is not difficult to confirm experimentally the existence of smallest size in solids. It is
sufficient to break a single crystal, such as a gallium arsenide wafer, in two. The breaking
surface is either completely flat or shows extremely small steps, as shown in Figure 118.
a large number of equal, small and discrete entities. Thus, simply listening to noise proves
that electric current is made of electrons, that air and liquids are made of molecules, and
that light is made of photons. In a sense, the sound of silence is the sound of atoms. Shot
noise would not exist in continuous systems.
”
* The human ear can detect pressure variations at least as small as 20 µPa.
228 i galilean motion • 3. the global simplicity of complexity
Precise observations show that matter is neither continuous nor a fractal: matter is made
of smallest basic particles. Galileo, who deduced their existence by thinking about giants
and trees, called them ‘smallest quanta.’ Today they are called ‘atoms’, in honour of a fam-
ous argument of the ancient Greeks. Indeed, 2500 years ago, the Greeks asked the follow-
ing question. If motion and matter are conserved, how can change and transformation
exist? The philosophical school of Leucippus and Democritus of Abdera* studied two
particular observations in special detail. They noted that salt dissolves in water. They also
noted that fish can swim in water. In the first case, the volume of water does not increase
when the salt is dissolved. In the second case, when fish advance, they must push water
aside. Leucippus and Democritus deduced that there is only one possible explanation
that satisfies observations and also reconciles conservation and transformation: nature is
made of void and of small, indivisible and conserved particles.** In this way any example
of motion, change or transformation is due to rearrangements of these particles; change
and conservation are thus reconciled.
* Leucippus of Elea (Λευκιππος) (c. 490 to c. 430 bce ), Greek philosopher; Elea was a small town south
of Naples. It lies in Italy, but used to belong to the Magna Graecia. Democritus (∆εµοκριτος) of Abdera
(c. 460 to c. 356 or 370 bce ), also a Greek philosopher, was arguably the greatest philosopher who ever lived.
Together with his teacher Leucippus, he was the founder of the atomic theory; Democritus was a much
admired thinker, and a contemporary of Socrates. The vain Plato never even mentions him, as Democritus
Copyright © Christoph Schiller November 1997–January 2007
was a danger to his own fame. Democritus wrote many books which all have been lost; they were not copied
during the Middle Ages because of his scientific and rational world view, which was felt to be a danger by
religious zealots who had the monopoly on the copying industry. Nowadays, it has become common to
claim – incorrectly – that Democritus had no proof for the existence of atoms. That is a typical example of
disinformation with the aim of making us feel superior to the ancients.
** The story is told by Lucretius, in full Titus Lucretius Carus, in his famous text De rerum natura, around
60 bce. (An English translation can be found on http://perseus.uchicago.edu/hopper/text.jsp?doc=Perseus:
text:1999.02.0131.) Lucretius relates many other proofs; in Book 1, he shows that there is vacuum in solids –
as proven by porosity and by density differences – and in gases – as proven by wind. He shows that smells are
Challenge 442 ny due to particles, and that so is evaporation. (Can you find more proofs?) He also explains that the particles
cannot be seen due to their small size, but that their effects can be felt and that they allow to consistently
explain all observations.
Especially if we the imagine particles as little balls, we cannot avoid calling this a typically male idea.
Challenge 443 d (What would be the female approach?)
d o extended bodies exist? 229
position-sensitive
laser
(segmented)
diode
photo- lens
detector
vertical cantilever
piezo
controller tip
horizontal sample
piezo
controllers
like everyday matter, atoms have mass, size, shape and colour. Single atoms have even
Ref. 186 been used as lamps and lasers.
Modern researchers in several fields have fun playing with atoms in the same way that
children play with Lego. Maybe the most beautiful demonstration of these possibilities is Copyright © Christoph Schiller November 1997–January 2007
Ref. 187 provided by the many applications of the atomic force microscope. If you ever have the
opportunity to see one, do not miss it!* It is a simple device which follows the surface of
an object with an atomically sharp needle; such needles, usually of tungsten, are easily
Ref. 188 manufactured with a simple etching method. The changes in the height of the needle
along its path over the surface are recorded with the help of a deflected light ray. With a
little care, the atoms of the object can be felt and made visible on a computer screen. With
special types of such microscopes, the needle can be used to move atoms one by one to
specified places on the surface. It is also possible to scan a surface, pick up a given atom
Ref. 189 and throw it towards a mass spectrometer to determine what sort of atom it is.
* A cheap version costs only a few thousand euro, and will allow you to study the difference between a silicon
wafer – crystalline – a flour wafer – granular-amorphous – and a consecrated wafer.
230 i galilean motion • 3. the global simplicity of complexity
Figures to be inserted
ρv + p + ρдz = const
1 2
(88)
2
where z is the height above ground. This is called Bernoulli’s equation. In this equation,
the first term is the kinetic energy (per volume) of the fluid, and the other two terms are
* Studying matter in even more detail yields the now well-known idea that matter, at higher and higher
magnifications, is made of molecules, atoms, nuclei, protons and neutrons, and finally, quarks. Atoms also
contain electrons. A final type of matter, neutrinos, is observed coming from the Sun and from certain types
of radioactive materials. Even though the fundamental bricks have become smaller with time, the basic idea
remains: matter is made of smallest entities, nowadays called elementary particles. In the second part of our
Page 943 mountain ascent we will explore this idea in detail. Appendix C lists the measured properties of all known
elementary particles.
d o extended bodies exist? 231
potential energies (per volume). The last term is only important if the fluid rises against
ground. The second term is the potential energy (per volume) resulting from the com-
Challenge 444 e pression of the fluid. Indeed, pressure is a potential energy per volume.
Energy conservation implies that the lower the pressure is, the larger the speed of a
fluid becomes. One can use this relation to measure the speed of a stationary water flow
in a tube. One just has to narrow the tube somewhat at one location along the tube, and
º
measure the pressure difference before and at the tube restriction. One finds that the
Challenge 445 n sped v is given as v = k p 1 − p 2 . (What is the constant k?) A device using this method
is called a Venturi gauge.
If the geometry of a system is kept fixed and the fluid speed is increased, at a certain
speed one observes a transition: the liquid loses its clarity, the flow is not stationary any
more. This is seen whenever a water tap is opened. The flow has changed from laminar
to turbulent. At this point, Bernoulli’s equation is not valid any more.
The description of turbulence might be the toughest of all problems in physics. When
The different speeds of the air above and below the wing lead to vortices at the end of
every wing. These vortices are especially important for the takeoff of any insect, bird and
Page 991 aeroplane. More details on wings are discussed later on.
* They are named after Claude Navier (b. 1785 Dijon, d. 1836 Paris), important French engineer and bridge
builder, and Georges Gabriel Stokes (b. 1819 Skreen, d. 1903 Cambridge), important Irish physicist and
mathematician.
232 i galilean motion • 3. the global simplicity of complexity
water
**
Challenge 448 n A ship leaves a river and enter the sea. What happens?
**
Put a rubber air balloon over the end of a bottle and let it hang inside the bottle. How
Challenge 449 e much can you blow up the balloon inside the bottle?
**
Put a small paper ball into the neck of a horizontal bottle and try to blow it into the bottle.
Challenge 450 e The paper will fly towards you. Why?
**
It is possible to blow an egg from one egg-cup to a second one just behind it. Can you
Challenge 451 e perform this trick?
**
In the seventeenth century, engineers who needed to pump water faced a challenge. To
pump water from mine shafts to the surface, no water pump managed more than 10 m Copyright © Christoph Schiller November 1997–January 2007
of height difference. For twice that height, one always needed two pumps in series, con-
Challenge 452 n nected by an intermediate reservoir. Why? How then do trees manage to pump water
upwards for larger heights?
**
When hydrogen and oxygen are combined to form water, the amount of hydrogen needed
is exactly twice the amount of oxygen, if no gas is to be left over after the reaction. How
Challenge 453 n does this observation confirm the existence of atoms?
**
Challenge 454 n How are alcohol-filled chocolate pralines made? Note that the alcohol is not injected into
them afterwards, because there would be no way to keep the result tight enough.
d o extended bodies exist? 233
**
How often can a stone jump when it is thrown over the surface of water? The present world
record was achieved in 2002: 40 jumps. More information is known about the previous
Ref. 192 world record, achieved in 1992: a palm-sized, triangular and flat stone was thrown with
a speed of 12 m~s (others say 20 m~s) and a rotation speed of about 14 revolutions per
second along a river, covering about 100 m with 38 jumps. (The sequence was filmed
with a video recorder from a bridge.)
What would be necessary to increase the number of jumps? Can you build a machine
Challenge 455 r that is a better thrower than yourself?
**
The biggest component of air is nitrogen (about 78 %). The second biggest component is
Challenge 456 n oxygen (about 21 %). What is the third biggest one?
**
Water can flow uphill: Heron’s fountain shows this most clearly. Heron of Alexandria Copyright © Christoph Schiller November 1997–January 2007
(c. 10 to c. 70) described it 2000 years ago; it is easily built at home, using some plastic
Challenge 457 n bottles and a little tubing. How does it work?
**
A light bulb is placed, underwater, in a stable steel cylinder with a diameter of 16 cm. A
Fiat Cinquecento (500 kg) is placed on a piston pushing onto the water surface. Will the
Challenge 458 n bulb resist?
**
Challenge 459 ny What is the most dense gas? The most dense vapour?
**
234 i galilean motion • 3. the global simplicity of complexity
Every year, the Institute of Maritime Systems of the University of Rostock organizes a
contest. The challenge is to build a paper boat with the highest carrying capacity. The
paper boat must weigh at most 10 g; the carrying capacity is measured by pouring lead
small shot onto it, until the boat sinks. The 2002 record stands at 2.6 kg. Can you achieve
Challenge 460 e this value? (For more information, see the http://www.paperboat.de website.)
**
A modern version of an old question – already posed by Daniel Colladon (1802–1893) –
is the following. A ship of mass m in a river is pulled by horses walking along the riverbank
attached by ropes. If the river is of superfluid helium, meaning that there is no friction
between ship and river, what energy is necessary to pull the ship upstream along the river
Challenge 461 n until a height h has been gained?
**
**
Several humans have survived free falls from aeroplanes for a thousand metres or more,
Challenge 465 n even though they had no parachute. How was this possible?
Copyright © Christoph Schiller November 1997–January 2007
**
Liquid pressure depends on height. If the average human blood pressure at the height of
Challenge 466 n the heart is 13.3 kPa, can you guess what it is inside the feet when standing?
**
The human heart pumps blood at a rate of about 0.1 l~s. A capillary has the diameter of a
red blood cell, around 7 µm, and in it the blood moves at a speed of half a millimetre per
Challenge 467 n second. How many capillaries are there in a human?
**
A few drops of tea usually flow along the underside of the spout of a teapot (or fall onto the
table). This phenomenon has even been simulated using supercomputer simulations of
d o extended bodies exist? 235
Ref. 193 the motion of liquids, by Kistler and Scriven, using the Navier–Stokes equations. Teapots
are still shedding drops, though.
**
**
A drop of water that falls into a pan containing hot oil dances on the surface for a consid-
erable time, if the oil is above 220°C. Cooks test the temperature of oil in this way. Why
Challenge 469 ny does this so-called Leidenfrost effect* take place?
**
Challenge 470 n Why don’t air molecules fall towards the bottom of the container and stay there?
**
Challenge 471 n Which of the two water funnels in Figure 125 is emptied more rapidly? Apply energy
Ref. 194 conservation to the fluid’s motion (also called Bernoulli’s ‘law’) to find the answer.
**
As we have seen, fast flow generates an underpressure. How do fish prevent their eyes Copyright © Christoph Schiller November 1997–January 2007
**
Glass is a solid. Nevertheless, many textbooks say that glass is a liquid. This error has been
propagated for about a hundred years, probably originating from a mistranslation of a
**
The recognized record height reached by a helicopter is 12 442 m above sea level, though
12 954 m has also been claimed. (The first height was reached in 1972, the second in 2002,
both by French pilots in French helicopters.) Why, then, do people still continue to use
their legs in order to reach the top of Mount Sagarmatha, the highest mountain in the
Challenge 475 n world?
**
A loosely knotted sewing thread lies on the surface of a bowl filled with water. Putting
a bit of washing-up liquid into the area surrounded by the thread makes it immediately
Challenge 476 e become circular. Why?
**
Have you ever dropped a Mentos candy into a Diet Coca Cola bottle? You will get an
Challenge 478 e interesting effect. (Do it at your own risk...) Is it possible to build a rocket in this way?
**
Challenge 479 e A needle can swim on water, if you put it there carefully. Just try, using a fork.
**
Fluids exhibit many complex motions. To see an overview, have a look at the beautiful
Challenge 480 e collection on the web site http://serve.me.nus.edu.sg/limtt. One of the most famous ex-
amples of fluid motion is the leapfrogging of vortex rings, shown in Figure 126. Lim Tee
d o extended bodies exist? 237
**
Comic books have difficulties with the concept of atoms. Could Asterix really throw Ro-
mans into the air using his fist? Are Lucky Luke’s precise revolver shots possible? Can
Spiderman’s silk support him in his swings from building to building? Can the Roadrun-
ner stop running in three steps? Can the Sun be made to stop in the sky by command?
Can space-ships hover using fuel? Take any comic-book hero and ask yourself whether
Challenge 482 e matter made of atoms would allow him the feats he seems capable of. You will find that
most cartoons are comic precisely because they assume that matter is not made of atoms,
but continuous! In a sense, atoms make life a serious adventure.
**
Can humans start earthquakes? What would happen if 1000 million Indians were to jump
238 i galilean motion • 3. the global simplicity of complexity
Challenge 483 n at the same time from the kitchen table to the floor?
In fact, several strong earthquakes have been triggered by humans. This has happened
when water dams have been filled, or when water has been injected into drilling holes.
It has been suggested that the extraction of deep underground water also causes earth-
quakes. If this is confirmed, a sizeable proportion of all earthquakes could be human-
triggered.
**
Challenge 484 n How can a tip of a stalactite be distinguished from a tip of a stalagmite? Does the differ-
ence exist also for icicles?
**
How much more weight would your bathroom scales show if you stood on them in a
Challenge 485 n vacuum?
distinguish three main types of steel, depending on the crystalline structure. Ferritic steels
have a body-centred cubic structure, austenitic steels have a face-centred cubic structure,
and martensitic steels have a body-centred tetragonal structure. Table 28 gives further
details.
**
A simple phenomenon which requires a complex explanation is the cracking of a whip.
Ref. 195 Since the experimental work of Peter Krehl it has been known that the whip cracks when
Challenge 487 ny the tip reaches a velocity of twice the speed of sound. Can you imagine why?
**
A bicycle chain is an extended object with no stiffness. However, if it is made to rotate
d o extended bodies exist? 239
rapidly, it acquires dynamical stiffness, and can roll down an inclined plane or along
the floor. This surprising effect can be watched at http://www.iwf.de/NR/rdonlyres/
EEFA7FDC-DDDC-490C-9C49-4537A925EFE6/718/C14825.asx or http://www.iwf.
de/NR/rdonlyres/EEFA7FDC-DDDC-490C-9C49-4537A925EFE6/793/C148292.smil.
**
Mechanical devices are not covered in this text. There is a lot of progress in the area even
Ref. 197 at present. For example, people have built robots that are able to ride a unicycle. But even
Ref. 198 the physics of human unicycling is not simple.
**
240 i galilean motion • 3. the global simplicity of complexity
There are many arguments against the existence of atoms as hard balls. Thomson-Kelvin
Ref. 199 put it in writing: “the monstruous assumption of infinitely strong and infinitely rigid
Challenge 488 n pieces of matter.” Even though Thomson was right in his comment, atoms do exist. Why?
ics, and the motion of the vacuum. Once we have explored these domains, we will have
completed the first step of our description of motion: continuum physics. In continuum
physics, motion and moving entities are described with continuous quantities that can
take any value, including arbitrarily small or arbitrarily large values.
But nature is not continuous. We have already seen that matter cannot be indefinitely
divided into ever-smaller entities. In fact, we will discover that there are precise experi-
ments that provide limits to the observed values for every domain of continuum physics.
There is a limit to mass, to speed, to angular momentum, to force, to entropy and to
change of charge. The consequences of these discoveries form the second step in our de-
scription of motion: quantum theory and relativity. Quantum theory is based on lower
limits; relativity is based on upper limits. The third and last step of our description of
motion will be formed by the unification of quantum theory and general relativity.
Every domain of physics, regardless of which one of the above steps it belongs to, de-
scribes change in terms two quantities: energy, and an extensive quantity characteristic
pV T . (89)
Copyright © Christoph Schiller November 1997–January 2007
The proportionality constant is fixed by the amount of gas used. (More about it shortly.)
The ideal gas relation allows us to determine temperature by measuring pressure and
volume. This is the way (absolute) temperature has been defined and measured for about
Ref. 204 a century. To define the unit of temperature, one only has to fix the amount of gas used. It
Page 1170 is customary to fix the amount of gas at 1 mol; for oxygen this is 32 g. The proportionality
constant, called the ideal gas constant R, is defined to be R = 8.3145 J~mol K. This number
has been chosen in order to yield the best approximation to the independently defined
Celsius temperature scale. Fixing the ideal gas constant in this way defines 1 K, or one
Kelvin, as the unit of temperature. In simple terms, a temperature increase of one Kelvin
is defined as the temperature increase that makes the volume of an ideal gas increase –
242 i galilean motion • 3. the global simplicity of complexity
F I G U R E 127 Braking generates heat on the floor and in the tire (© Klaus-Peter Möllmann and
Michael Vollmer)
Temperature
The temperature achieved by a civilization can be used as a measure of its technological
achievements. One can define the Bronze Age (1.1 kK, 3500 bce) , the Iron Age (1.8 kK,
1000 bce), the Electric Age (3 kK from c. 1880) and the Atomic Age (several MK, from
1944) in this way. Taking into account also the quest for lower temperatures, one can
Ref. 205 define the Quantum Age (4 K, starting 1908).
Heating implies flow of energy. For example, friction heats up and slows down mov-
ing bodies. In the old days, the ‘creation’ of heat by friction was even tested experiment-
ally. It was shown that heat could be generated from friction, just by continuous rubbing,
Ref. 201 without any limit; an example is shown in Figure 127. This ‘creation’ implies that heat is
not a material fluid extracted from the body – which in this case would be consumed Copyright © Christoph Schiller November 1997–January 2007
after a certain time – but something else. Indeed, today we know that heat, even though
it behaves in some ways like a fluid, is due to disordered motion of particles. The conclu-
sion of these studies is simple. Friction is the transformation of mechanical energy into
thermal energy.
To heat 1 kg of water by 1 K by friction, 4.2 kJ of mechanical energy must be trans-
formed through friction. The first to measure this quantity with precision was, in 1842,
the German physician Julius Robert Mayer (1814–1878). He regarded his experiment as
proof of the conservation of energy; indeed, he was the first person to state energy con-
servation! It is something of an embarrassment to modern physics that a medical doctor
was the first to show the conservation of energy, and furthermore, that he was ridiculed
by most physicists of his time. Worse, conservation of energy was accepted only when
it was repeated many years later by two authorities: Hermann von Helmholtz – himself
temperature 243
O b s e r va t i o n Te m p e r at u r e
also a physician turned physicist – and William Thomson, who also cited similar, but
later experiments by James Joule.* All of them acknowledged Mayer’s priority. Publicity
by William Thomson eventually led to the naming of the unit of energy after Joule.
In short, the sum of mechanical energy and thermal energy is constant. This is usu-
ally called the first principle of thermodynamics. Equivalently, it is impossible to produce
mechanical energy without paying for it with some other form of energy. This is an im-
portant statement, because among others it means that humanity will stop living one day.
Indeed, we live mostly on energy from the Sun; since the Sun is of finite size, its energy
Challenge 490 n content will eventually be consumed. Can you estimate when this will happen?
There is also a second (and the mentioned third) principle of thermodynamics, which
will be presented later on. The study of these topics is called thermostatics if the systems
concerned are at equilibrium, and thermodynamics if they are not. In the latter case, we
distinguish situations near equilibrium, when equilibrium concepts such as temperature
can still be used, from situations far from equilibrium, such as self-organization, where
Entropy
– It’s irreversible.
“ – Like my raincoat!
Mel Brooks, Spaceballs, 1987
Every domain of physics describes change in terms of two quantities: energy, and an ex- ”
Ref. 202 tensive quantity characteristic of the domain. Even though heat is related to energy, the
quantity physicists usually call heat is not an extensive quantity. Worse, what physicists
call heat is not the same as what we call heat in our everyday speech. The extensive quant-
ity corresponding to what we call ‘heat’ in everyday speech is called entropy.*** Entropy
Copyright © Christoph Schiller November 1997–January 2007
describes heat in the same way as momentum describes motion. When two objects differ-
ing in temperature are brought into contact, an entropy flow takes place between them,
* Hermann von Helmholtz (b. 1821 Potsdam, d. 1894 Berlin), important Prussian scientist. William Thom-
son (later William Kelvin) (1824–1907), important Irish physicist. James Prescott Joule (1818–1889), Eng-
lish physicist. Joule is pronounced so that it rhymes with ‘cool’, as his descendants like to stress. (The pro-
nunciation of the name ‘Joule’ varies from family to family.)
** This might change in future, when mass measurements improve in precision, thus allowing the detection
Page 322 of relativistic effects. In this case, temperature increase may be detected through its related mass increase.
However, such changes are noticeable only with twelve or more digits of precision in mass measurements.
*** The term ‘entropy’ was invented by the German physicist Rudolph Clausius (1822–1888) in 1865. He
formed it from the Greek ἐν ‘in’ and τρόπος ‘direction’, to make it sound similar to ‘energy’. It has always had
the meaning given here.
temperature 245
Pro cess/System E n t r o p y va l u e
like the flow of momentum that take place when two objects of different speeds collide.
Let us define the concept of entropy more precisely and explore its properties in some
more detail.
are called processes. Central to thermostatics is the distinction between reversible pro-
cesses, such as the flight of a thrown stone, and irreversible processes, such as the afore-
mentioned tumbling rock. Irreversible processes are all those processes in which friction
and its generalizations play a role. They are those which increase the sharing or mixing of
energy. They are important: if there were no friction, shirt buttons and shoelaces would
Ref. 206 not stay fastened, we could not walk or run, coffee machines would not make coffee, and
Page 826 maybe most importantly of all, we would have no memory.
Irreversible processes, in the sense in which the term is used in thermostatics, trans-
form macroscopic motion into the disorganized motion of all the small microscopic com-
ponents involved: they increase the sharing and mixing of energy. Irreversible processes
are therefore not strictly irreversible – but their reversal is extremely improbable. We can
say that entropy measures the ‘amount of irreversibility’: it measures the degree of mixing
246 i galilean motion • 3. the global simplicity of complexity
Flow of entropy
We know from daily experience that transport of an extensive quantity always involves
friction. Friction implies generation of entropy. In particular, the flow of entropy itself
produces additional entropy. For example, when a house is heated, entropy is produced
in the wall. Heating means to keep a temperature difference ∆T between the interior and
the exterior of the house. The heat flow J traversing a square metre of wall is given by
where κ is a constant characterizing the ability of the wall to conduct heat. While con-
ducting heat, the wall also produces entropy. The entropy production σ is proportional to
the difference between the interior and the exterior entropy flows. In other words, one
Copyright © Christoph Schiller November 1997–January 2007
has
(Ti − Te )2
σ= =κ
J J
− . (91)
Te Ti Ti Te
Note that we have assumed in this calculation that everything is near equilibrium in each
slice parallel to the wall, a reasonable assumption in everyday life. A typical case of a good
wall has κ = 1 W~m2 K in the temperature range between 273 K and 293 K. With this value,
one gets an entropy production of
Can you compare the amount of entropy that is produced in the flow with the amount that
Challenge 493 ny is transported? In comparison, a good goose-feather duvet has κ = 1.5 W~m2 K, which in
shops is also called 15 tog.*
There are two other ways, apart from heat conduction, to transport entropy: convection,
Challenge 494 n to surprising results. (What do you think?)** We’ll take the first steps towards the answer
shortly.
* That unit is not as bad as the official (not a joke) BthU ë h~sqft~cm~°F used in some remote provinces of
our galaxy.
The insulation power of materials is usually measured by the constant λ = κd which is independent of
the thickness d of the insulating layer. Values in nature range from about 2000 W~K m for diamond, which
is the best conductor of all, down to between 0.1 W~K m and 0.2 W~K m for wood, between 0.015 W~K m
and 0.05 W~K m for wools, cork and foams, and the small value of 5 ë 10−3 W~K m for krypton gas.
** A strange hint: your answer is almost surely wrong.
248 i galilean motion • 3. the global simplicity of complexity
* By the way, the word gas is a modern construct. It was coined by the Brussels alchemist and physician
Johan Baptista van Helmont (1579–1644), to sound similar to ‘chaos’. It is one of the few words which have
been invented by one person and then adopted all over the world.
** Daniel Bernoulli (b. 1700 Bâle, d. 1782 Bâle), important Swiss mathematician and physicist. His father
Johann and his uncle Jakob were famous mathematicians, as were his brothers and some of his nephews.
Daniel Bernoulli published many mathematical and physical results. In physics, he studied the separation
of compound motion into translation and rotation. In 1738 he published the Hydrodynamique, in which
he deduced all results from a single principle, namely the conservation of energy. The so-called Bernoulli’s
principle states that (and how) the pressure of a fluid decreases when its speed increases. He studied the tides
and many complex mechanical problems, and explained the Boyle–Mariotte gas ‘law’. For his publications
he won the prestigious prize of the French Academy of Sciences – a forerunner of the Nobel Prize – ten
times.
temperature 249
Now you can take up the following challenge: how can you measure the weight of a
Challenge 499 n car or a bicycle with a ruler only?
The picture of gases as being made of hard constituents without any long-distance
interactions breaks down at very low temperatures. However, the ideal gas relation (93)
can be improved to overcome these limitations by taking into account the deviations due
Brownian motion
It is easy to observe, under a microscope, that small particles (such as pollen) in a liquid
never come to rest. They seem to follow a random zigzag movement. In 1827, the English
botanist Robert Brown (1773–1858) showed with a series of experiments that this obser-
vation is independent of the type of particle and of the type of liquid. In other words,
Brown had discovered a fundamental noise in nature. Around 1860, this motion was
attributed to the molecules of the liquid colliding with the particles. In 1905 and 1906,
Ref. 209 Marian von Smoluchowski and, independently, Albert Einstein argued that this theory
could be tested experimentally, even though at that time nobody was able to observe mo-
lecules directly. The test makes use of the specific properties of thermal noise.
It had already been clear for a long time that if molecules, i.e. indivisible matter
particles, really existed, then heat had to be disordered motion of these constituents
and temperature had to be the average energy per degree of freedom of the constituents. Copyright © Christoph Schiller November 1997–January 2007
Bernoulli’s model of Figure 128 implies that for monoatomic gases the kinetic energy Tkin
Challenge 500 ny per particle is given by
Tkin = kT
3
(94)
2
where T is temperature. The so-called Boltzmann constant k = 1.4 ë 10−23 J~K is the stand-
ard conversion factor between temperature and energy.*At a room temperature of 293 K,
the kinetic energy is thus 6 zJ.
* The important Austrian physicist Ludwig Boltzmann (b. 1844 Vienna, d. 1906 Duino) is most famous for
his work on thermodynamics, in which he explained all thermodynamic phenomena and observables, in-
cluding entropy, as results of the behaviour of molecules. Planck named the Boltzmann constant after his
investigations. He was one of the most important physicists of the late nineteenth century and stimulated
250 i galilean motion • 3. the global simplicity of complexity
b
b
F I G U R E 130 Example paths for particles in Brownian motion and their displacement distribution
`d 2 e = nl 2 . (95)
`d 2 e = nl 2 = vl t . (96)
In other words, the average square displacement increases proportionally with time. Of
Copyright © Christoph Schiller November 1997–January 2007
course, this is only valid if the liquid is made of separate molecules. Repeatedly measuring
the position of a particle should give the distribution shown in Figure 130 for the probab-
ility that the particle is found at a given distance from the starting point. This is called the
Ref. 210 (Gaussian) normal distribution. In 1908, Jean Perrin* performed extensive experiments
many developments that led to quantum theory. It is said that Boltzmann committed suicide partly because
of the resistance of the scientific establishment to his ideas. Nowadays, his work is standard textbook mater-
ial.
* Jean Perrin (1870–1942), important French physicist, devoted most of his career to the experimental proof
of the atomic hypothesis and the determination of Avogadro’s number; in pursuit of this aim he perfected
the use of emulsions, Brownian motion and oil films. His Nobel Prize speech (http://nobelprize.org/physics/
laureates/1926/perrin-lecture.html) tells the interesting story of his research. He wrote the influential book
temperature 251
M at e r i a l Entropy per
pa r t i c l e
in order to test this prediction. He found that equation (96) corresponded completely
with observations, thus convincing everybody that Brownian motion is indeed due to
collisions with the molecules of the surrounding liquid, as Smoluchowski and Einstein
had predicted.* Perrin received the 1926 Nobel Prize for these experiments.
Einstein also showed that the same experiment could be used to determine the number
of molecules in a litre of water (or equivalently, the Boltzmann constant k). Can you work
Challenge 503 d out how he did this?
swers of which would tell us everything about the system, i.e. about its microscopic
state.
— Entropy measures the (logarithm of the) number W of possible microscopic states.
A given macroscopic state can have many microscopic realizations. The logarithm of
this number, multiplied by the Boltzmann constant k, gives the entropy.**
Les atomes and founded the Centre National de la Recherche Scientifique. He was also the first to speculate,
in 1901, that an atom is similar to a small solar system.
Ref. 211 * In a delightful piece of research, Pierre Gaspard and his team showed in 1998 that Brownian motion is
Page 267 also chaotic, in the strict physical sense given later on.
** When Max Planck went to Austria to search for the anonymous tomb of Boltzmann in order to get him
buried in a proper grave, he inscribed the formula S = k ln W on the tombstone. (Which physicist would
252 i galilean motion • 3. the global simplicity of complexity
In short, the higher the entropy, the more microstates are possible. Through either
of these definitions, entropy measures the quantity of randomness in a system. In other
words, it measures the transformability of energy: higher entropy means lower transform-
ability. Alternatively, entropy measures the freedom in the choice of microstate that a sys-
tem has. High entropy means high freedom of choice for the microstate. For example,
when a molecule of glucose (a type of sugar) is produced by photosynthesis, about 40
bits of entropy are released. This means that after the glucose is formed, 40 additional
yes-or-no questions must be answered in order to determine the full microscopic state
of the system. Physicists often use a macroscopic unit; most systems of interest are large,
and thus an entropy of 1023 bits is written as 1 J~K.*
To sum up, entropy is thus a specific measure for the characterization of disorder of
Ref. 212 thermal systems. Three points are worth making here. First of all, entropy is not the meas-
ure of disorder, but one measure of disorder. It is therefore not correct to use entropy as
a synonym for the concept of disorder, as is often done in the popular literature. Entropy
whenever the granularity of matter plays a role; it expresses the fact that matter is made
of small basic entities. The most striking way to put this statement is the following: There
is a smallest entropy in nature. Indeed, for all systems, the entropy obeys
SE
k
. (97)
2
This result is almost 100 years old; it was stated most clearly (with a different numerical
Ref. 214 factor) by the Hungarian–German physicist Leo Szilard. The same point was made by the
Ref. 215 French physicist Léon Brillouin (again with a different numerical factor). The statement
can also be taken as the definition of the Boltzmann constant.
The existence of a smallest entropy in nature is a strong idea. It eliminates the possibil-
ity of the continuity of matter and also that of its fractality. A smallest entropy implies that
matter is made of a finite number of small components. The limit to entropy expresses
the fact that matter is made of particles.* The limit to entropy also shows that Galilean
physics cannot be correct: Galilean physics assumes that arbitrarily small quantities do
exist. The entropy limit is the first of several limits to motion that we will encounter until
we finish the second part of our ascent. After we have found all limits, we can start the
third and final part, leading to unification.
The existence of a smallest quantity implies a limit on the precision of measurement.
Measurements cannot have infinite precision. This limitation is usually stated in the form
of an indeterminacy relation. Indeed, the existence of a smallest entropy can be rephrased
as an indeterminacy relation between the temperature T and the inner energy U of a
∆U E .
1 k
∆ (98)
T 2
Ref. 216 This relation** was given by Niels Bohr; it was discussed by Werner Heisenberg, who
Page 1099 called it one of the basic indeterminacy relations of nature. The Boltzmann constant (di-
Ref. 218 vided by 2) thus fixes the smallest possible entropy value in nature. For this reason, Gilles
Ref. 215 Cohen-Tannoudji calls it the quantum of information and Herbert Zimmermann calls it
the quantum of entropy.
The relation (98) points towards a more general pattern. For every minimum value for
an observable, there is a corresponding indeterminacy relation. We will come across this
several times in the rest of our adventure, most importantly in the case of the quantum
Page 720 of action and Heisenberg’s indeterminacy relation.
The existence of a smallest entropy has numerous consequences. First of all, it sheds
light on the third principle of thermodynamics. A smallest entropy implies that absolute
zero temperature is not achievable. Secondly, a smallest entropy explains why entropy
values are finite instead of infinite. Thirdly, it fixes the absolute value of entropy for every
system; in continuum physics, entropy, like energy, is only defined up to an additive con-
stant. The entropy limit settles all these issues.
The existence of a minimum value for an observable implies that an indeterminacy re-
Copyright © Christoph Schiller November 1997–January 2007
lation appears for any two quantities whose product yields that observable. For example,
entropy production rate and time are such a pair. Indeed, an indeterminacy relation con-
nects the entropy production rate P = dS~dt and the time t:
∆P ∆t E
k
. (99)
2
From this and the previous relation (98) it is possible to deduce all of statistical physics,
* The minimum entropy implies that matter is made of tiny spheres; the minimum action, which we will
encounter in quantum theory, implies that these spheres are actually small clouds.
Ref. 217 ** It seems that the historical value for the right hand side, given by k, has to be corrected to k~2.
254 i galilean motion • 3. the global simplicity of complexity
Ref. 218, Ref. 217 i.e., the precise theory of thermostatics and thermodynamics. We will not explore this
further here. (Can you show that the zeroth principle follows from the existence of a
Challenge 506 ny smallest entropy?) We will limit ourselves to one of the cornerstones of thermodynamics:
the second principle.
Page 44
“ backwards.
Lewis Carroll, Alice in Wonderland
When we first discussed time, we ignored the difference between past and future. But ob- ”
viously, a difference exists, as we do not have the ability to remember the future. This is
not a limitation of our brain alone. All the devices we have invented, such as tape record-
ers, photographic cameras, newspapers and books, only tell us about the past. Is there a
way to build a video recorder with a ‘future’ button? Such a device would have to solve a
Ref. 219
“ atoms.
George Wald
Historically, the study of statistical mechanics has been of fundamental importance for ”
Copyright © Christoph Schiller November 1997–January 2007
physics. It provided the first demonstration that physical objects are made of interacting
particles. The story of this topic is in fact a long chain of arguments showing that all the
properties we ascribe to objects, such as size, stiffness, colour, mass density, magnetism,
thermal or electrical conductivity, result from the interaction of the many particles they
consist of. The discovery that all objects are made of interacting particles has often been
called the main result of modern science.
Page 240 How was this discovery made? Table 29 listed the main extensive quantities used in
physics. Extensive quantities are able to flow. It turns out that all flows in nature are com-
posed of elementary processes, as shown in Table 33. We have seen that the flow of mass,
volume, charge, entropy and substance are composed. Later, quantum theory will show
temperature 255
O b s e r va t i o n M i n i m u m va l u e
the same for the flow of linear and angular momentum. All flows are made of particles.
This success of this idea has led many people to generalize it to the statement:
‘Everything we observe is made of parts.’ This approach has been applied with success Copyright © Christoph Schiller November 1997–January 2007
Ref. 220 to chemistry with molecules, materials science and geology with crystals, electricity with
electrons, atoms with elementary particles, space with points, time with instants, light
with photons, biology with cells, genetics with genes, neurology with neurons, mathem-
atics with sets and relations, logic with elementary propositions, and even to linguist-
ics with morphemes and phonemes. All these sciences have flourished on the idea that
everything is made of related parts. The basic idea seems so self-evident that we find it
Challenge 509 ny difficult even to formulate an alternative. Just try!
However, in the case of the whole of nature, the idea that nature is a sum of related
Page 1067 parts is incorrect. It turns out to be a prejudice, and a prejudice so entrenched that it
retarded further developments in physics in the latter decades of the twentieth century. In
particular, it does not apply to elementary particles or to space-time. Finding the correct
description for the whole of nature is the biggest challenge of our adventure, as it requires
256 i galilean motion • 3. the global simplicity of complexity
match
head
F I G U R E 132
The fire pump
classical physics predicts. Solids, like stones, have 6 thermodynamic degrees of freedom
and should show a specific thermal energy of 3N k. At high temperatures, this is indeed
observed. But measurements of solids at room temperature yield lower values, and the
lower the temperature, the lower the values become. Even gases show values lower than
those just mentioned, when the temperature is sufficiently low. In other words, molecules
* Every statement about complexes can be resolved into a statement about their constituents and into the
propositions that describe the complexes completely.
** A thermodynamic degree of freedom is, for each particle in a system, the number of dimensions in which
it can move plus the number of dimensions in which it is kept in a potential. Atoms in a solid have six,
particles in monoatomic gases have only three; particles in diatomic gases or rigid linear molecules have
Ref. 221 five. The number of degrees of freedom of larger molecules depends on their shape.
temperature 257
and atoms behave differently at low energies: atoms are not immutable little hard balls.
The deviation of these values is one of the first hints of quantum theory.
Challenge 511 ny hours after death, what is the minimum number of calories needed per day in our food?
**
The energy contained in thermal motion is not negligible. A 1 g bullet travelling at the
speed of sound has a kinetic energy of only 0.01 kcal.
**
3
Challenge 512 n How does a typical, 1500 m hot-air balloon work?
**
If you do not like this text, here is a proposal. You can use the paper to make a cup, as
258 i galilean motion • 3. the global simplicity of complexity
1 2 3 4
shown in Figure 133, and boil water in it over an open flame. However, to succeed, you
Challenge 513 n have to be a little careful. Can you find out in what way?
ancient units more familiar to nutritionists, 9 kcal). That is the same value as that of petrol.
Challenge 518 n Why are people and butter less dangerous than petrol?
**
In 1992, the Dutch physicist Martin van der Mark invented a loudspeaker which works by
heating air with a laser beam. He demonstrated that with the right wavelength and with
a suitable modulation of the intensity, a laser beam in air can generate sound. The effect
at the basis of this device, called the photoacoustic effect, appears in many materials. The
best laser wavelength for air is in the infrared domain, on one of the few absorption lines
of water vapour. In other words, a properly modulated infrared laser beam that shines
through the air generates sound. Such light can be emitted from a small matchbox-sized
semiconductor laser hidden in the ceiling and shining downwards. The sound is emitted
temperature 259
invisible pulsed
laser beam
emitting sound
laser
cable
to amplifier
**
A famous exam question: How can you measure the height of a building with a barometer,
Challenge 519 n a rope and a ruler? Find at least six different ways.
**
What is the approximate probability that out of one million throws of a coin you get
Challenge 520 ny exactly 500 000 heads and as many tails? º
You may want to use Stirling’s formula n! 2πn (n~e)n to calculate the result.*
**
Challenge 521 n Does it make sense to talk about the entropy of the universe?
Copyright © Christoph Schiller November 1997–January 2007
**
Challenge 522 ny Can a helium balloon lift the tank which filled it?
**
All friction processes, such as osmosis, diffusion, evaporation, or decay, are slow. They
take a characteristic time. It turns out that any (macroscopic) process with a time-scale is
irreversible. This is no real surprise: we know intuitively that undoing things always takes
more time than doing them. That is again the second principle of thermodynamics.
»
(2n + 1~3)π (n~e)n . Another
º º
* There are many improvements to Stirling’s formula. A simple one is n!
is 2πn (n~e)n e ~(12n+1) < n! < 2πn (n~e)n e ~(12n) .
1 1
260 i galilean motion • 3. the global simplicity of complexity
**
It turns out that storing information is possible with negligible entropy generation. How-
Ref. 225 ever, erasing information requires entropy. This is the main reason why computers, as well
as brains, require energy sources and cooling systems, even if their mechanisms would
otherwise need no energy at all.
**
When mixing hot rum and cold water, how does the increase in entropy due to the mixing
Challenge 523 ny compare with the entropy increase due to the temperature difference?
**
Why aren’t there any small humans, e.g. 10 mm in size, as in many fairy tales? In fact,
Challenge 524 n there are no warm-blooded animals of that size. Why not?
**
What is the rough probability that all oxygen molecules in the air would move away from
Challenge 525 ny a given city for a few minutes, killing all inhabitants?
**
If you pour a litre of water into the sea, stir thoroughly through all the oceans and then
Challenge 526 ny take out a litre of the mixture, how many of the original atoms will you find?
**
Challenge 527 ny How long would you go on breathing in the room you are in if it were airtight? Copyright © Christoph Schiller November 1997–January 2007
**
Challenge 528 ny What happens if you put some ash onto a piece of sugar and set fire to the whole? (Warn-
ing: this is dangerous and not for kids.)
**
Entropy calculations are often surprising. For a system of N particles with two states
each, there are Wall = 2 N states. For its most probable configuration, with exactly half the
particles in one state, and the other half in the other state, we have Wmax = N!~((N~2)!)2 .
Now, for a macroscopic system of particles, we might typically have N = 1024 . That gives
Wall Q Wmax ; indeed, the former is 1012 times larger than the latter. On the other hand, we
Challenge 529 ny find that ln Wall and ln Wmax agree for the first 20 digits! Even though the configuration
temperature 261
with exactly half the particles in each state is much more rare than the general case, where
Challenge 530 ny the ratio is allowed to vary, the entropy turns out to be the same. Why?
**
If heat is due to motion of atoms, our built-in senses of heat and cold are simply detectors
Challenge 531 ny of motion. How could they work?
By the way, the senses of smell and taste can also be seen as motion detectors, as they
Challenge 532 ny signal the presence of molecules flying around in air or in liquids. Do you agree?
**
The Moon has an atmosphere, although an extremely thin one, consisting of sodium (Na)
and potassium (K). This atmosphere has been detected up to nine Moon radii from its sur-
face. The atmosphere of the Moon is generated at the surface by the ultraviolet radiation
Challenge 533 n from the Sun. Can you estimate the Moon’s atmospheric density?
**
Thermometers based on mercury can reach 750°C. How is this possible, given that mer-
Challenge 536 n cury boils at 357°C?
Copyright © Christoph Schiller November 1997–January 2007
**
Challenge 537 n What does a burning candle look like in weightless conditions?
**
It is possible to build a power station by building a large chimney, so that air heated by
the Sun flows upwards in it, driving a turbine as it does so. It is also possible to make a
power station by building a long vertical tube, and letting a gas such as ammonia rise into
it which is then liquefied at the top by the low temperatures in the upper atmosphere; as
it falls back down a second tube as a liquid – just like rain – it drives a turbine. Why are
Challenge 538 n such schemes, which are almost completely non-polluting, not used yet?
**
262 i galilean motion • 3. the global simplicity of complexity
cold air
hot air
**
It is easy to cook an egg in such a way that the white is hard but the yolk remains liquid.
Challenge 540 n Can you achieve the opposite?
**
Thermoacoustic engines, pumps and refrigerators provide many strange and fascinating
applications of heat. For example, it is possible to use loud sound in closed metal cham-
bers to move heat from a cold place to a hot one. Such devices have few moving parts and
Ref. 226 are being studied in the hope of finding practical applications in the future.
** Copyright © Christoph Schiller November 1997–January 2007
Challenge 541 ny Does a closed few-particle system contradict the second principle of thermodynamics?
**
What happens to entropy when gravitation is taken into account? We carefully left grav-
itation out of our discussion. In fact, many problems appear – just try to think about
the issue. For example, Jakob Bekenstein has discovered that matter reaches its highest
Challenge 542 ny possible entropy when it forms a black hole. Can you confirm this?
**
The numerical values (but not the units!) of the Boltzmann constant k = 1.38 ë 10−23 J~K
and the combination h~ce agree in their exponent and in their first three digits, where h is
Challenge 543 ny Planck’s constant and e is the electron charge. Can you dismiss this as mere coincidence?
self-organization and chaos 263
Figures to be added
Ref. 227
“ zoology the study of non-elephant animals.
Stanislaw Ulam
In our list of global descriptions of motion, the high point is the study of self-organization. ”
Self-organization is the appearance of order. Order is a term that includes shapes, such as
Ref. 228 the complex symmetry of snowflakes; patterns, such as the stripes of zebras; and cycles,
such as the creation of sound when singing. Every example of what we call beauty is a Copyright © Christoph Schiller November 1997–January 2007
Challenge 544 n combination of shapes, patterns and cycles. (Do you agree?) Self-organization can thus
be called the study of the origin of beauty.
The appearance of order is a general observation across nature. Fluids in particular ex-
hibit many phenomena where order appears and disappears. Examples include the more
or less regular flickering of a burning candle, the flapping of a flag in the wind, the regular
stream of bubbles emerging from small irregularities in the surface of a champagne glass,
and the regular or irregular dripping of a water tap.
The appearance of order is found from the cell differentiation in an embryo inside a
woman’s body; the formation of colour patterns on tigers, tropical fish and butterflies; the
symmetrical arrangements of flower petals; the formation of biological rhythms; and so
on.
All growth processes are self-organization phenomena. Have you ever pondered the
264 i galilean motion • 3. the global simplicity of complexity
n = 21
n = 23
incredible way in which teeth grow? A practically inorganic material forms shapes in the
upper and the lower rows fitting exactly into each other. How this process is controlled
is still a topic of research. Also the formation, before and after birth, of neural networks
in the brain is another process of self-organization. Even the physical processes at the
basis of thinking, involving changing electrical signals, is to be described in terms of self-
organization.
Biological evolution is a special case of growth. Take the evolution of animal shapes.
It turns out that snake tongues are forked because that is the most efficient shape for
Ref. 229 following chemical trails left by prey and other snakes of the same species. (Snakes smell
with help of the tongue.) The fixed numbers of fingers in human hands or of petals of
Page 715 flowers are also consequences of self-organization.
Many problems of self-organization are mechanical problems: for example, the form- Copyright © Christoph Schiller November 1997–January 2007
ation of mountain ranges when continents move, the creation of earthquakes, or the cre-
ation of regular cloud arrangements in the sky. It can be fascinating to ponder, during
an otherwise boring flight, the mechanisms behind the formation of the clouds you see
Challenge 545 e from the aeroplane.
Studies into the conditions required for the appearance or disappearance of order have
shown that their description requires only a few common concepts, independently of the
details of the physical system. This is best seen looking at a few examples.
All the richness of self-organization reveals itself in the study of plain sand. Why do
sand dunes have ripples, as does the sand floor at the bottom of the sea? We can also
study how avalanches occur on steep heaps of sand and how sand behaves in hourglasses,
in mixers, or in vibrating containers. The results are often surprising. For example, as
Ref. 230 recently as 1996 Paul Umbanhowar and his colleagues found that when a flat container
self-organization and chaos 265
holding tiny bronze balls (around 0.165 mm in diameter) is shaken up and down in va-
cuum at certain frequencies, the surface of this bronze ‘sand’ forms stable heaps. They are
shown in Figure 137. These heaps, so-called oscillons, also bob up and down. Oscillons
can move and interact with one another.
Oscillons in sand are simple example for a general effect in nature: discrete systems
with nonlinear interactions can exhibit localized excitations. This fascinating topic is just
Ref. 231 beginning to be researched. It might well be that it will yield results relevant to our under-
standing of elementary particles.
Sand shows many other pattern-forming processes. A mixture of sand and sugar, when
poured onto a heap, forms regular layered structures that in cross section look like zebra
stripes. Horizontally rotating cylinders with binary mixtures inside them separate the
mixture out over time. Or take a container with two compartments separated by a 1 cm
wall. Fill both halves with sand and rapidly shake the whole container with a machine.
Over time, all the sand will spontaneously accumulate in one half of the container. As
another example of self-organization in sand, people have studied the various types of
Ref. 232 sand dunes that ‘sing’ when the wind blows over them. In fact, the behaviour of sand
and dust is proving to be such a beautiful and fascinating topic that the prospect of each Copyright © Christoph Schiller November 1997–January 2007
Secondly, patterns, shapes and cycles are due to the organized motion of large num-
bers of small constituents. Systems which self-organize are always composite: they are
cooperative structures.
Thirdly, all these systems obey evolution equations which are nonlinear in the config-
uration variables. Linear systems do not self-organize. Many self-organizing systems also
show chaotic motion.
Fourthly, the appearance and disappearance of order depends on the strength of a
driving force, the so-called order parameter. Often, chaotic motion appears when the driv-
ing is increased beyond the value necessary for the appearance of order. An example of
chaotic motion is turbulence, which appears when the order parameter, which is propor-
tional to the speed of the fluid, is increased to high values.
Moreover, all order and all structure appears when two general types of motion com-
pete with each other, namely a ‘driving’, energy-adding process, and a ‘dissipating’, brak-
ing mechanism. Thermodynamics plays a role in all self-organization. Self-organizing
* To describe the ‘mystery’ of human life, terms like ‘fire’, ‘river’ or ‘tree’ are often used as analogies. These
are all examples of self-organized systems: they have many degrees of freedom, have competing driving
and braking forces, depend critically on their initial conditions, show chaos and irregular behaviour, and
sometimes show cycles and regular behaviour. Humans and human life resemble them in all these respects;
thus there is a solid basis to their use as metaphors. We could even go further and speculate that pure beauty
is pure self-organization. The lack of beauty indeed often results from a disturbed equilibrium between
external braking and external driving.
self-organization and chaos 267
oscillation, quasiperiodic
fixed point limit cycle motion chaotic motion
state value
system 1
time
and its numerous variations allow us to describe many phenomena, ranging from spir-
Ref. 235 als, waves, hexagonal patterns, and topological defects, to some forms of turbulence. For
every physical system under study, the main task is to distil the observable A and the
parameters λ, µ and κ from the underlying physical processes.
Self-organization is a vast field which is yielding new results almost by the week. To
discover new topics of study, it is often sufficient to keep one’s eye open; most effects are
Challenge 548 ny comprehensible without advanced mathematics. Good hunting!
Most systems that show self-organization also show another type of motion. When the
driving parameter of a self-organizing system is increased to higher and higher values,
order becomes more and more irregular, and in the end one usually finds chaos. For
physicists, c ha o T c motion is the most irregular type of motion.* Chaos can be defined
i
positive Lyapounov exponent. The weather is such a system, as are dripping water-taps, the
fall of dice, and many other common systems. For example, research on the mechanisms
by which the heart beat is generated has shown that the heart is not an oscillator, but a
chaotic system with irregular cycles. This allows the heart to be continuously ready for
demands for changes in beat rate which arise once the body needs to increase or decrease
Ref. 236 its efforts.
* On the topic of chaos, see the beautiful book by H.-O. Peitgen, H. Jürgens & D. Saupe, Chaos and
Fractals, Springer Verlag, 1992. It includes stunning pictures, the necessary mathematical background, and
some computer programs allowing personal exploration of the topic. ‘Chaos’ is an old word: according to
Greek mythology, the first goddess, Gaia, i.e. the Earth, emerged from the chaos existing at the beginning.
She then gave birth to the other gods, the animals and the first humans.
268 i galilean motion • 3. the global simplicity of complexity
Incidentally, can you give a simple argument to show that the so-called butterfly effect
Challenge 549 n does not exist? This ‘effect’ is often cited in newspapers: the claim is that nonlinearities
imply that a small change in initial conditions can lead to large effects; thus a butterfly
wing beat is alleged to be able to induce a tornado. Even though nonlinearities do indeed
lead to growth of disturbances, the butterfly effect has never been observed; it does not
exist.
There is chaotic motion also in machines: chaos appears in the motion of trains on the
rails, in gear mechanisms, and in fire-fighter’s hoses. The precise study of the motion in
Challenge 550 ny a zippo cigarette lighter will probably also yield an example of chaos. The mathematical
description of chaos – simple for some textbook examples, but extremely involved for
others – remains an important topic of research.
All the steps from disorder to order, quasiperiodicity and finally to chaos, are examples
of self-organization. These types of motion, illustrated in Figure 140, are observed in many
fluid systems. Their study should lead, one day, to a deeper understanding of the mysteries
the eye, are being studied intensely. The ongoing work in this domain is fascinating. If
Challenge 552 ny you plan to become a scientist, consider taking this path.
Such studies provide the final arguments that confirm what J. Offrey de la Mettrie
in 1748 stated and explored in his famous book L’homme machine: humans are complex
machines. Indeed, the lack of understanding of complex systems in the past was due
mainly to the restrictive teaching of the subject of motion, which usually concentrated –
as we do in this walk – on examples of motion in simple systems. Even though the subject
* Already small versions of Niagara Falls, namely dripping water taps, show a large range of cooperative
Ref. 238 phenomena, including the chaotic, i.e. non-periodic, fall of water drops. This happens when the water flow
Challenge 551 ny has the correct value, as you can verify in your own kitchen. Several cooperative fluid phenomena have been
simulated even on the molecular level.
self-organization and chaos 269
water
pipe
pearls
λ finger
of self-organization provides fascinating insights, and will do so for many years to come,
we now leave it. We continue with our own adventure exploring the basics of motion.*
Ich sage euch: man muss noch Chaos in sich
**
When a fine stream of water leaves a water tap, putting a finger in the stream leads to a
Challenge 556 d wavy shape, as shown in Figure 143. Why?
**
When water emerges from a oblong opening, the stream forms a braid pattern, as shown
in Figure 144. This effect results from the interplay and competition between inertia and
Ref. 240 surface tension: inertia tends to widen the stream, while surface tension tends to narrow
it. Predicting the distance from one narrow region to the next is still a topic of research.
If the experiment is done in free air, without a plate, one usually observes an additional
effect: there is a chiral braiding at the narrow regions, induced by the asymmetries of the
water flow. You can observe this effect in the toilet! Scientific curiosity knows no limits:
Challenge 557 ny are you a right-turner or a left-turner, or both? On every day?
ematical analysis showed that the laminar flow is not always stable, and that the transition
to turbulence in a long pipe occurs with travelling waves. Then, in 2004, careful experi-
ments showed that these travelling waves indeed appear when water is flowing through
Ref. 242 a pipe at large Reynolds numbers.
**
For some beautiful pictures on self-organization in fluids, see the http://serve.me.nus.edu.
sg/limtt website. Among others, it shows that a circular vortex can ‘suck in’ a second one
behind it, and that the process can then repeat.
**
Also dance is an example of self-organization. Self-organization takes part in the brain.
from the limitations of physics to the limits of motion 271
Like for all complex movements, learning then is often a challenge. Nowadays there are
beautiful books that tell how physics can help you improve your dancing skills and the
Ref. 243 grace of your movements.
**
Do you want to enjoy working on your PhD? Go into a scientific toy shop, and look for
a toy that moves in a complex way. There are high chances that the motion is chaotic;
explore the motion and present a thesis about it. (Or simply explore the motion of a
hanging wire whose upper end is externally driven.)
Socrates’ saying applies also to Galilean physics, despite its general success in engineering ”
and in the description of everyday life. We will now give a short overview of the limita-
tions of the field.
What is contact?
Democritus declared that there is a unique sort
”
Ref. 245
Of the questions unanswered by classical physics, the details of contact and collisions are
among the most pressing. Indeed, we defined mass in terms of velocity changes during
Page 77 collisions. But why do objects change their motion in such instances? Why are collisions
between two balls made of chewing gum different from those between two stainless-steel
272 i galilean motion • 4. from the limitations of physics
Increasing the accuracy and precision of our description of nature implies leaving behind
us all the mistakes we have made so far. That is our aim in the following.
* For measurements, both precision and accuracy are best described by their standard deviation, as explained
in Appendix B, on page 1180.
to the limits of motion 273
such a book should contain all information contained in the universe. Is this possible?
Let us check the options.
If nature requires an infinitely long book to be fully described, such a publication obvi-
ously cannot exist. In this case, only approximate descriptions of nature are possible and
a perfect physics book is impossible.
If nature requires a finite amount of information for its description, there are two op-
tions. One is that the information of the universe cannot be summarized in a book; them
a perfect physics book is again impossible. The other option is that the universe does
contain a finite amount of information and that it can be summarized in a few small
statements. This would imply that the rest of the universe would not add to the informa-
tion already contained in the perfect physics book. In this case, it seems that the entropy
of the book and the entropy of the universe must be similar. This seems quite unlikely.
We note that the answer to this puzzle also implies the answer to another puzzle:
whether a brain can contain a full description of nature. In other words, the real question
or optical – interactions to achieve its function. Can you confirm that the same applies
Challenge 564 n to speed and to angle measurements? In summary, whatever method we use, in order to
measure velocity, length, time, and mass, interactions other than gravity are needed. Our
ability to measure shows that gravity is not all there is.
Is motion unlimited?
Galilean physics does not explain the ability to measure. In fact, it does not even explain
the existence of standards. Why do objects have fixed lengths? Why do clocks work with
regularity? Galilean physics cannot explain these observations.
Galilean physics also makes no clear statements on the universe as a whole. It seems
to suggest that it is infinite. Finitude does not fit with the Galilean description of mo-
274 i galilean motion • 4. from the limitations of physics
tion. Galilean physics is thus limited in its explanations because it disregards the limits
of motion.
We also note that the existence of infinite speeds in nature would not allow us to
define time sequences. Clocks would then be impossible. In other words, a description of
nature that allows unlimited speeds is not precise. Precision requires limits. To achieve
the highest possible precision, we need to discover all limits to motion. So far, we have
discovered only one: there is a smallest entropy. We now turn to another, more striking
one: the limit for speed. To understand this limit, we will explore the most rapid motion
we know: the motion of light.
Biblio graphy
145 Giuseppe Fumagalli, Chi l’ha detto?, Hoepli, 1983. Cited on page 175.
146 The beautiful story of the south-pointing carriage is told in Appendix B of James Foster
& J.D. Nightingale, A Short Course in General Relativity, Springer Verlag, 2nd edition,
1998. Such carriages have existed in China, as told by the great sinologist Joseph Needham,
but their construction is unknown. The carriage described by Foster and Nightingale is the
one reconstructed in 1947 by George Lancaster, a British engineer. Cited on page 176.
147 See for example Z. Ghahramani, Building blocks of movement, Nature 407, pp. 682–683,
2000. Researchers in robot control are also interested in such topics. Cited on page 176.
148 G. Gu tierrez, C. Fehr, A. Calzadill a & D. Figueroa, Fluid flow up the wall of
a spinning egg, American Journal of Physics 66, pp. 442–445, 1998. Cited on page 177.
149 A historical account is given in Wolf gang Yourgray & Stanley Mandelstam,
Variational Principles in Dynamics and Quantum Theory, Dover, 1968. Cited on pages 178
the same issue. The rules governing branching properties of blood vessels, of lymph systems
and of vessel systems in plants are explained. For more about plants, see also the paper G.B.
West, J.H. Brown & B.J. Enquist, A general model for the structure and allometry of
plant vascular systems, Nature 400, pp. 664–667, 1999. Cited on page 189.
158 J.R. Banavar, A. Martin & A. R inald o, Size and form in efficient transportation
networks, Nature 399, pp. 130–132, 1999. Cited on page 189.
159 N. Moreira, New striders - new humanoids with efficient gaits change the robotics land-
scape, Science News Online 6th of August, 2005. Cited on page 190.
160 John Mansley Robinson, An Introduction to Early Greek Philosophy, Houghton Muffin
1968, chapter 5. Cited on page 191.
161 See e.g. B. B ower, A child’s theory of mind, Science News 144, pp. 40–41. Cited on page
193.
276 i galilean motion
162 The most beautiful book on this topic is the text by Branko Grünbaum & G.C. Shep-
hard, Tilings and Patterns, W.H. Freeman and Company, New York, 1987. It has been trans-
lated into several languages and republished several times. Cited on page 194.
163 U. Niederer, The maximal kinematical invariance group of the free Schrödinger equation,
Helvetica Physica Acta 45, pp. 802–810, 1972. See also the introduction by O. Jahn & V.V.
Sreedhar, The maximal invariance group of Newton’s equations for a free point particle,
http://arxiv.org/abs/math-ph/0102011. Cited on page 199.
164 The story is told in the interesting biography of Einstein by A. Pais, ‘Subtle is the Lord...’ –
The Science and the Life of Albert Einstein, Oxford University Press, 1982. Cited on page 200.
165 W. Zürn & R. Widmer-Schnidrig, Globale Eigenschwingungen der Erde, Physik
Journal 1, pp. 49–55, 2002. Cited on page 206.
166 N. Gau thier, What happens to energy and momentum when two oppositely-moving
wave pulses overlap?, American Journal of Physics 71, pp. 787–790, 2003. Cited on page 210.
167 An informative and modern summary of present research about the ear and the details of
its function is http://www.physicsweb.org/article/world/15/5/8. Cited on page 212.
179 M. Ausloos, Proceedings of the Royal Society in London A 400, pp. 331–350, 1985. Cited
on page 223.
180 T.A. McMahob & J. Tyler B onner, Form und Leben – Konstruktion vom Reißbrett
der Natur, Spektrum Verlag, 1985. Cited on page 225.
181 G.W. Koch, S.C. Sillett, G.M. Jennings & S.D. Davis, The limits to tree height,
Nature 428, pp. 851–854, 2004. Cited on page 226.
182 A simple article explaining the tallness of trees is A. Mineyev, Trees worthy of Paul
Bunyan, Quantum pp. 4–10, January–February 1994. (Paul Bunyan is a mythical giant lum-
berjack who is the hero of the early frontier pioneers in the United States.) Note that the
transport of liquids in trees sets no limits on their height, since water is pumped up along
tree stems (except in spring, when it is pumped up from the roots) by evaporation from the
leaves. This works almost without limits because water columns, when nucleation is care-
fully avoided, can be put under tensile stresses of over 100 bar, corresponding to 1000 m.
See also P. Nobel, Plant Physiology, Academic Press, 2nd Edition, 1999. Cited on page
188 See for example C. Schiller, A.A. Koomans, T.L. van Rooy, C. Schönenberger
& H.B. Elswijk, Decapitation of tungsten field emitter tips during sputter sharpening,
Surface Science Letters 339, pp. L925–L930, 1996. Cited on page 229.
189 U. Weierstall & J.C.H. Spence, An STM with time-of-flight analyzer for atomic spe-
cies identification, MSA 2000, Philadelphia, Microscopy and Microanalysis 6, Supplement 2,
p. 718, 2000. Cited on page 229.
190 The present state of our understanding of turbulence is described in ... Cited on page 231.
191 K. Weltner, A comparison of explanations of aerodynamical lifting force, American
Journal of Physics 55, pp. 50–54, 1987, K. Weltner, Aerodynamic lifting force, The Phys-
ics Teacher 28, pp. 78–82, 1990. See also the http://user.uni-frankfurt.de/~weltner/Flight/
PHYSIC4.htm and the http://www.av8n.com/how/htm/airfoils.html websites. Cited on
page 231.
278 i galilean motion
192 Lydéric B ocquet, The physics of stone skipping, American Journal of Physics 17, pp. 150–
155, 2003. The present recod holder is Kurt Steiner, with 40 skips. See http://pastoneskipping.
com/steiner.htm and http://www.stoneskipping.com. The site http://www.yeeha.net/nassa/
guin/g2.html is by the a previous world record holder, Jerdome Coleman–McGhee. Cited
on page 233.
193 S.F. Kistler & L.E. Scriven, The teapot effect: sheetforming flows with deflection, wet-
ting, and hysteresis, Journal of Fluid Mechanics 263, pp. 19–62, 1994. Cited on page 235.
194 E. Holl ander, Over trechters en zo ..., Nederlands tijdschrift voor natuurkunde 68, p. 303,
2002. Cited on page 235.
195 P. Krehl, S. Engemann & D. Schwenkel, The puzzle of whip cracking – uncovered
by a correlation of whip-tip kinematics with shock wave emission, Shock Waves 8, pp. 1–9,
1998. The authors used high-speed cameras to study the motion of the whip. A new aspect
has been added by A. Goriely & T. McMillen, Shape of a cracking whip, Physical
Review Letters 88, p. 244301, 2002. This article focuses on the tapered shape of the whip.
However, the neglection of the tuft – a piece at the end of the whip which is required to
education, The Physics Teacher 39, pp. 371–376, 2001, Cited on page 242.
202 F. Herrmann, Mengenartige Größen im Physikunterricht, Physikalische Blätter 54,
pp. 830–832, September 1998. See also his lecture notes on general introductory physics
on the website http://www-tfp.physik.uni-karlsruhe.de/~didaktik. Cited on pages 241
and 244.
203 Thermostatics is difficult to learn also because it was not discovered in a systematic way.
See C. Truesdell, The Tragicomical History of Thermodynamics 1822–1854, Springer Ver-
lag, 1980. An excellent advanced textbook on thermostatics and thermodynamics is Linda
R eichl, A Modern Course in Statistical Physics, Wiley, 2nd edition, 1998. Cited on page
241.
204 Gas expansion was the main method used for the definition of the official temperature scale.
Only in 1990 were other methods introduced officially, such as total radiation thermometry
bibliography 279
(in the range 140 K to 373 K), noise thermometry (2 K to 4 K and 900 K to 1235 K), acoustical
thermometry (around 303 K), magnetic thermometry (0.5 K to 2.6 K) and optical radiation
thermometry (above 730 K). Radiation thermometry is still the central method in the range
from about 3 K to about 1000 K. This is explained in detail in R.L. Rusby, R.P. Hudson,
M. Durieux, J.F. Schooley, P.P.M. Steur & C.A. Swenson, The basis of the ITS-
90, Metrologia 28, pp. 9–18, 1991. On the water boiling point see also page 1188. Cited on
page 241.
205 See for example the captivating text by Gino Segrè, A Matter of Degrees: What Temperat-
ure Reveals About the Past and Future of Our Species, Planet and Universe, Viking, New York,
2002. Cited on page 242.
206 B. Polster, What is the best way to lace your shoes?, Nature 420, p. 476, 5 December 2002.
Cited on page 245.
207 See for example the article by H. Preston-Thomas, The international temperature scale
of 1990 (ITS-90), Metrologia 27, pp. 3–10, 1990, and the errata H. Preston-Thomas, The
international temperature scale of 1990 (ITS-90), Metrologia 27, p. 107, 1990, Cited on page
252.
213 This is a disappointing result of all efforts so far, as Grégoire Nicolis always stresses in his
university courses. Seth Lloyd has compiled a list of 31 proposed definitions of complexity,
containing among others, fractal dimension, grammatical complexity, computational com-
plexity, thermodynamic depth. See, for example, a short summary in Scientific American
p. 77, June 1995. Cited on page 252.
214 Minimal entropy is discussed by L. Szil ard, Über die Entropieverminderung in einem
thermodynamischen System bei Eingriffen intelligenter Wesen, Zeitschrift für Physik 53,
pp. 840–856, 1929. This classic paper can also be found in English translation in his collected
works. Cited on page 252.
215 G. Cohen-Tannoudji, Les constantes universelles, Pluriel, Hachette, 1998. See also L.
Brillouin, Science and Information Theory, Academic Press, 1962. Cited on page 253.
280 i galilean motion
216 See for example A.E. Shalyt-Margolin & A.Ya. Tregubovich, Generalized un-
certainty relation in thermodynamics, http://arxiv.org/abs/gr-qc/0307018, or J. Uffink &
J. van Lith-van Dis, Thermodynamic uncertainty relations, Foundations of Physics 29,
p. 655, 1999. Cited on page 253.
217 H.W. Zimmermann, Particle entropies and entropy quanta IV: the ideal gas, the second
law of thermodynamics, and the P-t uncertainty relation, Zeitschrift für physikalische Chemie
217, pp. 55–78, 2003, and H.W. Zimmermann, Particle entropies and entropy quanta V:
the P-t uncertainty relation, Zeitschrift für physikalische Chemie 217, pp. 1097–1108, 2003.
Cited on pages 253 and 254.
218 B. L avenda, Statistical Physics: A Probabilistic Approach, Wiley-Interscience, New York,
1991. Cited on pages 253 and 254.
219 The quote is found in the introduction by George Wald to the text by L awrence J.
Henderson, The Fitness of the Environment, Macmillan, New York, 1913, reprinted 1958.
Cited on page 254.
228 For more about the shapes of snowflakes, see the famous book by W.A. Bentlye & W.J.
Humphreys, Snow Crystals, Dover Publications, New York, 1962. This second printing of
the original from 1931 shows a large part of the result of Bentley’s lifelong passion, namely
several thousand photographs of snowflakes. Cited on page 263.
229 K. Schwenk, Why snakes have forked tongues, Science 263, pp. 1573–1577, 1994. Cited on
page 264.
230 P.B. Umbanhowar, F. Melo & H.L. Swinney, Localized excitations in a vertically
vibrated granular layer, Nature 382, pp. 793–796, 29 August 1996. Cited on page 264.
231 D.K. Campbell, S. Fl ach & Y.S. Kivshar, Localizing energy through nonlinearity
and discreteness, Physics Today 57, pp. 43–49, January 2004. Cited on page 265.
232 B. Andreotti, The song of dunes as a wave-particle mode locking, Physical Review Letters
92, p. 238001, 2004. Cited on page 265.
bibliography 281
233 K. Kötter, E. Goles & M. Markus, Shell structures with ‘magic numbers’ of spheres
in a swirled disk, Physical Review E 60, pp. 7182–7185, 1999. Cited on page 265.
234 A good introduction is the text by Daniel Walgraef, Spatiotemporal Pattern Formation,
With Examples in Physics, Chemistry and Materials Science, Springer 1996. Cited on page
265.
235 For an overview, see the Ph.D. thesis by Joceline Lega, Défauts topologiques associés à la
brisure de l’invariance de translation dans le temps, Université de Nice, 1989. Cited on page
267.
236 An idea of the fascinating mechanisms at the basis of the heart beat is given by A. Babloy-
antz & A. Destexhe, Is the normal heart a periodic oscillator?, Biological Cybernetics
58, pp. 203–211, 1989. Cited on page 267.
237 For a short, modern overview of turbulence, see L.P. Kadanoff, A model of turbulence,
Physics Today 48, pp. 11–13, September 1995. Cited on page 268.
238 For a clear introduction, see T. Schmidt & M. Mahrl, A simple mathematical model of
There are limitations on motion that are missed by the Galilean description. The first
limitation we discover is the existence of a maximal speed in nature. The maximum speed
implies many fascinating results: it leads to observer-varying time and length intervals,
to an intimate relation between mass and energy, and to the existence of event horizons.
We explore them now.
“ ”
L ight is indispensable for a precise description of motion. To check whether a
ine or a path of motion is straight, we must look along it. In other words, we use
ight to define straightness. How do we decide whether a plane is flat? We look across
it,** again using light. How do we measure length to high precision? With light. How do
we measure time to high precision? With light: once it was light from the Sun that was
Page 1170 used; nowadays it is light from caesium atoms.
In other words, light is important because it is the standard for undisturbed motion.
Physics would have evolved much more rapidly if, at some earlier time, light propagation
had been recognized as the ideal example of motion.
But is light really a phenomenon of motion? This was already known in ancient Greece,
from a simple daily phenomenon, the shadow. Shadows prove that light is a moving en-
tity, emanating from the light source, and moving in straight lines.*** The obvious con-
Copyright © Christoph Schiller November 1997–January 2007
clusion that light takes a certain amount of time to travel from the source to the surface
* ‘Nothing is faster than rumour.’ This common sentence is a simplified version of Virgil’s phrase: fama,
malum qua non aliud velocius ullum. ‘Rumour, the evil faster than all.’ From the Aeneid, book IV, verses 173
and 174.
** Note that looking along the plane from all sides is not sufficient for this: a surface that a light beam touches
right along its length in all directions does not need to be flat. Can you give an example? One needs other
Challenge 565 n methods to check flatness with light. Can you specify one?
*** Whenever a source produces shadows, the emitted entities are called rays or radiation. Apart from light,
other examples of radiation discovered through shadows were infrared rays and ultraviolet rays, which em-
anate from most light sources together with visible light, and cathode rays, which were found to be to the
motion of a new particle, the electron. Shadows also led to the discovery of X-rays, which again turned out
to be a version of light, with high frequency. Channel rays were also discovered via their shadows; they turn
out to be travelling ionized atoms. The three types of radioactivity, namely α-rays (helium nuclei), β-rays
284 ii special rel ativity • 5. speed, rest and light
Jupiter and Io
(second measurement)
Earth (second
measurement)
(again electrons), and γ-rays (high-energy X-rays) also produce shadows. All these discoveries were made
between 1890 and 1910: those were the ‘ray days’ of physics.
* Ole (Olaf) Rømer (1644 Aarhus – 1710 Copenhagen), Danish astronomer. He was the teacher of the
Dauphin in Paris, at the time of Louis XIV. The idea of measuring the speed of light in this way was due
to the Italian astronomer Givanni Cassini, whose assistant Rømer had been. Rømer continued his measure-
ments until 1681, when Rømer had to leave France, like all protestants (such as Christiaan Huygens), so that
his work was interrupted. Back in Denmark, a fire destroyed all his measurement notes. As a result, he was
not able to continue improving the precision of his method. Later he became an important administrator
and reformer of the Danish state.
maximum speed, observers at rest, and motion of light 285
c
c
earth v
v
Sun
α
c
c = v~ tan α . (101)
The same measurement can be made for light; we just need to measure the angle at which
the light from a star above Earth’s orbit arrives at the Earth. Because the Earth is moving
relative to the Sun and thus to the star, the angle is not a right one. This effect is called
the aberration of light; the angle is found most easily by comparing measurements made
six months apart. The value of the angle is 20.5 ′′ ; nowadays it can be measured with
a precision of five decimal digits. Given that the speed of the Earth around the Sun is
v = 2πR~T = 29.7 km~s, the speed of light must therefore be c = 3.00 ë 108 m~s.* This is
Copyright © Christoph Schiller November 1997–January 2007
* Umbrellas were not common in Britain in 1726; they became fashionable later, after being introduced from
China. The umbrella part of the story is made up. In reality, Bradley had his idea while sailing on the Thames,
when he noted that on a moving ship the apparent wind has a different direction from that on land. He had
observed 50 stars for many years, notably Gamma Draconis, and during that time he had been puzzled by
the sign of the aberration, which was opposite to the effect he was looking for, namely the star parallax. Both
the parallax and the aberration for a star above the ecliptic make them describe a small ellipse in the course
Challenge 568 n of an Earth year, though with different rotation senses. Can you see why?
By the way, it follows from special relativity that the formula (101) is wrong, and that the correct formula
Challenge 569 n is c = v~ sin α; can you see why?
To determine the speed of the Earth, we first have to determine its distance from the Sun. The simplest
method is the one by the Greek thinker Aristarchos of Samos (c. 310 to c. 230 bce ). We measure the angle
between the Moon and the Sun at the moment when the Moon is precisely half full. The cosine of that angle
gives the ratio between the distance to the Moon (determined, for example, by the methods of page 119)
Challenge 570 n and the distance to the Sun. The explanation is left as a puzzle for the reader.
286 ii special rel ativity • 5. speed, rest and light
half-silvered
mirror
mirror light
source
an astonishing value, especially when compared with the highest speed ever achieved by
a man-made object, namely the Voyager satellites, which travel at 52 Mm~h = 14 km~s,
with the growth of children, about 3 nm~s, or with the growth of stalagmites in caves,
about 0.3 pm~s. We begin to realize why measurement of the speed of light is a science
in its own right.
The first precise measurement of the speed of light was made in 1849 by the French
physicist Hippolyte Fizeau (1819–1896). His value was only 5 % greater than the modern
one. He sent a beam of light towards a distant mirror and measured the time the light
took to come back. How did Fizeau measure the time without any electric device? In fact,
Page 58 he used the same ideas that are used to measure bullet speeds; part of the answer is given
Challenge 572 n in Figure 147. (How far away does the mirror have to be?) A modern reconstruction of
Ref. 250 his experiment by Jan Frercks has achieved a precision of 2 %. Today, the experiment is
much simpler; in the chapter on electrodynamics we will discover how to measure the
Page 570 speed of light using two standard UNIX or Linux computers connected by a cable.
The speed of light is so high that it is even difficult to prove that it is finite. Perhaps the
most beautiful way to prove this is to photograph a light pulse flying across one’s field of Copyright © Christoph Schiller November 1997–January 2007
view, in the same way as one can photograph a car driving by or a bullet flying through
The angle in question is almost a right angle (which would yield an infinite distance), and good instru-
Ref. 248 ments are needed to measure it with precision, as Hipparchos noted in an extensive discussion of the prob-
lem around 130 bce. Precise measurement of the angle became possible only in the late seventeenth century,
when it was found to be 89.86°, giving a distance ratio of about 400. Today, thanks to radar measurements
Page 1183 of planets, the distance to the Sun is known with the incredible precision of 30 metres. Moon distance vari-
Challenge 571 n ations can even be measured to the nearest centimetre; can you guess how this is achieved?
Ref. 249 Aristarchos also determined the radius of the Sun and of the Moon as multiples of those of the Earth.
Aristarchos was a remarkable thinker: he was the first to propose the heliocentric system, and perhaps the
first to propose that stars were other, faraway suns. For these ideas, several of his contemporaries proposed
that he should be condemned to death for impiety. When the Polish monk and astronomer Nicolaus Co-
pernicus (1473–1543) again proposed the heliocentric system two thousand years later, he did not mention
Aristarchos, even though he got the idea from him.
maximum speed, observers at rest, and motion of light 287
red
shutter
switch
beam
10 mm
F I G U R E 148 A photograph of a light pulse moving from right to left through a bottle
with milky water, marked in millimetres (photograph © Tom Mattick)
Ref. 251 the air. Figure 148 shows the first such photograph, produced in 1971 with a standard
off-the-shelf reflex camera, a very fast shutter invented by the photographers, and, most
noteworthy, not a single piece of electronic equipment. (How fast does such a shutter have
Challenge 573 n to be? How would you build such a shutter? And how would you make sure it opened at
the right instant?) Copyright © Christoph Schiller November 1997–January 2007
A finite speed of light also implies that a rapidly rotating light beam behaves as shown
as in Figure 149. In everyday life, the high speed of light and the slow rotation of light-
houses make the effect barely noticeable.
In short, light moves extremely rapidly. It is much faster than lightning, as you might
Challenge 574 n like to check yourself. A century of increasingly precise measurements of the speed have
culminated in the modern value
In fact, this value has now been fixed exactly, by definition, and the metre has been defined
in terms of c. Table 35 gives a summary of what is known today about the motion of light.
Two surprising properties were discovered in the late nineteenth century. They form the
288 ii special rel ativity • 5. speed, rest and light
O b s e r va t i o n s a b o u t l i g h t
Can one play tennis using a laser pulse as the ball and mirrors
as rackets?
Et nihil est celerius annis.*
“ Ovid, Metamorphoses.
We all know that in order to throw a stone as far as possible, we run as we throw it; we ”
know instinctively that in that case the stone’s speed with respect to the ground is higher.
However, to the initial astonishment of everybody, experiments show that light emitted
from a moving lamp has the same speed as light emitted from a resting one. Light (in
vacuum) is never faster than light; all light beams have the same speed. Many specially
Ref. 253 designed experiments have confirmed this result to high precision. The speed of light can
Copyright © Christoph Schiller November 1997–January 2007
be measured with a precision of better than 1 m~s; but even for lamp speeds of more than
Challenge 575 n 290 000 000 m~s no differences have been found. (Can you guess what lamps were used?)
In everyday life, we know that a stone arrives more rapidly if we run towards it. Again,
for light no difference has been measured. All experiments show that the velocity of light
has the same value for all observers, even if they are moving with respect to each other
or with respect to the light source. The speed of light is indeed the ideal, perfect measure-
ment standard.**
Ref. 256 There is also a second set of experimental evidence for the
constancy of the speed of light. Every electromagnetic device,
such as an electric toothbrush, shows that the speed of light is
Page 544 constant. We will discover that magnetic fields would not res-
ult from electric currents, as they do every day in every motor
and in every loudspeaker, if the speed of light were not constant.
This was actually how the constancy was first deduced, by sev-
eral researchers. Only after understanding this, did the German–
Swiss physicist Albert Einstein* show that the constancy is also
in agreement with the motion of bodies, as we will do in this
section. The connection between electric toothbrushes and re-
Page 544 lativity will be described in the chapter on electrodynamics.(For Albert Einstein
information about direct influences of relativity on machine
Ref. 258 design, see the interesting textbook by Van Bladel.) In simple terms, if the speed of light
This relation is the basis of special relativity; in fact, the full theory of special relativity
is contained in it. Einstein often regretted that the theory was called ‘Relativitätstheorie’
or ‘theory of relativity’; he preferred the name ‘Invarianztheorie’ or ‘theory of invariance’,
Ref. 259 but was not able to change the name.
observation of a supernova in 1987, when the flash and the neutrino pulse arrived a 12 seconds apart. (It
is not known whether the difference is due to speed differences or to a different starting point of the two
flashes.) What is the first digit for which the two speed values could differ, knowing that the supernova was
Challenge 576 n 1.7 ë 105 light years away?
Experiments also show that the speed of light is the same in all directions of space, to at least 21 digits of
Ref. 254 precision. Other data, taken from gamma ray bursts, show that the speed of light is independent of frequency,
Ref. 255 to at least 20 digits of precision.
* Albert Einstein (b. 1879 Ulm, d. 1955 Princeton); one of the greatest physicists ever. He published three
important papers in 1905, one about Brownian motion, one about special relativity, and one about the idea
Copyright © Christoph Schiller November 1997–January 2007
of light quanta. Each paper was worth a Nobel Prize, but he was awarded the prize only for the last one. Also
in 1905, he proved the famous formula E 0 = mc 2 (published in early 1906), possibly triggered by an idea of
Page 326 Olinto De Pretto. Although Einstein was one of the founders of quantum theory, he later turned against it.
His famous discussions with his friend Niels Bohr nevertheless helped to clarify the field in its most counter-
intuitive aspects. He explained the Einstein–de Haas effect which proves that magnetism is due to motion
inside materials. In 1915 and 1916, he published his highest achievement: the general theory of relativity,
one of the most beautiful and remarkable works of science.
Being Jewish and famous, Einstein was a favourite target of attacks and discrimination by the National
Socialist movement; in 1933 he emigrated to the USA. He was not only a great physicist, but also a great
Ref. 257 thinker; his collection of thoughts about topics outside physics are worth reading.
Anyone interested in emulating Einstein should know that he published many papers, and that many
of them were wrong; he would then correct the results in subsequent papers, and then do so again. This
happened so frequently that he made fun of himself about it. Einstein realizes the famous definition of a
genius as a person who makes the largest possible number of mistakes in the shortest possible time.
290 ii special rel ativity • 5. speed, rest and light
The constancy of the speed of light is in complete contrast with Galilean mechanics,
and proves that the latter is wrong at high velocities. At low velocities the description
remains good, because the error is small. But if we want a description valid at all velocities,
we have to discard Galilean mechanics. For example, when we play tennis we use the fact
that by hitting the ball in the right way, we can increase or decrease its speed. But with
light this is impossible. Even if we take an aeroplane and fly after a light beam, it still
moves away with the same speed. Light does not behave like cars. If we accelerate a bus
we are driving, the cars on the other side of the road pass by with higher and higher
speeds. For light, this is not so: light always passes by with the same speed.*
Why is this result almost unbelievable, even though the measurements show it un-
ambiguously? Take two observers O and Ω (pronounced ‘omega’) moving with relative
velocity v, such as two cars on opposite sides of the street. Imagine that at the moment
they pass each other, a light flash is emitted by a lamp in O. The light flash moves through
positions x(t) for O and through positions ξ(τ) (pronounced ‘xi of tau’) for Ω. Since the
=c= .
x ξ
(104)
t τ
However, in the situation described, we obviously have x x ξ. In other words, the con-
stancy of the speed of light implies that t x τ, i.e. that time is different for observers moving
Challenge 577 e relative to each other. Time is thus not unique. This surprising result, which has been con-
Ref. 260 firmed by many experiments, was first stated clearly in 1905 by Albert Einstein. Though
many others knew about the invariance of c, only the young Einstein had the courage to
say that time is observer-dependent, and to face the consequences. Let us do so as well.
Already in 1895, the discussion of viewpoint invariance had been called the theory of
relativity by Henri Poincaré.** Einstein called the description of motion without gravity
Ref. 256 the theory of special relativity, and the description of motion with gravity the theory of
general relativity. Both fields are full of fascinating and counter-intuitive results. In par-
ticular, they show that everyday Galilean physics is wrong at high speeds.
The speed of light is a limit speed. We stress that we are not talking of the situation
where a particle moves faster than the speed of light in matter, but still slower than the
speed of light in vacuum. Moving faster than the speed of light in matter is possible. If the
particle is charged, this situation gives rise to the so-called Čerenkov radiation. It corres- Copyright © Christoph Schiller November 1997–January 2007
ponds to the V-shaped wave created by a motor boat on the sea or the cone-shaped shock
wave around an aeroplane moving faster than the speed of sound. Čerenkov radiation is
regularly observed; for example it is the cause of the blue glow of the water in nuclear re-
actors. Incidentally, the speed of light in matter can be quite low: in the centre of the Sun,
* Indeed, even with the current measurement precision of 2 ë 10−13 , we cannot discern any changes of the
Ref. 254 speed of light with the speed of the observer.
** Henri Poincaré (1854–1912), important French mathematician and physicist. Poincaré was one of the
most productive men of his time, advancing relativity, quantum theory, and many parts of mathematics.
The most beautiful and simple introduction to relativity is still that given by Albert Einstein himself, for
example in Über die spezielle und allgemeine Relativitätstheorie, Vieweg, 1997, or in The Meaning of Relativity,
Methuen, London, 1951. It has taken a century for books almost as beautiful to appear, such as the text by
Ref. 261, Ref. 262 Taylor and Wheeler.
maximum speed, observers at rest, and motion of light 291
the speed of light is estimated to be only around 10 km~year, and even in the laboratory,
Ref. 263, Ref. 264 for some materials, it has been found to be as low as 0.3 m~s. In the following, when we
use the term ‘speed of light’, we mean the speed of light in vacuum. The speed of light in
air in smaller than that in vacuum only by a fraction of a percent, so that in most cases,
the difference can be neglected.
=¼ = γ(v) .
t1 1
(106)
t2 1− v2
c2
Time intervals for a moving observer are shorter by this factor γ; the time dilation factor
is always larger than 1. In other words, moving clocks go slower. For everyday speeds the
first second
time time
Challenge 580 e effect is tiny. That is why we do not detect time differences in everyday life. Nevertheless,
Galilean physics is not correct for speeds near that of light. The same factor γ also appears
in the formula E = γmc 2 , which we will deduce below. Expression (105) or (106) is the
only piece of mathematics needed in special relativity: all other results derive from it.
such particles massive particles. If light had mass, it would be necessary to distinguish
the ‘massless energy speed’ c from the speed of light c L , which would be lower and would
depend on the kinetic energy of those massive particles. The speed of light would not be
constant, but the massless energy speed would still be so. Massive light particles could
be captured, stopped and stored in a box. Such boxes would make electric illumination
unnecessary; it would be sufficient to store some daylight in them and release the light,
slowly, during the following night, maybe after giving it a push to speed it up.*
Physicists have tested the possibility of massive light in quite some detail. Observations
Ref. 266, Ref. 267 now put any possible mass of light (particles) at less than 1.3 ë 10−52 kg from terrestrial
* Incidentally, massive light would also have longitudinal polarization modes. This is in contrast to observa-
tions, which show that light is polarized exclusively transversally to the propagation direction.
maximum speed, observers at rest, and motion of light 293
sender receiver
z
sender θs
v x
experiments, and at less than 4 ë 10−62 kg from astrophysical arguments (which are a bit
less strict). In other words, light is not heavy, light is light.
But what happens when light hits a moving mirror? If the speed of light does not
change, something else must. The situation is akin to that of a light source moving with re-
spect to the receiver: the receiver will observe a different colour from that observed by the
sender. This is called the Doppler effect. Christian Doppler* was the first to study the fre-
quency shift in the case of sound waves – the well-known change in whistle tone between
Copyright © Christoph Schiller November 1997–January 2007
approaching and departing trains – and to extend the concept to the case of light waves.
As we will see later on, light is (also) a wave, and its colour is determined by its frequency,
or equivalently, by its wavelength λ. Like the tone change for moving trains, Doppler real-
ized that a moving light source produces a colour at the receiver that is different from the
colour at the source. Simple geometry, and the conservation of the number of maxima
Challenge 582 e and minima, leads to the result
* Christian Andreas Doppler (b. 1803 Salzburg, d. 1853 Venezia), Austrian physicist. Doppler studied the
effect named after him for sound and light. In 1842 he predicted (correctly) that one day we would be able
to use the effect to measure the motion of distant stars by looking at their colours.
294 ii special rel ativity • 5. speed, rest and light
=» (1 − cos θ r ) = γ (1 − cos θ r ) .
λr 1 v v
(107)
λs 1 − v 2 ~c 2 c c
The variables v and θ r in this expression are defined in Figure 152. Light from an approach-
ing source is thus blue-shifted, whereas light from a departing source is red-shifted. The
first observation of the Doppler effect for light was made by Johannes Stark* in 1905,
who studied the light emitted by moving atoms. All subsequent experiments confirmed
the calculated colour shift within measurement errors; the latest checks have found agree-
Ref. 268 ment to within two parts per million. In contrast to sound waves, a colour change is also
found when the motion is transverse to the light signal. Thus, a yellow rod in rapid mo-
tion across the field of view will have a blue leading edge and a red trailing edge prior to
the closest approach to the observer. The colours result from a combination of the longit-
udinal (first-order) Doppler shift and the transverse (second-order) Doppler shift. At a
particular angle θ unshifted the colours will be the same. (How does the wavelength change
Challenge 585 n Can you imagine how the number z is determined? Typical values for z for light sources
in the sky range from −0.1 to 3.5, but higher values, up to more than 10, have also been
Challenge 586 n found. Can you determine the corresponding speeds? How can they be so high?
In summary, whenever one tries to change the speed of light, one only manages to
Copyright © Christoph Schiller November 1997–January 2007
* Johannes Stark (1874–1957), discovered in 1905 the optical Doppler effect in channel rays, and in 1913
the splitting of spectral lines in electrical fields, nowadays called the Stark effect. For these two discoveries
he received the 1919 Nobel Prize for physics. He left his professorship in 1922 and later turned into a full-
blown National Socialist. A member of the NSDAP from 1930 onwards, he became known for aggressively
criticizing other people’s statements about nature purely for ideological reasons; he became rightly despised
by the academic community all over the world.
Challenge 584 n ** At what speed does a red traffic light appear green?
maximum speed, observers at rest, and motion of light 295
Challenge 589 e
“ ”
for Lucky Luke to achieve the feat shown in Figure 153, his bullet has to move faster than
the speed of light. (What about his hand?) In order to emulate Lucky Luke, we could take
Copyright © Christoph Schiller November 1997–January 2007
the largest practical amount of energy available, taking it directly from an electrical power
station, and accelerate the lightest ‘bullets’ that can be handled, namely electrons. This ex-
periment is carried out daily in particle accelerators such as the Large Electron Positron
ring, the LEP, of 27 km circumference, located partly in France and partly in Switzerland,
near Geneva. There, 40 MW of electrical power (the same amount used by a small city)
accelerates electrons and positrons to energies of over 16 nJ (104.5 GeV) each, and their
Ref. 271 speed is measured. The result is shown in Figure 154: even with these impressive means
* There are still people who refuse to accept these results, as well as the ensuing theory of relativity. Every
physicist should enjoy the experience, at least once in his life, of conversing with one of these men. (Strangely,
no woman has yet been reported as belonging to this group of people.) This can be done, for example, via
Ref. 272 the internet, in the sci.physics.relativity newsgroup. See also the http://www.crank.net website. Crackpots
are a fascinating lot, especially since they teach the importance of precision in language and in reasoning,
which they all, without exception, neglect. Encounters with several of them provided the inspiration for this
chapter.
maximum speed, observers at rest, and motion of light 297
t
second
first
observer third
observer
(e.g. train) observer
(e.g. Earth)
(e.g. stone)
kseT
kteT
O
x
The observation of speed of light as a limit speed for objects is easily seen to be a
consequence of its constancy. Bodies that can be at rest in one frame of reference obviously
move more slowly than the maximum velocity (light) in that frame. Now, if something
moves more slowly than something else for one observer, it does so for all other observers
Challenge 592 d as well. (Trying to imagine a world in which this would not be so is interesting: funny
things would happen, such as things interpenetrating each other.) Since the speed of light
is the same for all observers, no object can move faster than light, for every observer.
We follow that the maximum speed is the speed of massless entities. Electromagnetic
waves, including light, are the only known entities that can travel at the maximum speed.
Gravitational waves are also predicted to achieve maximum speed. Though the speed
of neutrinos cannot be distinguished experimentally from the maximum speed, recent
Ref. 273 experiments suggest that they do have a tiny mass.
Conversely, if a phenomenon exists whose speed is the limit speed for one observer,
Challenge 593 e then this limit speed must necessarily be the same for all observers. Is the connection
Challenge 594 r between limit property and observer invariance generally valid in nature? Copyright © Christoph Schiller November 1997–January 2007
* By taking the (natural) logarithm of this equation, one can define a quantity, the rapidity, that measures
298 ii special rel ativity • 5. speed, rest and light
v se =
v st + v te
.
1 + v st v te ~c 2
(109)
Challenge 596 e This is called the velocity composition formula. The result is never larger than c and is
always smaller than the naive sum of the velocities.* Expression (109) has been confirmed
Page 321, page 544 by all of the millions of cases for which it has been checked. You may check that it reduces
Ref. 267 to the naive sum for everyday life values.
vDc (110)
Lorentz contraction from Maxwell’s equation for the electrodynamic field. He was the first to understand,
long before quantum theory confirmed the idea, that Maxwell’s equations for the vacuum also describe
matter and all its properties, as long as moving charged point particles – the electrons – are included. He
showed this in particular for the dispersion of light, for the Zeeman effect, for the Hall effect and for the
Faraday effect. He gave the correct description of the Lorentz force. In 1902, he received the physics Nobel
Prize, together with Pieter Zeeman. Outside physics, he was active in the internationalization of scientific
collaborations. He was also instrumental in the creation of the largest human-made structures on Earth: the
polders of the Zuyder Zee.
*** Albert Abraham Michelson (b. 1852 Strelno, d. 1931 Pasadena), Prussian–Polish–US-American physi-
cist, awarded the Nobel Prize in physics in 1907. Michelson called the set-up he devised an interferometer, a
term still in use today. Edward William Morley (1838–1923), US-American chemist, was Michelson’s friend
and long-time collaborator.
Page 87 **** This point is essential. For example, Galilean physics states that only relative motion is physical. Galilean
physics also excludes various mathematically possible ways to realize a constant light speed that would con-
maximum speed, observers at rest, and motion of light 299
AOM Power
30 driver servo
angle/3 [deg]
Laser 1 PD
10 Nd: YAG FC
T °C
FC
0 T °C Laser 2 Res B PD
Nd: YAG Fiber BS
PZT AOM DBM
10 FC
∑ PD
PD
Frequency Local
20 servo oscillator PD Res A
0 100 200 300 400 500 600 Frequency Local
DBM
servo oscillator
time since begin of rotation [s]
∑
AOM Power
driver
servo
— In a closed free-floating room, there is no way to tell the speed of the room.
— There is no notion of absolute rest (or space): rest (like space) is an observer-dependent
concept.*
— Time depends on the observer; time is not absolute.
More interesting and specific conclusions can be drawn when two additional conditions
are assumed. First, we study situations where gravitation can be neglected. (If this not the
case, we need general relativity to describe the system.) Secondly, we also assume that the
Copyright © Christoph Schiller November 1997–January 2007
data about the bodies under study – their speed, their position, etc. – can be gathered
without disturbing them. (If this not the case, we need quantum theory to describe the
system.)
To deduce the precise way in which the different time intervals and lengths measured
by two observers are related to each other, we take an additional simplifying step. We start
observer (greek)
v
light
c
observer (roman)
L L
with a situation where no interaction plays a role. In other words, we start with relativistic
kinematics of bodies moving without disturbance.
If an undisturbed body is observed to travel along a straight line with a constant ve-
locity (or to stay at rest), one calls the observer inertial, and the coordinates used by the
observer an inertial frame of reference. Every inertial observer is itself in undisturbed
motion. Examples of inertial observers (or frames) thus include – in two dimensions –
those moving on a frictionless ice surface or on the floor inside a smoothly running train
or ship; for a full example – in all three spatial dimensions – we can take a cosmonaut
travelling in a space-ship as long as the engine is switched off. Inertial observers in three
dimensions might also be called free-floating observers. They are thus not so common.
Challenge 598 e Non-inertial observers are much more common. Can you confirm this? Inertial observ-
ers are the most simple ones, and they form a special set:
— Any two inertial observers move with constant velocity relative to each other (as longs
as gravity plays no role, as assumed above).
Copyright © Christoph Schiller November 1997–January 2007
— All inertial observers are equivalent: they describe the world with the same equations.
Because it implies the lack of absolute space and time, this statement was called the
principle of relativity by Henri Poincaré. However, the essence of relativity is the exist-
ence of a limit speed.
To see how measured length and space intervals change from one observer to the other,
we assume two inertial observers, a Roman one using coordinates x, y, z and t, and a
Greek one using coordinates ξ, υ, ζ and τ,* that move with velocity v relative to each
other. The axes are chosen in such a way that the velocity points in the x-direction. The
* They are read as ‘xi’, ‘upsilon’, ‘zeta’ and ‘tau’. The names, correspondences and pronunciations of all Greek
letters are explained in Appendix A.
maximum speed, observers at rest, and motion of light 301
constancy of the speed of light in any direction for any two observers means that for the
motion of light the coordinate differentials are related by
Assume also that a flash lamp at rest for the Greek observer, thus with dξ = 0, produces
two flashes separated by a time interval dτ. For the Roman observer, the flash lamp moves
with speed v, so that dx = vdt. Inserting this into the previous expression, and assuming
linearity and speed direction independence for the general case, we find that intervals are
Challenge 599 e related by
dτ + vdξ~c 2
dt = γ(dτ + vdξ~c 2 ) = » with v = dx~dt
1 − v 2 ~c 2
dx = γ(dξ + vdτ) = »
dξ + vdτ
These expressions describe how length and time intervals measured by different observ-
ers are related. At relative speeds v that are small compared to the velocity of light, such
as occur in everyday life, the time intervals are essentially equal; the stretch factor or re-
lativistic correction or relativistic contraction γ is then equal to 1 for all practical purposes.
However, for velocities near that of light the measurements of the two observers give
different values. In these cases, space and time mix, as shown in Figure 158.
The expressions (112) are also strange in another respect. When two observers look
Challenge 600 n at each other, each of them claims to measure shorter intervals than the other. In other
words, special relativity shows that the grass on the other side of the fence is always shorter
– if one rides along beside the fence on a bicycle and if the grass is inclined. We explore
this bizarre result in more detail shortly.
The stretch factor γ is equal to 1 for most practical purposes in everyday life. The largest
value humans have ever achieved is about 2 ë 105 ; the largest observed value in nature is
Challenge 601 n about 1012 . Can you imagine where they occur?
Once we know how space and time intervals change, we can easily deduce how coordin-
Copyright © Christoph Schiller November 1997–January 2007
ates change. Figures 157 and 158 show that the x coordinate of an event L is the sum of
two intervals: the ξ coordinate and the length of the distance between the two origins. In
other words, we have
ξ = γ(x − vt) and v =
dx
. (113)
dt
Using the invariance of the space-time interval, we get
Henri Poincaré called these two relations the Lorentz transformations of space and time
302 ii special rel ativity • 5. speed, rest and light
after their discoverer, the Dutch physicist Hendrik Antoon Lorentz.* In one of the most
Ref. 278 beautiful discoveries of physics, in 1892 and 1904, Lorentz deduced these relations from
Page 555 the equations of electrodynamics, where they had been lying, waiting to be discovered,
since 1865.** In that year James Clerk Maxwell had published the equations in order to
describe everything electric and magnetic. However, it was Einstein who first understood
that t and τ, as well as x and ξ, are equally correct and thus equally valid descriptions of
space and time.
The Lorentz transformation describes the change of viewpoint from one inertial frame
to a second, moving one. This change of viewpoint is called a (Lorentz) boost. The formu-
lae (113) and (114) for the boost are central to the theories of relativity, both special and
general. In fact, the mathematics of special relativity will not get more difficult than that:
if you know what a square root is, you can study special relativity in all its beauty.
Many alternative formulae for boosts have been explored, such as expressions in which
Ref. 279 the relative acceleration of the two observers is included, as well as the relative velocity.
What is space-time?
Von Stund’ an sollen Raum für sich und Zeit für
The Lorentz transformations tell us something important: that space and time are two
aspects of the same basic entity. They ‘mix’ in different ways for different observers. This
fact is commonly expressed by stating that time is the fourth dimension. This makes sense
”
because the common basic entity – called space-time – can be defined as the set of all
events, events being described by four coordinates in time and space, and because the set
Challenge 602 n of all events has the properties of a manifold.**** (Can you confirm this?)
In other words, the existence of a maximum speed in nature forces us to introduce
a space-time manifold for the description of nature. In the theory of special relativity,
the space-time manifold is characterized by a simple property: the space-time interval di
Ref. 280 between two nearby events, defined as
Copyright © Christoph Schiller November 1997–January 2007
v2
di 2 = c 2 dt 2 − dx 2 − dy 2 − dz 2 = c 2 dt 2 (1 − ), (115)
c2
time diagram.
Clearly, two events can be placed in sequence only if one event is the cause of the other.
But this connection can only apply if the events exchange energy (e.g. through a signal).
In other words, a relation of cause and effect between two events implies that energy or
signals can travel from one event to the other; therefore, the speed connecting the two
events must not be larger than the speed of light. Figure 159 shows that event E at the
origin of the coordinate system can only be influenced by events in quadrant IV (the past
light cone, when all space dimensions are included), and can itself influence only events
* Hermann Minkowski (1864–1909), German mathematician. He had developed similar ideas to Einstein,
but the latter was faster. Minkowski then developed the concept of space-time. Minkowski died suddenly at
the age of 44.
304 ii special rel ativity • 5. speed, rest and light
time t
ne
lig
II
lig
ne
T future T
co
ht
ht
co
future
ht
co
co
ht
lig
ne
ne
lig
III I
E elsewhere E elsewhere
elsewhere space y
IV x
past
past
F I G U R E 159 A space-time diagram for a moving object T seen from an inertial observer O in the case
in quadrant II (the future light cone). Events in quadrants I and III neither influence nor
are influenced by event E. The light cone defines the boundary between events that can
be ordered with respect to their origin – namely those inside the cone – and those that
cannot – those outside the cones, happening elsewhere for all observers. (Some people
call all the events happening elsewhere the present.) So, time orders events only partially.
For example, for two events that are not causally connected, their temporal order (or their
simultaneity) depends on the observer!
In particular, the past light cone gives the complete set of events that can influence
what happens at the origin. One says that the origin is causally connected only to the past
light cone. This statement reflects the fact that any influence involves transport of energy,
and thus cannot travel faster than the speed of light. Note that causal connection is an
invariant concept: all observers agree on whether or not it applies to two given events.
Challenge 604 n Can you confirm this?
A vector inside the light cone is called timelike; one on the light cone is called lightlike
or null; and one outside the cone is called spacelike. For example, the world-line of an
observer, i.e. the set of all events that make up its past and future history, consists of time- Copyright © Christoph Schiller November 1997–January 2007
like events only. Time is the fourth dimension; it expands space to space-time and thus
‘completes’ space-time. This is the relevance of the fourth dimension to special relativity,
no more and no less.
Special relativity thus teaches us that causality and time can be defined only because
light cones exist. If transport of energy at speeds faster than that of light did exist, time
could not be defined. Causality, i.e. the possibility of (partially) ordering events for all
observers, is due to the existence of a maximal speed.
If the speed of light could be surpassed in some way, the future could influence the past.
Challenge 605 n Can you confirm this? In such situations, one would observe acausal effects. However,
there is an everyday phenomenon which tells that the speed of light is indeed maximal:
our memory. If the future could influence the past, we would also be able to remember the
future. To put it in another way, if the future could influence the past, the second principle
curiosities of special rel ativity 305
of thermodynamics would not be valid and our memory would not work.* No other data
from everyday life or from experiments provide any evidence that the future can influence
the past. In other words, time travel to the past is impossible. How the situation changes in
quantum theory will be revealed later on. Interestingly, time travel to the future is possible,
as we will see shortly.
The distance d is larger than ct already for v A 0.71c, and, if v is chosen large enough, it
increases beyond all bounds! In other words, relativity does not limit the distance we can
travel in a lifetime, and not even the distance we can travel in a single second. We could,
in principle, roam the entire universe in less than a second. In situations such as these it
makes sense to introduce the concept of proper velocity w, defined as
w = d~t = » =γv.
v
(117)
1 − v 2 ~c 2
As we have just seen, proper velocity is not limited by the speed of light; in fact the proper
velocity of light itself is infinite.**
apart, even if they are at rest with respect to each other in an inertial reference frame. For
example, if we have two similar watches showing the same time, and if we carry one of
them for a walk and back, they will show different times afterwards. This experiment has
* Another related result is slowly becoming common knowledge. Even if space-time had a nontrivial shape,
such as a cylindrical topology with closed time-like curves, one still would not be able to travel into the
Ref. 281 past, in contrast to what many science fiction novels suggest. This is made clear by Stephen Blau in a recent
pedagogical paper.
** Using proper velocity, the relation given in equation (109) for the superposition of two velocities wa = γ a va
Challenge 607 e and wb = γ b vb simplifies to
w sY = γ a γ b (v a + v bY ) and w sÙ = w bÙ , (118)
where the signs Y and Ù designate the component in the direction of and the component perpendicular to
Ref. 282 va , respectively. One can in fact express all of special relativity in terms of ‘proper’ quantities.
306 ii special rel ativity • 5. speed, rest and light
first
twin
trip of
Earth second twin
time time
comparison
and
first change of
twin rocket
Ref. 283, Ref. 284 actually been performed several times and has fully confirmed the prediction of special
relativity. The time difference for a person or a watch in an aeroplane travelling around the
Earth once, at about 900 km~h, is of the order of 100 ns – not very noticeable in everyday
life. In fact, the delay is easily calculated from the expression
=γ.
t
(119)
t′
Human bodies are clocks; they show the elapsed time, usually called age, by various
changes in their shape, weight, hair colour, etc. If a person goes on a long and fast trip, on
her return she will have aged less than a second person who stayed at her (inertial) home.
The most famous illustration of this is the famous twin paradox (or clock paradox).
An adventurous twin jumps on a relativistic rocket that leaves Earth and travels for many
years. Far from Earth, he jumps on another relativistic rocket going the other way and
returns to Earth. The trip is illustrated in Figure 160. At his arrival, he notes that his
twin brother on Earth is much older than himself. Can you explain this result, especially
Copyright © Christoph Schiller November 1997–January 2007
the asymmetry between the two brothers? This result has also been confirmed in many
Ref. 285 experiments.
Special relativity thus confirms, in a surprising fashion, the well-known observation
that those who travel a lot remain younger. The price of the retained youth is, however,
that everything around one changes very much more quickly than if one is at rest with
the environment.
The twin paradox can also be seen as a confirmation of the possibility of time travel
to the future. With the help of a fast rocket that comes back to its starting point, we can
arrive at local times that we would never have reached within our lifetime by staying
home. Alas, we can never return to the past.*
Ref. 286 * There are even special books on time travel, such as the well researched text by Nahin. Note that the concept
curiosities of special rel ativity 307
Ref. 288 made by Einstein himself, is incorrect. The centrifugal acceleration leads to a reduction in
gravitational acceleration that exactly cancels the increase due to the velocity. This story
serves as a reminder to be careful when applying special relativity in situations involving
gravity. Special relativity is only applicable when space-time is flat, not when gravity is
present.
In short, a mother can stay younger than her daughter. We can also conclude that we
cannot synchronize clocks at rest with respect to each other simply by walking, clock in
hand, from one place to another. The correct way to do so is to exchange light signals.
Challenge 610 n Can you describe how?
of time travel has to be clearly defined; otherwise one has no answer to the clerk who calls his office chair a
time machine, as sitting on it allows him to get to the future.
308 ii special rel ativity • 5. speed, rest and light
observations
observations by the pilot
by the farmer
pilot
time
farmer
time
plane ends
Length contraction
The length of an object measured by an observer attached to the object is called its proper Copyright © Christoph Schiller November 1997–January 2007
length. According to special relativity, the length measured by an inertial observer passing
by is always smaller than the proper length. This result follows directly from the Lorentz
Challenge 612 e transformations.
For a Ferrari driving at 300 km~h or 83 m~s, the length is contracted by 0.15 pm: less
than the diameter of a proton. Seen from the Sun, the Earth moves at 30 km~s; this gives
a length contraction of 6 cm. Neither of these effects has ever been measured. But larger
effects could be. Let us explore some examples.
Imagine a pilot flying through a barn with two doors, one at each end. The plane
is slightly longer than the barn, but moves so rapidly that its relativistically contracted
length is shorter than the length of the barn. Can the farmer close the barn (at least for
a short time) with the plane completely inside? The answer is positive. But why can the
pilot not say the following: relative to him, the barn is contracted; therefore the plane
curiosities of special rel ativity 309
ski ski
h
trap trap
v
B rope F
does not fit inside the barn? The answer is shown in Figure 162. For the farmer, the doors
close (and reopen) at the same time. For the pilot, they do not. For the farmer, the pilot is
in the dark for a short time; for the pilot, the barn is never dark. (That is not completely
Challenge 613 n true: can you work out the details?)
We now explore some variations of the general case. Can a rapid snowboarder fall into
a hole that is a bit shorter than his board? Imagine him boarding so fast that the length
contraction factor γ = d~d ′ is 4.* For an observer on the ground, the snowboard is four
times shorter, and when it passes over the hole, it will fall into it. However, for the boarder,
it is the hole which is four times shorter; it seems that the snowboard cannot fall into it.
Ref. 289 More careful analysis shows that, in contrast to the observation of the hole digger, the
snowboarder does not experience the board’s shape as fixed: while passing over the hole,
the boarder observes that the board takes on a parabolic shape and falls into the hole,
Challenge 615 e as shown in Figure 163. Can you confirm this? In other words, shape is not an observer-
invariant concept. (However, rigidity is observer-invariant, if defined properly; can you
Challenge 616 n confirm this?)
This explanation, though published, is not correct, as Harald van Lintel and Christian
Copyright © Christoph Schiller November 1997–January 2007
Ref. 290 Gruber have pointed out. One should not forget to estimate the size of the effect. At re-
lativistic speeds the time required for the hole to affect the full thickness of the board
cannot be neglected. The snowboarder only sees his board take on a parabolic shape if
it is extremely thin and flexible. For usual boards moving at relativistic speeds, the snow-
Challenge 617 ny boarder has no time to fall any appreciable height h or to bend into the hole before passing
it. Figure 163 is so exaggerated that it is incorrect. The snowboarder would simply speed
over the hole.
The paradoxes around length contraction become even more interesting in the case of
Ref. 291 a conductive glider that makes electrical contact between two rails, as shown in Figure 164.
The two rails are parallel, but one rail has a gap that is longer than the glider. Can you
Challenge 614 n * Even the Earth contracts in its direction of motion around the Sun. Is the value measurable?
310 ii special rel ativity • 5. speed, rest and light
work out whether a lamp connected in series stays lit when the glider moves along the
Challenge 618 n rails with relativistic speed? (Make the simplifying and not fully realistic assumption that
electrical current flows as long and as soon as the glider touches the rails.) Do you get
the same result for all observers? And what happens when the glider is longer than the
detour? (Warning: this problem gives rise to heated debates!) What is unrealistic in this
experiment?
Ref. 292 Another example of length contraction appears when two objects, say two cars, are
connected over a distance d by a straight rope, as shown in Figure 165 Imagine that both
are at rest at time t = 0 and are accelerated together in exactly the same way. The observer
» the same distance apart. On the other hand,
at rest will maintain that the two cars remain
the rope needs to span a distance d = d~ 1 − v 2 ~c 2 , and thus has to expand when the
′
two cars are accelerating. In other words, the rope will break. Is this prediction confirmed
Challenge 619 n by observers on each of the two cars?
A funny – but quite unrealistic – example of length contraction is that of a submarine
F I G U R E 166 Flying through twelve vertical columns (shown in the two uppermost images) with 0.9
times the speed of light as visualized by Nicolai Mokros and Norbert Dragon, showing the effect of
speed and position on distortions (© Nicolai Mokros)
312 ii special rel ativity • 5. speed, rest and light
F I G U R E 167 Flying through three straight and vertical columns with 0.9 times the speed of light as
visualized by Daniel Weiskopf: on the left with the original colours; in the middle including the
Doppler effect; and on the right including brightness effects, thus showing what an observer would
actually see (© Daniel Weiskopf )
F I G U R E 168 What a researcher standing and one running rapidly through a corridor observe
(ignoring colour effects) (© Daniel Weiskopf )
We note that the shape of the image seen by a moving observer is a distorted version
of that seen by one at rest at the same point. A moving observer, however, does not see
different things than a resting one at the same point. Indeed, light cones are independent
of observer motion.
The Lorentz contraction is measurable; however, it cannot be photographed. This sur-
Ref. 294 prising distinction was discovered only in 1959. Measuring implies simultaneity at the ob-
ject’s position; photographing implies simultaneity at the observer’s position. On a photo-
graph, the Lorentz contraction is modified by the effects due to different light travel times
curiosities of special rel ativity 313
from the different parts of an object; the result is a change in shape that is reminiscent of,
but not exactly the same as, a rotation. The total deformation is an angle-dependent aber-
Page 285 ration. We discussed aberration at the beginning of this section. Aberration transforms
circles into circles: such transformations are called conformal.
The images of Figure 168, produced by Daniel Weiskopf, also include the Doppler ef-
fect and the brightness changes. They show that these effects are at least as striking as the
distortion due to aberration.
This leads to the ‘pearl necklace paradox’. If the relativistic motion transforms spheres
into spheres, and rods into shorter rods, what happens to a pearl necklace moving along
Challenge 622 n its own long axis? Does it get shorter?
There is much more to be explored using relativistic films. For example, the author
Challenge 623 r predicts that films of rapidly rotating spheres in motion will reveal interesting effects.
Also in this case, optical observation and measurement results will differ. For certain
combinations of relativistic rotations and relativistic boosts, it is predicted* that the sense
which means that the front twin has aged more than the back twin! Thus, in accelerated
systems, ageing is position-dependent.
Copyright © Christoph Schiller November 1997–January 2007
For choosing a seat in a bus, though, this result does not help. It is true that the best
seat in an accelerating bus is the back one, but in a decelerating bus it is the front one. At
the end of a trip, the choice of seat does not matter.
Is it correct to deduce that people on high mountains age faster than people in valleys,
Challenge 626 n so that living in a valley helps postponing grey hair?
* In July 2005.
314 ii special rel ativity • 5. speed, rest and light
moving t'
judge
J light signal
x' moving
judge
J
light signal x'
F I G U R E 169 For the athlete on the left, the judge moving in the opposite
direction sees both feet off the ground at certain times, but not for the
athlete on the right
they are disqualified if they break it. A student athlete was thinking about the theoretical
maximum speed he could achieve in the Olympics. The ideal would be that each foot
accelerates instantly to (almost) the speed of light. The highest walking speed is achieved
by taking the second foot off the ground at exactly the same instant at which the first is put
down. By ‘same instant’, the student originally meant ‘as seen by a competition judge at
rest with respect to Earth’. The motion of the feet is shown in the left diagram of Figure 169;
it gives a limit speed for walking of half the speed of light. But then the student noticed
that a moving judge will see both feet off the ground and thus disqualify the athlete for
Ref. 295 running. To avoid disqualification by any judge, the second foot has to wait for a light
signal from the first. The limit speed for Olympic walking is thus only one third of the
speed of light.
Is the speed of shadow greater than the speed of light? Copyright © Christoph Schiller November 1997–January 2007
Actually, motion faster than light does exist and is even rather common. Special relativity
only constrains the motion of mass and energy. However, non-material points or non-
energy-transporting features and images can move faster than light. There are several
Page 305 simple examples. To be clear, we are not talking about proper velocity, which in these
Challenge 627 n cases cannot be defined anyway. (Why?)
The following examples show speeds that are genuinely higher than the speed of light
in vacuum.
Consider the point marked X in Figure 170, the point at which scissors cut paper. If
the scissors are closed rapidly enough, the point moves faster than light. Similar examples
can also be found in every window frame, and in fact in any device that has twisting parts.
Another example of superluminal motion is a music record – an old-fashioned LP –
disappearing into its sleeve, as shown in Figure 171. The point where the edge of the record
curiosities of special rel ativity 315
The Beatles
The Beatles
v
X
The Beatles
meets the edge of the sleeve can travel faster than light.
Another example suggests itself when we remember that we live on a spherical planet.
Imagine you lie on the floor and stand up. Can you show that the initial speed with which
Challenge 628 n the horizon moves away from you can be larger than that of light?
Finally, a standard example is the motion of a spot of light produced by shining a laser
beam onto the Moon. If the laser is moved, the spot can easily move faster than light. The
same applies to the light spot on the screen of an oscilloscope when a signal of sufficiently
high frequency is fed to the input.
All these are typical examples of the speed of shadows, sometimes also called the speed
of darkness. Both shadows and darkness can indeed move faster than light. In fact, there
Challenge 629 n is no limit to their speed. Can you find another example?
In addition, there is an ever-increasing number of experimental set-ups in which the Copyright © Christoph Schiller November 1997–January 2007
phase velocity or even the group velocity of light is higher than c. They regularly make
headlines in the newspapers, usually along the lines of ‘light moves faster than light’. We
Page 591 will discuss this surprising phenomenon in more detail later on. In fact, these cases can
also be seen – with some imagination – as special cases of the ‘speed of shadow’ phe-
nomenon.
For a different example, imagine we are standing at the exit of a tunnel of length l. We
see a car, whose speed we know to be v, entering the other end of the tunnel and driving
towards us. We know that it entered the tunnel because the car is no longer in the Sun or
because its headlights were switched on at that moment. At what time t, after we see it
entering the tunnel, does it drive past us? Simple reasoning shows that t is given by
time
observer
emitted or reflected light
tachyon
light cone
v appr = =
l vc
, (122)
t c−v
which is higher than c for any car velocity v higher than c~2. For cars this does not happen
too often, but astronomers know a type of bright object in the sky called a quasar (a
contraction of ‘quasi-stellar object’), which sometimes emits high-speed gas jets. If the
emission is in or near the direction of the Earth, its apparent speed – even the purely
transverse component – is higher than c. Such situations are now regularly observed with
Ref. 296 telescopes.
Note that to a second observer at the entrance of the tunnel, the apparent speed of the
car moving away is given by
v leav =
vc
, (123)
c+v
Copyright © Christoph Schiller November 1997–January 2007
which is never higher than c~2. In other words, objects are never seen departing with
more than half the speed of light.
The story has a final twist. We have just seen that motion faster than light can be ob-
served in several ways. But could an object moving faster than light be observed at all?
Surprisingly, it could be observed only in rather unusual ways. First of all, since such
an imaginary object, usually called a tachyon, moves faster than light, we can never see
it approaching. If it can be seen at all, a tachyon can only be seen departing. Seeing a
tachyon would be similar to hearing a supersonic jet. Only after a tachyon has passed
nearby, assuming that it is visible in daylight, could we notice it. We would first see a
flash of light, corresponding to the bang of a plane passing with supersonic speed. Then
we would see two images of the tachyon, appearing somewhere in space and departing
curiosities of special rel ativity 317
R v G
u
O w
in opposite directions, as can be deduced from Figure 172. Even if one of the two images
were approaching us, it would be getting fainter and smaller. This is, to say the least, rather
unusual behaviour. Moreover, if you wanted to look at a tachyon at night, illuminating
observers are in different directions, as is the case when the velocity vectors form a loop.
The simplest set-up is shown in Figure 173. In special relativity, a general concatenation
Ref. 298 of pure boosts does not give a pure boost, but a boost plus a rotation. As a result, the
endpoints of chains of parallel sticks are usually not parallel.
An example of this effect appears in rotating motion. If we walk in a fast circle holding
a stick, always keeping the stick parallel to the direction it had just before, at the end of the
circle the stick will have an angle with respect to the original direction. Similarly, the axis
of a rotating body circling a second body will not be pointing in the same direction after
one turn. This effect is called Thomas precession, after Llewellyn Thomas, who discovered
it in 1925, a full 20 years after the birth of special relativity. It had escaped the attention of
dozens of other famous physicists. Thomas precession is important in the inner working
of atoms; we will return to it in a later section of our adventure. These surprising phenom-
318 ii special rel ativity • 5. speed, rest and light
ena are purely relativistic, and are thus measurable only in the case of speeds comparable
to that of light.
A moving observer thus always measures lower values than a resting one.
In 1908, Max Planck used this expression, together with the corresponding transform-
ation for heat, to deduce that the entropy is invariant under Lorentz transformations.
Being the discoverer of the Boltzmann constant k, Planck proved in this way that the
Mass in relativity
Page 77 In Galilean physics, the mass ratio between two bodies was defined using collisions; it
was given by the negative inverse of the velocity change ratio
=−
m2 ∆v 1
. (125)
m1 ∆v 2
However, experiments show that the expression must be different for speeds near that of
Challenge 631 ny light. In fact, experiments are not needed: thinking alone can show this. Can you do so?
rel ativistic mechanics 319
There is only one solution to this problem. The two Galilean conservation theorems
Ref. 300 P i m i v i = const for momentum and P i m i = const for mass have to be changed into
Q γ i m i v i = const (126)
i
and
Q γ i m i = const . (127)
i
These expressions, which will remain valid throughout the rest of our ascent of Motion
Mountain, imply, among other things, that teleportation is not possible in nature. (Can
Challenge 632 n you confirm this?) Obviously, in order to recover Galilean physics, the relativistic correc-
tion (factors) γ i have to be almost equal to 1 for everyday velocities, that is, for velocities
nowhere near the speed of light.
γv = »
1 M
. (129)
1 − v 2 ~c 2 F I G U R E 174 An inelastic collision of two
identical particles seen from two
different inertial frames of reference
With this expression, and a generalization of
the situation of Galilean physics, the mass ratio
between two colliding particles is defined as the ratio
Copyright © Christoph Schiller November 1997–January 2007
∆(γ 2 v 2 )
=−
m1
.
∆(γ 1 v 1 )
(130)
m2
(We do not give here the generalized mass definition, mentioned in the chapter on Ga-
Page 80 lilean mechanics, that is based on acceleration ratios, because it contains some subtleties,
which we will discover shortly.) The correction factors γ i ensure that the mass defined
by this equation is the same as the one defined in Galilean mechanics, and that it is the
same for all types of collision a body may have.* In this way, mass remains a quantity
Challenge 634 e * The results below also show that γ = 1 + T~mc 2 , where T is the kinetic energy of a particle.
320 ii special rel ativity • 5. speed, rest and light
before
pA
A B
after
θ
pA ϕ
p = γmv (131)
the (linear) relativistic (three-) momentum of a particle. Again, the total momentum is a
conserved quantity for any system not subjected to external influences, and this conserva-
tion is a direct consequence of the way mass is defined.
For low speeds, or γ 1, relativistic momentum is the same as that of Galilean physics,
and is proportional to velocity. But for high speeds, momentum increases faster than
velocity, tending to infinity when approaching light speed.
where the angles are defined in Figure 176. It follows that the sum φ + θ is smaller than
a right angle in the relativistic case. Relativistic speeds thus completely change the game
of snooker. Indeed, every accelerator physicist knows this: for electrons or protons, these
angles can easily be deduced from photographs taken in cloud chambers, which show
the tracks left by particles when they move through them. All such photographs confirm
Ref. 267 the above expression. In fact, the shapes of detectors are chosen according to expression
(132), as sketched in Figure 176. If the formula – and relativity – were wrong, most of these
detectors would not work, as they would miss most of the particles after the collision. In
rel ativistic mechanics 321
θ
ϕ
F I G U R E 176 The dimensions of detectors in particle accelerators are based on the relativistic
snooker angle rule
fact, these experiments also prove the formula for the composition of velocities. Can you
In other words, the mass of the final system is larger than the sum of the two original
masses m. In contrast to Galilean mechanics, the sum of all masses in a system is not a
conserved quantity. Only the sum P i γ i m i of the corrected masses is conserved.
Relativity provides the solution to this puzzle. Everything falls into place if, for the
energy E of an object of mass m and velocity v, we use the expression
mc 2
E = γmc 2 = » , (134)
1 − v 2 ~c 2
applying it both to the total system and to each component. The conservation of the cor-
rected mass can then be read as the conservation of energy, simply without the factor c 2 . Copyright © Christoph Schiller November 1997–January 2007
In the example of the two identical masses sticking to each other, the two particles are
thus each described by mass and energy, and the resulting system has an energy E given
by the sum of the energies of the two particles. In particular, it follows that the energy E 0
of a body at rest and its mass m are related by
E 0 = mc 2 , (135)
which is perhaps the most beautiful and famous discovery of modern physics. Since c 2 is
so large, we can say that mass is concentrated energy. In other words, special relativity says
that every mass has energy, and that every form of energy in a system has mass. Increasing
the energy of a system increases its mass, and decreasing the energy content decreases the
mass. In short, if a bomb explodes inside a closed box, the mass, weight and momentum
322 ii special rel ativity • 5. speed, rest and light
of the box are the same before and after the explosion, but the combined mass of the
debris inside the box will be smaller than before. All bombs – not only nuclear ones –
thus take their energy from a reduction in mass. In addition, every action of a system –
such a caress, a smile or a look – takes its energy from a reduction in mass.
The kinetic energy T is thus given by
1 ë 3 v4 1 ë 3 ë 5 v6
T = γmc 2 − mc 2 = mv 2 +
1
m + m + ... (136)
2 2 ë 4 c2 2 ë 4 ë 6 c4
Challenge 638 e (using the binomial theorem) which reduces to the Galilean value only for low speeds.
The mass–energy equivalence E = γmc 2 implies that taking any energy from matter
results in a mass decrease. When a person plays the piano, thinks or runs, its mass de-
creases. When a cup of tea cools down or when a star shines, its mass decreases. The
mass–energy equivalence pervades all of nature.
The mass–energy equivalence has been confirmed by all experiments performed so far.
The measurement is simplest for the nuclear mass defect. The most precise experiment,
from 2005, confirmed the mass–energy relation to more than 6 digits, by comparing the
masses difference of nuclei before and after neutron capture one one hand, and emitted
Ref. 301 gamma ray energy on the other hand.
Modern methods of mass measurement of single molecules have even made it pos-
sible to measure the chemical mass defect, by comparing the mass of a single molecule
with that of its constituent atoms. David Pritchard’s group has developed so-called Pen-
ning traps, which allow masses to be determined from the measurement of frequencies;
* There may be two extremely diluted, yet undiscovered, form of energy, called dark matter and (confusingly)
Page 450 dark energy, scattered throughout the universe. They are deduced from (quite difficult) mass measurements.
The issue has not yet been finally resolved.
rel ativistic mechanics 323
E 0 = mc 2 . (137)
m 2 c 4 = E 2 − p2 c 2 (138)
for all relativistic systems, be they objects or, as we will see below, radiation. For the mo-
mentum vector we get the other important relation
p=
E
v, (139)
c2
Challenge 642 e which is equally valid for any type of moving energy, be it an object or a beam or a pulse of
radiation.* We will use both relations often in the rest of our ascent of Motion Mountain,
including the following discussion.
mentum are intrinsic consequences of the definition of mass. Let us now have a look at
collisions in more detail, using these new concepts. A collision is a process, i.e. a series of
events, for which
— the total momentum before the interaction and after the interaction is the same;
— the momentum is exchanged in a small region of space-time;
— for small velocities, the Galilean description is valid.
In everyday life an impact, i.e. a short-distance interaction, is the event at which both
objects change momentum. But the two colliding objects are located at different points
Ref. 303 when this happens. A collision is therefore described by a space-time diagram such as the
time t τ
E'2 p'2
E'1 p'1
E
p
E2 p2
E1 p1
object 1
object 2 object 1 object 2
space x ξ
1 − v 1 v 1′
m 2 c 4 = (E 1 − E 1′ )2 − (p 1 − p′1 )2 c 2 = 2m 12 c 4 − 2E 1 E 1′ ( )<0. (140)
c2
This is a strange result, because it means that the unknown mass is an imaginary num-
ber!!* On top of that, we also see directly from the second graph that the exchanged object
moves faster than light. It is a tachyon, from the Greek ταχύς ‘rapid’. In other words, colli-
sions involve motion that is faster than light! We will see later that collisions are indeed the
only processes where tachyons play a role in nature. Since the exchanged objects appear
Copyright © Christoph Schiller November 1997–January 2007
only during collisions, never on their own, they are called virtual objects, to distinguish
them from the usual, real objects, which can move freely without restriction.** We will
study their properties later on, when we come to discuss quantum theory.
»
* It is usual to change the mass–energy and mass–momentum relation of tachyons to E = mc 2 ~ v 2 ~c 2 − 1
»
and p = mv~ v 2 ~c 2 − 1 ; this amounts to a redefinition of m. After the redefinition, tachyons have real
mass. The energy and momentum relations show that tachyons lose energy and momentum when they get
faster. (Provocatively, a single tachyon in a box could provide us with all the energy we need.) Both signs
for the energy and momentum relations must be retained, because otherwise the equivalence of all inertial
observers would not be generated. Tachyons thus do not have a minimum energy or a minimum momentum.
** More precisely, a virtual particle does not obey the relation m 2 c 4 = E 2 − p2 c 2 , valid for real particles.
rel ativistic mechanics 325
In nature, a tachyon is always a virtual object. Real objects are always bradyons – from
the Greek βραδύς ‘slow’ – or objects moving slower than light. Note that tachyons, despite
their high velocity, do not allow the transport of energy faster than light; and that they
do not violate causality if and only if they are emitted or absorbed with equal probability.
Challenge 645 ny Can you confirm all this?
When we study quantum theory, we will also discover that a general contact interac-
tion between objects is described not by the exchange of a single virtual object, but by
a continuous stream of virtual particles. For standard collisions of everyday objects, the
interaction turns out to be electromagnetic. In this case, the exchanged particles are vir-
tual photons. In other words, when one hand touches another, when it pushes a stone, or
when a mountain supports the trees on it, streams of virtual photons are continuously
Page 750 exchanged.
There is an additional secret hidden in collisions. In the right-hand side of Figure 177,
the tachyon is emitted by the first object and absorbed by the second one. However, it
practically equal; only at large relative speeds, typically at more than a few per cent of the
speed of light, is there a noticeable difference. Most such situations are microscopic. We
have already mentioned the electrons inside a television tube or inside a particle accel-
erator. The particles making up cosmic radiation are another example: their high energy
has produced many of the mutations that are the basis of evolution of animals and plants
on this planet. Later we will discover that the particles involved in radioactivity are also
relativistic.
But why don’t we observe any rapid macroscopic bodies? Moving bodies, including
observers, with relativistic velocities have a property not found in everyday life: when
they are involved in a collision, part of their energy is converted into new matter via
E = γmc 2 . In the history of the universe this has happened so many times that practically
all the bodies still in relativistic motion are microscopic particles.
326 ii special rel ativity • 5. speed, rest and light
A CM-0 B
v v
transformed CM
A CM-1 B
v=0 v 2v/(1+v2/c2 )
geometrical CM
A CM-2 B
v=0 2 2 2v/(1+v2/c2 )
v/(1+v /c )
A second reason for the disappearance of rapid relative motion is radiation damping.
Challenge 647 n Can you imagine what happens to charges during collisions, or in a bath of light?
In short, almost all matter in the universe moves with small velocity relative to other
matter. The few known counter-examples are either very old, such as the quasar jets men-
tioned above, or stop after a short time. The huge energies necessary for macroscopic
relativistic motion are still found in supernova explosions, but they cease to exist after
only a few weeks. In summary, the universe is mainly filled with slow motion because it
Page 457 is old. We will determine its age shortly.
Albert Einstein took several months after his first paper on special relativity to deduce
the expression
E = γmc 2 (141)
which is often called the most famous formula of physics. He published it in a second,
Ref. 256 separate paper towards the end of 1905. Arguably, the formula could have been discovered
thirty years earlier, from the theory of electromagnetism.
In fact, at least one person did deduce the result before Einstein. In 1903 and 1904,
before Einstein’s first relativity paper, a little-known Italian engineer, Olinto De Pretto,
was the first to calculate, discuss and publish the formula E = mc 2 .* It might well be that
* Umberto Bartocci, mathematics professor of the University of Perugia in Italy, published the details of
rel ativistic mechanics 327
lig
ne
future T
ht
co
co
ht
ne
lig
E elsewhere y
past
Einstein got the idea for the formula from De Pretto, possibly through Einstein’s friend
Michele Besso or other Italian-speaking friends he met when he visited his parents, who
were living in Italy at the time. Of course, the value of Einstein’s efforts is not diminished
by this.
Ref. 305 In fact, a similar formula had also been deduced in 1904 by Friedrich Hasenöhrl and
published again in Annalen der Physik in 1905, before Einstein, though with an incorrect
numerical prefactor, due to a calculation mistake. The formula E = mc 2 is also part of
several expressions in two publications in 1900 by Henri Poincaré. The real hero in the
story might well be Tolver Preston, who discussed the equivalence of mass and energy
already in 1875, in his book Physics of the Ether. The mass–energy equivalence was thus
indeed floating in the air, only waiting to be discovered.
In the 1970s, there was a similar story: a simple relation between the gravitational
acceleration and the temperature of the vacuum was discovered. The result had been
waiting to be discovered for over 50 years. Indeed, a number of similar, anterior results
were found in the libraries. Could other simple relations be hidden in modern physics
Challenge 648 n waiting to be found?
4-vectors
Copyright © Christoph Schiller November 1997–January 2007
To describe motion consistently for all observers, we have to introduce some new quant-
ities. First of all, motion of particles is seen as a sequence of events. To describe events
with precision, we use event coordinates, also called 4-coordinates. These are written as
this surprising story in several papers. The full account is found in his book Umberto Bartocci, Albert
Einstein e Olinto De Pretto: la vera storia della formula più famosa del mondo, Ultreja, 1998.
328 ii special rel ativity • 5. speed, rest and light
called X 1 = x, the second, X 2 = y, and the third, X 3 = z. One can then define a distance d
between events as the length of the difference vector. In fact, one usually uses the square
of the length, to avoid those unwieldy square roots. In special relativity, the magnitude
(‘squared length’) of a vector is always defined through
XX = X 0 2 − X 1 2 − X 2 2 − X 3 2 = ct 2 − x 2 − y 2 − z 2 = X a X a = η ab X a X b = η ab X a X b .(143)
In this equation we have introduced for the first time two notations that are useful in
relativity. First of all, we automatically sum over repeated indices. Thus, X a X a means the
sum of all products X a X a as a ranges over all indices. Secondly, for every 4-vector X
we distinguish two ways to write the coordinates, namely coordinates with superscripts
and coordinates with subscripts. (In three dimensions, we only use subscripts.) They are
related by the following general relation
1 0 0 0
= η ab = .
0 −1 0 0
ab
η (145)
0 0 −1 0
0 0 0 −1
Don’t panic: this is all, and it won’t get more difficult! We now go back to physics.
The magnitude of a position or distance vector, also called the space-time interval, is
essentially the proper time times c. The proper time is the time shown by a clock moving
in a straight line and with constant velocity from the starting point to the end point in
space-time. The difference from the usual 3-vectors is that the magnitude of the interval
can be positive, negative or even zero. For example, if the start and end points in space-
time require motion with the speed of light, the proper time is zero (this is required for
Page 304 null vectors). If the motion is slower than the speed of light, the squared proper time is
positive and the distance is timelike. For negative intervals and thus imaginary proper
times, the distance is spacelike.** A simplified overview is given by Figure 179.
Now we are ready to calculate and measure motion in four dimensions. The meas-
Copyright © Christoph Schiller November 1997–January 2007
urements are based on one central idea. We cannot define the velocity of a particle as
the derivative of its coordinates with respect to time, since time and temporal sequences
depend on the observer. The solution is to define all observables with respect to the just-
mentioned proper time τ, which is defined as the time shown by a clock attached to the ob-
ject. In relativity, motion and change are always measured with respect to clocks attached
to the moving system. In particular, the relativistic velocity or 4-velocity U of a body is
* Note that 30 % of all physics textbooks use the negative of η as the metric, the so-called spacelike convention,
and thus have opposite signs in this definition. In this text, as in 70 % of all physics texts, we use the timelike
convention.
** In the latter case, the negative of the magnitude, which is a positive number, is called the squared proper
distance. The proper distance is the length measured by an odometer as the object moves along.
rel ativistic mechanics 329
thus defined as the rate of change of the event coordinates or 4-coordinates X = (ct, x)
with respect to proper time, i.e. as
U = dX~dτ . (146)
The coordinates X are measured in the coordinate system defined by the inertial observer
chosen. The value of the velocity U depends on the observer or coordinate system used;
so the velocity depends on the observer, as it does in everyday life. Using dt = γ dτ and
thus
= =γ , where as usual γ = »
dx dx dt dx 1
, (147)
dτ dt dτ dt 1 − v 2 ~c 2
u 0 = γc , u i = γv i U = (γc, γv) .
For small velocities we have γ 1, and then the last three components of the 4-velocity
are those of the usual, Galilean 3-velocity. For the magnitude of the 4-velocity U we find
UU = U a U a = η ab U a U b = c 2 , which is therefore independent of the magnitude of the
3-velocity v and makes it a timelike vector, i.e. a vector inside the light cone.*
Note that the magnitude of a 4-vector can be zero even though all its components
are different from zero. Such a vector is called null. Which motions have a null velocity
Challenge 650 n vector?
Similarly, the relativistic acceleration or 4-acceleration B of a body is defined as
Using dγ~dτ = γdγ~dt = γ 4 va~c 2 , we get the following relations between the four com-
Ref. 306 ponents of B and the 3-acceleration a = dv~dt:
(va)v i
B0 = γ 4 B i = γ2 a i + γ4
va
, . (151)
c c2
h′0 = γ V (h 0 − h 1 V ~c)
h 1 = γ V (h 1 − h 0 V ~c)
′
h′2 = h 2
′
h3 = h3 (149)
when changing from one inertial observer to another moving with a relative velocity V in the x direction; the
corresponding generalizations for the other coordinates are understood. This relation allows one to deduce
the transformation laws for any 3-vector. Can you deduce the velocity composition formula (109) from this
Challenge 649 n definition, applying it to 4-velocity?
330 ii special rel ativity • 5. speed, rest and light
time
(E/c , p)
space
4-momentum
To describe motion, we also need the concept of momentum. The 4-momentum is defined
as
P = mU (154)
Copyright © Christoph Schiller November 1997–January 2007
Challenge 652 e For the relation with the 3-jerk j = da~dt we then get
γ5 2 (va) γ5 2 (va) v i
2 2
J = (J , J ) = (jv + a + 4γ ) , γ j i + 2 ((jv)v i + a v i + 4γ + 3(va)a i )
0 i 2 3 2
2 2
(153)
c c c c
Challenge 653 ny which we will use later on. Surprisingly, J does not vanish when j vanishes. Why not?
rel ativistic mechanics 331
For this reason 4-momentum is also called the energy–momentum 4-vector. In short, the
4-momentum of a body is given by mass times 4-displacement per proper time. This is the
simplest possible definition of momentum and energy. The concept was introduced by
Max Planck in 1906. The energy–momentum 4-vector, also called momenergy, like the
4-velocity, is tangent to the world line of a particle. This connection, shown in Figure 180,
follows directly from the definition, since
E 2 ~c 2 − p2 = m 2 c 2 , (157)
thus confirming a result given above. We have already mentioned that energies or situ-
ations are called relativistic if the kinetic energy T = E − E 0 is not negligible when com-
pared to the rest energy E 0 = mc 2 . A particle whose kinetic energy is much higher than
its rest mass is called ultrarelativistic. Particles in accelerators or in cosmic rays fall into
Challenge 654 n this category. (What is their energy–momentum relation?)
In contrast to Galilean mechanics, relativity implies an absolute zero for the energy.
One cannot extract more energy than mc 2 from a system of mass m. In particular, a zero
value for potential energy is fixed in this way. In short, relativity shows that energy is
bounded from below.
Note that by the term ‘mass’ m we always mean what is sometimes called the rest mass.
This name derives from the bad habit of many science fiction and secondary-school books
of calling the product γm the relativistic mass. Workers in the field usually (but not unan-
Ref. 307 imously) reject this concept, as did Einstein himself, and they also reject the often-heard
expression that ‘(relativistic) mass increases with velocity’. Relativistic mass and energy
would then be two words for the same concept: this way to talk is at the level of the tabloid
press. Copyright © Christoph Schiller November 1997–January 2007
Not all Galilean energy contributes to mass. Potential energy in an outside field does
not. Relativity forces us into precise energy bookkeeping. ‘Potential energy’ in relativity
is an abbreviation for ‘energy reduction of the outside field’.
Can you show that for two particles with momenta P1 and P2 , one has P1 P2 = m 1 E 2 =
Challenge 655 n M 2 E 1 = c 2 γv 12 m 1 m 2 , where v 12 is their relative velocity?
4-force
The 4-force K is defined as
K = dP~dτ = mB . (158)
332 ii special rel ativity • 5. speed, rest and light
Therefore force remains equal to mass times acceleration in relativity. From the definition
Ref. 306, Ref. 308 of K we deduce the relation with 3-force f = dp~dt = md(γv)~dt, namely*
With this clarification, we can now briefly consider rotation in relativity. The first ques-
tion is how lengths and times change in a rotating frame of reference. You may want
to check that an observer in a rotating frame agrees with a non-rotating colleague on
the radius of a rotating body; however, both find that the rotating body, even if it is ri-
Challenge 658 e gid, has a circumference different from the one it had before it started rotating. Sloppily
speaking, the value of π changes for rotating observers. The ratio between the circumfer-
Challenge 659 e ence c and the radius r turns out to be c~r = 2πγ: it increases with rotation speed. This
* Some authors define 3-force as dp~dτ; then K looks slightly different. In any case, it is important to note that
in relativity, 3-force f = dp~dt is indeed proportional to 3-acceleration a; however, force and acceleration are
Challenge 656 n not parallel to each other. In fact, for rest-mass-preserving forces one finds f = γma + (fv)v~c 2 . In contrast,
in relativity 3-momentum is not proportional to 3-velocity, although it is parallel to it.
rel ativistic mechanics 333
Ref. 309 counter-intuitive result is often called Ehrenfest’s paradox. Among other things, it shows
that space-time for an observer on a rotating disc is not the Minkowski space-time of
special relativity.
Rotating bodies behave strangely in many ways. For ex-
ample, one gets into trouble when one tries to synchronize O3 O2 O
1
clocks mounted on a rotating circle,as shown in Figure 182 On
If one starts synchronizing the clock at O2 with that at O1 , On–1
and so on, continuing up to clock On , one finds that the last
clock is not synchronized with the first. This result reflects
the change in circumference just mentioned. In fact, a care-
ful study shows that the measurements of length and time
intervals lead all observers Ok to conclude that they live in
a rotating space-time. Rotating discs can thus be used as an
introduction to general relativity, where this curvature and F I G U R E 182 Observers on a
l ab = x a pb − x b p a . (160)
In other words, 4-angular momentum is a tensor, not a vector, as shown by its two indices.
Challenge 661 ny Angular momentum is conserved in special relativity. The moment of inertia is naturally
defined as the proportionality factor between angular velocity and angular momentum.
Obviously, for a rotating particle, the rotational energy is part of the rest mass. You may
Challenge 662 ny want to calculate the fraction for the Earth and the Sun. It is not large. By the way, how
Challenge 663 ny would you determine whether a microscopic particle, too small to be seen, is rotating?
In relativity, rotation and translation combine in strange ways. Imagine a cylinder in
uniform rotation along its axis, as seen by an observer at rest. As Max von Laue has dis-
cussed, the cylinder will appear twisted to an observer moving along the rotation axis.
Challenge 664 e Can you confirm this?
Here is a last puzzle about rotation. Velocity is relative; this means that the measured
Copyright © Christoph Schiller November 1997–January 2007
Challenge 665 ny value depends on the observer. Is this the case also for angular velocity?
Wave motion
In Galilean physics, a wave is described by a wave vector and a frequency. In special
relativity, the two are combined in the wave 4-vector, given by
L= ( , n) ,
1 ω
(161)
λ c
334 ii special rel ativity • 5. speed, rest and light
where λ is the wavelength, ω the wave velocity, and n the normed direction vector. Sup-
pose an observer with 4-velocity U finds that a wave L has frequency ν. Show that
ν = LU (162)
Challenge 666 ny must be obeyed. Interestingly, the wave velocity ω transforms in a different way than
Ref. 261 particle velocity except in the case ω = c. Also the aberration formula for wave motion
Challenge 667 ny differs from that for particles, except in the case ω = c.
S = −mc 2
τ2
∫τ1
dτ , (163)
where τ is the proper time along its path. This is indeed the correct expression. It im-
plies conservation of (relativistic) energy and momentum, as the change in proper time
Challenge 668 ny is maximal for straight-line motion with constant velocity. Can you confirm this? Indeed,
in nature, all particles move in such a way that their proper time is maximal. In other
words, we again find that in nature things change as little as possible. Nature is like a
wise old man: its motions are as slow as possible. If you prefer, every change is maximally
effective. As we mentioned before, Bertrand Russell called this the law of cosmic laziness.
The expression (163) for the action is due to Max Planck. In 1906, by exploring it in
detail, he found that the quantum of action ħ, which he had discovered together with the
Boltzmann constant, is a relativistic invariant (like the Boltzmann constant k). Can you
Challenge 669 ny imagine how he did this?
The action can also be written in more complex, seemingly more frightening ways.
These equivalent ways to write it are particularly appropriate to prepare for general re-
lativity:
Copyright © Christoph Schiller November 1997–January 2007
¾
º
S= ∫ L dt = −mc ∫ dt = −mc u a u a dτ = −mc
t2 1 τ2 dx a dx bs2
2
t1 γ τ1
∫ s1
∫
ds ds
ds , η ab
(164)
where s is some arbitrary, but monotonically increasing, function of τ, such as τ itself. As
usual, the metric η α β of special relativity is
1 0 0 0
= η ab = .
0 −1 0 0
ab
η (165)
0 0 −1 0
0 0 0 −1
rel ativistic mechanics 335
You can easily confirm the form of the action (164) by deducing the equation of motion
Challenge 670 ny in the usual way.
In short, nature is not in a hurry: every object moves in a such way that its own clock
shows the longest delay possible, compared with any alternative motion nearby.* This
general principle is also valid for particles under the influence of gravity, as we will see
in the section on general relativity, and for particles under the influence of electric or
magnetic interactions. In fact, it is valid in all cases of (macroscopic) motion found in
nature. For the moment, we just note that the longest proper time is realized when the
Challenge 672 ny difference between kinetic and potential energy is minimal. (Can you confirm this?) For
the Galilean case, the longest proper time thus implies the smallest average difference
between the two energy types. We thus recover the principle of least action in its Galilean
formulation.
Page 178 Earlier on, we saw that the action measures the change going on in a system. Special re-
lativity shows that nature minimizes change by maximizing proper time. In nature, proper
xa
xa ( (167)
x2
xa ( xa + ba , (168)
* If neutrinos were massless, the action (164) would not be applicable for them. Why? Can you find an
Challenge 671 ny alternative for this (admittedly academic) case?
336 ii special rel ativity • 5. speed, rest and light
xa + ba x 2 xa xa
xa ( or ( 2 + ba . (169)
1 + 2b a x a + b 2 x 2 x 2 x
These transformations are called conformal because they do not change angles of (infin-
Challenge 674 ny itesimally) small shapes, as you may want to check. They therefore leave the form (of in-
finitesimally small objects) unchanged. For example, they transform infinitesimal circles
into infinitesimal circles. They are called special because the full conformal group includes
the dilations and the inhomogeneous Lorentz transformations as well.*
Note that the way in which special conformal transformations leave light cones invari-
Challenge 676 ny ant is rather subtle.
Since dilations do not commute with time translations, there is no conserved quantity
associated with this symmetry. (The same is true of Lorentz boosts.) In contrast, rotations
Challenge 675 ny * The set of all special conformal transformations forms a group with four parameters; adding dilations
and the inhomogeneous Lorentz transformations one gets fifteen parameters for the full conformal group.
Copyright © Christoph Schiller November 1997–January 2007
The conformal group is locally isomorphic to SU(2,2) and to the simple group SO(4,2): these concepts are
Page 1210 explained in Appendix D. Note that all this is true only for four space-time dimensions; in two dimensions
– the other important case, especially in string theory – the conformal group is isomorphic to the group of
arbitrary analytic coordinate transformations, and is thus infinite-dimensional.
** The conformal group does not appear only in the kinematics of special relativity: it is the symmetry
group of all physical interactions, such as electromagnetism, provided that all the particles involved have
zero mass, as is the case for the photon. A field that has mass cannot be conformally invariant; therefore
conformal invariance is not an exact symmetry of all of nature. Can you confirm that a mass term mφ 2 in a
Challenge 678 ny Lagrangian is not conformally invariant?
However, since all particles observed up to now have masses that are many orders of magnitude smaller
than the Planck mass, it can be said that they have almost vanishing mass; conformal symmetry can then
be seen as an approximate symmetry of nature. In this view, all massive particles should be seen as small
corrections, or perturbations, of massless, i.e. conformally invariant, fields. Therefore, for the construction
of a fundamental theory, conformally invariant Lagrangians are often assumed to provide a good starting
accelerating observers 337
Accelerating observers
So far, we have only studied what inertial, or free-flying, observers say to each other when
they talk about the same observation. For example, we saw that moving clocks always
run slow. The story gets even more interesting when one or both of the observers are
accelerating.
One sometimes hears that special relativity cannot be used to describe accelerating
observers. That is wrong, just as it is wrong to say that Galilean physics cannot be used for
accelerating observers. Special relativity’s only limitation is that it cannot be used in non-
flat, i.e. curved, space-time. Accelerating bodies do exist in flat space-times, and therefore
they can be discussed in special relativity.
As an appetizer, let us see what an acceler-
ating, Greek, observer says about the clock
Ref. 310 of an inertial, Roman, one, and vice versa. observer (Greek) v
Assume that the Greek observer, shown in
∆τ dτ »
= = 1 − v 2 ~c 2 =
1
, (170)
∆t dt γv
Challenge 679 ny a formula we are now used to. We find again that moving clocks run slow.
Ref. 310 For accelerated motions, the differential version of the above reasoning is necessary.
The Roman/Greek clock rate ratio is again dτ~dt, and τ and τ + dτ are calculated in the
same way from the times t and t + dt. Assume again that the Greek observer moves along
the path x(t), as measured by the Roman one. We find directly that Copyright © Christoph Schiller November 1997–January 2007
τ = t − x(t)v(t)~c 2 (171)
and thus
τ + dτ = (t + dt) − [x(t) − dtv(t)][v(t) + dta(t)]~c 2 . (172)
approximation.
* These sets form what mathematicians call hypersurfaces.
338 ii special rel ativity • 5. speed, rest and light
This shows that accelerated clocks can run fast or slow, depending on their position x and
the sign of their acceleration a. There are quotes in the above equation because we can
see directly that the Greek observer notes
‘dt~dτ’ = γv , (174)
which is not the inverse of equation (173). This difference becomes most apparent in the
simple case of two clocks with the same velocity, one of which has a constant acceleration
д towards the origin, whereas the other moves inertially. We then have
and
‘dt~dτ’ = 1 . (176)
This relation shows that accelerations are not Lorentz invariant, unless the velocities are
small compared to the speed of light. This is in contrast to our everyday experience, where
accelerations are independent of the speed of the observer.
Expression (177) simplifies if the accelerations are measured at a time t at which ω
vanishes – i.e. if they are measured by the so-called comoving inertial observer. In that
case the acceleration relation is given by
and the acceleration a c = α is also called proper acceleration, as its value describes what
the Greek, comoving observer feels: proper acceleration describes the experience of being
pushed into the back of the accelerating seat.
In general, the observer’s speed and the acceleration are not parallel. We can calculate
Ref. 311 how the value of 3-acceleration a measured by a general inertial observer is related to the
value ac measured by the comoving observer using expressions (151) and (149). We get
the generalization of (178):
vac = vaγv3 (179)
accelerating observers 339
and
(1 − γv )(vac )v γv (vac )v
a= ac − .
1
2
− (180)
γv v2 c2
(ac v)2
a2 = a
1 2
− (181)
γv4 c c2
Page 329 which we know already in a slightly different form. It shows (again) that the comoving
or proper 3-acceleration is always larger than the 3-acceleration measured by an outside
inertial observer. The faster the outside inertial observer is moving, the smaller the accele-
Challenge 680 e ration he observes. Acceleration is not a relativistic invariant. The expression also shows
that whenever the speed is perpendicular to the acceleration, a boost yields a factor γv2 ,
ever, a more general one. There are other, non-inertial, situations where this is still the
case.
Non-inertial frames, or accelerating frames, are a useful concept in special relativity. In
fact, we all live in such a frame. We can use special relativity to describe it in the same
way that we used Galilean physics to describe it at the beginning of our journey.
A general frame of reference is a continuous set of observers remaining at rest with
respect to each other. Here, ‘at rest with respect to each other’ means that the time for a
light signal to go from one observer to another and back again is constant over time, or
equivalently, that the rod distance between the two observers is constant. Any frame of
reference can therefore also be called a rigid collection of observers. We therefore note
that a general frame of reference is not the same as a set of coordinates; the latter is usu-
340 ii special rel ativity • 5. speed, rest and light
τ
t
n
II
izo
ξ
r
ho
re
tu
fu
Ω
O
III c2/g x
I
pa
st
ho
IV
rizo
n
F I G U R E 184 The hyperbolic motion of an
rectilinearly, uniformly accelerating observer Ω
B ë B = −д 2 (182)
Copyright © Christoph Schiller November 1997–January 2007
Ref. 314 where д is a constant independent of t. The simplest case is uniformly accelerating motion
that is also rectilinear, i.e. for which the acceleration a is parallel to v at one instant of time
Challenge 682 ny and (therefore) for all other times as well. In this case we can write, using 3-vectors,
Ref. 312 * There are essentially only two other types of rigid coordinate frames, apart from the inertial frames:
γ3a = g =g.
dγv
or (183)
dt
Taking the direction we are talking about to be the x-axis, and solving for v(t), we get
v=¼
дt
, (184)
д2 t2
1 + c2
where it was assumed that v(0) = 0. We note that for small times we get v = дt and for
large times v = c, both as expected. The momentum of the accelerated observer increases
Challenge 683 ny linearly with time, again as expected. Integrating, we find that the accelerated observer
moves along the path ¾
c2
x(t) =
д2 t 2
1+ 2 , (185)
д c
for the relationship between proper time τ and the time t and position x measured by
the external, inertial Roman observer. We will encounter this relation again during our
study of black holes.
Does all this sound boring? Just imagine accelerating on a motorbike at д = 10 m~s2
for the proper time τ of 25 years. That would bring you beyond the end of the known
universe! Isn’t that worth a try? Unfortunately, neither motorbikes nor missiles that ac-
Challenge 684 n celerate like this exist, as their fuel tanks would have to be enormous. Can you confirm
this?
Copyright © Christoph Schiller November 1997–January 2007
Ref. 316 * Use your favourite mathematical formula collection – every student should have one – to deduce this. The
hyperbolic sine and the»hyperbolic cosine are defined by sinh y = (e»
y
− e−y )~2 and cosh y = (e y + e−y )~2.
They imply that ∫ dy~ y + a = arsinh y~a = Arsh y~a = ln(y + y 2 + a 2 ).
2 2
342 ii special rel ativity • 5. speed, rest and light
t = ( + ) sinh
c ξ дτ
д c c
c2
x = ( + ξ) cosh
дτ
д c
y=υ
z=ζ, (187)
where τ now is the time coordinate in the Greek frame. We note also that the space-time
interval dσ satisfies
dσ 2 = (1 + дξ~c 2 )2 c 2 dτ 2 − dξ 2 − dυ 2 − dζ 2 = c 2 dt 2 − dx 2 − dy 2 − dz 2 , (188)
which, surprisingly enough, is constant in time! In other words, the Greek observer will
observe that he stays at a constant distance from the Roman one, in complete contrast to
what the Roman observer says. Take your time to check this strange result in some other
way. We will need it again later on, to explain why the Earth does not explode. (Can you
Challenge 686 n guess how that is related to this result?)
The composition theorem for accelerations is more complex than for velocities. The
Ref. 319 best explanation of this was published by Mishra. If we call a nm the acceleration of sys- Copyright © Christoph Schiller November 1997–January 2007
y
Observer 1
x
a22 : proper acceleration
v22 = 0
v0n : object speed seen by observer n
Observer 2
x a0n : object acceleration
Object seen by observer n
(191)
Challenge 688 ny and you might enjoy checking the expression.
Page 319 Can you state how the acceleration ratio enters into the definition of mass in special
Challenge 689 ny relativity?
Event horizons
There are many surprising properties of accelerated motion. Of special interest is the
trajectory, in the coordinates ξ and τ of the rigidly accelerated frame, of an object located
Challenge 690 ny at the departure point x = x 0 = c 2 ~д at all times t. One gets the two relations*
c2
ξ=− (1 − sech )
дτ
д c
dξ~dτ = −c sech
дτ дτ
Copyright © Christoph Schiller November 1997–January 2007
tanh . (193)
c c
These equations are strange. For large times τ the coordinate ξ approaches the limit value
−c 2 ~д and dξ~dτ approaches zero. The situation is similar to that of a car accelerating
away from a woman standing on a long road. Seen from the car, the woman moves away;
however, after a while, the only thing one notices is that she is slowly approaching the
* The functions appearing above, the hyperbolic secant and the hyperbolic tangent, are defined using the
expressions from the footnote on page 341:
1 sinh y
sech y = and tanh y = . (192)
cosh y cosh y
344 ii special rel ativity • 5. speed, rest and light
τ
t
n
II
izo
ξ
r
ho
re
tu
fu
Ω
O
III c2/g x
I
pa
st
ho
IV
rizo
n
F I G U R E 186 Hyperbolic motion and event
horizons
Challenge 693 n the shape of the horizon seen by a uniformly accelerated observer?
д0 x 2 2 2
dσ 2 = 1 + c dt (194)
c2
accelerating observers 345
where h is the rod distance between the source and the receiver, and where дs = д0 ~(1 +
д0 x s ~c 2 ) and дr = д0 ~(1 + дo x r ~c 2 ) are the proper accelerations measured at the source
and at the detector. In short, the frequency of light decreases when light moves in the
direction of acceleration. By the way, does this have an effect on the colour of trees along
Challenge 695 n their vertical extension?
The formula usually given, namely
=1− 2 ,
fr дh
(196)
fs c
v light = c (1 + )
дh
(197)
c2
which is higher than c for light moving in front of or ‘above’ him, and lower than c for
light moving behind or ‘below’ him. This strange result follows from a basic property of
any accelerating frame of reference. In such a frame, even though all observers are at rest
with respect to each other, clocks do not remain synchronized. This change of the speed
of light has also been confirmed by experiment.* Thus, the speed of light is only constant
when it is defined as c = dx~dt, and if dx and dt are measured with a ruler located at
a point inside the interval dx and a clock read off during the interval dt. If the speed
of light is defined as ∆x~∆t, or if the ruler defining distances or the clock measuring
Copyright © Christoph Schiller November 1997–January 2007
times is located away from the propagating light, the speed of light is different from c for
accelerating observers! This is the same effect you can experience when you turn around
your vertcial axis at night: the star velocities you observe are much higher than the speed
of light.
Note that this result does not imply that signals or energy can be moved faster than c.
Challenge 697 n You may want to check this for yourself.
In fact, all these effects are negligible for distances l that are much less than c 2 ~a. For
an acceleration of 9.5 m~s2 (about that of free fall), distances would have to be of the order
Page 419 * The propagation delays to be discussed in the chapter on general relativity can be seen as confirmations of
this effect.
346 ii special rel ativity • 5. speed, rest and light
time
clock 1 clock 2
t3
t2
t1
space
of one light year, or 9.5 ë 1012 km, in order for any sizable effects to be observed. In short,
c is the speed of light relative to nearby matter only.
By the way, everyday gravity is equivalent to a constant acceleration. So, why then do
Challenge 698 n distant objects, such as stars, not move faster than light, following expression (197)?
t 3 − t 1 = 2 (t 2 − t 1 ) . (198)
Here it is assumed that the clocks have been synchronised according to the prescription
on page 307. If the factor were not exactly two, the speed of light would not be constant.
In fact, all experiments so far have yielded a factor of two, within measurement errors.*
* The subtleties of the one-way and two-way speed of light will remain a point of discussion for a long time.
Many experiments are explained and discussed in Ref. 267. Zhang says in his summary on page 171, that the
one-way velocity of light is indeed independent of the light source; however, no experiment really shows that
Ref. 321 it is equal to the two-way velocity. Moreover, most so called ‘one-way’ experiments are in fact still ‘two-way’
experiments (see his page 150).
accelerating observers 347
This result is sometimes expressed by saying that it is impossible to measure the one-
Challenge 699 n way velocity of light; only the two-way velocity of light is measurable. Do you agree?
l a < c2 , (200)
where c is the speed of sound, which is the speed limit for the material parts of solids. Let
Ref. 322 us now repeat the argument in relativity, using the speed of light instead of that of sound.
Imagine accelerating the front of a solid body with some proper acceleration a. The back
end cannot move with an acceleration α equal or larger than infinity, or if one prefers, it
Challenge 700 n cannot move with more than the speed of light. A quick check shows that therefore the
length l of a solid body must obey
l α < c 2 ~2 , (201)
where c is now the speed of light. The speed of light thus limits the size of solid bodies. For
example, for 9.8 m~s2 , the acceleration of good motorbike, this expression gives a length
limit of 9.2 Pm, about a light year. Not a big restriction: most motorbikes are shorter.
However, there are other, more interesting situations. The highest accelerations achiev-
able today are produced in particle accelerators. Atomic nuclei have a size of a few femo-
Copyright © Christoph Schiller November 1997–January 2007
Challenge 701 ny tometres. Can you deduce at which energies they break when smashed together in an
accelerator? In fact, inside a nucleus, the nucleons move with accelerations of the order
of v 2 ~r ħ 2 ~m 2 r 3 1031 m~s2 ; this is one of the highest values found in nature.
Note that Galilean physics and relativity produce a similar conclusion: a limiting speed,
be it that of sound or that of light, makes it impossible for solid bodies to be rigid. When
we push one end of a body, the other end always moves a little bit later.
A puzzle: does the speed limit imply a relativistic ‘indeterminacy relation’
∆l ∆a D c 2 (202)
* The (longitudinal) speed of sound is about 5.9 km~s for glass, iron or steel; about 4.5 km~s for gold; and
about 2 km~s for lead. Other sound speeds are given on page 208.
348 ii special rel ativity • 5. speed, rest and light
4πε 0 c 2 d 2 m
l< . (203)
e2
The nearer electrons can get, the smaller they must be. The present experimental limit
gives a size smaller than 10−19 m. Can electrons be exactly point-like? We will come back
to this question during our study of general relativity and quantum theory.
of course! Just look at what happens when the value of c is changed in formulae.’ (In fact,
there have even been attempts to build ‘variable speed of light theories’.) However, this
often-heard statement is wrong.
Since the speed of light enters into our definition of time and space, it thus enters,
even if we do not notice it, into the construction of all rulers, all measurement standards
and all measuring instruments. Therefore there is no way to detect whether the value
actually varies. No imaginable experiment could detect a variation of the limit speed, as
Challenge 703 n the limit speed is the basis for all measurements. ‘That is intellectual cruelty!’, you might
say. ‘All experiments show that the speed of light is invariant; we had to swallow one
counter-intuitive result after another to accept the constancy of the speed of light, and
now we are supposed to admit that there is no other choice?’ Yes, we are. That is the irony
of progress in physics. The observer-invariance of the speed of light is counter-intuitive
special rel ativity in four sentences 349
Biblio graphy
246 Aristotle, On sense and the sensible, section 1, part 1, 350 bce. Cited in Jean-Paul
Dumont, Les écoles présocratiques, Folio Essais, Gallimard, p. 157, 1991. Cited on page 284.
247 The history of the measurement of the speed of light can be found in chapter 19 of the text
by Francis A. Jenkins & Harvey E. White, Fundamentals of Optics, McGraw-Hill,
New York, 1957. Cited on page 284.
248 On the way to perform such measurements, see Sydney G. Brewer, Do-it-yourself As-
tronomy, Edinburgh University Press, 1988. Kepler himself never measured the distances of
planets to the Sun, but only ratios of planetary distances. The parallax of the Sun from two
points of the Earth is at most 8.79 ′′ ; it was first measured in the eighteenth century. Cited
on page 286.
249 Aristarchos, On the sizes and the distances of the Sun and the Moon, c. 280 bce, in
Michael J. Crowe, Theories of the World From Antiquity to the Copernican Revolution,
Some of these experiments are not completely watertight, however. There is a competing
theory of electrodynamics, due to Ritz, which maintains that the speed of light is c only
when measured with respect to the source; The light from stars, however, passes through
the atmosphere, and its speed might thus be reduced to c.
The famous experiment with light emitted from rapid pions at CERN is not subject to this
criticism. It is described in T. Alväger, J.M. Bailey, F.J.M. Farley, J. Kjellman &
I. Wallin, Test of the second postulate of relativity in the GeV region, Physics Letters 12,
pp. 260–262, 1964. See also T. Alväger & al., Velocity of high-energy gamma rays, Arkiv
för Fysik 31, pp. 145–157, 1965.
Another precise experiment at extreme speeds is described by G.R. Kalbfleisch, N.
Baggett, E.C. Fowler & J. Alspector, Experimental comparison of neutrino, anti-
neutrino, and muon velocities, Physical Review Letters 43, pp. 1361–1364, 1979. Cited on
page 288.
bibliography 351
254 See e.g. C. Will, Theory and Experiment in Gravitational Physics, Revised edition, Cam-
bridge University Press, 1993. Cited on pages 289 and 290.
255 B.E. Schaefer, Severe limits on variations of the speed of light with frequency, Physical
Review Letters 82, pp. 4964–4966, 21 June 1999. Cited on page 289.
256 The beginning of the modern theory of relativity is the famous paper by Albert Ein-
stein, Zur Elektrodynamik bewegter Körper, Annalen der Physik 17, pp. 891–921, 1905. It
still well worth reading, and every physicist should have done so. The same can be said of the
famous paper, probably written after he heard of Olinto De Pretto’s idea, found in Albert
Einstein, Ist die Trägheit eines Körpers von seinem Energieinhalt abhängig?, Annalen der
Physik 18, pp. 639–641, 1905. See also the review Albert Einstein, Über das Relativitäts-
prinzip und die aus demselben gezogenen Folgerungen, Jahrbuch der Radioaktivität und
Elektronik 4, pp. 411–462, 1907. These papers are now available in many languages. A later,
unpublished review is available in facsimile and with an English translation as Albert
Einstein, Hanoch Gutfreund, ed., Einstein’s 1912 Manuscript on the Theory of Relativity,
George Braziller, 2004. Cited on pages 289, 290, and 326.
by the star matter. The most common estimate for the Sun is an escape time of 40 000 to 1
million years, but estimates between 17 000 years and 50 million years can be found in the
literature. Cited on page 291.
264 L. Vestergaard Hau, S.E. Harris, Z. Du tton & C.H. Behroozi, Light speed
reduction to 17 meters per second in an ultracold atomic gas, Nature 397, pp. 594–598, 1999.
See also Ref. 26. Cited on page 291.
265 The method of explaining special relativity by drawing a few lines on paper is due to Her-
mann B ondi, Relativity and Common Sense: A New Approach to Einstein, Dover, New
York, 1980. See also Dierck-Ekkehard Liebscher, Relativitätstheorie mit Zirkel und
Lineal, Akademie-Verlag Berlin, 1991. Cited on page 291.
266 Rod S. L akes, Experimental limits on the photon mass and cosmic vector potential, Phys-
ical Review Letters 80, pp. 1826–1829, 1998. The speed of light is independent of frequency
352 ii special rel ativity
within a factor of 6ë10−21 , as was shown from gamma ray studies by B.E. Schaefer, Severe
limits on variations of the speed of light with frequency, Physical Review Letters 82, pp. 4964–
4966, 1999. Cited on page 292.
267 An overview of experimental results is given in Yuan Zhong Zhang, Special Relativity
and its Experimental Foundations, World Scientific, 1998. Cited on pages 292, 298, 307, 320,
346, and 353.
268 R.W. McGowan & D.M. Giltner, New measurement of the relativistic Doppler shift
in neon, Physical Review Letters 70, pp. 251–254, 1993. Cited on page 294.
269 The present record for clock synchronization seems to be 1 ps for two clocks distant 3 km
from each other. See A. Valencia, G. Scarcelli & Y. Shih, Distant clock synchron-
ization using entangled photon pairs, Applied Physics Letters 85, pp. 2655–2657, 2004, or
http://arxiv.org/abs/quant-ph/0407204. Cited on page 295.
270 J. Frenkel & T. Kontorowa, Über die Theorie der plastischen Verformung, Physikali-
sche Zeitschrift der Sowietunion 13, p. 1, 1938. F.C. Frank, On the equations of motion of
bury Neutrino Observatory, Physical Review Letters 89, p. 011301, 2002, also confirm that
neutrinos have a mass in the 1 eV region. Cited on pages 297 and 1276.
274 B. Rothenstein & G. Eckstein, Lorentz transformations directly from the speed of
light, American Journal of Physics 63, p. 1150, 1995. See also the comment by E. Kapuścik,
Comment on “Lorentz transformations directly from the speed of light,” by B. Rothenstein
and G. Eckstein, American Journal of Physics 65, p. 1210, 1997. Cited on page 298.
275 See e.g. the 1922 lectures by Lorentz at Caltech, published as H.A. Lorentz, Problems of
Modern Physics, edited by H. Bateman, Ginn and Company, page 99, 1927. Cited on page
298.
276 A.A. Michelson & E.W. Morley, On the relative motion of the Earth and the lumini-
ferous ether, American Journal of Science (3rd series) 34, pp. 333–345, 1887. Michelson pub-
lished many other papers on the topic after this one. Cited on page 298.
bibliography 353
277 S. Stephan, P. Antonini & M. Okhapkin, A precision test of the isotropy of the speed
of light using rotating cryogenic resonators, http://arxiv.org/abs/physics/0510169. Cited on
page 298.
278 H.A. Lorentz, De relative beweging van de aarde en dem aether, Amst. Versl. 1, p. 74,
1892, and also H.A. Lorentz, Electromagnetic phenomena in a system moving with any
velocity smaller than that of light, Amst. Proc. 6, p. 809, 1904, or Amst. Versl. 12, p. 986, 1904.
Cited on page 302.
279 A general refutation of such proposals is discussed by S.R. Mainwaring & G.E. Sted-
man, Accelerated clock principles, Physical Review A 47, pp. 3611–3619, 1993. Experiments
on muons at CERN in 1968 showed that accelerations of up to 1020 m~s2 have no effect, as
explained by D.H. Perkins, Introduction to High Energy Physics, Addison-Wesley, 1972,
or by J. Bailey & al., Il Nuovo Cimento 9A, p. 369, 1972. Cited on page 302.
280 W. R indler, General relativity before special relativity: an unconventional overview of
relativity theory, American Journal of Physics 62, pp. 887–893, 1994. Cited on page 302.
mesotrons with momentum, Physical Review 59, pp. 223–228, 1941. ‘Mesotron’ was the old
name for muon. Cited on page 307.
288 A. Harvey & E. Schucking, A small puzzle from 1905, Physics Today, pp. 34–36, March
2005. Cited on page 307.
289 W. R indler, Length contraction paradox, American Journal of Physics 29, pp. 365–366,
1961. For a variation without gravity, see R. Shaw, Length contraction paradox, American
Journal of Physics 30, p. 72, 1962. Cited on page 309.
290 H. van Lintel & C. Gruber, The rod and hole paradox re-examined, European Journal
of Physics 26, pp. 19–23, 2005. Cited on page 309.
291 This situation is discussed by G.P. Sastry, Is length contraction paradoxical?, American
Journal of Physics 55, 1987, pp. 943–946. This paper also contains an extensive literature list
covering variants of length contraction paradoxes. Cited on page 309.
354 ii special rel ativity
292 S.P. B oughn, The case of the identically accelerated twins, American Journal of Physics 57,
pp. 791–793, 1989. Cited on pages 310 and 313.
293 J.M. Supplee, Relativistic buoyancy, American Journal of Physics 57 1, pp. 75–77, January
1989. See also G.E.A. Matsas, Relativistic Arquimedes law for fast moving bodies and the
general-relativistic resolution of the ‘submarine paradox’, Physical Review D 68, p. 027701,
2003, or http://arxiv.org/abs/gr-qc/0305106. Cited on page 310.
294 The distinction was first published by J. Terrell, Invisibility of Lorentz contraction, Phys-
ical Review 116, pp. 1041–1045, 1959, and R. Penrose, The apparent shape of a relativistic-
ally moving sphere, Proceedings of the Cambridge Philosophical Society 55, pp. 137–139, 1959.
Cited on page 312.
295 G.R. Rybicki, Speed limit on walking, American Journal of Physics 59, pp. 368–369, 1991.
Cited on page 314.
296 The first examples of such astronomical observations were provided by A.R. Whitney &
al., Quasars revisited: rapid time variations observed via very-long-baseline interferometry,
Science 173, pp. 225–230, 1971, and by M.H. Cohen & al., The small-scale structure of
299 See for example S.S. Costa & G.E.A. Matsas, Temperature and relativity, preprint avail-
able at http://arxiv.org/abs/gr-qc/9505045. Cited on page 318.
300 R.C. Tolman & G.N. Lewis, The principle of relativity and non-Newtonian mechanics,
Philosophical Magazine 18, pp. 510–523, 1909, and R.C. Tolman, Non-Newtonian mech-
anics: the mass of a moving body, Philosophical Magazine 23, pp. 375–380, 1912. Cited on
page 319.
301 S. R ainville, J.K. Thompson, E.G. Myers, J.M. Brown, M.S. Dewey, E.G.
Kessler, R.D. Desl attes, H.G. B örner, M. Jentschel, P. Mu tti & D.E.
Pritchard, World year of physics: a direct test of E = mc 2 , Nature 438, pp. 1096–1097,
2005. Cited on page 322.
302 This information is due to a private communication by Frank DiFilippo; part of the story
is given in F. DiFilippo, V. Natarajan, K.R. B oyce & D.E. Pritchard, Accurate
bibliography 355
atomic masses for fundamental metrology, Physical Review Letters 73, pp. 1481–1484, 1994.
These measurements were performed with Penning traps; a review of the possibilities they
offer is given by R.C. Thompson, Precision measurement aspects of ion traps, Measure-
ment Science and Technology 1, pp. 93–105, 1990. The most important experimenters in the
field of single particle levitation were awarded the Nobel Prize in 1989. One of the Nobel
Prize lectures can be found in W. Paul, Electromagnetic traps for neutral and charged
particles, Reviews of Modern Physics 62, pp. 531–540, 1990. Cited on page 323.
303 J.L. Synge, Relativity: The Special Theory, North-Holland, 1956, pp. 208–213. More about
antiparticles in special relativity can be found in J.P. Costell a, B.H.J. McKell ar &
A.A. R awlinson, Classical antiparticles, American Journal of Physics 65, pp. 835–841,
1997. See also Ref. 318. Cited on page 323.
304 A. Papapetrou, Drehimpuls- und Schwerpunktsatz in der relativistischen Mechanik,
Praktika Acad. Athenes 14, p. 540, 1939, and A. Papapetrou, Drehimpuls- und Schwer-
punktsatz in der Diracschen Theorie, Praktika Acad. Athenes 15, p. 404, 1940. See also
M.H.L. Pryce, The mass-centre in the restricted theory of relativity and its connexion
312 The impossibility of defining rigid coordinate frames for non-uniformly accelerating observ-
ers is discussed by Charles Misner, Kip Thorne & John A. Wheeler, Gravitation,
Freeman, p. 168, 1973. Cited on page 340.
313 E.A. Desloge & R.J. Philpott, Uniformly accelerated reference frames in special re-
lativity, American Journal of Physics 55, pp. 252–261, 1987. Cited on page 340.
314 R.H. Good, Uniformly accelerated reference frame and twin paradox, American Journal
of Physics 50, pp. 232–238, 1982. Cited on pages 340, 341, and 344.
315 J. Dwayne Hamilton, The uniformly accelerated reference frame, American Journal of
Physics 46, pp. 83–89, 1978. Cited on page 341.
316 The best and cheapest mathematical formula collection remains the one by K. Rottmann,
Mathematische Formelsammlung, BI Hochschultaschenbücher, 1960. Cited on page 341.
317 C.G. Adler & R.W. Brehme, Relativistic solutions to a falling body in a uniform gravit-
ation field, American Journal of Physics 59, pp. 209–213, 1991. Cited on page 342.
318 See for example the excellent lecture notes by D.J. R aymond, A radically modern ap-
G
eneral relativity is easy. Nowadays, it can be made as intuitive as universal
ravity and its inverse square law – by using the right approach. The main ideas of
eneral relativity, like those of special relativity, are accessible to secondary-school
students. Black holes, gravitational waves, space-time curvature and the limits of the uni-
c4
FD = 3.0 ë 1043 N . (204)
4G
358 iii gravitation and rel ativity • 6. maximum force
F I G U R E 188 Effects of gravity: a stalactite (© Richard Cindric) and the rings of Saturn, photographed
when the Sun is hidden (CICLOPS, JPL, ESA, NASA)
c5
PD = 9.1 ë 1051 W . (205)
4G
No power of any lamp, engine or explosion can exceed this value. The maximum power
is realized when a (Schwarzschild) black hole is radiated away in the time that light takes
to travel along a length corresponding to its diameter. We will see below precisely what
black holes are and why they are connected to these limits.
The existence of a maximum force or power implies the full theory of general relativ-
ity. In order to prove the correctness and usefulness of this approach, a sequence of argu-
ments is required. The sequence is the same as for the establishment of the limit speed in
special relativity. First of all, we have to gather all observational evidence for the claimed
Copyright © Christoph Schiller November 1997–January 2007
limit. Secondly, in order to establish the limit as a principle of nature, we have to show
that general relativity follows from it. Finally, we have to show that the limit applies in all
possible and imaginable situations. Any apparent paradoxes will need to be resolved.
These three steps structure this introduction to general relativity. We start the story by
explaining the origin of the idea of a limiting value.
Page 376 situations, the inverse square law of universal gravitation. When the factor is properly
taken into account, the maximum force (or power) is simply given by the (corrected)
Planck energy divided by the (corrected) Planck length or Planck time.
The expression for the maximum force involves the speed of light c and the gravita-
tional constant G; it thus qualifies as a statement on relativistic gravitation. The funda-
mental principle of special relativity states that speed v obeys v D c for all observers. Ana-
logously, the basic principle of general relativity states that in all cases force F and power
* Observers in general relativity, like in special relativity, are massive physical systems that are small enough
so that their influence on the system under observation is negligible.
Page 722 ** When Planck discovered the quantum of action, he had also noticed the possibility to define natural units.
On a walk with his seven-year-old son in the forest around Berlin, he told him that he had made a discovery
as important as the discovery of universal gravity.
360 iii gravitation and rel ativity • 6. maximum force
P obey F D c 4 ~4G and P D c 5 ~4G. It does not matter whether the observer measures the
force or power while moving with high velocity relative to the system under observation,
during free fall, or while being strongly accelerated. However, we will see that it is essen-
tial that the observer records values measured at his own location and that the observer
is realistic, i.e., made of matter and not separated from the system by a horizon. These
conditions are the same that must be obeyed by observers measuring velocity in special
relativity.
Since physical power is force times speed, and since nature provides a speed limit,
the force bound and the power bound are equivalent. We have already seen that force
Page 331 and power appear together in the definition of 4-force; we can thus say that the upper
bound is valid for every component of a force, as well as for its magnitude. The power
bound limits the output of car and motorcycle engines, lamps, lasers, stars, gravitational
radiation sources and galaxies. It is equivalent to 1.2 ë 1049 horsepowers. The maximum
power principle states that there is no way to move or get rid of energy more quickly than
c3
D = 1.0 ë 1035 kg~s .
dm
(206)
dt 4G
This bound imposes a limit on pumps, jet engines and fast eaters. Indeed, the rate of flow
of water or any other material through tubes is limited. The mass flow limit is obviously
Copyright © Christoph Schiller November 1997–January 2007
values larger than the stated limit. Many people have claimed to have produced speeds
larger than that of light. So far, nobody has ever claimed to have produced a force larger
than the limit value.
The large accelerations that particles undergo in collisions inside the Sun, in the most
powerful accelerators or in reactions due to cosmic rays correspond to force values much
smaller than the force limit. The same is true for neutrons in neutron stars, for quarks
inside protons, and for all matter that has been observed to fall towards black holes. Fur-
thermore, the search for space-time singularities, which would allow forces to achieve or
exceed the force limit, has been fruitless.
In the astronomical domain, all forces between stars or galaxies are below the limit
value, as are the forces in their interior. Not even the interactions between any two halves
of the universe exceed the limit, whatever physically sensible division between the two
Page 374 halves is taken. (The meaning of ‘physically sensible division’ will be defined below; for
divisions that are not sensible, exceptions to the maximum force claim can be constructed.
It is necessary to show that they imply the complete theory of general relativity. (This sec-
tion is only for readers who already know the field equations of general relativity. Other
Page 366 readers may skip to the next section.)
In order to derive the theory of relativity we need to study those systems that realize
the limit under scrutiny. In the case of the special theory of relativity, the main system
that realizes the limit speed is light. For this reason, light is central to the exploration of
special relativity. In the case of general relativity, the systems that realize the limit are less
obvious. We note first that a maximum force (or power) cannot be realized throughout
a volume of space. If this were possible, a simple boost* could transform the force (or
power) to a higher value. Therefore, nature can realize maximum force and power only
on surfaces, not volumes. In addition, these surfaces must be unattainable. These unat-
Ref. 324 tainable surfaces are basic to general relativity; they are called horizons. Maximum force
Page 344 and power only appear on horizons. We have encountered horizons in special relativity,
where they were defined as surfaces that impose limits to observation. (Note the contrast
through any horizon is always finite in extension, when measured along the propagation
direction. One can thus speak more specifically of an energy pulse. Any energy pulse
through a horizon is thus characterized by an energy E and a proper length L. When the
energy pulse flows perpendicularly through a horizon, the rate of momentum change, or
force, for an observer at the horizon is
F=
E
. (207)
L
Our goal is to show that the existence of a maximum force implies general relativity. Now,
* A boost was defined in special relativity as a change of viewpoint to a second observer moving in relation
to the first.
a simple principle encompassing general rel ativity 363
maximum force is realized on horizons. We thus need to insert the maximum possible
values on both sides of equation (207) and to show that general relativity follows.
Using the maximum force value and the area 4πR 2 for a spherical horizon we get
c4
=
E
4πR 2 . (208)
4G LA
The fraction E~A is the energy per area flowing through any area A that is part of a horizon.
The insertion of the maximum values is complete when one notes that the length L of
the energy pulse is limited by the radius R. The limit L D R follows from geometrical
considerations: seen from the concave side of the horizon, the pulse must be shorter than
the radius of curvature. An independent argument is the following. The length L of an
Ref. 327 object accelerated by a is limited, by special relativity, by L D c 2 ~2a. Special relativity
already shows that this limit is related to the appearance of a horizon. Together with
c2
E= aA. (209)
8πG
This horizon equation relates the energy flow E through an area A of a spherical horizon
with surface gravity a. It states that the energy flowing through a horizon is limited, that
this energy is proportional to the area of the horizon, and that the energy flow is propor-
tional to the surface gravity. (The horizon equation is also called the first law of black hole
Ref. 328 mechanics or the first law of horizon mechanics.)
The above derivation also yields the intermediate result
c4 A
ED . (210)
16πG L
This form of the horizon equation states more clearly that no surface other than a horizon
can achieve the maximum energy flow, when the area and pulse length (or surface gravity)
are given. No other domain of physics makes comparable statements: they are intrinsic
to the theory of gravitation.
An alternative derivation of the horizon equation starts with the emphasis on power
Copyright © Christoph Schiller November 1997–January 2007
c2
δE = a δA . (211)
8πG
This differential relation – it might be called the general horizon equation – is valid for any
horizon. It can be applied separately for every piece δA of a dynamic or spatially changing
horizon. The general horizon equation (211) has been known to be equivalent to general
Ref. 329 relativity at least since 1995, when this equivalence was (implicitly) shown by Jacobson.
We will show that the differential horizon equation has the same role for general relativity
as the equation dx = c dt has for special relativity. From now on, when we speak of the
horizon equation, we mean the general, differential form (211) of the relation.
It is instructive to restate the behaviour of energy pulses of length L in a way that holds
for any surface, even one that is not a horizon. Repeating the above derivation, one gets
To generalize the general horizon equation, one introduces the general surface element
dΣ and the local boost Killing vector field k that generates the horizon (with suitable
norm). Jacobson uses these two quantities to rewrite the left-hand side of the general
horizon equation (211) as
δE = ∫T ab k
a
dΣ b , (213)
where Tab is the energy–momentum tensor. This expression obviously gives the energy
at the horizon for arbitrary coordinate systems and arbitrary energy flow directions.
Jacobson’s main result is that the factor a δA in the right hand side of the general hori-
zon equation (211) can be rewritten, making use of the (purely geometric) Raychaudhuri
a simple principle encompassing general rel ativity 365
equation, as
a δA = c 2 ∫R ab k
a
dΣ b , (214)
where R ab is the Ricci tensor describing space-time curvature. This relation describes
how the local properties of the horizon depend on the local curvature.
Combining these two steps, the general horizon equation (211) becomes
c4
∫ Tab k a dΣ b = ∫R ab k
a
dΣ b . (215)
8πG
Jacobson then shows that this equation, together with local conservation of energy (i.e.,
vanishing divergence of the energy–momentum tensor) can only be satisfied if
c4
Tab = R ab − ( + Λ)дab ,
R
where R is the Ricci scalar and Λ is a constant of integration the value of which is not
determined by the problem. The above equations are the full field equations of general
relativity, including the cosmological constant Λ. The field equations thus follow from
the horizon equation. They are therefore shown to be valid at horizons.
Since it is possible, by choosing a suitable coordinate transformation, to position a
horizon at any desired space-time point, the field equations must be valid over the whole
of space-time. This observation completes Jacobson’s argument. Since the field equations
follow, via the horizon equation, from the maximum force principle, we have also shown
that at every space-time point in nature the same maximum force holds: the value of the
maximum force is an invariant and a constant of nature.
In other words, the field equations of general relativity are a direct consequence of the
limit on energy flow at horizons, which in turn is due to the existence of a maximum
force (or power). In fact, as Jacobson showed, the argument works in both directions.
Maximum force (or power), the horizon equation, and general relativity are equivalent.
In short, the maximum force principle is a simple way to state that, on horizons, energy
flow is proportional to area and surface gravity. This connection makes it possible to de-
duce the full theory of general relativity. In particular, a maximum force value is sufficient
to tell space-time how to curve. We will explore the details of this relation shortly. Note
Copyright © Christoph Schiller November 1997–January 2007
that if no force limit existed in nature, it would be possible to ‘pump’ any desired amount
of energy through a given surface, including any horizon. In this case, the energy flow
would not be proportional to area, horizons would not have the properties they have, and
general relativity would not hold. We thus get an idea how the maximum flow of energy,
the maximum flow of momentum and the maximum flow of mass are all connected to
horizons. The connection is most obvious for black holes, where the energy, momentum
Page 486 or mass are those falling into the black hole.
By the way, since the derivation of general relativity from the maximum force principle
or from the maximum power principle is now established, we can rightly call these limits
horizon force and horizon power. Every experimental or theoretical confirmation of the
field equations indirectly confirms their existence.
366 iii gravitation and rel ativity • 6. maximum force
Space-time is curved
Imagine two observers who start moving parallel to each other and who continue straight
ahead. If after a while they discover that they are not moving parallel to each other any
Challenge 706 n more, then they can deduce that they have moved on a curved surface (try it!) or in a
curved space. In particular, this happens near a horizon. The derivation above showed
that a finite maximum force implies that all horizons are curved; the curvature of horizons
in turn implies the curvature of space-time. If nature had only flat horizons, there would
be no space-time curvature. The existence of a maximum force implies that space-time
is curved.
A horizon so strongly curved that it forms a closed boundary, like the surface of a
Page 484 sphere, is called a black hole. We will study black holes in detail below. The main property
of a black hole, like that of any horizon, is that it is impossible to detect what is ‘behind’
the boundary.*
The analogy between special and general relativity can thus be carried further. In spe-
modulus G (or µ) describes how difficult it is to move two parallel surfaces of a material
against each other. If the force F is needed to move two parallel surfaces of area A and
length l against each other by a distance ∆l, one defines the shear modulus G by
=G
F ∆l
. (217)
A l
The shear modulus for metals and alloys ranges between 25 and 80 GPa. The continuum
theory of solids shows that for any crystalline solid without any defect (a ‘perfect’ solid)
* Analogously, in special relativity it is impossible to detect what moves faster than the light barrier.
a simple principle encompassing general rel ativity 367
there is a so-called theoretical shear stress: when stresses higher than this value are ap-
plied, the material breaks. The theoretical shear stress, in other words, the maximum
stress in a material, is given by
G tss =
G
. (218)
2π
The maximum stress is thus essentially given by the shear modulus. This connection is
similar to the one we found for the vacuum. Indeed, imagining the vacuum as a material
Ref. 330 that can be bent is a helpful way to understand general relativity. We will use it regularly
in the following.
What happens when the vacuum is stressed with the maximum force? Is it also torn
apart like a solid? Yes: in fact, when vacuum is torn apart, particles appear. We will find
out more about this connection later on: since particles are quantum entities, we need
to study quantum theory first, before we can describe the effect in the last part of our
mountain ascent.
force. The reverse, the deduction of general relativity’s field equations from the maximum
power of force, is correct only under the assumption that gravity is purely geometric. This
is the essential statement of general relativity. If the mechanism of gravity would be based
on other fields, such as hitherto unknown particles, the equivalence between gravity and
a maximum force would not be given.
368 iii gravitation and rel ativity • 6. maximum force
The last, but central, step in our discussion of the force limit is the same as in the discus-
sion of the speed limit. We need to show that any imaginable experiment – not only any
real one – satisfies the hypothesis. Following a tradition dating back to the early twen-
”
tieth century, such an imagined experiment is called a Gedanken experiment, from the
German Gedankenexperiment, meaning ‘thought experiment’.
In order to dismiss all imaginable attempts to exceed the maximum speed, it is suf-
* ‘When an idea is just rising on the horizon, the soul’s temperature with respect to it is usually very cold.
Only gradually does the idea develop its warmth, and it is hottest (which is to say, exerting its greatest in-
fluence) when belief in the idea is already once again in decline.’ Friedrich Nietzsche (1844–1900), German
philosopher and scholar. This is aphorism 207 – Sonnenbahn der Idee – from his text Menschliches Allzu-
menschliches – Der Wanderer und sein Schatten.
a simple principle encompassing general rel ativity 369
ample by pulling two ends of a rope in opposite directions. We assume for simplicity that
an unbreakable rope exists. To produce a force exceeding the limit value, we need to store
large (elastic) energy in the rope. This energy must enter from the ends. When we increase
the tension in the rope to higher and higher values, more and more (elastic) energy must
be stored in smaller and smaller distances. To exceed the force limit, we would need to
add more energy per distance and area than is allowed by the horizon equation. A hori-
zon thus inevitably appears. But there is no way to stretch a rope across a horizon, even if
it is unbreakable. A horizon leads either to the breaking of the rope or to its detachment
from the pulling system. Horizons thus make it impossible to generate forces larger than
the force limit. In fact, the assumption of infinite wire strength is unnecessary: the force
limit cannot be exceeded even if the strength of the wire is only finite.
We note that it is not important whether an applied force pulls – as for ropes or wires
– or pushes. In the case of pushing two objects against each other, an attempt to increase
the force value without end will equally lead to the formation of a horizon, due to the
**
The brick attempt. The force and power limits can also be tested with more concrete
Gedanken experiments. We can try to exceed the force limit by stacking weight. But even
building an infinitely high brick tower does not generate a sufficiently strong force on its
foundations: integrating the weight, taking into account its decrease with height, yields
a finite value that cannot reach the force limit. If we continually increase the mass dens-
ity of the bricks, we need to take into account that the tower and the Earth will change
into a black hole. And black holes, as mentioned above, do not allow the force limit to be
exceeded.
**
The boost attempt. A boost can apparently be chosen in such a way that a force value F in
Ref. 332 one frame is transformed into any desired value F ′ in another frame. However, this result
is not physical. To be more concrete, imagine a massive observer, measuring the value F,
**
The divergence attempt. The force on a test mass m at a radial distance d from a Schwarz-
Ref. 326 schild black hole (for Λ = 0) is given by
F= ¼
GMm
. (220)
d 2 1 − 2G
dc
M
2
In addition, the inverse square law of universal gravitation states that the force between
two masses m and M is
F=
GMm
. (221)
d2
a simple principle encompassing general rel ativity 371
Both expressions can take any value; this suggest that no maximum force limit exists.
A detailed investigation shows that the maximum force still holds. Indeed, the force
in the two situations diverges only for non-physical point-like masses. However, the max-
imum force implies a minimum approach distance to a mass m given by
d min =
2Gm
. (222)
c2
The minimum approach distance – in simple terms, this would be the corresponding
black hole radius – makes it impossible to achieve zero distance between two masses or
between a horizon and a mass. This implies that a mass can never be point-like, and that
there is a (real) minimum approach distance, proportional to the mass. If this minimum
approach distance is introduced in equations (220) and (221), one gets
c4 c4
F= D
1
¼
Mm
and
c4 c4
F= D
Mm
4G (M + m)2 4G
. (224)
The maximum force value is thus never exceeded, as long as we take into account the size
of observers.
**
The consistency problem. If observers cannot be point-like, one might question whether
it is still correct to apply the original definition of momentum change or energy change
as the integral of values measured by observers attached to a given surface. In general
relativity, observers cannot be point-like, but they can be as small as desired. The original
definition thus remains applicable when taken as a limit procedure for ever-decreasing
observer size. Obviously, if quantum theory is taken into account, this limit procedure
comes to an end at the Planck length. This is not an issue for general relativity, as long as
the typical dimensions in the situation are much larger than this value.
Copyright © Christoph Schiller November 1997–January 2007
**
The quantum problem. If quantum effects are neglected, it is possible to construct surfaces
Challenge 708 ny with sharp angles or even fractal shapes that overcome the force limit. However, such
surfaces are not physical, as they assume that lengths smaller than the Planck length can
be realized or measured. The condition that a surface be physical implies that it must have
Ref. 324 an intrinsic uncertainty given by the Planck length. A detailed study shows that quantum
effects do not allow the horizon force to be exceeded.
**
The relativistically extreme observer attempt. Any extreme observer, whether in rapid
inertial or in accelerated motion, has no chance to beat the limit. In classical physics
372 iii gravitation and rel ativity • 6. maximum force
we are used to thinking that the interaction necessary for a measurement can be made
as small as desired. This statement, however, is not valid for all observers; in particular,
extreme observers cannot fulfil it. For them, the measurement interaction is large. As a
result, a horizon forms that prevents the limit from being exceeded.
**
The microscopic attempt. We can attempt to exceed the force limit by accelerating a small
particle as strongly as possible or by colliding it with other particles. High forces do in-
deed appear when two high energy particles are smashed against each other. However,
if the combined energy of the two particles became high enough to challenge the force
limit, a horizon would appear before they could get sufficiently close.
In fact, quantum theory gives exactly the same result. Quantum theory by itself already
Ref. 333 provides a limit to acceleration. For a particle of mass m it is given by
**
Copyright © Christoph Schiller November 1997–January 2007
The force addition attempt. In special relativity, composing velocities by a simple vector
addition is not possible. Similarly, in the case of forces such a naive sum is incorrect; any
attempt to add forces in this way would generate a horizon. If textbooks on relativity had
explored the behaviour of force vectors under addition with the same care with which
they explored that of velocity vectors, the force bound would have appeared much earlier
in the literature. (Obviously, general relativity is required for a proper treatment.)
**
Challenge 709 r Can you propose and resolve another attempt to exceed the force or power limit?
a simple principle encompassing general rel ativity 373
6000 m
mountain
nuclei
surface A
Gedanken experiments with the power limit and the mass flow
limit
Like the force bound, the power bound must be valid for all imaginable systems. Here are
some attempts to refute it.
**
The cable-car attempt. Imagine an engine that accelerates a mass with an unbreakable
and massless wire (assuming that such a wire could exist). As soon as the engine reached
the power bound, either the engine or the exhausts would reach the horizon equation.
When a horizon appears, the engine cannot continue to pull the wire, as a wire, even
an infinitely strong one, cannot pass a horizon. The power limit thus holds whether the
engine is mounted inside the accelerating body or outside, at the end of the wire pulling Copyright © Christoph Schiller November 1997–January 2007
it.
**
The mountain attempt. It is possible to define a surface that is so strangely bent that
it passes just below every nucleus of every atom of a mountain, like the surface A in
Figure 189. All atoms of the mountain above sea level are then just above the surface,
barely touching it. In addition, imagine that this surface is moving upwards with almost
the speed of light. It is not difficult to show that the mass flow through this surface is
higher than the mass flow limit. Indeed, the mass flow limit c 3 ~4G has a value of about
1035 kg~s. In a time of 10−22 s, the diameter of a nucleus divided by the speed of light, only
1013 kg need to flow through the surface: that is the mass of a mountain.
This surface seems to provide a counter-example to the limit. However, a closer look
374 iii gravitation and rel ativity • 6. maximum force
shows that this is not the case. The problem is the expression ‘just below’. Nuclei are
quantum particles and have an indeterminacy in their position; this indeterminacy is
essentially the nucleus–nucleus distance. As a result, in order to be sure that the surface
of interest has all atoms above it, the shape cannot be that of surface A in Figure 189. It
must be a flat plane that remains below the whole mountain, like surface B in the figure.
However, a flat surface beneath a mountain does not allow the mass change limit to be
exceeded.
**
The multiple atom attempt. One can imagine a number of atoms equal to the number of
the atoms of a mountain that all lie with large spacing (roughly) in a single plane. Again,
the plane is moving upwards with the speed of light. But also in this case the uncertainty
in the atomic positions makes it impossible to say that the mass flow limit has been ex-
ceeded.
**
The luminosity attempt. The existence of a maximum luminosity bound has been dis-
Ref. 326 cussed by astrophysicists. In its full generality, the maximum bound on power, i.e. on
energy per time, is valid for any energy flow through any physical surface whatsoever. The
physical surface may even run across the whole universe. However, not even bringing
together all lamps, all stars and all galaxies of the universe yields a surface which has a
larger power output than the proposed limit.
The surface must be physical.* A surface is physical if an observer can be placed on each
of its points. In particular, a physical surface may not cross a horizon, or have local detail
Page 1025 finer than a certain minimum length. This minimum length will be introduced later on;
it is given by the corrected Planck length. If a surface is not physical, it may provide a
Challenge 710 n counter-example to the power or force limits. However, these counter-examples make no
statements about nature. (Ex falso quodlibet.*)
**
The many lamp attempt, or power paradox. An absolute power limit imposes a limit on the
rate of energy transport through any imaginable surface. At first sight, it may seem that
the combined power emitted by two radiation sources that each emit 3/4 of the maximum
value should give 3/2 times that value. However, two such lamps would be so massive that
they would form a black hole. No amount of radiation that exceeds the limit can leave.
Again, since the horizon limit (212) is achieved, a horizon appears that swallows the light
and prevents the force or power limit from being exceeded.
**
**
The black hole attempt. One possible system in nature that actually achieves the power
limit is the final stage of black hole evaporation. However, even in this case the power
limit is not exceeded, but only equalled.
**
The water flow attempt. One could try to pump water as rapidly as possible through a
large tube of cross-section A. However, when a tube of length L filled with water flowing
at speed v gets near to the mass flow limit, the gravity of the water waiting to be pumped
through the area A will slow down the water that is being pumped through the area. The
limit is again reached when the cross-section A turns into a horizon. Copyright © Christoph Schiller November 1997–January 2007
Checking that no system – from microscopic to astrophysical – ever exceeds the max-
imum power or maximum mass flow is a further test of general relativity. It may seem easy
to find a counter-example, as the surface may run across the whole universe or envelop
any number of elementary particle reactions. However, no such attempt succeeds.
In summary, in all situations where the force, power or mass-flow limit is challenged,
whenever the energy flow reaches the black hole mass–energy density in space or the
corresponding momentum flow in time, an event horizon appears; this horizon makes it
impossible to exceed the limits. All three limits are confirmed both in observation and
in theory. Values exceeding the limits can neither be generated nor measured. Gedanken
experiments also show that the three bounds are the tightest ones possible. Obviously, all
three limits are open to future tests and to further Gedanken experiments. (If you can
Challenge 711 r think of a good one, let the author know.)
The concepts of horizon force and horizon power can be used as the basis for a direct,
intuitive approach to general relativity.
”
**
What is gravity? Of the many possible answers we will encounter, we now have the first:
gravity is the ‘shadow’ of the maximum force. Whenever we experience gravity as weak,
we can remember that a different observer at the same point and time would experience
the maximum force. Searching for the precise properties of that observer is a good exer-
cise. Another way to put it: if there were no maximum force, gravity would not exist. Copyright © Christoph Schiller November 1997–January 2007
**
The maximum force implies universal gravity. To see this, we study a simple planetary
system, i.e., one with small velocities and small forces. A simple planetary system of size
L consists of a (small) satellite circling a central mass M at a radial distance R = L~2.
º implies the 2condition aL c ,
2
Let a be the acceleration of the object. Small velocity
deduced from special relativity; small force implies 4GMa c , deduced from the
force limit. These conditions are valid for the system as a whole and for all its components.
Both expressions have the dimensions of speed squared. Since the system has only one
* ‘We all live under the same sky, but we do not have the same horizon.’ Konrad Adenauer (1876–1967),
West German chancellor.
a simple principle encompassing general rel ativity 377
º
characteristic speed, the two expressions aL = 2aR and 4GMa must be proportional,
yielding
a= f 2 ,
GM
(226)
R
where the numerical factor f must still be determined. To determine it, we study the
escape velocity necessary to leave the central body. The escape velocity must be smaller
than the speed of light for any body larger than a black hole. The escape velocity, derived
from expression (226), from a body of mass M and radius R is given by v esc 2
= 2 f GM~R.
The minimum radius R of objects, given by R = 2GM~c , then implies that f = 1. There-
2
fore, for low speeds and low forces, the inverse square law describes the orbit of a satellite
around a central mass.
**
**
If curvature and energy are linked, the maximum speed must also hold for gravitational
energy. Indeed, we will find that gravity has a finite speed of propagation. The inverse
square law of everyday life cannot be correct, as it is inconsistent with any speed limit.
More about the corrections induced by the maximum speed will become clear shortly.
In addition, since gravitational waves are waves of massless energy, we would expect the
Page 411 maximum speed to be their propagation speed. This is indeed the case, as we will see.
**
A body cannot be denser than a (non-rotating) black hole of the same mass. The max-
imum force and power limits that apply to horizons make it impossible to squeeze mass
into smaller horizons. The maximum force limit can therefore be rewritten as a limit for
the size L of physical systems of mass m: Copyright © Christoph Schiller November 1997–January 2007
LE
4Gm
. (227)
c2
If we call twice the radius of a black hole its ‘size’, we can state that no physical system
of mass m is smaller than this value.* The » size limit plays an important role in general
relativity. The opposite inequality, m E A~16π c 2 ~G, which describes the maximum
‘size’ of black holes, is called the Penrose inequality and has been proven for many physic-
Ref. 334, Ref. 335, ally realistic situations. The Penrose inequality can be seen to imply the maximum force
Ref. 336 limit, and vice versa. The maximum force principle, or the equivalent minimum size of
* The maximum value for the mass to size limit is obviously equivalent to the maximum mass change given
above.
378 iii gravitation and rel ativity • 6. maximum force
matter–energy systems, thus prevents the formation of naked singularities, and implies
the validity of the so-called cosmic censorship.
**
There is a power limit for all energy sources. In particular, the value c 5 ~4G limits the lu-
minosity of all gravitational sources. Indeed, all formulae for gravitational wave emission
Ref. 326 imply this value as an upper limit. Furthermore, numerical relativity simulations never
exceed it: for example, the power emitted during the simulated merger of two black holes
is below the limit.
**
Perfectly plane waves do not exist in nature. Plane waves are of infinite extension. But
neither electrodynamic nor gravitational waves can be infinite, since such waves would
carry more momentum per time through a plane surface than is allowed by the force
**
In nature, there are no infinite forces. There are thus no naked singularities in nature.
Horizons prevent the appearance of naked singularities. In particular, the big bang was
not a singularity. The mathematical theorems by Penrose and Hawking that seem to imply
the existence of singularities tacitly assume the existence of point masses – often in the
form of ‘dust’ – in contrast to what general relativity implies. Careful re-evaluation of
each such proof is necessary.
**
The force limit means that space-time has a limited stability. The limit suggests that space-
time can be torn into pieces. This is indeed the case. However, the way that this happens
is not described by general relativity. We will study it in the third part of this text.
**
The maximum force is the standard of force. This implies that the gravitational constant
G is constant in space and time – or at least, that its variations across space and time
Ref. 337 cannot be detected. Present data support this claim to a high degree of precision. Copyright © Christoph Schiller November 1997–January 2007
**
The maximum force principle implies that gravitational energy – as long as it can be
defined – falls in gravitational fields in the same way as other type of energy. As a result,
Ref. 326 the maximum force principle predicts that the Nordtvedt effect vanishes. The Nordtvedt
effect is a hypothetical periodical change in the orbit of the Moon that would appear if the
gravitational energy of the Earth–Moon system did not fall, like other mass–energy, in
the gravitational field of the Sun. Lunar range measurements have confirmed the absence
of this effect.
**
If horizons are surfaces, we can ask what their colour is. This question will be explored
a simple principle encompassing general rel ativity 379
**
Page 1097 Later on we will find that quantum effects cannot be used to exceed the force or power
Challenge 712 e limit. (Can you guess why?) Quantum theory also provides a limit to motion, namely
a lower limit to action; however, this limit is independent of the force or power limit.
(A dimensional analysis already shows this: there is no way to define an action by com-
binations of c and G.) Therefore, even the combination of quantum theory and general
relativity does not help in overcoming the force or power limits.
c3
= ρ 0 4πR 02 c =
dm
, (228)
dt 4G
Ref. 338 a relation also predicted by the Friedmann models. The precision measurements of the
cosmic background radiation by the WMAP satellite confirm that the present-day total
energy density ρ 0 (including dark matter and dark energy) and the horizon radius R 0
just reach the limit value. The maximum force limit thus predicts the observed size of the
universe.
A finite power limit also suggests that a finite age for the universe can be deduced. Can
Challenge 713 ny you find an argument?
380 iii gravitation and rel ativity • 6. maximum force
c4
D
∆E
. (229)
∆x 4G
For example, a position error of 1 mm gives a mass error of below 3 ë 1023 kg. In everyday
life, all measurements comply with this relation. Indeed, the left side is so much smaller
c5
4πR 2 I D . (230)
4G
Obviously, this statement is difficult to check experimentally, whatever the frequency
and type of wave might be, because the value appearing on the right-hand side is ex-
a simple principle encompassing general rel ativity 381
tremely large. Possibly, future experiments with gravitational wave detectors, X-ray de-
tectors, gamma ray detectors, radio receivers or particle detectors might allow us to test
relation (230) with precision. (You might want to predict which of these experiments will
Challenge 714 e confirm the limit first.)
The lack of direct experimental tests of the force and power limits implies that indirect
tests become particularly important. All such tests study the motion of matter or energy
and compare it with a famous consequence of the force and power limits: the field equa-
tions of general relativity. This will be our next topic.
Challenge 716 ny maximum and a minimum force in nature. (Can you find a smaller force?)
Proving the minimum force conjecture is more involved than for the case of the max-
imum force. So far, only some hints are possible. Like the maximum force, the minimum
force must be compatible with gravitation, must not be contradicted by any experiment,
and must withstand any Gedanken experiment. A quick check shows that the minimum
force, as we have just argued, allows us to deduce gravitation, is an invariant, and is not
contradicted by any experiment. There are also hints that there may be no way to generate
or measure a smaller value. For example, the minimum force corresponds to the energy
per length contained by a photon with a wavelength of the size of the universe. It is hard
– but maybe not impossible – to imagine the production of a still smaller force.
We have seen that the maximum force principle and general relativity fail to fix the
382 iii gravitation and rel ativity • 6. maximum force
value of the cosmological constant. Only a unified theory can do so. We thus get two re-
quirements for such a theory. First, any unified theory must predict the same upper limit
to force. Secondly, a unified theory must fix the cosmological constant. The appearance of
ħ in the conjectured expression for the minimum force suggests that the minimum force
is determined by a combination of general relativity and quantum theory. The proof of
this suggestion and the direct measurement of the minimum force are two important
challenges for our ascent beyond general relativity.
We are now ready to explore the consequences of general relativity and its field equa-
tions in more detail. We start by focusing on the concept of space-time curvature in every-
day life, and in particular, on its consequences for the observation of motion.
Acknowledgement
The author thanks Steve Carlip, Corrado Massa, Tom Helmond, Gary Gibbons, Heinrich
Neumaier and Peter Brown for interesting discussions on these topics.
Biblio graphy
323 It seems that the first published statement of the principle was in the year 2000 edition
of this text, in the chapter on gravitation and relativity. The present author discovered the
maximum force principle in 1998, when searching for a way to derive the results of chapter
XI that would be so simple that it would convince even a secondary-school student. The
reference is Christoph Schiller, Motion Mountain – The Adventure of Physics, found
at http://www.motionmountain.net. The idea of a maximum force was also proposed by
Gary Gibbons in 2002 (see reference below). Nowadays Gary Gibbons is more cautious than
the author about whether the maximum force can be seen as an actual physical principle
(despite the title of his paper). The approach of a maximum force was discussed in various
usenet discussion groups in the early twenty-first century. These discussion showed that the
idea of a maximum force (and a maximum power) were known to some people, but that
nobody before Gibbons and the author had put it in writing. Also this discovery was thus
made much too late. In short, only the idea to raise maximum force or power to a principle
335 G. Huisken & T. Ilmanen, The Riemannian Penrose inequality, International Mathem-
atics Research Notices 59, pp. 1045–1058, 1997. Cited on page 377.
336 S.A. Hay ward, Inequalities relating area, energy, surface gravity and charge of black holes,
Physical Review Letters 81, pp. 4557–4559, 1998. Cited on page 377.
337 C. Will, Was Einstein Right? – Putting General Relativity to the Test, Oxford University
Press, 1993. See also his paper http://arxiv.org/abs/gr-qc/9811036. Cited on page 378.
338 The measurement results by the WMAP satellite are summarized on the website http://map.
gsfc.nasa.gov/m_mm.html; the papers are available at http://lambda.gsfc.nasa.gov/product/
map/current/map_bibliography.cfm. Cited on page 379.
“
Gravitational influences do transport energy.** Our description of motion must therefore
Horace*
”
be precise enough to imply that this transport can happen at most with the speed of light.
Henri Poincaré stated this requirement as long ago as 1905. The results following from this
principle will be fascinating: we will find that empty space can move, that the universe has
a finite age and that objects can be in permanent free fall. It will turn out that empty space
can be bent, although it is much stiffer than steel. Despite these strange consequences, the
theory and all its predictions have been confirmed by all experiments.
The theory of universal gravitation, which describes motion due to gravity using the
relation a = GM~r 2 , allows speeds higher than that of light. Indeed, the speed of a mass
he encountered the first substantial layers of air. That was when his free fall started to be
disturbed. Later, after four and a half minutes of fall, his special parachute opened; and
nine minutes later he landed in New Mexico.
Kittinger and all other observers in free fall, such as the cosmonauts circling the Earth
or the passengers in parabolic aeroplane flights,*** make the same observation: it is im-
possible to distinguish anything happening in free fall from what would happen at rest.
This impossibility is called the principle of equivalence; it is one of the starting points of
general relativity. It leads to the most precise – and final – definition of rest: rest is free
fall. Rest is lack of disturbance; so is free fall.
The set of all free-falling observers at a point in space-time generalizes the special-
relativistic notion of the set of the inertial observers at a point. This means that we must
describe motion in such a way that not only inertial but also freely falling observers can
talk to each other. In addition, a full description of motion must be able to describe gravit-
ation and the motion it produces, and it must be able to describe motion for any observer
imaginable. General relativity realizes this aim.
As a first step, we put the result in simple words: true motion is the opposite of free fall.
This statement immediately rises a number of questions: Most trees or mountains are not
Challenge 718 n in free fall, thus they are not at rest. What motion are they undergoing? And if free fall is
rest, what is weight? And what then is gravity anyway? Let us start with the last question.
* Gravity is also the uneven length of metre bars at different places, as we will see below. Both effects are
needed to describe it completely; but for daily life on Earth, the clock effect is sufficient, since it is much
Challenge 719 n larger than the length effect, which can usually be neglected. Can you see why?
the new ideas on space, time and gravity 387
= 2 .
∆ f д∆h
(232)
f c
The sign of the frequency change depends on whether the light motion and the train
acceleration are in the same or in opposite directions. For actual trains or buses, the
Challenge 723 n frequency change is quite small; nevertheless, it is measurable. Acceleration induces fre-
quency changes in light. Let us compare this effect of acceleration with the effects of grav-
ity.
To measure time and space, we use light. What happens to light when gravity is in-
Ref. 343 volved? The simplest experiment is to let light fall or rise. In order to deduce what must
happen, we add a few details. Imagine a conveyor belt carrying masses around two wheels,
a low and a high one, as shown in Figure 191. The descending, grey masses are slightly lar-
ger. Whenever such a larger mass is near the bottom, some mechanism – not shown in the
figure – converts the mass surplus to light, in accordance with the equation E = mc 2 , and
by measuring the red-shift of light emitted by the Sun, using the famous Fraunhofer lines
Page 772 as colour markers. The results of the first experiments, by Schwarzschild and others, were
unclear or even negative, due to a number of other effects that induce colour changes at
* The expression v = дt is valid only for non-relativistic speeds; nevertheless, the conclusion of this section
Challenge 722 e is not affected by this approximation.
** As in special relativity, here and in the rest of our mountain ascent, the term ‘mass’ always refers to rest
mass.
Challenge 724 n *** Can this process be performed with 100% efficiency?
**** The precise relation between energy and frequency of light is described and explained in our discussion
on quantum theory, on page 737. But we know already from classical electrodynamics that the energy of light
depends on its intensity and on its frequency.
Challenge 725 ny ***** How does this argument change if you include the illumination by the Sun?
388 iii gravitation and rel ativity • 7. the new ideas
high temperatures. But in 1920 and 1921, Grebe and Bachem, and independently Perot,
confirmed the gravitational red-shift with careful experiments. In later years, technolo-
gical advances made the measurements much easier, until it was even possible to measure
the effect on Earth. In 1960, in a classic experiment using the Mössbauer effect, Pound and
Ref. 345 Rebka confirmed the gravitational red-shift in their university tower using γ radiation.
But our two thought experiments tell us much more. Let
us use the same arguments as in the case of special relativity:
a colour change implies that clocks run differently at differ- m
ent heights, just as they run differently in the front and in
the back of a train. The time difference ∆τ is predicted to
depend on the height difference ∆h and the acceleration of
m+E/c2
gravity д according to
= = 2 .
∆τ ∆ f д∆h h
(233)
Do these experiments show that time changes or are they simply due to clocks that
Challenge 730 e function badly? Take some time and try to settle this question. We will give one argument
only: gravity does change the colour of light, and thus really does change time. Clock
precision is not an issue here.
In summary, gravity is indeed the uneven running of clocks at different heights. Note
that an observer at the lower position and another observer at the higher position agree
on the result: both find that the upper clock goes faster. In other words, when gravity is
present, space-time is not described by the Minkowski geometry of special relativity, but
by some more general geometry. To put it mathematically, whenever gravity is present,
the 4-distance ds 2 between events is different from the expression without gravity:
ds 2 x c 2 dt 2 − dx 2 − dy 2 − dz 2 . (234)
the new ideas on space, time and gravity 389
Gravity is the presence of tidal effects. The absence of tidal effects implies the absence of
gravity. Tidal effects are the everyday consequence of height-dependent time. Isn’t this a
beautiful conclusion?
In principle, Kittinger could have felt gravitation during his free fall, even with his eyes
closed, had he paid attention to himself. Had he measured the distance change between
his two hands, he would have found a tiny decrease which could have told him that he
was falling. This tiny decrease would have forced Kittinger to a strange conclusion. Two
inertially moving hands should move along two parallel lines, always keeping the same
distance. Since the distance changes, he must conclude that in the space around him lines
starting out in parallel do not remain so. Kittinger would have concluded that the space
around him was similar to the surface of the Earth, where two lines starting out north,
parallel to each other, also change distance, until they meet at the North Pole. In other
words, Kittinger would have concluded that he was in a curved space.
By studying the change in distance between his hands, Kittinger could even have con-
On the 7th of November 1919, Albert Einstein became world-famous. On that day, an
article in the Times newspaper in London announced the results of a double expedition
to South America under the heading ‘Revolution in science / new theory of the universe
”
/ Newtonian ideas overthrown’. The expedition had shown unequivocally – though not
for the first time – that the theory of universal gravity, essentially given by a = GM~r 2 ,
was wrong, and that instead space was curved. A worldwide mania started. Einstein was
Copyright © Christoph Schiller November 1997–January 2007
presented as the greatest of all geniuses. ‘Space warped’ was the most common headline.
Einstein’s papers on general relativity were reprinted in full in popular magazines. People
could read the field equations of general relativity, in tensor form and with Greek indices,
in Time magazine. Nothing like this has happened to any other physicist before or since.
What was the reason for this excitement?
The expedition to the southern hemisphere had performed an experiment proposed
Ref. 350 by Einstein himself. Apart from seeking to verify the change of time with height, Einstein
had also thought about a number of experiments to detect the curvature of space. In the
one that eventually made him famous, Einstein proposed to take a picture of the stars
* ‘When an insect walks over the surface of a sphere it probably does not notice that the path it walks is
curved. I, on the other hand, had the luck to notice it.’
the new ideas on space, time and gravity 391
near the Sun, as is possible during a solar eclipse, and compare it with a picture of the
same stars at night, when the Sun is far away. Einstein predicted a change in position of
1.75′ (1.75 seconds of arc) for star images at the border of the Sun, a value twice as large
Page 135 as that predicted by universal gravity. The prediction, corresponding to about 1~40 mm
on the photographs, was confirmed in 1919, and thus universal gravity was ruled out.
Does this result imply that space is curved? Not by itself. In fact, other explanations
could be given for the result of the eclipse experiment, such as a potential differing from
the inverse square form. However, the eclipse results are not the only data. We already
know about the change of time with height. Experiments show that two observers at dif-
ferent heights measure the same value for the speed of light c near themselves. But these
experiments also show that if an observer measures the speed of light at the position of
the other observer, he gets a value differing from c, since his clock runs differently. There
is only one possible solution to this dilemma: metre bars, like clocks, also change with
height, and in such a way as to yield the same speed of light everywhere.
image image
of star
position
star of star
Sun
Sun
Mercury Earth
Earth
Copyright © Christoph Schiller November 1997–January 2007
F I G U R E 193 The mattress model of space: the path of a light beam and of a satellite near a spherical
mass
But beware: the right-hand figure, although found in many textbooks, can be
Ref. 351 misleading. It can easily be mistaken for a reproduction of a potential around a body.
Indeed, it is impossible to draw a graph showing curvature and potential separately.
Challenge 736 n (Why?) We will see that for small curvatures, it is even possible to explain the change in
metre bar length using a potential only. Thus the figure does not really cheat, at least in
the case of weak gravity. But for large and changing values of gravity, a potential cannot
be defined, and thus there is indeed no way to avoid using curved space to describe grav-
ity. In summary, if we imagine space as a sort of generalized mattress in which masses
392 iii gravitation and rel ativity • 7. the new ideas
Curved space-time
Figure 193 shows the curvature of space only, but in fact space-time is curved. We will
shortly find out how to describe both the shape of space and the shape of space-time, and
how to measure their curvature.
Let us have a first attempt to describe nature with the idea of curved space-time. In the
case of Figure 193, the best description of events is with the use of the time t shown by a
clock located at spatial infinity; that avoids problems with the uneven running of clocks at
different distances from the central mass. For the radial coordinate r, the most practical
choice to avoid problems with the curvature of space is to use the circumference of a circle
around the central body, divided by 2π. The curved shape of space-time is best described
Page 303 by the behaviour of the space-time distance ds, or by the wristwatch time dτ = ds~c,
between two neighbouring points with coordinates (t, r) and (t + dt, r + dr). As we saw
Page 389 above, gravity means that in spherical coordinates we have
ds 2
dτ 2 = x dt 2 − dr 2 ~c 2 − r 2 dφ 2 ~c 2 . (235)
c2
Copyright © Christoph Schiller November 1997–January 2007
The inequality expresses the fact that space-time is curved. Indeed, the experiments on
time change with height confirm that the space-time interval around a spherical mass is
given by
ds 2 dr 2 r2 2
dτ 2 = 2 = 1 −
2GM 2
dt − − dφ . (236)
c rc 2 c 2 − 2GrM c 2
This expression is called the Schwarzschild metric after one of its discoverers.* The metric
(236) describes the curved shape of space-time around a spherical non-rotating mass.
* Karl Schwarzschild (1873–1916), important German astronomer; he was one of the first people to un-
derstand general relativity. He published his formula in December 1915, only a few months after Einstein
the new ideas on space, time and gravity 393
Challenge 738 n It is well approximated by the Earth or the Sun. (Why can their rotation be neglected?)
Expression (236) also shows that gravity’s strength around a body of mass M and radius
R is measured by a dimensionless number h defined as
h=
2G M
. (237)
c2 R
This ratio expresses the gravitational strain with which lengths and the vacuum are de-
formed from the flat situation of special relativity, and thus also determines how much
clocks slow down when gravity is present. (The ratio also reveals how far one is from any
possible horizon.) On the surface of the Earth the ratio h has the small value of 1.4 ë 10−9 ;
on the surface of the Sun is has the somewhat larger value of 4.2 ë 10−6 . The precision
of modern clocks allows one to detect such small effects quite easily. The various con-
sequences and uses of the deformation of space-time will be discussed shortly.
We note that if a mass is highly concentrated, in particular when its radius becomes
RS =
2GM
, (238)
c2
the Schwarzschild metric behaves strangely: at that location, time disappears (note that
t is time at infinity). At the Schwarzschild radius, the wristwatch time (as shown by a
clock at infinity) stops – and a horizon appears. What happens precisely will be explored
Page 487 below. This situation is not common: the Schwarzschild radius for a mass like the Earth is
8.8 mm, and for the Sun is 3.0 km; you might want to check that the object size for every
Challenge 739 e system in everyday life is larger than its Schwarzschild radius. Bodies which reach this
Ref. 353 limit are called black holes; we will study them in detail shortly. In fact, general relativity
states that no system in nature is smaller than its Schwarzschild size, in other words that
the ratio h defined by expression (237) is never above unity.
In summary, the results mentioned so far make it clear that mass generates curvature.
The mass–energy equivalence we know from special relativity then tells us that as a con-
sequence, space should also be curved by the presence of any type of energy–momentum.
Every type of energy curves space-time. For example, light should also curve space-time.
However, even the highest-energy beams we can create correspond to extremely small
masses, and thus to unmeasurably small curvatures. Even heat curves space-time; but in
Copyright © Christoph Schiller November 1997–January 2007
most systems, heat is only about a fraction of 10−12 of total mass; its curvature effect is
thus unmeasurable and negligible. Nevertheless it is still possible to show experimentally
that energy curves space. In almost all atoms a sizeable fraction of the mass is due to the
electrostatic energy among the positively charged protons. In 1968 Kreuzer confirmed
Ref. 354 that energy curves space with a clever experiment using a floating mass.
It is straightforward to imagine that the uneven running of clock is the temporal equi-
Challenge 740 ny valent of spatial curvature. Taking the two together, we conclude that when gravity is
present, space-time is curved.
had published his field equations. He died prematurely, at the age of 42, much to Einstein’s distress. We
will deduce the form of the metric later on, directly from the field equations of general relativity. The other
Ref. 352 discoverer of the metric, unknown to Einstein, was the Dutch physicist J. Droste.
394 iii gravitation and rel ativity • 7. the new ideas
Let us sum up our chain of thoughts. Energy is equivalent to mass; mass produces grav-
ity; gravity is equivalent to acceleration; acceleration is position-dependent time. Since
light speed is constant, we deduce that energy–momentum tells space-time to curve. This
statement is the first half of general relativity.
We will soon find out how to measure curvature, how to calculate it from energy–
momentum and what is found when measurement and calculation are compared. We
will also find out that different observers measure different curvature values. The set of
transformations relating one viewpoint to another in general relativity, the diffeomorph-
ism symmetry, will tell us how to relate the measurements of different observers.
Since matter moves, we can say even more. Not only is space-time curved near masses,
it also bends back when a mass has passed by. In other words, general relativity states that
Ref. 355 space, as well as space-time, is elastic. However, it is rather stiff: quite a lot stiffer than steel.
To curve a piece of space by 1 % requires an energy density enormously larger than to
Challenge 741 ny curve a simple train rail by 1 %. This and other interesting consequences of the elasticity
“
We continue on the way towards precision in our understanding of gravitation. All our
theoretical and empirical knowledge about gravity can be summed up in just two general
”
statements. The first principle states:
vDc (239)
The theory following from this first principle, special relativity, is extended to general
relativity by adding a second principle, characterizing gravitation. There are several equi-
valent ways to state this principle. Here is one. Copyright © Christoph Schiller November 1997–January 2007
c4
FD , (240)
4G
where G is the universal constant of gravitation.
E 2 .
L 4G
(241)
M c
In other words, a massive system cannot be more concentrated than a non-rotating black
hole of the same mass. Another way to express the principle of gravitation is the following:
c5
PD . (242)
4G
Why does a stone thrown into the air fall back to Earth? –
Geodesics
A genius is somebody who makes all possible
”
* This didactic approach is unconventional. It is possible that is has been pioneered by the present author.
Ref. 356 The British physicist Gary Gibbons also developed it independently. Earlier references are not known.
** Or it would be, were it not for a small deviation called quantum theory.
396 iii gravitation and rel ativity • 7. the new ideas
In our discussion of special relativity, we saw that inertial or free-floating motion is the
Page 334 motion which connecting two events that requires the longest proper time. In the absence
of gravity, the motion fulfilling this requirement is straight (rectilinear) motion. On the
Page 56 other hand, we are also used to thinking of light rays as being straight. Indeed, we are all
accustomed to check the straightness of an edge by looking along it. Whenever we draw
the axes of a physical coordinate system, we imagine either drawing paths of light rays or
drawing the motion of freely moving bodies.
In the absence of gravity, object paths and light paths coincide. However, in the pres-
ence of gravity, objects do not move along light paths, as every thrown stone shows. Light
does not define spatial straightness any more. In the presence of gravity, both light and
matter paths are bent, though by different amounts. But the original statement remains
valid: even when gravity is present, bodies follow paths of longest possible proper time.
For matter, such paths are called timelike geodesics. For light, such paths are called lightlike
or null geodesics.
c2
r= 9.2 ë 1015 m . (243)
д
the new ideas on space, time and gravity 397
height
c · time
slow, steep throw
h
d
throw distance
The large value of the radius, corresponding to a low curvature, explains why we do not
notice it in everyday life. The parabolic shape typical of the path of a stone in everyday
S = −mc 2
B
∫ dτ .
Copyright © Christoph Schiller November 1997–January 2007
(244)
A
That is all we need to know about the free fall of objects. As a consequence, any deviation
from free fall keeps you young. The larger the deviation, the younger you stay.
Page 506 As we will see below, the minimum action description of free fall has been tested ex-
Ref. 357 tremely precisely, and no difference from experiment has ever been observed. We will
also find out that for free fall, the predictions of general relativity and of universal grav-
ity differ substantially both for particles near the speed of light and for central bodies of
high density. So far, all experiments have shown that whenever the two predictions dif-
fer, general relativity is right, and universal gravity and other alternative descriptions are
wrong.
All bodies fall along geodesics. This tells us something important. The fall of bodies
398 iii gravitation and rel ativity • 7. the new ideas
does not depend on their mass. The geodesics are like ‘rails’ in space-time that tell bod-
ies how to fall. In other words, space-time can indeed be imagined as a single, giant, de-
formed entity. Space-time is not ‘nothing’; it is an entity of our thinking. The shape of this
entity tells objects how to move. Space-time is thus indeed like an intangible mattress; this
deformed mattress guides falling objects along its networks of geodesics.
Moreover, bound energy falls in the same way as mass, as is proven by comparing the
fall of objects made of different materials. They have different percentages of bound en-
Challenge 748 n ergy. (Why?) For example, on the Moon, where there is no air, cosmonauts dropped steel
balls and feathers and found that they fell together, alongside each other. The independ-
Ref. 358 ence on material composition has been checked and confirmed over and over again.
Challenge 750 n
Take a plastic bottle and make some holes in it near the bottom. Fill the bottle with water,
closing the holes with your fingers. If you let the bottle go, no water will leave the bottle
during the fall. Can you explain how this experiment confirms the equivalence of rest
” Copyright © Christoph Schiller November 1997–January 2007
* ‘If you do not take the answer too seriously and regard it only for amusement, I can explain it to you in the
following way: in the past it was thought that if all things were to disappear from the world, space and time
would remain. But following relativity theory, space and time would disappear together with the things.’
400 iii gravitation and rel ativity • 7. the new ideas
**
Gravity has the same properties in the whole universe – except in the US patent office. In
2005, it awarded a patent, Nr. 6 960 975, for an antigravity device that works by distorting
space-time in such a way that gravity is ‘compensated’ (see http://patft.uspto.gov). Do you
Challenge 752 n know a simpler device?
**
The radius of curvature of space-time at the Earth’s surface is 9.2 ë 1015 m. Can you confirm
Challenge 753 e this value?
**
Challenge 754 ny A piece of wood floats on water. Does it stick out more or less in a lift accelerating up-
**
Page 313 We saw in special relativity that if two twins are identically accelerated in the same dir-
ection, with one twin some distance ahead of the other, then the twin ahead ages more
than the twin behind. Does this happen in a gravitational field as well? And what happens
Challenge 755 ny when the field varies with height, as on Earth?
**
A maximum force and a maximum power also imply a maximum flow of mass. Can you
Challenge 756 ny show that no mass flow can exceed 1.1 ë 1035 kg~s?
**
The experiments of Figure 190 and 191 differ in one point: one happens in flat space, the
other in curved space. One seems to be connected with energy conservation, the other
Challenge 757 ny not. Do these differences invalidate the equivalence of the observations?
**
Challenge 758 n How can cosmonauts weigh themselves to check whether they are eating enough?
Copyright © Christoph Schiller November 1997–January 2007
**
Is a cosmonaut in orbit really floating freely? No. It turns out that space stations and satel-
lites are accelerated by several small effects. The important ones are the pressure of the
light from the Sun, the friction of the thin air, and the effects of solar wind. (Micrometeor-
ites can usually be neglected.) These three effects all lead to accelerations of the order of
10−6 m~s2 to 10−8 m~s2 , depending on the height of the orbit. Can you estimate how long
it would take an apple floating in space to hit the wall of a space station, starting from the
Challenge 759 n middle? By the way, what is the magnitude of the tidal accelerations in this situation?
**
Page 81 There is no negative mass in nature, as discussed in the beginning of our walk (even
antimatter has positive mass). This means that gravitation cannot be shielded, in contrast
the new ideas on space, time and gravity 401
**
Can two hydrogen atoms circle each other, in their mutual gravitational field? What
Challenge 763 n would the size of this ‘molecule’ be?
**
Challenge 764 n Can two light pulses circle each other, in their mutual gravitational field?
**
The various motions of the Earth mentioned in the section on Galilean physics, such as
Page 87 its rotation around its axis or around the Sun, lead to various types of time in physics
and astronomy. The time defined by the best atomic clocks is called terrestrial dynamical
402 iii gravitation and rel ativity • 7. the new ideas
time. By inserting leap seconds every now and then to compensate for the bad definition
Page 1177 of the second (an Earth rotation does not take 86 400, but 86 400.002 seconds) and, in
minor ways, for the slowing of Earth’s rotation, one gets the universal time coordinate or
UTC. Then there is the time derived from this one by taking into account all leap seconds.
One then has the – different – time which would be shown by a non-rotating clock in
the centre of the Earth. Finally, there is barycentric dynamical time, which is the time that
Ref. 361 would be shown by a clock in the centre of mass of the solar system. Only using this latter
time can satellites be reliably steered through the solar system. In summary, relativity says
goodbye to Greenwich Mean Time, as does British law, in one of the rare cases were the
law follows science. (Only the BBC continues to use it.)
**
Space agencies thus have to use general relativity if they want to get artificial satellites to
Mars, Venus, or comets. Without its use, orbits would not be calculated correctly, and
dτ 2 2GM r 2 dφ 2 2GM v 2
=1− − ( ) = 1 − − 2 . (246)
dt rc 2 c 2 dt rc 2 c
* To give an idea of what this means, the unparametrized post-Newtonian formalism, based on general
relativity, writes the equation of motion of a body of mass m near a large mass M as a deviation from the
Copyright © Christoph Schiller November 1997–January 2007
GM GM v 2 GM v 4 Gm v 5
a= 2
+ f2 2 2 + f4 2 4 + f5 2 5 + ë ë ë (245)
r r c r c r c
Here the numerical factors f n are calculated from general relativity and are of order one. The first two odd
terms are missing because of the (approximate) reversibility of general relativistic motion: gravity wave
emission, which is irreversible, accounts for the small term f 5 ; note that it contains the small mass m instead
of the large mass M. All factors f n up to f 7 have now been calculated. However, in the solar system, only the
term f 2 has ever been detected. This situation might change with future high-precision satellite experiments.
Page 416 Higher-order effects, up to f 5 , have been measured in the binary pulsars, as discussed below.
In a parametrized post-Newtonian formalism, all factors f n , including the uneven ones, are fitted through
the data coming in; so far all these fits agree with the values predicted by general relativity.
** For more information, see the http://www.gpsworld.com website.
the new ideas on space, time and gravity 403
Challenge 765 e For the relation between satellite time and Earth time we then get
2
v sat
dt sat 2 1 − r2G M
=
2 − c2
sat c
2 . (247)
dt Earth 2G M
1 − r Earth −
v Earth
c2 c2
Can you deduce how many microseconds a satellite clock gains every day, given that the
Challenge 766 n GPS satellites orbit the Earth once every twelve hours? Since only three microseconds
would give a position error of one kilometre after a single day, the clocks in the satellites
Ref. 363 must be adjusted to run slow by the calculated amount. The necessary adjustments are
monitored, and so far have confirmed general relativity every single day, within experi-
mental errors, since the system began operation.
**
**
Could our experience that we live in only three spatial dimensions be due to a limitation
Challenge 768 n of our senses? How?
**
Challenge 769 ny Can you estimate the effect of the tides on the colour of the light emitted by an atom?
**
The strongest possible gravitational field is that of a small black hole. The strongest grav-
Ref. 365 itational field ever observed is somewhat less though. In 1998, Zhang and Lamb used the
X-ray data from a double star system to determine that space-time near the 10 km sized
neutron star is curved by up to 30 % of the maximum possible value. What is the corres-
ponding gravitational acceleration, assuming that the neutron star has the same mass as
Challenge 770 ny the Sun?
** Copyright © Christoph Schiller November 1997–January 2007
Ref. 366 Light deflection changes the angular size δ of a mass M with radius r when observed at
Challenge 771 e distance d. The effect leads to the pretty expression
»
r 1 − R S ~d
δ = arcsin( » ) RS =
2GM
where . (248)
d 1 − R S ~r c2
Challenge 772 ny What percentage of the surface of the Sun can an observer at infinity see? We will examine
Page 494 this issue in more detail shortly.
404 iii gravitation and rel ativity • 7. the new ideas
What is weight?
There is no way for a single (and point-like) observer to distinguish the effects of gravity
from those of acceleration. This property of nature allows one to make a strange state-
ment: things fall because the surface of the Earth accelerates towards them. Therefore,
the weight of an object results from the surface of the Earth accelerating upwards and
pushing against the object. That is the principle of equivalence applied to everyday life.
For the same reason, objects in free fall have no weight.
Let us check the numbers. Obviously, an accelerating surface of the Earth produces a
weight for each body resting on it. This weight is proportional to the inertial mass. In other
words, the inertial mass of a body is identical to the gravitational mass. This is indeed
Ref. 367 observed in experiments, and to the highest precision achievable. Roland von Eőtvős*
performed many such high-precision experiments throughout his life, without finding
any discrepancy. In these experiments, he used the fact that the inertial mass determines
centrifugal effects and the gravitational mass determines free fall. (Can you imagine how
“
Copyright © Christoph Schiller November 1997–January 2007
Vergilius**
An accelerating car will soon catch up with an object thrown forward from it. For the
same reason, the surface of the Earth soon catches up with a stone thrown upwards, be-
cause it is continually accelerating upwards. If you enjoy this way of seeing things, ima-
”
gine an apple falling from a tree. At the moment when it detaches, it stops being acceler-
ated upwards by the branch. The apple can now enjoy the calmness of real rest. Because
* Roland von Eőtvős (b. 1848 Budapest, d. 1919 Budapest), Hungarian physicist. He performed many high-
precision gravity experiments; among other discoveries, he discovered the effect named after him. The uni-
versity of Budapest is named after him.
** ‘Going it acquires strength.’ Publius Vergilius Maro (b. 70 bce Andes, d. 19 bce Brundisium), from the
Aeneid 4, 175.
motion in general rel ativity – bent light and wobbling vacuum 405
of our limited human perception, we call this state of rest free fall. Unfortunately, the
accelerating surface of the Earth approaches mercilessly and, depending on the time for
which the apple stayed at rest, the Earth hits it with a greater or lesser velocity, leading to
more or less severe shape deformation.
Falling apples also teach us not to be disturbed any more by the statement that gravity
is the uneven running of clocks with height. In fact, this statement is equivalent to saying
that the surface of the Earth is accelerating upwards, as the discussion above shows.
Can this reasoning be continued indefinitely? We can go on for quite a while. It is
fun to show how the Earth can be of constant radius even though its surface is acceler-
Challenge 774 ny ating upwards everywhere. We can thus play with the equivalence of acceleration and
gravity. However, this equivalence is only useful in situations involving only one acceler-
ating body. The equivalence between acceleration and gravity ends as soon as two falling
objects are studied. Any study of several bodies inevitably leads to the conclusion that
gravity is not acceleration; gravity is curved space-time.
Before we tackle the details of general relativity, we will explore how the motion of objects ”
and light differs from that predicted by universal gravity, and how these differences can
be measured.
Weak fields
Copyright © Christoph Schiller November 1997–January 2007
Gravity is strong near horizons. This happens when the mass M and the distance scale R
obey
1.
2GM
(249)
Rc 2
Therefore, gravity is strong mainly in three situations: near black holes, near the horizon
of the universe, and at extremely high particle energies. The first two cases are explored
later on, while the last will be explored in the third part of our mountain ascent. In con-
trast, in most parts of the universe there are no nearby horizons; in these cases, gravity
is a weak effect. Despite the violence of avalanches or of falling asteroids, in everyday life
gravity is much weaker than the maximum force. On the Earth the ratio just mentioned
406 iii gravitation and rel ativity • 8. motion of light and vacuum
THIRRING EFFECT
universal gravity prediction relativistic prediction
Moon a
m
Earth M
THIRRING-LENSE EFFECT
universal gravity prediction relativistic prediction
Foucault's pendulum
Earth
Earth
universe or mass shell
is only about 10−9 . In this and all cases of everyday life, gravitation can still be approx-
imated by a field, despite what was said above. These weak field situations are interesting
because they are simple to understand; they mainly require for their explanation the dif-
ferent running of clocks at different heights. Weak field situations allow us to mention
space-time curvature only in passing, and allow us to continue to think of gravity as a
source of acceleration. However, the change of time with height already induces many
new and interesting effects. The only thing we need is a consistent relativistic treatment.
Ref. 368 in universal gravity, but they both appear in general relativity. Figure 196 shows these
predictions.
In the first example, nowadays called the Thirring effect, centrifugal accelerations as
well as Coriolis accelerations for masses in the interior of a rotating mass shell are pre-
dicted. Thirring showed that if an enclosing mass shell rotates, masses inside it are attrac-
ted towards the shell. The effect is very small; however, this prediction is in stark contrast
to that of universal gravity, where a spherical mass shell – rotating or not – has no effect
on masses in its interior. Can you explain this effect using the figure and the mattress
Challenge 775 ny analogy?
The second effect, the Thirring–Lense effect,* is more famous. General relativity pre-
* Even though the order of the authors is Lense and Thirring, it is customary (but not universal) to stress
weak fields 407
dicts that an oscillating Foucault pendulum, or a satellite circling the Earth in a polar
orbit, does not stay precisely in a fixed plane relative to the rest of the universe, but that
the rotation of the Earth drags the plane along a tiny bit. This frame-dragging, as the ef-
fect is also called, appears because the Earth in vacuum behaves like a rotating ball in a
foamy mattress. When a ball or a shell rotates inside the foam, it partly drags the foam
along with it. Similarly, the Earth drags some vacuum with it, and thus turns the plane
of the pendulum. For the same reason, the Earth’s rotation turns the plane of an orbiting
satellite.
The Thirring–Lense or frame-dragging effect is ex-
tremely small. It was measured for the first time in 1998
by an Italian group led by Ignazio Ciufolini, and then
again by the same group in the years up to 2004. They
followed the motion of two special artificial satellites –
shown in Figure 197 – consisting only of a body of steel
Gravitomagnetism*
Frame-dragging and the Thirring–Lense effect can be seen as special cases of gravitomag-
netism. (We will show the connection below.) This approach to gravity, already studied
Ref. 371 in the nineteenth century by Holzmüller and by Tisserand, has become popular again in
recent years, especially for its didactic advantages. As mentioned above, talking about a
gravitational field is always an approximation. In the case of weak gravity, such as occurs
Copyright © Christoph Schiller November 1997–January 2007
in everyday life, the approximation is very good. Many relativistic effects can be described
in terms of the gravitational field, without using the concept of space curvature or the met-
ric tensor. Instead of describing the complete space-time mattress, the gravitational-field
model only describes the deviation of the mattress from the flat state, by pretending that
the deviation is a separate entity, called the gravitational field. But what is the relativistic-
ally correct way to describe the gravitational field?
In other words, the change of speed is due to electric fields E, whereas magnetic fields B
give a velocity-dependent change of the direction of velocity, without changing the speed
itself. Both changes depend on the value of the charge q. In the case of gravity this expres-
sion becomes
mẍ = mG − mẋ H . (251)
The role of charge is taken by mass. In this expression we already know the field G, given
by
G = ∇φ = ∇ =− 3 .
GM GMx
(252)
r r
As usual, the quantity φ is the (scalar) potential. The field G is the usual gravitational
field of universal gravity, produced by every mass, and in this context is called the grav-
itoelectric field; it has the dimension of an acceleration. Masses are the sources of the
gravitoelectric field. The gravitoelectric field obeys ∆G = −4πGρ, where ρ is the mass
Copyright © Christoph Schiller November 1997–January 2007
* The approximation requires low velocities, weak fields, and localized and stationary mass–energy distribu-
tions.
weak fields 409
M rod
m particle
F I G U R E 198 The
reality of
gravitomagnetism
H = ∇ A = 16πN ρv (254)
where ρ is mass density of the source of the field and N is a proportionality constant. The
quantity A is called the gravitomagnetic vector potential. In nature, there are no sources for
the gravitomagnetic field; it thus obeys ∇H = 0. The gravitomagnetic field has dimension
of inverse time, like an angular velocity.
Challenge 777 ny When the situation in Figure 198 is evaluated, we find that the proportionality constant
N is given by
N = 2 = 7.4 ë 10−28 m~kg ,
G
(255)
c
an extremely small value. We thus find that as in the electrodynamic case, the gravito-
magnetic field is weaker than the gravitoelectric field by a factor of c 2 . It is thus hard to
observe. In addition, a second aspect renders the observation of gravitomagnetism even
more difficult. In contrast to electromagnetism, in the case of gravity there is no way to
Challenge 778 n observe pure gravitomagnetic fields (why?); they are always mixed with the usual, gravito-
electric ones. For these reasons, gravitomagnetic effects were measured for the first time Copyright © Christoph Schiller November 1997–January 2007
only in the 1990s. We see that universal gravity is the approximation of general relativity
that arises when all gravitomagnetic effects are neglected.
In summary, if a mass moves, it also produces a gravitomagnetic field. How can one
imagine gravitomagnetism? Let’s have a look at its effects. The experiment of Figure 198
showed that a moving rod has the effect to slightly accelerate a test mass in the same dir-
ection. In our metaphor of the vacuum as a mattress, it looks as if a moving rod drags the
vacuum along with it, as well as any test mass that happens to be in that region. Gravito-
magnetism can thus be seen as vacuum dragging. Because of a widespread reluctance to
think of the vacuum as a mattress, the expression frame dragging is used instead.
In this description, all frame dragging effects are gravitomagnetic effects. In particular,
a gravitomagnetic field also appears when a large mass rotates, as in the Thirring–Lense
effect of Figure 196. For an angular momentum J the gravitomagnetic field H is a dipole
410 iii gravitation and rel ativity • 8. motion of light and vacuum
field; it is given by
H = ∇ h = ∇ −2
Jx
(256)
r3
exactly as in the electrodynamic case. The gravitomagnetic field around a spinning mass
has three main effects.
First of all, as in electromagnetism, a spinning test particle with angular momentum
S feels a torque if it is near a large spinning mass with angular momentum J. This torque
T is given by
T= = SH .
dS 1
(257)
dt 2
The torque leads to the precession of gyroscopes. For the Earth, this effect is extremely
small: at the North Pole, the precession has a conic angle of 0.6 milli-arcseconds and a
rotation rate of the order of 10−10 times that of the Earth.
Ω̇ = − =− 2 3 + 2 = 2 3
H G J G 3(Jx)x G 2J
c SxS c SxS c a (1 − e 2 )3~2
(258)
2 5
which is the prediction of Lense and Thirring.* The effect is extremely small, giving a
change of only 8 ′′ per orbit for a satellite near the surface of the Earth. Despite this small-
ness and a number of larger effects disturbing it, Ciufolini’s team have managed to con-
Ref. 369 firm the result.
As a third effect of gravitomagnetism, a rotating mass leads to the precession of the
periastron. This is a similar effect to the one produced by space curvature on orbiting
masses even if the central body does not rotate. The rotation just reduces the precession
due to space-time curvature. This effect has been fully confirmed for the famous binary
pulsar PSR B1913+16, discovered in 1974, as well as for the ‘real’ double pulsar PSR J0737-3039,
discovered in 2003. This latter system shows a periastron precession of 16.9°~a, the largest
Ref. 374 value observed so far.
The split into gravitoelectric and gravitomagnetic effects is thus a useful approxim-
Copyright © Christoph Schiller November 1997–January 2007
ation to the description of gravity. It also helps to answer questions such as: How can
gravity keep the Earth orbiting around the Sun, if gravity needs 8 minutes to get from
Challenge 781 ny the Sun to us? To find the answer, thinking about the electromagnetic analogy can help.
In addition, the split of the gravitational field into gravitoelectric and gravitomagnetic
components allows a simple description of gravitational waves.
Challenge 780 ny * A homogeneous spinning sphere has an angular momentum given by J = 25 MωR 2 .
weak fields 411
Gravitational waves
One of the most fantastic predictions of physics is the existence of gravitational waves.
Gravity waves* prove that empty space itself has the ability to move and vibrate. The basic
idea is simple. Since space is elastic, like a large mattress in which we live, space should
be able to oscillate in the form of propagating waves, like a mattress or any other elastic
medium.
Frequency Wa v e l e n g t h N a m e Expected
appearance
Ref. 375 Jørgen Kalckar and Ole Ulfbeck have given a simple argument for the necessity of
Copyright © Christoph Schiller November 1997–January 2007
gravitational waves based on the existence of a maximum speed. They studied two equal
masses falling towards each other under the effect of gravitational attraction, and ima-
gined a spring between them. Such a spring will make the masses bounce towards each
other again and again. The central spring stores the kinetic energy from the falling masses.
The energy value can be measured by determining the length by which the spring is com-
pressed. When the spring expands again and hurls the masses back into space, the gravit-
ational attraction will gradually slow down the masses, until they again fall towards each
other, thus starting the same cycle again.
However, the energy stored in the spring must get smaller with each cycle. Whenever
a sphere detaches from the spring, it is decelerated by the gravitational pull of the other
sphere. Now, the value of this deceleration depends on the distance to the other mass;
but since there is a maximal propagation velocity, the effective deceleration is given by
the distance the other mass had when its gravity effect started out towards the second
mass. For two masses departing from each other, the effective distance is thus somewhat
smaller than the actual distance. In short, while departing, the real deceleration is larger
than the one calculated without taking the time delay into account.
* To be strict, the term ‘gravity wave’ has a special meaning: gravity waves are the surface waves of the sea,
where gravity is the restoring force. However, in general relativity, the term is used interchangeably with
‘gravitational wave’.
412 iii gravitation and rel ativity • 8. motion of light and vacuum
Similarly, when one mass falls back towards the other, it is accelerated by the other
mass according to the distance it had when the gravity effect started moving towards it.
Therefore, while approaching, the acceleration is smaller than the one calculated without
time delay.
Therefore, the masses arrive with a smaller energy than they departed with. At every
bounce, the spring is compressed a little less. The difference between these two energies
is lost by each mass: it is taken away by space-time, in other words, it is radiated away as
gravitational radiation. The same thing happens with mattresses. Remember that a mass
deforms the space around it as a metal ball on a mattress deforms the surface around it.
(However, in contrast to actual mattresses, there is no friction between the ball and the
mattress.) If two metal balls repeatedly bang against each other and then depart again,
until they come back together, they will send out surface waves on the mattress. Over
time, this effect will reduce the distance that the two balls depart from each other after
each bang. As we will see shortly, a similar effect has already been measured, where the
∇ G = −4πGρ , ∇G=−
∂H
∂t
∇H = 0 , ∇ H = −16πGρv +
N ∂G
. (259)
G ∂t
We have met two of these equations already. The two other equations are expanded ver-
sions of what we have encountered, taking time-dependence into account. Except for a
factor of 16 instead of 4 in the last equation, the equations for gravtitodynamics are the
same as Maxwell’s equations for electrodynamics.* These equations have a simple prop-
erty: in vacuum, one can deduce from them a wave equation for the gravitoelectric and
Challenge 783 ny the gravitomagnetic fields G and H. (It is not hard: try!) In other words, gravity can be-
have like a wave: gravity can radiate. All this follows from the expression of universal
gravity when applied to moving observers, with the requirement that neither observers Copyright © Christoph Schiller November 1997–January 2007
nor energy can move faster than c. Both the above argument involving the spring and
the present mathematical argument use the same assumptions and arrive at the same
conclusion.
* The additional factor reflects the fact that the ratio between angular momentum and energy (the ‘spin’)
of gravity waves is different from that of electromagnetic waves. Gravity waves have spin 2, whereas electro-
magnetic waves have spin 1. Note that since gravity is universal, there can exist only a single kind of spin 2
radiation particle in nature. This is in strong contrast to the spin 1 case, of which there are several examples
in nature.
By the way, the spin of radiation is a classical property. The spin of a wave is the ratio E~Lω, where E is
the energy, L the angular momentum, and ω is the angular frequency. For electromagnetic waves, this ratio
is equal to 1; for gravitational waves, it is 2.
Ref. 377 Note that due to the approximation at the basis of the equations of gravitodynamics, the equations are
neither gauge-invariant nor generally covariant.
weak fields 413
No wave
(all times)
Challenge 784 e A few manipulations show that the speed of these waves is given by
¾
c=
G
. (260)
N Copyright © Christoph Schiller November 1997–January 2007
c=º
1
. (261)
ε0 µ0
The same letter has been used for the two speeds, as they are identical. Both influences
travel with the speed common to all energy with vanishing rest mass. (We note that this is,
strictly speaking, a prediction: the speed of gravitational waves has not yet been measured.
Ref. 378 A claim from 2003 to have done so has turned out to be false.)
Ref. 380 How should one imagine these waves? We sloppily said above that a gravitational wave
corresponds to a surface wave of a mattress; now we have to do better and imagine that
we live inside the mattress. Gravitational waves are thus moving and oscillating deform-
414 iii gravitation and rel ativity • 8. motion of light and vacuum
ations of the mattress, i.e., of space. Like mattress waves, it turns out that gravity waves
are transverse. Thus they can be polarized. (Surface waves on mattresses cannot, because
in two dimensions there is no polarization.) Gravity waves can be polarized in two inde-
pendent ways. The effects of a gravitational wave are shown in Figure 200, for both linear
and circular polarization.* We note that the waves are invariant under a rotation by π
and that the two linear polarizations differ by an angle π~4; this shows that the particles
corresponding to the waves, the gravitons, are of spin 2. (In general, the classical radi-
ation field for a spin S particle is invariant under a rotation by 2π~S. In addition, the two
orthogonal linear polarizations of a spin S particle form an angle π~2S. For the photon,
for example, the spin is 1; indeed, its invariant rotation angle is 2π and the angle formed
by the two polarizations is π~2.)
If we image empty space as a mattress that fills space, gravitational waves are wobbling
deformations of the mattress. More precisely, Figure 200 shows that a wave of circular po-
larization has the same properties as a corkscrew advancing through the mattress. We will
* A (small amplitude) plane gravity wave travelling in the z-direction is described by a metric д given by
1 0 0 0
−1 + h x x
д=
0 hx y 0
(262)
0 hx y −1 + h x x 0
0 0 0 −1
where its two components, whose amplitude ratio determine the polarization, are given by
as in all plane harmonic waves. The amplitudes B ab , the frequency ω and the phase φ are determined by
the specific physical system. The general dispersion relation for the wave number k resulting from the wave
equation is
ω
=c (264)
k
and shows that the waves move with the speed of light.
In another gauge, a plane wave can be written as
c (1 + 2φ)
2
A1 A2 A3
д=
A1 −1 + 2φ hx y 0
(265)
A2 hx y −1 + h x x 0
A3 0 0 −1
cylindrically symmetric around its rotation axis will do so. As a result, rotating an arm
leads to gravitational wave emission. Most of these statements also apply to masses in
Challenge 786 ny mattresses. Can you point out the differences?
Einstein found that the amplitude h of waves at a distance r from a source is given, to
Ref. 379 a good approximation, by the second derivative of the retarded quadrupole moment Q:
h ab = = 4 d tt Q ab (t − r~c) .
2G 1 ret 2G 1
4
d tt Q ab (266)
c r c r
This expression shows that the amplitude of gravity waves decreases only with 1~r, in
contrast to naive expectations. However, this feature is the same as for electromagnetic
waves. In addition, the small value of the prefactor, 1.6 ë 10−44 Wm~s, shows that truly
gigantic systems are needed to produce quadrupole moment changes that yield any de-
tectable length variations in bodies. To be convinced, just insert a few numbers, keep-
ing in mind that the best present detectors are able to measure length changes down to
m1 m2 2 4 6
P=− = = l ω
dE G ... ret ... ret 32 G
Q Q (267)
dt 45c 5 ab ab 5 c 5 m1 + m2
32 G 4 (m 1 m 2 )2 (m 1 + m 2 )
P= . (268)
5 c5 l5
Ref. 340 For elliptical orbits, the rate increases with the ellipticity, as explained by Goenner. Insert-
ing the values for the case of the Earth and the Sun, we get a power of about 200 W, and
a value of 400 W for the Jupiter–Sun system. These values are so small that their effect
cannot be detected at all.
For all orbiting systems, the frequency of the waves is twice the orbital frequency, as
you might want to check. These low frequencies make it even more difficult to detect
Copyright © Christoph Schiller November 1997–January 2007
Challenge 788 ny
them.
As a result, the only observation of effects of gravitational waves to date is in binary
pulsars. Pulsars are small but extremely dense stars; even with a mass equal to that of
the Sun, their diameter is only about 10 km. Therefore they can orbit each other at small
distances and high speeds. Indeed, in the most famous binary pulsar system, PSR 1913+16,
the two stars orbit each other in an amazing 7.8 h, even though their semimajor axis is
gravitation, a monopole is a point-like or two spherical masses, and, since masses cannot be negative, a
quadrupole is formed by two monopoles. A flattened sphere, such as the Earth, can be approximated by the
sum of a monopole and a quadrupole. The same is valid for an elongated sphere.
Page 561 * Gravitomagnetism and gravitoelectricity allow one to define a gravitational Poynting vector. It is as easy
Ref. 373 to define and use as in the case of electrodynamics.
416 iii gravitation and rel ativity • 8. motion of light and vacuum
mirror
L1
mirror
light L2
source
F I G U R E 202 Detection of
gravitational waves
about 700 Mm, just less than twice the Earth–Moon distance. Since their orbital speed is
up to 400 km~s, the system is noticeably relativistic.
Pulsars have a useful property: because
of their rotation, they emit extremely reg- time
precision, consider that this experiment de- F I G U R E 201 Comparison between measured
tected a reduction of the orbital diameter time delay for the periastron of the binary pulsar
Copyright © Christoph Schiller November 1997–January 2007
* In 1993 he shared the Nobel Prize in physics for his life’s work.
weak fields 417
is perpendicular to the line connecting the other pair. Then measure the distance changes
of each pair. If a gravitational wave comes by, one pair will increase in distance and the
other will decrease, at the same time.
Since detectable gravitational waves cannot be produced by humans, wave detection
first of all requires the patience to wait for a strong enough wave to come by. Secondly, a
system able to detect length changes of the order of 10−22 or better is needed – in other
words, a lot of money. Any detection is guaranteed to make the news on television.*
It turns out that even for a body around a black hole, only about 6 % of the rest mass
can be radiated away as gravitational waves; furthermore, most of the energy is radiated
during the final fall into the black hole, so that only quite violent processes, such as black
hole collisions, are good candidates for detectable gravity wave sources.
Gravitational waves are a fascinating area of study. They still provide many topics to
Challenge 791 r explore. For example: can you find a method to measure their speed? A well-publicized
Ref. 378 but false claim appeared in 2003. Indeed, any correct measurement that does not simply
α=
ª ∂v
∫−ª ∂x
dy , (269)
Copyright © Christoph Schiller November 1997–January 2007
Challenge 793 ny where v is the speed of light measured by a distant observer. (Can you confirm this?) The
next step is to use the Schwarzschild metric
dr 2 r2 2
dτ 2 = (1 − )dt
2GM 2
dφ
(c 2 − 2GrM ) c 2
− − (270)
rc 2
Challenge 794 ny and transform it into (x, y) coordinates to first order. That gives
Ref. 383 * The topic of gravity waves is full of interesting sidelines. For example, can gravity waves be used to power
Challenge 790 ny a rocket? Yes, say Bonnor and Piper. You might ponder the possibility yourself.
418 iii gravitation and rel ativity • 8. motion of light and vacuum
dτ 2 = (1 − )dt 2 − (1 + ) (dx 2 + dy 2 )
2GM 2GM 1
2
(271)
rc rc 2 c 2
which again to first order leads to
= (1 − )c .
∂v 2GM
(272)
∂x rc 2
This confirms what we know already, namely that distant observers see light slowed down
when passing near a mass. Thus we can also speak of a height-dependent index of refrac-
tion. In other words, constant local light speed leads to a global slowdown.
Challenge 795 ny Inserting the last result in (269) and using a clever substitution, we
get a deviation angle α given by
α=
4GM 1 α
dio sources in the sky which lie on the path of the Sun. They have confirmed general
Ref. 362, Ref. 339, relativity’s prediction within a few per cent.
Ref. 340 So far, bending of radiation has also been observed near Jupiter, near certain stars,
Page 475 near several galaxies and near galaxy clusters. For the Earth, the angle is at most 3 nrad,
too small to be measured yet, even though this may be feasible in the near future. There
is a chance to detect this value if, as Andrew Gould proposes, the data of the satellite
Hipparcos, which is taking precision pictures of the night sky, are analysed properly in
the future.
Page 426 Of course, the bending of light also confirms that in a triangle, the sum of the angles
does not add up to π (two right angles), as is predicted later for curved space. (What is
Challenge 797 ny the sign of the curvature?)
weak fields 419
10 May 1970
Earth orbit
31 March 1970
Sun Mariner 6
orbit
a: semimajor periastron
axis
240 a
Time delay (µs)
180
M
120
0
Jan Feb Mar Apr May Jun
1970
F I G U R E 204 Time delay in radio signals – one F I G U R E 205 The orbit around a
of the experiments by Irwin Shapiro central body in general relativity
Time delay
The above calculation of the bending of light near masses shows that for a distant observer,
light is slowed down near a mass. Constant local light speed leads to a global light speed
slowdown. If light were not slowed down near a mass, it would have to go faster than c
Ref. 386 for an observer near the mass!* In 1964, Irwin Shapiro had the idea to measure this effect.
He proposed two methods. The first was to send radar pulses to Venus, and measure the
time taken for the reflection to get back to Earth. If the signals pass near the Sun, they
will be delayed. The second was to use an artificial satellite communicating with Earth.
Ref. 387 The first measurement was published in 1968, and directly confirmed the prediction of
general relativity within experimental errors. All subsequent tests of the same type, such
as the one shown in Figure 204, have also confirmed the prediction within experimental
Copyright © Christoph Schiller November 1997–January 2007
errors, which nowadays are of the order of one part in a thousand. The delay has also
Ref. 388 been measured in binary pulsars, as there are a few such systems in the sky for which the
line of sight lies almost precisely in the orbital plane.
The simple calculations presented here suggest a challenge: Is it also possible to de-
scribe full general relativity – thus gravitation in strong fields – as a change of the speed
Challenge 799 ny of light with position and time induced by mass and energy?
* A nice exercise is to show that the bending of a slow particle gives the Soldner value, whereas with increas-
Challenge 798 e ing speed, the value of the bending approaches twice that value. In all these considerations, the rotation
of the mass has been neglected. As the effect of frame dragging shows, rotation also changes the deviation
angle; however, in all cases studied so far, the influence is below the detection threshold.
420 iii gravitation and rel ativity • 8. motion of light and vacuum
Effects on orbits
Astronomy allows precise measurements of motions. So, Einstein first of all tried to ap-
ply his results to the motion of planets. He looked for deviations of their motions from
the predictions of universal gravity. Einstein found such a deviation: the precession of
the perihelion of Mercury. The effect is shown in Figure 205. Einstein said later that the
moment he found out that his calculation for the precession of Mercury matched obser-
vations was one of the happiest moments of his life.
The calculation is not difficult. In universal gravity, orbits are calculated by setting
a grav = a centri , in other words, by setting GM~r 2 = ω 2 r and fixing energy and angular
momentum. The mass of the orbiting satellite does not appear explicitly.
In general relativity, the mass of the orbiting satellite is made to disappear by rescaling
Ref. 339, Ref. 340 energy and angular momentum as e = E~mc 2 and j = J~m. Next, the space curvature
Page 392 needs to be included. We use the Schwarzschild metric (270) mentioned above to deduce
that the initial condition for the energy e, together with its conservation, leads to a rela-
=
dt e
, (274)
dτ 1 − 2GM~rc 2
whereas the initial condition on the angular momentum j and its conservation imply that
= 2 .
dφ j
(275)
dτ r
These relations are valid for any particle, whatever its mass m. Inserting all this into the
Schwarzschild metric, we find that the motion of a particle follows
( ) + V 2 ( j, r) = e 2
dr 2
(276)
cdτ
where the effective potential V is given by
j2
V 2 (J, r) = (1 − )(1 ).
2GM
+ (277)
rc 2 r2 c2
Copyright © Christoph Schiller November 1997–January 2007
Challenge 801 ny The expression differs slightly from the one in universal gravity, as you might want to
Challenge 802 e check. We now need to solve for r(φ). For circular orbits we get two possibilities
6GM~c 2
r = ¼ (278)
1 1 − 12( GcMj )2
º
where the minus sign gives a stable and the plus sign an unstable orbit. If c j~GM < 2 3 ,
º
no stable orbit exists; the object will impact the surface or, for a black hole, be swallowed.
There is a stable circular orbit only if the angular momentum j is larger than 2 3 GM~c.
We thus find that in general relativity, in contrast to universal gravity, there is a smallest
weak fields 421
stable circular orbit. The radius of this smallest stable circular orbit is 6GM~c 2 = 3R S .
What is the situation for elliptical orbits? Setting u = 1~r in (276) and differentiating,
the equation for u(φ) becomes
u′ + u =
GM 3GM 2
+ 2 u . (279)
j2 c
Without the nonlinear correction due to general relativity on the far right, the solutions
Challenge 803 e are the famous conic sections
u 0 (φ) = (1 + ε cos φ) ,
GM
(280)
j2
i.e. ellipses, parabolas or hyperbolas. The type of conic section depends on the value of
the parameter ε, the so-called eccentricity. We know the shapes of these curves from uni-
3G 2 M 2
u 1 (φ) = [1
GM
+ ε cos(φ − φ)] . (281)
j2 j2 c 2
The hyperbolas and parabolas of universal gravity are thus slightly deformed. Instead of
elliptical orbits we get the famous rosetta path shown in Figure 205. Such a path is above
all characterized by a periastron shift. The periastron, or perihelion in the case of the Sun,
is the nearest point to the central body reached by an orbiting body. The periastron turns
Challenge 805 e around the central body by an angle
α 6π
GM
a(1 − ε 2 )c 2
(282)
for every orbit, where a is the semimajor axis. For Mercury, the value is 43 ′′ per century.
Around 1900, this was the only known effect that was unexplained by universal gravity;
when Einstein’s calculation led him to exactly that value, he was overflowing with joy for
many days.
Copyright © Christoph Schiller November 1997–January 2007
To be sure about the equality between calculation and experiment, all other effects
leading to rosetta paths must be eliminated. For some time, it was thought that the quad-
rupole moment of the Sun could be an alternative source of this effect; later measurements
ruled out this possibility.
In the meantime, the perihelion shift has been measured also for the orbits of Icarus,
Venus and Mars around the Sun, as well as for several binary star systems. In binary
Ref. 388 pulsars, the periastron shift can be as large as several degrees per year. In all cases, expres-
sion (282) describes the motion within experimental errors.
We note that even the rosetta orbit itself is not really stable, due to the emission of
gravitational waves. But in the solar system, the power lost this way is completely negli-
Page 415 gible even over thousands of millions of years, as we saw above, so that the rosetta path
422 iii gravitation and rel ativity • 8. motion of light and vacuum
geodesic
precession
Earth
start
Lense– S
after one
Thirring
orbit
precession
The angle change is called the geodesic effect – ‘geodetic’ in other languages. It is a further
consequence of the split into gravitoelectric and gravitomagnetic fields, as you may want
Challenge 806 e to show. Obviously, it does not exist in universal gravity.
In cases where the pointing of the orbiting body is realized by an intrinsic rotation,
such as a spinning satellite, the geodesic effect produces a precession of the axis. Thus the
effect is comparable to spin–orbit coupling in atomic theory. (The Thirring–Lense effect
mentioned above is analogous to spin–spin coupling.)
Ref. 389 The geodesic effect, or geodesic precession, was predicted by Willem de Sitter in 1916; Copyright © Christoph Schiller November 1997–January 2007
in particular, he proposed detecting that the Earth–Moon system would change its point-
ing direction in its fall around the Sun. The effect is tiny; for the axis of the Moon the
precession angle is about 0.019 arcsec per year. The effect was first detected in 1987 by an
Ref. 390 Italian team for the Earth–Moon system, through a combination of radio-interferometry
and lunar ranging, making use of the Cat’s-eyes deposited by Lunakhod and Apollo on
the Moon. Experiments to detect it in artificial satellites are also under way.
At first sight, geodesic precession is similar to the Thomas precession found in special
Page 317 relativity. In both cases, a transport along a closed line results in the loss of the original
direction. However, a careful investigation shows that Thomas precession can be added
to geodesic precession by applying some additional, non-gravitational interaction, so the
analogy is shaky.
This completes our discussion of weak gravity effects. We now turn to strong gravity,
how is curvature measured? 423
where curvature cannot be neglected and where the fun is even more intense.
**
Challenge 809 ny Would two parallel beams of gravitational waves attract each other?
intrinsic curvature, if she takes the limit for vanishingly small circles and if she normalizes
the values correctly. In other words, the ant can imagine to cut out a little disc around the
point she is on, to iron it flat and to check whether the disc would tear or produce folds.
Any two-dimensional surface is intrinsically curved whenever ironing is not able to make
a flat street map out of it. The ‘density’ of folds or tears is related to the curvature.
This means that we can recognize intrinsic curvature also by checking whether two
parallel lines stay parallel, approach each other, or depart from each other. In the first case,
such as lines on a paper cylinder, the surface is said to have vanishing intrinsic curvature;
Challenge 811 e * Unless the mountain has the shape of a perfect cone. Can you confirm this?
** Note that the answer to this question also tells us how to distinguish real curvature from curved coordinate
systems on a flat space. This question is often asked by those approaching general relativity for the first time.
424 iii gravitation and rel ativity • 8. motion of light and vacuum
C = 2πr(1 − A = πr 2 (1 −
K 2 K 2
r + ...) and r + ...) (284)
6 12
where the dots imply higher-order terms. This allows one to define the intrinsic curvature
K, also called the Gaussian curvature, for a general point on a two-dimensional surface
in either of the following two equivalent ways:
K = 6 lim(1 − ) K = 12 lim(1 − ) .
C 1 A 1
or (285)
Copyright © Christoph Schiller November 1997–January 2007
r 0 2πr r 2 r 0 πr 2 r 2
These expressions allow an ant to measure the intrinsic curvature at each point for any
smooth surface.* From now on in this text, curvature will always mean intrinsic curvature.
Note that the curvature can be different from place to place, and that it can be positive, as
right
angle direction of
maximal curvature
for an egg, or negative, as for the part of a torus nearest to the hole. A saddle is another ex-
ample of the latter case, but, unlike the torus, its curvature changes along all directions. In
fact, it is not possible at all to fit a two-dimensional surface of constant negative curvature
A = 4πr 2 (1 − V= r (1 − r 2 + ...) ,
K 2 4π 3 K
r + ...) and (287)
3 3 5
where K is the curvature for an isotropic point. This leads to
»
K = 3 lim(1 − ) 2 = 6 lim = 6 lim 3 ,
A 1 r − A~4π r excess
(288)
r 0 4πr r
2 r 0 r 3 r 0 r
»
defining the excess radius as r excess = r − A~4π . We thus find that for a three-
dimensional space, the average curvature is six times the excess radius of a small sphere
divided by the cube of the radius. A positive curvature is equivalent to a positive excess
radius, and similarly for vanishing and negative cases.
426 iii gravitation and rel ativity • 8. motion of light and vacuum
Of course, a curvature value defined in this way is only an average over all possible
directions. The precise definition of curvature involves a disc. For points that are not
isotropic, the value yielded by a disc will differ from the value calculated by using a sphere,
as it will depend on the orientation of the disc. In fact, there is a relationship between all
Challenge 817 ny possible disc curvatures at a given point; taken together, they must form a tensor. (Why?)
For a full description of curvature, we thus have to specify, as for any tensor in three
dimensions, the main curvature values in three orthogonal directions.*
What are the curvature values in the space around us? Already in 1827, the math-
ematician and physicist Carl-Friedrich Gauss** is said to have checked whether the
three angles formed by three mountain peaks near his place of residence added up to
π. Nowadays we know that the deviation δ from the angle π on the surface of a body of
mass M and radius r is given by
δ = π − (α + β + γ) A triangle K = A triangle
GM
. (289)
“ de changer de direction.***
Francis Picabia
”
* These three disc values are not independent however, since together, they must yield the just-mentioned
average volume curvature K. In total, there are thus three independent scalars describing the curvature in
three dimensions (at each point). With the metric tensor дab and the Ricci tensor R ab to be introduced below,
one possibility is to take for the three independent numbers the values R = −2K, R ab R ab and detR~detд.
Copyright © Christoph Schiller November 1997–January 2007
** Carl-Friedrich Gauß (b. 1777 Braunschweig, d. 1855 Göttingen), German mathematician. Together with
the Leonhard Euler, the most important mathematician of all times. A famous enfant prodige, when he was
19 years old, he constructed the regular heptadecagon with compass and ruler (see http://www.mathworld.
wolfram.com/Heptadecagon.html). He was so proud of this result that he put a drawing of the figure on his
tomb. Gauss produced many results in number theory, topology, statistics, algebra, complex numbers and
differential geometry which are part of modern mathematics and bear his name. Among his many accom-
plishments, he produced a theory of curvature and developed non-Euclidean geometry. He also worked on
electromagnetism and astronomy.
Gauss was a difficult character, worked always for himself, and did not found a school. He published
little, as his motto was: pauca sed matura. As a consequence, when another mathematician published a new
result, he regularly produced a notebook in which he had noted the very same result already years before.
His notebooks are now available online at http://www.sub.uni-goettingen.de.
*** ‘Our head is round in order to allow our thougths to change direction.’ Francis Picabia (b. 1879 Paris, d.
1953 Paris) French dadaist and surrealist painter.
how is curvature measured? 427
In nature, with four space-time dimensions, specifying curvature requires a more in-
volved approach. First of all, the use of space-time coordinates automatically introduces
the speed of light c as limit speed, which is a central requirement in general relativity.
Furthermore, the number of dimensions being four, we expect a value for an average
d2 s
= −Ks + higher orders (290)
dl 2
where l measures the length along the geodesic, and K is the curvature, in other words,
the inverse squared curvature radius. In space-time, this relation is extended by sub-
Copyright © Christoph Schiller November 1997–January 2007
stituting proper time (times the speed of light) for proper length. Thus separation and
curvature are related by
d2 s
= −Kc 2 s + higher orders . (291)
dτ 2
But this is the definition of an acceleration. In other words, what in the purely spatial
case is described by curvature, in the case of space-time becomes the relative acceleration
of two particles freely falling from nearby points. Indeed, we have encountered these
Page 131 accelerations already: they describe tidal effects. In short, space-time curvature and tidal
effects are precisely the same.
Obviously, the magnitude of tidal effects, and thus of curvature, will depend on the
orientation – more precisely on the orientation of the space-time plane formed by the
428 iii gravitation and rel ativity • 8. motion of light and vacuum
two particle velocities. The definition also shows that K is a tensor, so that later on we
Challenge 819 ny will have to add indices to it. (How many?) The fun is that we can avoid indices for a
Ref. 393 while by looking at a special combination of spatial curvatures. If we take three planes in
space, all orthogonal to each other and intersecting at a given point, the sum of the three
so-called sectional curvature values does not depend on the observer. (This corresponds
to the tensor trace.) Can you confirm this, by using the definition of the curvature just
Challenge 820 ny given?
The sum of the three sectional curvatures defined for mutually orthogonal planes K(12),
Challenge 821 ny K(23) and K(31) , is related to the excess radius defined above. Can you find out how?
If a surface has constant (intrinsic) curvature, i.e. the same curvature at all locations,
Challenge 822 e geometrical objects can be moved around without deforming them. Can you picture this?
In summary, curvature is not such a difficult concept. It describes the deformation
Ref. 394 of space-time. If we imagine space (-time) as a big blob of rubber in which we live, the
curvature at a point describes how this blob is squeezed at that point. Since we live inside
where W (0) is the proper energy density at the point. The lower indices indicate the mixed
curvatures defined by the three orthogonal directions 1, 2 and 3. This is all of general
relativity in one paragraph.
Challenge 823 e An equivalent expression is easily found using the excess radius defined above, by
Copyright © Christoph Schiller November 1997–January 2007
introducing the mass M = V W (0) ~c 2 . For the surface area A of the volume V containing
the mass, we get
»
r excess = r − A~4π = 2 M .
G
(293)
3c
In short, general relativity affirms that for every observer, the excess radius of a small
sphere is given by the mass inside the sphere.*
Ref. 395 * Another, equivalent formulation is that for small radii the area A is given by
A = 4πr (1 + r R)
2 1 2
(294)
9
where R is the Ricci scalar, to be introduced later on.
how is curvature measured? 429
Note that the above expression implies that the average space curvature at a point in
empty space vanishes. As we will see shortly, this means that near a spherical mass the
negative of the curvature towards the mass is equal to twice the curvature around the
mass; the total sum is thus zero.
Curvature will also differ from point to point. In particular, the expression implies that
if energy moves, curvature will move with it. In short, both space curvature and, as we
will see shortly, space-time curvature change over space and time.
We note in passing that curvature has an annoying effect: the relative velocity of distant
Challenge 824 ny observers is undefined. Can you provide the argument? In curved space, relative velocity
is defined only for nearby objects – in fact only for objects at no distance at all. Only in
flat space are relative velocities of distant objects well defined.
The quantities appearing in expression (292) are independent of the observer. But often
people want to use observer-dependent quantities. The relation then gets more involved;
the single equation (292) must be expanded to ten equations, called Einstein’s field equa-
Universal gravity
The only reason which keeps me here is gravity.
“ Anonymous
For small velocities and low curvature values, the temporal curvatures K(0 j) turn out to ”
have a special property. In this case, they can be defined as the second spatial derivatives
Challenge 825 e of a single scalar function φ. In other words, we can write
∂2 φ
K(0 j) = .
∂(x j )2
(295)
In everyday situations, the function φ turns out to be the gravitational potential. Indeed,
universal gravity is the limiting case of general relativity for small speeds and small spatial
curvature. These two limits imply, making use of W (0) = ρc 2 and c ª, that
In other words, for small speeds, space is flat and the potential obeys Poisson’s equation.
Universal gravity is thus indeed the low speed and low curvature limit of general relativity.
Can you show that relation (292) between curvature and energy density indeed means
Challenge 826 ny that time near a mass depends on the height, as stated in the beginning of this chapter?
– CS – more to be inserted – CS –
430 iii gravitation and rel ativity • 8. motion of light and vacuum
K rφ = K rθ = − and K θ φ = 2 2 3
GM GM
2
c r 3 c r
K tφ = K tθ = 2 3 and K tr = −2 2 3
GM GM
(297)
c r c r
Ref. 393 everywhere. The dependence on 1~r 3 follows from the general dependence of all tidal
Page 131 effects; we have already calculated them in the chapter on universal gravity. The factors
G~c 2 are due to the maximum force of gravity; only the numerical prefactors need to be
calculated from general relativity. The average curvature obviously vanishes, as it does
Challenge 828 ny for all vacuum. As expected, the values of the curvatures near the surface of the Earth are
exceedingly small.
**
Challenge 830 e Where are the points of highest and lowest Gaussian curvature on an egg?
Now that we have a feeling for curvature, we want to describe it in a way that allows any
observer to talk to any other observer. Unfortunately, this means using formulae with
tensors. These formulae look daunting. The challenge is to see in each of the expressions
”
the essential point (e.g. by forgetting all indices for a while) and not to be distracted by
Copyright © Christoph Schiller November 1997–January 2007
“
We mentioned above that a 4-dimensional space-time is described by 2-curvature, 3-
curvature and 4-curvature. Many texts on general relativity start with 3-curvature. These
André Gide
”
** ‘Every street urchin in our mathematical Göttingen knows more about four-dimensional geometry than
Einstein. Nevertheless, it was Einstein who did the work, not the great mathematicians.’
*** ‘One has to follow one’s inclination, especially if it climbs upwards.’
all observers – heavier mathematics 431
curvatures describing the distinction between the 3-volume calculated from a radius and
the actual 3-volume. They are described by the Ricci tensor.* With an argument we en-
countered already for the case of geodesic deviation, it turns out that the Ricci tensor
describes how the shape of a spherical cloud of freely falling particles is deformed on its
path.
– CS – to be expanded – CS –
In short, the Ricci tensor is the general-relativistic version of ∆φ, or better, of 2φ.
The most global, but least detailed, definition of curvature is the one describing the
distinction between the 4-volume calculated from a measured radius and the actual 4-
volume. This is the average curvature at a space-time point and is represented by the so-
called Ricci scalar R, defined as
R = −2K = −
2
It turns out that the Ricci scalar can be derived from the Ricci tensor by a so-called con-
traction, which is a precise averaging procedure. For tensors of rank two, contraction is
the same as taking the trace:
R = Rλ λ = дλµ Rλµ . (299)
The Ricci scalar describes the curvature averaged over space and time. In the image of
a falling spherical cloud, the Ricci scalar describes the volume change of the cloud. The
Ricci scalar always vanishes in vacuum. This result allows one, on the surface of the Earth,
Challenge 831 ny to relate the spatial curvature to the change of time with height.
Now comes an idea discovered by Einstein after two years of hard work. The import-
ant quantity for the description of curvature in nature is not the Ricci tensor R ab , but
a tensor built from it. This Einstein tensor G ab is defined mathematically (for vanishing
cosmological constant) as
G ab = R ab − дab R .
1
(300)
2
It is not difficult to understand its meaning. The value G 00 is the sum of sectional
curvatures in the planes orthogonal to the 0 direction and thus the sum of all spatial
Copyright © Christoph Schiller November 1997–January 2007
sectional curvatures:
G 00 = K(12) + K(23) + K(31) . (301)
Similarly, for each dimension i the diagonal element G i i is the sum (taking into consid-
eration the minus signs of the metric) of sectional curvatures in the planes orthogonal to
the i direction. For example, we have
* Gregorio Ricci-Cubastro (b. 1853 Lugo , d. 1925 Bologna), Italian mathematician. He is the father of
absolute differential calculus, once also called ‘Ricci calculus’. Tullio Levi-Civita was his pupil.
432 iii gravitation and rel ativity • 8. motion of light and vacuum
The distinction between the Ricci tensor and the Einstein tensor thus lies in the way in
which the sectional curvatures are combined: discs containing the coordinate in ques-
tion in one case, discs orthogonal to the coordinate in the other case. Both describe the
curvature of space-time equally well, and fixing one means fixing the other. (What are
Challenge 832 d the trace and the determinant of the Einstein tensor?)
The Einstein tensor is symmetric, which means that it has ten independent compon-
ents. Most importantly, its divergence vanishes; it therefore describes a conserved quant-
ity. This was the essential property which allowed Einstein to relate it to mass and energy
in mathematical language.
The surface is assumed to be characterized by its normal vector dA. Since the energy–
momentum density is a proportionality factor between two vectors, T is a tensor.
Of course, we are talking about 4-flows and 4-surfaces here. Therefore the energy–
momentum density tensor can be split in the following way:
(304)
S2 t 21 t 22 t 23 energy flow or momentum
S3 t 31 t 32 t 33 momentum density flow density
where w = T00 is a 3-scalar, S a 3-vector and t a 3-tensor. The total quantity T is called the
energy–momentum (density) tensor. It has two essential properties: it is symmetric and
its divergence vanishes.
The vanishing divergence of the tensor T, often written as
∂ a T ab = 0 or abbreviated T ab , a = 0 , (305)
expresses the fact that the tensor describes a conserved quantity. In every volume, en-
all observers – heavier mathematics 433
ergy can change only via flow through its boundary surface. Can you confirm that the
description of energy–momentum with this tensor satisfies the requirement that any two
observers, differing in position, orientation, speed and acceleration, can communicate
Challenge 833 ny their results to each other?
The energy–momentum density tensor gives a full description of the distribution of
energy, momentum and mass over space and time. As an example, let us determine the
energy–momentum density for a moving liquid. For a liquid of density ρ, a pressure p
and a 4-velocity u, we have
T ab = (ρ 0 + p)u a u b − pд ab (306)
where ρ 0 is the density measured in the comoving frame, the so-called proper density.*
Obviously, ρ, ρ 0 and p depend on space and time.
Of course, for a particular material fluid, we need to know how pressure p and density
p = p(ρ) . (308)
This relation is a material property and thus cannot be determined from relativity. It has
to be derived from the constituents of matter or radiation and their interactions. The
simplest possible case is dust, i.e. matter made of point particles** with no interactions at
all. Its energy–momentum tensor is given by
T ab = ρ 0 u a u b . (309)
Challenge 834 ny Can you explain the difference from the liquid case?
The divergence of the energy–momentum tensor vanishes for all times and positions,
Challenge 835 ny as you may want to check. This property is the same as for the Einstein tensor presented
above. But before we elaborate on this issue, a short remark. We did not take into account
gravitational energy. It turns out that gravitational energy cannot be defined in general.
Gravity is not an interaction and does not have an associated energy.***
ρ0 c
2
0 0 0
0
= .
p 0 0
0
ab
T (307)
0 p 0
0 0 0 p
** Even though general relativity expressly forbids the existence of point particles, the approximation is
useful in cases when the particle distances are large compared to their own size.
*** In certain special circumstances, such as weak fields, slow motion, or an asymptotically flat space-time,
we can define the integral of the G 00 component of the Einstein tensor as negative gravitational energy. Grav-
itational energy is thus only defined approximately, and only for our everyday environment. Nevertheless,
Page 479 this approximation leads to the famous speculation that the total energy of the universe is zero. Do you
Challenge 836 ny agree?
434 iii gravitation and rel ativity • 8. motion of light and vacuum
– CS – to be finished – CS –
The field equations for empty space-time also show scale symmetry. This is the in-
variance of the equations after multiplication of all coordinates by a common numerical
factor. In 1993, Torre and Anderson showed that diffeomorphism symmetry and trivial
Ref. 396 scale symmetry are the only symmetries of the vacuum field equations.
Apart from diffeomorphism symmetry, full general relativity, including mass–energy,
has an additional symmetry which is not yet fully elucidated. This symmetry connects
the various possible initial conditions of the field equations; the symmetry is extremely
Ref. 397 complex and is still a topic of research. These fascinating investigations should give new
insights into the classical description of the big bang.
all observers – heavier mathematics 435
Page 361
Einstein’s famous field equations were the basis of many religious worries. They contain
the full description of general relativity. As explained above, they follow from the max-
”
G ab = −κ Tab
or
R ab − дab R − Λдab = −κ T ab .
1
(311)
2
The constant κ, called the gravitational coupling constant, has been measured to be
κ= = 2.1 ë 10−43 ~N
8πG
(312)
c4
and its small value – 2π divided by the maximum force c 4 ~4G – reflects the weakness of
gravity in everyday life, or better, the difficulty of bending space-time. The constant Λ,
the so-called cosmological constant, corresponds to a vacuum energy volume density, or
Page 468 pressure Λ~κ. Its low value is quite hard to measure. The currently favoured value is
Ref. 398 Current measurements and simulations suggest that this parameter, even though it is
Copyright © Christoph Schiller November 1997–January 2007
numerically near to the square of the present radius of the universe, is a constant of nature
that does not vary with time.
In summary, the field equations state that the curvature at a point is equal to the flow of
energy–momentum through that point, taking into account the vacuum energy density.
In other words, energy–momentum tells space-time how to curve.*
* Einstein arrived at his field equations using a number of intellectual guidelines that are called principles in
the literature. Today, many of them are not seen as central any more. Nevertheless, we give a short overview.
- Principle of general relativity: all observers are equivalent; this principle, even though often stated, is
probably empty of any physical content.
- Principle of general covariance: the equations of physics must be stated in tensorial form; even though
Ref. 399 it is known today that all equations can be written with tensors, even universal gravity, in many cases they
436 iii gravitation and rel ativity • 8. motion of light and vacuum
The field equations of general relativity can be simplified for the case in which speeds
are small. In that case T00 = ρc 2 and all other components of T vanish. Using the defini-
Challenge 838 ny tion of the constant κ and setting φ = (c 2 ~2)h 00 in дab = η ab + h ab , we find
d2 x
∇2 φ = 4πρ and = −∇φ (314)
dt 2
which we know well, since it can be restated as follows: a body of mass m near a body of
mass M is accelerated by
a=G 2,
M
(315)
r
a value which is independent of the mass m of the falling body. And indeed, as noted
already by Galileo, all bodies fall with the same acceleration, independently of their size,
their mass, their colour, etc. In general relativity also, gravitation is completely demo-
require unphysical ‘absolute’ elements, i.e. quantities which affect others but are not affected themselves.
Page 692 This unphysical idea is in contrast with the idea of interaction, as explained above.
- Principle of minimal coupling: the field equations of gravity are found from those of special relativity
by taking the simplest possible generalization. Of course, now that the equations are known and tested
experimentally, this principle is only of historical interest.
- Equivalence principle: acceleration is locally indistinguishable from gravitation; we used it to argue that
space-time is semi-Riemannian, and that gravity is its curvature.
- Mach’s principle: inertia is due to the interaction with the rest of the universe; this principle is correct,
even though it is often maintained that it is not fulfilled in general relativity. In any case, it is not the essence
Page 482 of general relativity.
- Identity of gravitational and inertial mass: this is included in the definition of mass from the outset, but
restated ad nauseam in general relativity texts; it is implicitly used in the definition of the Riemann tensor.
- Correspondence principle: a new, more general theory, such as general relativity, must reduce to previous
theories, in this case universal gravity or special relativity, when restricted to the domains in which those
are valid.
* Here is another way to show that general relativity fits with universal gravity. From the definition of the
Copyright © Christoph Schiller November 1997–January 2007
Riemann tensor we know that relative acceleration b a and speed of nearby particles are related by
c d
∇ e b a = R ced a v v . (316)
From the symmetries of R we know there is a φ such that b a = −∇ a φ. That means that
∇ e b a = ∇ e ∇ a φ = R ced
a
vc vd (317)
Challenge 839 e differentiable and thus deterministic. Note that only the complete motion, of space-time
and matter and energy, has these properties. For particle motion only, motion is in fact
irreversible, since some gravitational radiation is usually emitted.
By contracting the field equations we find, for vanishing cosmological constant, the
following expression for the Ricci scalar:
R = −κT . (320)
This result also implies the relation between the excess radius and the mass inside a
Challenge 840 ny sphere.
The field equations are nonlinear in the metric д, meaning that sums of solutions usu-
ally are not solutions. That makes the search for solutions rather difficult. For a complete
solution of the field equations, initial and boundary conditions should be specified. The
Ref. 400 ways to do this form a specialized part of mathematical physics; it is not explored here.
– CS – more to be added – CS –
Another area of application concerns gravitational waves. The full field equations show
that waves are not harmonic, but nonlinear. Sine waves exist only approximately, for small
amplitudes. Even more interestingly, if two waves collide, in many cases singularities are Copyright © Christoph Schiller November 1997–January 2007
predicted to appear. This whole theme is still a research topic and might provide new
insights for the quantization of general relativity in the coming years.
We end this section with a side note. Usually, the field equations are read in one sense
only, as stating that energy–momentum produces curvature. One can also read them in
the other way, calculating the energy–momentum needed to produces a given curvature.
When one does this, one discovers that not all curved space-times are possible, as some
would lead to negative energy (or mass) densities. Such solutions would contradict the
mentioned limit on length-to-mass ratios for physical systems.
2GM = Rc 2 (1 − R )
Λ 2
(321)
3
implies that a radius-independent maximum force is valid only for positive or zero cos-
mological constant. For a negative cosmological constant the force limit would only be
valid for infinitely small black holes. In the following, we take a pragmatic approach and
note that a maximum force limit can be seen to imply a vanishing or positive cosmolo-
gical constant. Obviously, the force limit does not specify the value of the constant; to
mental test of the horizon equation for static horizons only is not sufficient to confirm
general relativity; such a test rules out only some, but not all, scalar–tensor theories.
explore its consequences and compare them with the known properties of nature. The
maximum force limit can also be visualized in another way. If the gravitational attraction
between a central body and a satellite were stronger than it is, black holes would be smaller
than they are; in that case the maximum force limit and the maximum speed could be
exceeded. If, on the other hand, gravitation were weaker than it is, a fast and accelerating
observer would not be able to determine that the two bodies interact. In summary, a
maximum force of c 4 ~4G implies universal gravity. There is no difference between stating
that all bodies attract through gravitation and stating that there is a maximum force with
the value c 4 ~4G.
to move.
The curvature of space-time for weak gravitational fields is fixed by the inverse square
law of gravity. Space curvature is thus present in the right amount around each mass.
Ref. 402 As Richard Feynman explains, by extending this result to all possible observers, we can
deduce all low-curvature effects of gravitation. In particular, this implies the existence of
linear (low-amplitude) gravitational waves and of the Thirring–Lense effect. Linearized
general relativity thus follows from the maximum force principle.
late these trajectories.* To start, one needs to know the shape of space-time, the notion
of ‘shape’ being generalized from its familiar two-dimensional meaning. For a being liv-
ing on the surface, it is usually described by the metric дab , which defines the distances
between neighbouring points through
It is a famous exercise of calculus to show from this expression that a curve x a (s) de-
pending on a well behaved (affine) parameter s is a timelike or spacelike (metric) geodesic,
Challenge 842 ny i.e. the longest possible path between the two events,** only if
dx d 1 ∂дbc dx b dx c
(дad )=
d
, (323)
ds ds 2 ∂x a ds ds
d2 x a b c
+ Γ a dx dx
bc =0 (327)
du 2 du du
with the different condition
dx a dx b
дab =0. (328)
du du
* This is a short section for the more curious; it can be skipped at first reading.
** We remember that in space in everyday life, geodesics are the shortest possible paths; however, in space-
Copyright © Christoph Schiller November 1997–January 2007
time in general relativity, geodesics are the longest possible paths. In both cases, they are the ‘straightest’
possible paths.
*** This is often written as
d2 x a b
a dx dx
c
+ Γbc =0 (324)
ds 2 ds ds
where the condition
dx a dx b
дab =1 (325)
ds ds
must be fulfilled, thus simply requiring that all the tangent vectors are unit vectors, and that ds x 0 all along
the path. The symbols Γ appearing above are given by
Γ a bc = = д ad (∂ b дd c + ∂ c дdb − ∂ d дbc ) ,
a 1
(326)
bc 2
and are called Christoffel symbols of the second kind or simply the metric connection.
all observers – heavier mathematics 441
Given all these definitions of various types of geodesics, what are the lines drawn in
Challenge 844 ny Figure 193 on page 391?
Is gravity an interaction?
We tend to answer this question affirmatively, as in Galilean physics gravity was seen
as an influence on the motion of bodies. In fact, we described it by a potential, imply-
ing that gravity produces motion. But let us be careful. A force or an interaction is what
changes the motion of objects. However, we just saw that when two bodies attract each
other through gravitation, both always remain in free fall. For example, the Moon circles
the Earth because it continuously falls around it. Since any freely falling observer con-
tinuously remains at rest, the statement that gravity changes the motion of bodies is not
correct for all observers. Indeed, we will soon discover that in a sense to be discussed
shortly, the Moon and the Earth both follow ‘straight’ paths.
Is this correction of our idea of gravity only a question of words? Not at all. Since
gravity is not an interaction, it is not due to a field and there is no potential.
Page 692 Let us check this strange result in yet another way. The most fundamental definition Copyright © Christoph Schiller November 1997–January 2007
of ‘interaction’ is as the difference between the whole and the sum of its parts. In the case
* This definition was formalized by Arnowitt, Deser and Misner, and since then has often been called the
ADM mass. The idea is to use the metricдi j and to take the integral
(дi j, i ν j − дi i , j ν j )dA
1
m=
16π ∫SR
(329)
where S R is the coordinate sphere of radius R, ν is the unit vector normal to the sphere and dA is the area
element on the sphere. The limit exists if space-time is asymptotically flat and if the mass distribution is suffi-
Ref. 404 ciently concentrated. Mathematical physicists have also shown that for any manifold whose metric changes
at infinity as
дi j = (1 + f ~r + O(1~r ))δ i j
2
(330)
the total mass is given by M = 2 f .
442 iii gravitation and rel ativity • 8. motion of light and vacuum
of gravity, an observer in free fall could claim that nothing special is going on, independ-
ently of whether the other body is present or not, and could claim that gravity is not an
interaction.
However, that is going too far. An interaction transports energy between systems. We
have indeed seen that gravity can be said to transport energy only approximately. Grav-
itation is thus an interaction only approximately. But that is not a sufficient reason to
abandon this characterization. The concept of energy is not useful for gravity beyond the
domain of everyday life. For the general case, namely for a general observer, gravity is
thus fundamentally different from electricity or magnetism.
Another way to look at the issue is the following. Take a satellite orbiting Jupiter with
energy–momentum p = mu. If we calculate the energy–momentum change along its path
Challenge 846 ny s, we get
du a de a a du a
=m = m(e a u ) = me a ( + Γ a bd ub uc ) = 0
dp du
+ (331)
* Refraction, the slowdown of light inside matter, is not a counter-example. Strictly speaking, light inside
matter is constantly being absorbed and re-emitted. In between these processes, light still propagates with
the speed of light in vacuum. The whole process only looks like a slowdown in the macroscopic limit. The
same applies to diffraction and to reflection. A list of apparent ways to bend light can be found on page 580;
details of the quantum-mechanical processes at their basis can be found on page 745.
all observers – heavier mathematics 443
For special relativity, we found that all its main effects – such as a limit speed, Lorentz
contraction or energy–mass equivalence – are also found for dislocations in solids. Do sys-
tems analogous to general relativity exist? Attempts to find such systems have only been
partially successful. Several equations and ideas of general relativity are applicable to de-
formations of solids, since general relativity describes the deformation of the space-time
Ref. 406 mattress. Kröner has studied this analogy in great detail. Other systems with horizons,
and thus with observables analogous to curvature, are found in certain liquids – where
vortices play the role of black holes – and in certain quantum fluids for the propagation of
Ref. 407 light. Exploring such systems has become a research topic in its own right. A full analogy
of general relativity in a macroscopic system was discovered only a few years ago. This
analogy will be presented in the third part of our adventure; we will need an additional
ingredient that is not visible at this point of our ascent.
Riemann gymnastics
where, as usual, Latin indices a, b, c, d, etc. run from 0 to 3, as do Greek indices here,
and a summation is implied when an index name appears twice. Obviously R is a tensor,
of rank 4. This tensor thus describes only the intrinsic curvature of a space-time. In con-
trast, the metric д describes the complete shape of the surface, not only the curvature.
The curvature is thus the physical quantity of relevance locally, and physical descriptions
Copyright © Christoph Schiller November 1997–January 2007
therefore use only the Riemann** tensor R or quantities derived from it.***
* This is a short section for the more curious; it can be skipped at first reading.
** Bernhard Riemann (b. 1826 Breselenz, d. 1866 Selasca), important German mathematician.
*** We showed above that space-time is curved by noting changes in clock rates, in metre bar lengths and
in light propagation. Such experiments are the easiest way to determine the metric д. We know that space-
time is described by a 4-dimensional manifold M with a metric дab that locally, at each space-time point,
is a Minkowski metric. Such a manifold is called a Riemannian manifold. Only such a metric allows one to
define a local inertial system, i.e. a local Minkowski space-time at every space-time point. In particular, we
have
ab b a a
дab = 1~д and дa = д b = δ b . (333)
How are curvature and metric related? The solution to this question usually occupies a large number of
444 iii gravitation and rel ativity • 8. motion of light and vacuum
But we can forget the just-mentioned definition of curvature. There is a second, more
physical way to look at the Riemann tensor. We know that curvature means gravity. As
we said above, gravity means that when two nearby particles move freely with the same
Challenge 851 e velocity and the same direction, the distance between them changes. In other words, the
local effect of gravity is relative acceleration of nearby particles.
It turns out that the tensor R describes precisely this relative acceleration, i.e. what we
called the tidal effects earlier on. Obviously, the relative acceleration b increases with the
Challenge 852 ny separation d and the square (why?) of the speed u of the two particles. Therefore we can
also define R as a (generalized) proportionality factor among these quantities:
The components of the Riemann curvature tensor have the dimensions of inverse square
length. Since it contains all information about intrinsic curvature, we conclude that if R
∂Γ a bd ∂Γ a bc
R a bcd = − + Γ a ec Γ e bd − Γ a f d Γ f bc . (334)
∂x c ∂x d
The curvature tensor is built from the second derivatives of the metric. On the other hand, we can also
Copyright © Christoph Schiller November 1997–January 2007
determine the metric if the curvature is known. An approximate relation is given below.
* This second definition is also called the definition through geodesic deviation. It is of course not evident
Ref. 408 that it coincides with the first. For an explicit proof, see the literature. There is also a third way to picture
the tensor R, a more mathematical one, namely the original way Riemann introduced it. If one parallel-
transports a vector w around a parallelogram formed by two vectors u and v, each of length ε, the vector w
is changed to w + δw. One then has
2
δw = −ε R u v w + higher-order terms . (336)
More can be learned about the geodesic deviation by studying the behaviour of the famous south-pointing
Page 176 carriage. This device, common in China before the compass was discovered, only works if the world is flat.
Indeed, on a curved surface, after following a large closed path, it will show a different direction than at the
Challenge 854 ny start of the trip. Can you explain why?
** The free-fall definition shows that the Riemann tensor is symmetric in certain indices and antisymmetric
why can we see the stars? – motion in the universe 445
R bc = R a bac . (340)
Its components, like those of the Riemann tensor, are inverse square lengths. The values
a
of the tensor R bc , or those of R bcd , are independent of the sign convention used in the
Challenge 856 e Minkowski metric, in contrast to R abcd .
Challenge 857 ny Can you confirm the relation R abcd R abcd = 48m 2 ~r 6 for the Schwarzschild solution?
On clear nights, between two and five thousand stars are visible with the naked eye. Sev-
eral hundred of them have names. Indeed, in all parts of the world, the stars and the con-
stellations they form are seen as memories of ancient events, and stories are told about
”
them.** But the simple fact that we can see the stars is the basis for a story much more Copyright © Christoph Schiller November 1997–January 2007
Note that the order of the indices is not standardized in the literature. The list of invariants which can be
constructed from R is long. We mention that 21 ε abcd R cd e f R ab e f , namely the product R R of the Riemann
tensor with its dual, is the invariant characterizing the Thirring–Lense effect.
* ‘Two things fill the mind with ever new and increasing admiration and awe, the more often and persistently
Ref. 410 thought considers them: the starred sky above me and the moral law inside me.’
** About the myths around the stars and the constellations, see e.g. the text by G. Fasching, Sternbilder und
ihre Mythen, Springer Verlag, 1993. On the internet there are also the beautiful http://www.astro.wisc.edu/
~dolan/constellations/constellations.html and http://www.astro.uiuc.edu/~kaler/sow/sow.html websites.
446 iii gravitation and rel ativity • 9. why can we see the stars?
”
Ref. 411
The stars we see on a clear night are mainly the brightest of our nearest neighbours in
the surrounding region of the Milky Way. They lie at distances between four and a few
thousand light years from us. Roughly speaking, in our environment there is a star about
every 400 cubic light years.
Almost all visible stars are from our own galaxy. The only extragalactic object con-
stantly visible to the naked eye in the northern hemisphere is the so-called Andromeda
nebula, shown enlarged in Figure 210. It is a whole galaxy like our own, as Immanuel Kant
had already conjectured in 1755. Several extragalactic objects are visible with the naked
eye in the southern hemisphere: the Tarantula nebula, as well as the large and the small Copyright © Christoph Schiller November 1997–January 2007
Magellanic clouds. The Magellanic clouds are neighbour galaxies to our own. Other, tem-
porary exceptions are the rare novae, exploding stars which can be seen if they appear
in nearby galaxies, or the still rarer supernovae, which can often be seen even in faraway
galaxies.
In fact, the visible stars are special in other respects also. For example, telescopes show
that about half of them are in fact double: they consist of two stars circling around each
other, as in the case of Sirius. Measuring the orbits they follow around each other allows
Challenge 859 ny one to determine their masses. Can you explain how?
Is the universe different from our Milky Way? Yes, it is. There are several arguments to
demonstrate this. First of all, our galaxy – th word galaxy is just the original Greek term
for ‘Milky Way’ – is flattened, because of its rotation. If the galaxy rotates, there must be
other masses which determine the background with respect to which this rotation takes
why can we see the stars? – Motion in the universe 447
place. In fact, there is a huge number of other galaxies – about 1011 – in the universe, a
Clouds are systems in which the matter density diminishes with the distance from the
centre, with no definite border and with no definite size. Most astrophysical objects are
best described as clouds.
The Earth is also a cloud, if we take its atmosphere, its magnetosphere and the dust
ring around it as part of it. The Sun is a cloud. It is a gas ball to start with, but is even
more a cloud if we take into consideration its protuberances, its heliosphere, the solar
wind it generates and its magnetosphere. The solar system is a cloud if we consider its
comet cloud, its asteroid belt and its local interstellar gas cloud. The galaxy is a cloud if
* The Milky Way, or galaxy in Greek, was said to have originated when Zeus, the main Greek god, tried to let
his son Heracles feed at Hera’s breast in order to make him immortal; the young Heracles, in a sign showing
his future strength, sucked so forcefully that the milk splashed all over the sky.
448 iii gravitation and rel ativity • 9. why can we see the stars?
F I G U R E 213 The colliding galaxies M51 and M51B (NASA), 65 000 al across, 31 Mal
away.
we remember its matter distribution and the cloud of cosmic radiation it is surrounded
by. In fact, even people can be seen as clouds, as every person is surrounded by gases,
little dust particles from skin, vapour, etc.
Ref. 412 In the universe, almost all clouds are plasma clouds. A plasma is an ionized gas, such
why can we see the stars? – Motion in the universe 449
F I G U R E 215 Rotating clouds emitting jets along their axis; top row: a
composite image (visible and infrared) of the galaxy 0313-192, the galaxy
3C296, and the Vela pulsar; bottom row: the star in formation HH30, the
star in formation DG Tauri B, and a black hole jet from the galaxy M87 (all
Copyright © Christoph Schiller November 1997–January 2007
NASA)
as fire, lightning, the inside of neon tubes, or the Sun. At least 99.9 % of all matter in the
universe is in the form of plasma clouds. Only a very small percentage exists in solid or
liquid form, such as toasters, subways or their users.
Clouds in the universe have certain common properties. First, clouds seen in the uni-
verse, when undisturbed by collisions or other interactions from neighbouring objects,
are rotating. Most clouds are therefore flattened and are in shape of discs. Secondly, in
many rotating clouds, matter is falling towards the centre: most clouds are accretion discs.
Finally, undisturbed accretion discs usually emit something along the rotation axis: they
possess jets. This basic cloud structure has been observed for young stars, for pulsars, for
450 iii gravitation and rel ativity • 9. why can we see the stars?
galaxies, for quasars and for many other systems. Figure 215 gives some examples. (Does
the Sun have a jet? Does the Milky Way have a jet? So far, none has been detected – there
Challenge 861 r is still room for discovery.)
In summary, at night we see mostly rotating, flattened plasma clouds emitting jets
along their axes. A large part of astronomy and astrophysics collects information about
Ref. 413 them. An overview about the observations is given in Table 37.
A sp e c t Main Va l u e
properties
Phenomena
galaxy formation observed by Hubble several times
trigger event unknown
A sp e c t Main Va l u e
properties
galaxy groups size 100 Zm
number of galaxies between a dozen and 1000
our local group number of galaxies 30
galaxies size 0.5 to 2 Zm
number c. 1011 inside horizon
containing 10 to 400 globular clusters
containing typically 1011 stars each
containing typically one supermassive and several
intermediate-mass black holes
our galaxy diameter 1.0(0.1) Zm
mass 1042 kg or 5 ë 1011 solar masses Ref. 414
A sp e c t Main Va l u e
properties
emitters of X-ray X-ray emission
bursts
pulsars periodic radio emission
mass up to around 25 solar masses
magnetars high magnetic fields up to 1011 T and higher Ref. 418
(soft gamma repeaters, anomalous X-ray pulsars)
mass above 25 solar masses Ref. 419
black holes horizon radius r = 2GM~c 2 , observed mass range
from 1 to 100 million solar masses
General properties
cosmic horizon distance c. 1026 m = 100 Ym
energy density
neutrinos energy density Ω ν unknown
average temperature photons 2.725(2) K
neutrinos not measured, predicted value is 2 K
perturbations photon anisotropy ∆T~T = 1 ë 10−5
density amplitude A = 0.8(1)
spectral index n = 0.97(3)
tensor-to-scalar ratio r < 0.53 with 95% confidence
ionization optical depth τ = 0.15(7)
decoupling z = 1100
why can we see the stars? – Motion in the universe 453
But while we are speaking of what we see in the sky, we need to clarify a general issue.
The term universe implies turning. The universe is what turns around us at night. For a
physicist, at least three definitions are possible for the term ‘universe’:
”
— The (visible) universe is the totality of all observable mass and energy. This includes
everything inside the cosmological horizon. Since the horizon is moving away from
Copyright © Christoph Schiller November 1997–January 2007
us, the amount of observable mass and energy is constantly increasing. The content of
the term ‘visible universe’ is thus not fixed in time. (What is the origin of this increase?
We will come back to this issue later on.)
— The (believed) universe is the totality of all mass and energy, including any that is not
visible. Numerous books on general relativity state that there definitely exists matter
or energy beyond the observation boundaries. We will explain the origin of this belief
below.
— The (full) universe is the sum of matter and energy as well as space-time itself.
These definitions are often mixed up in physical and philosophical discussions. There
is no generally accepted consensus on the terms, so one has to be careful. In this text,
when we use the term ‘universe’, we imply the last definition only. We will discover re-
454 iii gravitation and rel ativity • 9. why can we see the stars?
F I G U R E 217 An atlas of our cosmic environment: illustrations at scales up to 12.5, 50, 250, 5 000, 50 000,
500 000, 5 million, 100 million, 1 000 million and 14 000 million light years (© Richard Powell, http://
www.atlasoftheuniverse.com)
why can we see the stars? – Motion in the universe 455
peatedly that without clear distinction between the definitions the complete ascent of
Motion Mountain becomes impossible. (For example: Is the amount of matter and en-
Challenge 862 ny ergy in the full universe the same as in the visible universe?)
Note that the ‘size’ of the visible universe, or better, the distance to its horizon, is a
quantity which can be imagined. The value of 1026 m is not beyond imagination. If one
took all the iron from the Earth’s core and made it into a wire reaching to the edge of the
Challenge 863 ny visible universe, how thick would it be? The answer might surprise you. Also, the content
of the universe is clearly finite. There are about as many visible galaxies in the universe as
there are grains in a cubic metre of sand. To expand on the comparison, can you deduce
how much space you would need to contain all the flour you would get if every little speck
Challenge 864 ny represented one star?
Obviously, the universe is full of motion. To get to know the universe a bit, it is useful to
measure the speed and position of as many objects in it as possible. In the twentieth cen-
”
tury, a large number of such observations were obtained from stars and galaxies. (Can you
Challenge 865 ny imagine how distance and velocity are determined?) This wealth of data can be summed
up in two points.
First of all, on large scales, i.e. averaged over about five hundred million light years,
the matter density in the universe is homogeneous and isotropic. Obviously, at smaller
scales inhomogeneities exist, such as galaxies or cheesecakes. Our galaxy for example is
Ref. 420 neither isotropic nor homogeneous. But at large scales the differences average out. This
large-scale homogeneity of matter distribution is often called the cosmological principle.
The second point about the universe is even more important. In the 1920s, independ-
Ref. 421 ently, Carl Wirtz, Knut Lundmark and Gustaf Stromberg showed that on the whole, galax-
ies move away from the Earth, and the more so, the more they were distant. There are a
few exceptions for nearby galaxies, such as the Andromeda nebula itself; but in general,
the speed of flight v of an object increases with distance d. In 1929, the US-American as-
tronomer Edwin Hubble** published the first measurement of the relation between speed
and distance. Despite his use of incorrect length scales he found a relation
v=Hd,
Copyright © Christoph Schiller November 1997–January 2007
(341)
where the proportionality constant H is today called the Hubble constant. A graph of
the relation is given in Figure 218. The Hubble constant is known today to have a value
around 71 km s−1 Mpc−1 . (Hubble’s own value was so far from this value that it is not cited
* ‘Verily, at first chaos came to be ...’. The Theogony, attributed to the probably mythical Hesiodos, was final-
ized around 700 bce. It can be read in English and Greek on the http://www.perseus.tufts.edu website. The
famous quotation here is from verse 117.
** Edwin Powell Hubble (1889–1953), important US-American astronomer. After being an athlete and tak-
ing a law degree, he returned to his childhood passion of the stars; he finally proved Immanuel Kant’s 1755
conjecture that the Andromeda nebula was a galaxy like our own. He thus showed that the Milky Way is
only a tiny part of the universe.
456 iii gravitation and rel ativity • 9. why can we see the stars?
any more.) For example, a star at a distance of 2 Mpc* is moving away from Earth with a
speed between of around 142 km~s, and proportionally more for stars further away.
In fact, the discovery by Wirtz, Lundmark and Stromberg implies that every galaxy
Challenge 866 ny moves away from all the others. (Why?) In other words, the matter in the universe is ex-
panding. The scale of this expansion and the enormous dimensions involved are amazing.
The motion of all the thousand million galaxy groups in the sky is described by the single
equation (341)! Some deviations are observed for nearby galaxies, as mentioned above,
and for faraway galaxies, as we will see.
The cosmological principle and the expansion taken together imply that the universe
cannot have existed before time when it was of vanishing size; the universe thus has a
finite age. Together with the evolution equations, as explained in more detail below, the
Hubble constant points to an age value of around 13 700 million years. The expansion also
means that the universe has a horizon, i.e. a finite maximum distance for sources whose
signals can arrive on Earth. Signals from sources beyond the horizon cannot reach us.
Since the universe is expanding, in the past it has been much smaller and thus much
denser than it is now. It turns out that it has also been hotter. George Gamow** predicted
Copyright © Christoph Schiller November 1997–January 2007
Ref. 422 in 1948 that since hot objects radiate light, the sky cannot be completely black at night,
but must be filled with black-body radiation emitted when it was ‘in heat’. That radiation,
called the background radiation, must have cooled down due to the expansion of the uni-
Challenge 867 ny verse. (Can you confirm this?) Despite various similar predictions by other authors, in
one of the most famous cases of missed scientific communication, the radiation was
found only much later, by two researchers completely unaware of all this work. A fam-
Ref. 423 ous paper in 1964 by Doroshkevich and Novikov had even stated that the antenna used
by the (unaware) later discoverers was the best device to search for the radiation! In any
case, only in 1965 did Arno Penzias and Robert Wilson discover the radiation. It was in
one of the most beautiful discoveries of science, for which both later received the Nobel
Ref. 424 Prize for physics. The radiation turns out to be described by the black-body radiation for
a body with a temperature of 2.7 K; it follows the black-body dependence to a precision
of about 1 part in 104 .
But apart from expansion and cooling, the past fourteen thousand million years have
also produced a few other memorable events.
Stars seem to be there for ever. In fact, every now and then a new star appears in the sky: ”
they shine because they are hot. They are hot because of nuclear reactions in their interior.
Page 944 We will discuss these processes in more detail in the chapter on the nucleus.
The adventures the universe has experienced, or better, the adventures the matter and radi- ”
Ref. 426 ation inside it have experienced, are summarized in Table 38. The steps not yet discussed
will be studied in quantum theory. This history table has applications no theoretical phys-
icist would have imagined. The sequence is so beautiful and impressive that nowadays it
is used in certain psychotherapies to point out to people the story behind their existence
and to remind them of their own worth. Enjoy.
Ti m e Ti m e Event Te m pe r -
from from big at u r e
n o wa b a n gb
14 ë 109 a t Pl b
Time, space, matter and initial conditions 1032 K TPl
indeterminate
13 ë 109 a c. 800 t Pl Distinction of space-time from matter and radiation, 1030 K
10−42 s initial conditions determinate
10−35 s to Inflation & GUT epoch starts; strong and 5 ë 1026 K
10−32 s electroweak interactions diverge
10−12 s Antiquarks annihilate; electromagnetic and weak 1015 K
interaction separate
why can we see the stars? – Motion in the universe 459
Ti m e Ti m e Event Te m pe r -
from from big at u r e
n o wa b a n gb
2 ë 10−6 s Quarks get confined into hadrons; universe is a 1013 K
plasma
Positrons annihilate
0.3 s Universe becomes transparent for neutrinos 1010 K
a few seconds Nucleosynthesis: D, 4 He, 3 He and 7 Li nuclei form; 109 K
radiation still dominates
2500 a Matter domination starts; density perturbations 75 000 K
magnify
z = 1100 380 000 a Recombination: during these latter stages of the big 3000 K
bang, H, He and Li atoms form, and the universe
becomes ‘transparent’ for light, as matter and
Ti m e Ti m e Event Te m pe r -
from from big at u r e
n o wa b a n gb
800 ë 106 a Earth is completely covered with ice for the first time
(reason still unknown) Ref. 428
600 to Earth is completely covered with ice for the last time
540 ë 106 a
540(5) ë 106 a Paleozoic era (Palaeozoicum, ‘age of old life’) starts,
after a gigantic ice age: animals appear, oldest fossils
(with 540(5) start of Cambrian, 495(5) Ordovician,
440(5) Silurian, 417(5) Devonian, 354(5)
Carboniferous and 292(5) Permian periods)
450 ë 106 a Land plants appear
370 ë 106 a Wooden trees appear
Alluvium) epoch)
50 ë 106 a Large mammals appear
7(1) ë 106 a Hominids appears
3 ë 106 a Supernova explodes, with following consequences:
more intense cosmic radiation, higher formation rate
of clouds, Earth cools down drastically, high
evolutionary pressure on the hominids and as a
result, Homo appears Ref. 429
500 000 a Formation of youngest stars in galaxy
500 000 a Homo sapiens appears
100 000 a Beginning of last ice age
90 000 a Homo sapiens sapiens appears
why can we see the stars? – Motion in the universe 461
Ti m e Ti m e Event Te m pe r -
from from big at u r e
n o wa b a n gb
11 800 a End of last ice age, start of Holocene
6 000 a First written texts
2 500 a Physics starts
500 a Use of coffee, pencil and modern physics starts
200 a Electricity use begins
100 a Einstein publishes
10 to 120 a You are a unicellular being
present c. 14 ë 109 a You are reading this Tγ = 2.73 K,
Tν 1.6 K,
Tb 0 K
a. The time coordinate used here is the one given by the coordinate system defined by the microwave back-
ground radiation, as explained on page 465. A year is abbreviated ‘a’ (Latin ‘annus’). Errors in the last digits
are given between parentheses.
b. This quantity is not exactly defined since the big bang is not a space-time event. More on this issue on
page 1055.
c. The history of the atoms on Earth shows that we are made from the leftovers of a supernova. We truly are
made of stardust.
The geological time scale is the one of the International Commission on Stratigraphy; the times are
measured through radioactive dating.
Despite its length and its interest, this table has its limitations. For example, what
happened elsewhere in the last few thousand million years? There is still a story to be
written of which next to nothing is known. For obvious reasons, investigations have been
rather Earth-centred.
Research in astrophysics is directed at discovering and understanding all phenomena
observed in the skies. Here we skip most of this fascinating topic, since as usual, we want
to focus on motion. Interestingly, general relativity allows us to explain many of the gen-
eral observations about motion in the universe. Copyright © Christoph Schiller November 1997–January 2007
Challenge 868 n
“ point in various directions, all with the same
speed. While running, one rabbit turns its head,
and makes a startling observation. What does it
see?
”
The data showing that the universe is sprinkled with stars all over lead to a simple conclu-
sion: the universe cannot be static. Gravity always changes the distances between bodies;
the only exceptions are circular orbits. Gravity also changes the average distances between
bodies: gravity always tries to collapse clouds. The biggest cloud of all, the one formed by
all the matter in the universe, must therefore either be collapsing, or still be in expansion.
462 iii gravitation and rel ativity • 9. why can we see the stars?
Ref. 430 The first to dare to draw this conclusion was Aleksander Friedmann.* In 1922 he de-
duced the detailed evolution of the universe in the case of homogeneous, isotropic mass
distribution. His calculation is a classic example of simple but powerful reasoning. For a
universe which is homogeneous and isotropic for every point, the line element is given
Challenge 869 ny by
ds 2 = c 2 dt 2 − a 2 (t)(dx 2 + dy 2 + dz 2 ) (342)
and matter is described by a density ρ M and a pressure p M . Inserting all this into the field
equations, we get two equations
ȧ 2 k
+ 2 =
8πG Λ
ρM + and (343)
a a 3 3
ä = − (ρ M + 3p M ) a + a
4πG Λ
(344)
3 3
3H 02
ρc = (8 2) ë 10−27 kg~m3 (346)
8πG
corresponding to about 8, give or take 2, hydrogen atoms per cubic metre. On Earth, one
would call this value an extremely good vacuum. Such are the differences between every-
day life and the universe as a whole. In any case, the critical density characterizes a matter
distribution leading to an evolution of the universe just between never-ending expansion
Copyright © Christoph Schiller November 1997–January 2007
and collapse. In fact, this density is the critical one, leading to a so-called marginal evolu-
tion, only in the case of vanishing cosmological constant. Despite this restriction, the term
is now used for this expression in all other cases as well. One thus speaks of dimensionless
* Aleksander Aleksandrowitsch Friedmann (1888–1925), Russian physicist who predicted the expansion of
the universe. Following his early death from typhus, his work remained almost unknown until Georges A.
Lemaître (b. 1894 Charleroi, d. 1966 Leuven), Belgian priest and cosmologist, took it up and expanded
it in 1927, focusing, as his job required, on solutions with an initial singularity. Lemaître was one of the
propagators of the (erroneous!) idea that the big bang was an ‘event’ of ‘creation’ and convinced his whole
Page 471, page 472 organization of it. The Friedmann–Lemaître solutions are often erroneously called after two other physicists,
who studied them again much later, in 1935 and 1936, namely H.P. Robertson and A.G. Walker.
why can we see the stars? – Motion in the universe 463
The cosmological constant can also be related to this critical density by setting
Λc 2 Λc 2
ΩΛ = = =
ρΛ
. (348)
ρ c 8πGρ c 3H 02
ΩM + ΩΛ + ΩK = 1 . (350) 2 experimental
values
on
In the past, when data were lacking, physicists nsi
e xpa ion
were divided into two camps: the claustrophobics 1 ed ans
rat
believing that Ω K A 0 and the agoraphobics believ- ccele ed exp
a rat
ing that Ω K < 0. More details about the measured
ΩΛ ele
dec
ansion
eternal exp
values of these parameters will be given shortly. 0
collapse
The diagram of Figure 220 shows the most inter-
clo t
fla en
se
-1 too
ive
scale scale
factor factor
a a
ct
a(t) present
l Pl
experimental
quantum uncertainties
effects
t Pl time t
present time t
q0 = − = ΩM − ΩΛ .
ä 0 1
(352)
a 0 H 02 2
The parameter q 0 is positive if the expansion is slowing down, and negative if the expan-
sion is accelerating. These possibilities are also shown in the diagram.
An even clearer way to picture the expansion of the universe for vanishing pressure is
to rewrite equation (343) using τ = t H 0 and x(τ) = a(t)~a(t 0 ), yielding
( ) + U(x) = Ω K
dx 2
dτ
with U(x) = −Ω Λ x − Ω Λ x 2 (353)
Copyright © Christoph Schiller November 1997–January 2007
This looks like the evolution equation for the motion of a particle with mass 1, with total
energy Ω K in a potential U(x). The resulting evolutions are easily deduced.
For vanishing Ω Λ , the universe either expands for ever, or recollapses, depending on
the value of the mass–energy density.
For non-vanishing (positive) Ω Λ , the potential has exactly one maximum; if the
particle has enough energy to get over the maximum, it will accelerate continuously. That
is the situation the universe seems to be in today.
For a certain time range, the result is shown in Figure 221. There are two points to be
noted: first the set of possible curves is described by two parameters, not one. In addition,
lines cannot be drawn down to zero size. There are two main reasons: we do not yet
understand the behaviour of matter at very high energy, and we do not understand the
behaviour of space-time at very high energy. We return to this important issue later on.
why can we see the stars? – Motion in the universe 465
k = –1
k=0
k = +1
F I G U R E 222 The long-term evolution of the universe’s scale factor a for various parameters
The main conclusion to be drawn from Friedmann’s work is that a homogeneous and
isotropic universe is not static: it either expands or contracts. In either case, it has a finite
age. This profound idea took many years to spread around the cosmology community;
even Einstein took a long time to get accustomed to it.
Note that due to its isotropic expansion, the universe has a preferred reference frame:
the frame defined by average matter. The time measured in that frame is the time listed
in Table 38 and is the one we assume when we talk about the age of the universe.
An overview of the possibilities for the long time evolution is given in Figure 222. Copyright © Christoph Schiller November 1997–January 2007
The evolution can have various outcomes. In the early twentieth century, people decided
among them by personal preference. Albert Einstein first preferred the solution k = 1 and
Λ = a −2 = 4πGρ M . It is the unstable solution found when x(τ) remains at the top of the
potential U(x).
In 1917, the Dutch physicist Willem de Sitter had found, much to Einstein’s personal
dismay, that an empty universe with ρ M = p M = 0 and k = 1 is also possible. This type of
Challenge 872 ny universe expands for large times. The de Sitter universe shows that in special cases, matter
is not needed for space-time to exist.
Lemaître had found expanding universes for positive mass, and his results were also
contested by Einstein at first. When later the first measurements confirmed the calcu-
lations, the idea of a massive and expanding universe became popular. It became the
standard model in textbooks. However, in a sort of collective blindness that lasted from
466 iii gravitation and rel ativity • 9. why can we see the stars?
First of all, the sky is not black at night. It has the same intrinsic colour as during the day,
as any long-exposure photograph shows. (See, for example, Figure 63.) But that colour,
like the colour of the sky during the day, is not due to the temperature of the sky, but
”
to scattered light from the stars. If we look for the real colour of the sky, we need to
look for its thermal radiation. Indeed, measurements show that even the empty sky is
not completely cold or black at night. It is filled with radiation of around 200 GHz; more
precise measurements show that the radiation corresponds to the thermal emission of a
body at 2.73 K. This background radiation is the thermal radiation left over from the big
Copyright © Christoph Schiller November 1997–January 2007
bang.
Ref. 431 The universe is indeed colder than the stars. But why is this so? If the universe were
homogeneous on large scales and infinitely large, it would have an infinite number of
stars. Looking in any direction, we would see the surface of a star. The night sky would
Challenge 874 n be as bright as the surface of the Sun! Can you convince your grandmother about this?
In a deep forest, one sees a tree in every direction. Similarly, in a ‘deep’ universe, we
would see a star in every direction. Now, the average star has a surface temperature of
Challenge 873 ny * In this case, for Ω M E 1, the age of the universe follows t 0 D 2~(3H 0 ), where the limits correspond. For
vanishing mass density one has t 0 = 1~H o .
** ‘At night, a person is dressed only with a nightgown, and directly under it there is the character.’ Robert
Musil (b. 1880 Klagenfurt, d. 1942 Geneva), German writer.
why can we see the stars? – Motion in the universe 467
about 6000 K. If we lived in a deep and old universe, we would effectively live inside an
oven with a temperature of around 6000 K, making it impossible to enjoy ice cream.
This paradox was most clearly formulated in 1823 by the astronomer Wilhelm Olbers.*
As he extensively discussed the question, it is also called Olbers’ paradox. Today we know
that even if all matter in the universe were converted into radiation, the universe would
still not be as bright as just calculated. In other words, the power and lifetime of stars are
Ref. 432 much too low to produce the oven brightness just mentioned. So something is wrong.
In fact, two main effects can be invoked to avoid the contradiction. First, since the uni-
verse is finite in age, distant stars are shining for less time. We see them in a younger stage
or even during their formation, when they were darker. As a result, the share of bright-
ness of distant stars is smaller than that of nearby stars, so that the average temperature
of the sky is reduced.** Secondly, we could imagine that the radiation of distant stars is
red-shifted and that the volume the radiation must fill is increasing continuously, so that
the average temperature of the sky is also reduced.
In a young universe, we would thus not be able to see the stars, even if they existed.
From the brightness of the sky at night, measured to be about 3 ë 10−13 times that of an
average star like the Sun, we can deduce something interesting: the density of stars in the
universe must be much smaller than in our galaxy. The density of stars in the galaxy can
be deduced by counting the stars we see at night. But the average star density in the galaxy
would lead to much higher values for the night brightness if it were constant throughout
Copyright © Christoph Schiller November 1997–January 2007
Ref. 432 the universe. We can thus deduce that the galaxy is much smaller than the universe simply
by measuring the brightness of the night sky and by counting the stars in the sky! Can
Challenge 877 ny you make the explicit calculation?
* Heinrich Wilhelm Matthäus Olbers (b. 1758 Arbergen, d. 1840 Bremen), astronomer. He discovered two
planetoids, Pallas and Vesta, and five comets; he developed the method of calculating parabolic orbits
for comets which is still in use today. Olbers also actively supported the mathematician and astronomer
Page 103 Friedrich Wilhelm Bessel in his career choice. The paradox is named after Olbers, though others had made
similar points before, such as the Swiss astronomer Jean Philippe Loÿs de Cheseaux in 1744 and Johannes
Kepler in 1610.
** Can you explain that the sky is not black just because it is painted black or made of black chocolate? Or
more generally, that the sky is not made of and does not contain any dark and cold substance, as Olbers
468 iii gravitation and rel ativity • 9. why can we see the stars?
In summary, the sky is black at night because space-time and matter are of finite, but
Challenge 878 ny old age. As a side issue, here is a quiz: is there an Olbers’ paradox also for gravitation?
“ feel small?
- I can feel small without any help from the
universe.
Anonymous
Sometimes the history of the universe is summed up in two words: bang!...crunch. But
will the universe indeed recollapse, or will it expand for ever? Or is it in an intermediate,
marginal situation? The parameters deciding its fate are the mass density and cosmolo-
”
gical constant.
The main news of the last decade of twentieth-century astrophysics are the experi-
where the errors are of the order of 0.1 or less. The values imply that the universe is spatially
flat, its expansion is accelerating and there will be no big crunch. However, no definite
Page 477 statement on the topology is possible. We will return to this last issue shortly.
Copyright © Christoph Schiller November 1997–January 2007
In particular, the data show that the density of matter, including all dark matter, is only
about one third of the critical value.* Two thirds are given by the cosmological term. For
the cosmological constant Λ one gets the value
3H 02
Λ = ΩΛ 10−52 ~m2 . (356)
c2
This value has important implications for quantum theory, since it corresponds to a va-
cuum energy density
Λc 4 10−46 (GeV)4
ρΛ c2 = 0.5 nJ~m3 .
(ħc)3
(357)
8πG
But the cosmological term also implies a negative vacuum pressure p Λ = −ρ Λ c 2 . Inserting
Page 435 this result into the relation for the potential of universal gravity deduced from relativity
a= − c r = 2 − Ω Λ H 02 r ,
GM Λ 2 GM
(360)
r 2 3 r
which shows that a positive vacuum energy indeed leads to a repulsive gravitational effect.
Inserting the mentioned value (356) for the cosmological constant Λ we find that the
repulsive effect is small even for the distance between the Earth and the Sun. In fact, the
order of magnitude of the repulsive effect is so much smaller than that of attraction that
one cannot hope for a direct experimental confirmation of this deviation from universal
Challenge 880 ny gravity at all. Probably astrophysical determinations will remain the only possible ones.
A positive gravitational constant manifests itself through a positive component in the
expansion rate, as we will see shortly.
But the situation is puzzling. The origin of this cosmological constant is not explained
by general relativity. This mystery will be solved only with the help of quantum theory.
In any case, the cosmological constant is the first local and quantum aspect of nature
detected by astrophysical means.
Ref. 438 popular books in order to explain rain? No. Even if it were filled with air, the total mass
would never have allowed the universe to reach the present size; it would have recollapsed
Challenge 881 ny much earlier and we would not exist.
The universe is thus transparent because it is mostly empty. But why is it so empty? First
of all, in the times when the size of the universe was small, all antimatter annihilated with
the corresponding amount of matter. Only a tiny fraction of matter, which originally was
slightly more abundant than antimatter, was left over. This 10−9 fraction is the matter we
Page 963 see now. As a consequence, there are 109 as many photons in the universe as electrons or
quarks.
In addition, 380 000 years after antimatter annihilation, all available nuclei and elec-
trons recombined, forming atoms, and their aggregates, like stars and people. No free
470 iii gravitation and rel ativity • 9. why can we see the stars?
charges interacting with photons were lurking around any more, so that from that period
onwards light could travel through space as it does today, being affected only when it hits
a star or dust particle.
If we remember that the average density of the universe is 10−26 kg~m3 and that most
of the matter is lumped by gravity in galaxies, we can imagine what an excellent va-
cuum lies in between. As a result, light can travel along large distances without noticeable
hindrance.
But why is the vacuum transparent? That is a deeper question. Vacuum is transparent
because it contains no electric charges and no horizons: charges or horizons are indispens-
able in order to absorb light. In fact, quantum theory shows that vacuum does contain
Page 876 so-called virtual charges. However, virtual charges have no effects on the transmission of
light.
“
Μελέτη θανάτου.
Plato, Phaedo, 81a.
Above all, the big bang model, which is deduced from the colour of the stars and galaxies,
states that about fourteen thousand million years ago the whole universe was extremely
”
Page 706 small. This fact gave the big bang its name. The term was created (with a sarcastic un-
Ref. 439 dertone) in 1950 by Fred Hoyle, who by the way never believed that it applies to nature.
Nevertheless, the term caught on. Since the past smallness of the universe be checked
directly, we need to look for other, verifiable consequences. The central ones are the fol-
lowing:
— all matter moves away from all other matter;
— the mass of the universe is made up of about 75% hydrogen and 23% helium;
— there is thermal background radiation of about 2.7 K;
— the maximal age for any system in the universe is around fourteen thousand million
years;
— there are background neutrinos with a temperature of about 2 K;*
— for non-vanishing cosmological constant, Newtonian gravity is slightly reduced.
All predictions except the last two have been confirmed by observations. Technology will
probably not allow us to check the last two in the foreseeable future; however, there is no
Copyright © Christoph Schiller November 1997–January 2007
* The theory states that Tν ~Tγ (4~11)1~3 . These neutrinos appeared about 0.3 s after the big bang.
why can we see the stars? – Motion in the universe 471
which of course is incorrect, as just explained. But the analogy is better than it looks: in
fact, there is no precise North Pole, since quantum theory shows that there is a funda-
mental indeterminacy as to its position. There is also a corresponding indeterminacy for
the big bang.
In fact, it does not take more than three lines to show with quantum theory that time
and space are not defined either at or near the big bang. We will give this simple argument
Page 1023 in the first chapter of the third part of our mountain ascent. The big bang therefore cannot
be called a ‘beginning’ of the universe. There never was a time when the scale factor a(t)
of the universe was zero.
* Many physicists are still wary of making such strong statements on this point. The first sections of the third
Page 1018 part of our mountain ascent give the precise arguments leading to them.
472 iii gravitation and rel ativity • 9. why can we see the stars?
This conceptual mistake is frequently encountered. In fact, quantum theory shows that
near the big bang, events can neither be ordered nor even be defined. More bluntly, there
is no beginning; there has never been an initial event or singularity.
Obviously the concept of time is not defined ‘outside’ or ‘before’ the existence of the
Ref. 441 universe; this fact was already clear to thinkers over a thousand years ago. It is then tempt-
ing to conclude that time must have started. But as we saw, that is a logical mistake as well:
first of all, there is no starting event, and secondly, time does not flow, as clarified already
Page 49 in the beginning of our walk.
A similar mistake lies behind the idea that the universe had certain ‘initial condi-
Page 155 tions.’ Initial conditions by definition make sense only for objects or fields, i.e. for entities
which can be observed from the outside, i.e. for entities which have an environment. The
universe does not comply with this requirement; it thus cannot have initial conditions.
Nevertheless, many people still insist on thinking about this issue; interestingly, Stephen
Ref. 442 Hawking sold millions of copies of a book explaining that a description without initial
Creation, i.e. the appearance of something out of nothing, needs an existing concept of
A witch hunter
”
Page 699 space and time to make sense. The concept of ‘appearance’ makes no sense otherwise. But
whatever the description of the big bang, be it classical, as in this chapter, or quantum
mechanical, as in later ones, this condition is never fulfilled. Even in the present, classical
description of the big bang, which gave rise to its name, there is no appearance of mat-
ter, nor of energy, nor of anything else. And this situation does not change in any later,
improved description, as time or space are never defined before the appearance of matter.
In fact, all properties of a creation are missing: there is no ‘moment’ of creation, no
appearance from nothing, no possible choice of any ‘initial’ conditions out of some set of
possibilities, and as we will see in more detail later on, not even any choice of particular
physical ‘laws’ from any set of possibilities.
Copyright © Christoph Schiller November 1997–January 2007
In summary, the big bang does not imply nor harbour a creation process. The big bang
was not an event, not a beginning and not a case of creation. It is impossible to continue
Challenge 882 ny the ascent of Motion Mountain if one cannot accept each of these three conclusions. To
deny them is to continue in the domain of beliefs and prejudices, thus effectively giving
up on the mountain ascent.
Note that this requirement is not new. In fact, it was already contained in equation (1)
Page 51 at the start of our walk, as well as in all the following ones. It appears even more clearly at
this point. But what then is the big bang? We’ll find out in the third part. We now return
to the discussion of what the stars can tell us about nature.
* This statement will still provoke strong reactions among physicists; it will be discussed in more detail in
the section on quantum theory.
why can we see the stars? – Motion in the universe 473
* Air scattering makes the sky blue also at night, as can be proven by long-exposure photographs. (See, for
example, Figure 63.) However, our eyes are not able to perceive this, and the low levels of light make it appear
black to us.
474 iii gravitation and rel ativity • 9. why can we see the stars?
C l as s Te m pe r - Example L o c at i o n Colour
at u r e
O 30 kK Mintaka δ Orionis blue-violet
O 31(10) kK Alnitak ζ Orionis blue-violet
B 22(6) kK Bellatrix γ Orionis blue
B 26 kK Saiph κ Orionis blue-white
B 12 kK Rigel β Orionis blue-white
B 25 kK Alnilam ε Orionis blue-white
B 17(5) kK Regulus α Leonis blue-white
A 9.9 kK Sirius α Canis Majoris blue-white
A 8.6 kK Megrez δ Ursae Majoris white
A 7.6(2) kK Altair α Aquilae yellow-white
Note. White dwarfs, or class-D stars, are remnants of imploded stars, with a size of only a few tens of
kilometres. Not all are white; they can be yellow or red. They comprise 5% of all stars. None is visible with
the naked eye. Temperature uncertainties in the last digit are given between parentheses.
The size of all other stars is an independent variable and is sometimes added as roman numerals at the end
of the spectral type. (Sirius is an A1V star, Arcturus a K2III star.) Giants and supergiants exist in all classes
from O to M.
To accommodate brown dwarfs, two new star classes, L and T, have been proposed.
shift due to a relative speed with respect to the observer, and the gravitational red-shift.
Not all stars are good approximations to black bodies, so that the black-body radiation
Page 603 law does not always accurately describe their colour. However, most stars are reasonable
approximations of black bodies. The temperature of a star depends mainly on its size,
Ref. 443 its mass, its composition and its age, as astrophysicists are happy to explain. Orion is a
good example of a coloured constellation: each star has a different colour. Long-exposure
Page 73 photographs beautifully show this.
The basic colour determined by temperature is changed by two effects. The first, the
Challenge 883 ny Doppler red-shift z, depends on the speed v between source and observer as
½
z= = −1=
∆λ fS c+v
−1 . (361)
λ fO c−v
why can we see the stars? – Motion in the universe 475
Such shifts play a significant role only for remote, and thus faint, stars visible through the
telescope. With the naked eye, Doppler shifts cannot be seen. But Doppler shifts can make
distant stars shine in the infrared instead of in the visible domain. Indeed, the highest
Doppler shifts observed for luminous objects are larger than 5.0, corresponding to a re-
Challenge 884 ny cessional speed of more than 94 % of the speed of light. Note that in the universe, the
red-shift is also related to the scale factor R(t) by
R(t 0 )
z= −1 .
R(t emission )
(362)
Light at a red-shift of 5.0 was thus emitted when the universe was one sixth of its present
age.
The other colour-changing effect, the gravitational red-shift z g , depends on the matter
density of the source and is given by
Challenge 885 ny It is usually quite a bit smaller than the Doppler shift. Can you confirm this?
No other red-shift processes are known; moreover, such processes would contradict
Page 484 all the known properties of nature. But the colour issue leads to the next question.
Are all stars different? – Gravitational lenses Copyright © Christoph Schiller November 1997–January 2007
“
Are we sure that at night, two stars are really different? The answer is no. Recently, it
was shown that two ‘stars’ were actually two images of the same object. This was found
”
by comparing the flicker of the two images. It was found that the flicker of one image
was exactly the same as the other, just shifted by 423 days. This result was found by the
Estonian astrophysicist Jaan Pelt and his research group while observing two images of
Ref. 444 quasars in the system Q0957+561.
* ‘Through hardship to the stars.’ A famous Latin motto. Often incorrectly given as ‘per ardua at astra’.
476 iii gravitation and rel ativity • 9. why can we see the stars?
star star
Earth
galaxy
Earth
second image
second image
The two images are the result of gravitational lensing, as shown in Figure 225. Indeed, a
large galaxy can be seen between the two images, much nearer to the Earth. This effect had
been already considered by Einstein; however he did not believe that it was observable.
Ref. 445 The real father of gravitational lensing is Fritz Zwicky, who predicted in 1937 that the effect
would be quite common and easy to observe, if lined-up galaxies instead of lined-up stars Copyright © Christoph Schiller November 1997–January 2007
problem is tricky. For single stars, the issue is not so important, seen overall. Reassuringly,
only about 80 multiple star images have been identified so far. But when whole galaxies
are seen as several images at once (and several dozens are known so far) we might start to
get nervous. In the case of the galaxy cluster CL0024+1654, shown in Figure 227, seven
thin, elongated, blue images of the same distant galaxy are seen around the yellow, nearer,
elliptical galaxies.
But multiple images can be created not only by gravitational lenses; the shape of the
universe could also play some tricks.
an additional dimension.
Ref. 446 But what is the experimental evidence for this statement? There is none. Nothing is
yet known about the shape of the universe. It is extremely hard to determine it, simply
because of its sheer size.
What do experiments say? In the nearby region of the universe, say within a few mil-
lion light years, the topology is simply connected. But for large distances, almost nothing
is certain. Maybe research into gamma-ray bursts will tell us something about the topo-
logy, as these bursts often originate from the dawn of time.* Maybe even the study of
fluctuations of the cosmic background radiation can tell us something. All this research
is still in its infancy.
* The story is told from the mathematical point of view by B ob Osserman, Poetry of the Universe, 1996.
478 iii gravitation and rel ativity • 9. why can we see the stars?
Since little is known, we can ask about the range of possible answers. As just mentioned,
in the standard model with k = 1, space-time is usually assumed to be a product of linear
time, with the topology R of the real line, and a sphere S 3 for space. That is the simplest
possible shape, corresponding to a simply-connected universe. For k = 0, the simplest
topology of space is three-dimensional real space R3 , and for k = −1 it is a hyperbolic
manifold H3 .
Page 463 In addition, Figure 220 showed that depending on the value of the cosmological con-
stant, space could be finite and bounded, or infinite and unbounded. In Friedmann–
Lemaître calculations, simple-connectedness is usually tacitly assumed, even though it
is not at all required.
It could well be that space-time is multiply connected, like a higher-dimensional ver-
sion of a torus. It could also have even more complex topologies.* In these cases, it could
even be that the actual number of galaxies is much smaller than the observed number.
This situation would correspond to a kaleidoscope, where a few beads produce a large
The horizon is a tricky entity. In fact, all cosmological models show that it moves rapidly ”
Ref. 448 away from us. A detailed investigation shows that for a matter-dominated universe the
Challenge 887 ny horizon moves away from us with a velocity
v horizon = 3c . (364)
A pretty result, isn’t it? Obviously, since the horizon does not transport any signal, this is
not a contradiction of relativity. But what is behind the horizon?
If the universe is open or marginal, the matter we see at night is predicted by naively
Copyright © Christoph Schiller November 1997–January 2007
applied general relativity to be a – literally – infinitely small part of all matter existing.
Indeed, an open or marginal universe implies that there is an infinite amount of matter
Challenge 888 ny behind the horizon. Is such a statement verifiable?
In a closed universe, matter is still predicted to exist behind the horizon; however, in
this case it is only a finite amount.
In short, the standard model of cosmology states that there is a lot of matter behind
the horizon. Like most cosmologists, we sweep the issue under the rug and take it up
* The Friedmann–Lemaître metric is also valid for any quotient of the just-mentioned simple topologies by
a group of isometries, leading to dihedral spaces and lens spaces in the case k = 1, to tori in the case k = 0,
Ref. 447 and to any hyperbolic manifold in the case k = −1.
why can we see the stars? – Motion in the universe 479
only later in our walk. A precise description of the topic is provided by the hypothesis of
inflation.
Why are there so few stars? – The energy and entropy content of
the universe
Die Energie der Welt ist constant. Die Entropie
famous statement? Let us have a look at what general relativity has to say about all this.
In general relativity, a total energy can indeed be defined, in contrast to localized energy,
which cannot. The total energy of all matter and radiation is indeed a constant of motion.
It is given by the sum of the baryonic, luminous and neutrino parts:
c 2 M0 c2
E = Eb + Eγ + E ν + ... + ... + ... . (365)
T0 G
* ‘The energy of the universe is constant. Its entropy tends towards a maximum.’
480 iii gravitation and rel ativity • 9. why can we see the stars?
This value is constant only when integrated over the whole universe, not when just the
inside of the horizon is taken.*
Many people also add a gravitational energy term. If one tries to do so, one is obliged
to define it in such a way that it is exactly the negative of the previous term. This value
for the gravitational energy leads to the popular speculation that the total energy of the
universe might be zero. In other words, the number of stars could also be limited by this
relation.
However, the discussion of entropy puts a strong question mark behind all these seem-
ingly obvious statements. Many people have tried to give values for the entropy of the
Ref. 450 universe. Some have checked whether the relation
kc 3 A kG
S= = 4πM 2 , (366)
Għ 4 ħc
which is correct for black holes, also applies to the universe. This assumes that all the
an energy value. This is in fact the only correct conclusion one can draw about the issue.
density. Gravitation will thus cause the denser regions to increase in density and the re-
gions of lower density to be depleted. Can you confirm the instability, simply by assuming
Challenge 891 ny a space filled with dust and a = GM~r 2 ? In summary, even a tiny quantum fluctuation in
the mass density will lead, after a certain time, to lumped matter.
But how did the first inhomogeneities form? That is one of the big problems of mod-
ern physics and astrophysics, and there is no accepted answer yet. Several modern ex-
periments are measuring the variations of the cosmic background radiation spectrum
with angular position and with polarization; these results, which will be available in the
Ref. 451 coming years, might provide some information on the way to settle the issue.
then it is unique, or it is not everything, and then it is not the universe. We will discover
Page 832 that quantum theory does not change this conclusion, despite recurring reports to the
contrary.
Whoever speaks of many universes is talking gibberish.
Why are the stars fixed? – Arms, stars and Mach’s principle
Si les astres étaient immobiles, le temps et
”
* ‘If the stars were immobile, time and space would not exist any more.’ Maurice Maeterlink (1862–1949) is
a famous Belgian dramatist.
482 iii gravitation and rel ativity • 9. why can we see the stars?
The two arms possessed by humans have played an important role in discussions about
motion, and especially in the development of relativity. Looking at the stars at night, we
can make a simple observation, if we keep our arms relaxed. Standing still, our arms hang
down. Then we turn rapidly. Our arms lift up. In fact they do so whenever we see the stars
turning. Some people have spent a large part of their lives studying this phenomenon.
Why?
Ref. 453 Stars and arms prove that motion is relative, not absolute.* This observation leads to
two possible formulations of what Einstein called Mach’s principle.
— Inertial frames are determined by the rest of the matter in the universe.
This idea is indeed realized in general relativity. No question about it.
— Inertia is due to the interaction with the rest of the universe.
Ref. 455 incorrectness of Mach’s principle will persist, like that of the derision of Columbus.
In fact, Mach’s principle is valuable. As an example, take our galaxy. Experiments show
that it is flattened and rotating. The Sun turns around its centre in about 250 million years.
Indeed, if the Sun did not turn around the galaxy’s centre, we would fall into it in about 20
million years. As the physicist Dennis Sciama pointed out, from the shape of our galaxy
* The original reasoning by Newton and many others used a bucket and the surface of the water in it; but
the arguments are the same.
** For another example, at school one usually hears that Columbus was derided because he thought the
Earth to be spherical. But he was not derided at all for this reason; there were only disagreements on the size
of the Earth, and in fact it turned out that his critics were right, and that he was wrong in his own, much too
small, estimate of the radius.
why can we see the stars? – Motion in the universe 483
we can draw a powerful conclusion: there must be a lot of other matter, i.e. a lot of other
Challenge 895 n stars and galaxies in the universe. Can you confirm his reasoning?
any energy attracts other energy through gravitation. In particular, light is electromag-
netic energy, and experiments have shown that all electromagnetic energy is subject to
gravitation. Could two light beams that are advancing with a small angle between them
converge, because of mutual gravitational attraction? That could have measurable and
possibly interesting effects on the light observed from distant stars.
The simplest way to explore the issue is to study the following question: Do parallel
light beams remain parallel? Interestingly, a precise calculation shows that mutual gravit-
Ref. 457 ation does not alter the path of two parallel light beams, even though it does alter the path
of antiparallel light beams.* The reason is that for parallel beams moving at light speed,
Challenge 897 ny the gravitomagnetic component exactly cancels the gravitoelectric component.
Since light does not attract light moving along, light is not disturbed by its own gravity
during the millions of years that it takes to reach us from distant stars. Light does not
attract or disturb light moving alongside. So far, all known quantum-mechanical effects
also confirm this conclusion.
“
Black holes are the most extreme gravitational phenomena. They realize nature’s limit
Alanus de Insulis
of length-to-mass ratios. They produce the highest force value possible in nature; as a
”
Copyright © Christoph Schiller November 1997–January 2007
result, they produce high space-time curvatures. Therefore, black holes cannot be studied
without general relativity. In addition, the study of black holes is a major stepping stone
towards unification and the final description of motion.
Ref. 353 ‘Black hole’ is shorthand for ‘gravitationally completely collapsed object’. For many
years it was unclear whether or not they exist. But the available experimental data have
now led most experts to conclude that there is a black hole at the centre of most galaxies,
Ref. 460 including our own. Black holes are also suspected at the heart of quasars and of gamma
* ‘He who lies on the ground cannot fall down from it.’ The author’s original name is Alain de Lille (c. 1128–
1203).
bl ack holes – falling forever 485
ray bursters. It seems that the evolution of galaxies is strongly tied to the evolution of
black holes. In addition, half a dozen smaller black holes have been identified elsewhere
in our galaxy. For these and many other reasons, black holes, the most impressive, the
Ref. 461 most powerful and the most relativistic systems in nature, are a fascinating subject of
study.
Horizons
The escape velocity is the speed needed to launch an projectile in such a way that it never
falls back down. The escape velocity depends on the mass and the size of the planet from
which the launch takes place. What happens when a planet or star has an escape velocity
that is larger than the speed of light c? Such objects were first imagined by the British
geologist John Michell in 1784, and independently by the French mathematician Pierre
Ref. 462 Laplace in 1795, long before general relativity was developed. Michell and Laplace realized
something fundamental: even if an object with such a high escape velocity were a hot star,
RS =
2GM
(367)
c2
called the Schwarzschild radius. The formula is valid both in universal gravity and in gen-
eral relativity, provided that in general relativity we take the radius as meaning the cir-
cumference divided by 2π. Such a body realizes the limit value for length-to-mass ratios
in nature. For this and other reasons to be given shortly, we will call R S also the size of
the black hole of mass M. (But note that it is only half the diameter. In addition, the term
‘size’ has to be taken with some grain of salt.) In principle, it is possible to imagine an ob-
ject with a smaller length-to-mass ratio; but nobody has yet observed one. In fact, we will
discover that there is no way to observe an object smaller than the Schwarzschild radius,
just as an object moving faster than the speed of light cannot be observed. However, we
can observe black holes – the limit case – just as we can observe entities moving at the
speed of light.
Copyright © Christoph Schiller November 1997–January 2007
When a test mass approaches the critical radius R S , two things happen. First, the local
proper acceleration for (imaginary) point masses increases without bound. For realistic
objects of finite size, the black hole realizes the highest force possible in nature. Something
that falls into a black hole cannot be pulled back out. A black hole thus swallows all matter
that falls into it. It acts like a cosmic vacuum cleaner.
At the surface of a black hole, the red-shift factor for a distant observer also increases
without bound. The ratio between the two quantities is called the surface gravity of a black
* John Archibald Wheeler (1911–), US-American physicist, important expert on general relativity and au-
thor of several excellent textbooks, among them the beautiful John A. Wheeler, A Journey into Gravity
and Spacetime, Scientific American Library & Freeman, 1990, in which he explains general relativity with
passion and in detail, but without any mathematics.
486 iii gravitation and rel ativity • 10. bl ack holes – falling forever
A black hole thus does not allow any light to leave it.
A surface that realizes the force limit and an in-
finite red-shift makes it is impossible to send light,
matter, energy or signals of any kind to the outside
world. A black hole is thus surrounded by a hori-
zon. We know that a horizon is a limit surface. In
fact, a horizon is a limit in two ways. First, a hori- event horizon
zon is a limit to communication: nothing can com-
municate across it. Secondly, a horizon is a surface
of maximum force and power. These properties are
observer remains constant! But that is not all. Furthermore, because of this permanent
free fall, black holes are the only state of matter in thermodynamic equilibrium! In a sense,
floors and all other every-day states of matter are metastable: these forms are not as stable
as black holes.
The characterizing property of a black hole is thus its horizon. The first time we en-
Page 343 countered horizons was in special relativity, in the section on accelerated observers. The
* Robert Oppenheimer (1904–1967), important US-American physicist. He can be called the father of the-
oretical physics in the USA. He worked on quantum theory and atomic physics. He then headed the team
that developed the nuclear bomb during the Second World War. He was also the most prominent (innocent)
victim of one of the greatest witch-hunts ever organized in his home country. See also the http://www.nap.
edu/readingroom/books/biomems/joppenheimer.html website.
bl ack holes – falling forever 487
horizons due to gravitation are similar in all their properties; the section on the maximum
force and power gave a first impression. The only difference we have found is due to the
neglect of gravitation in special relativity. As a result, horizons in nature cannot be planar,
in contrast to what is suggested by the observations of the imagined point-like observers
assumed to exist in special relativity.
Both the maximum force principle and the field equations imply that the space-time
around a rotationally symmetric (thus non-rotating) and electrically neutral mass is de-
Page 393 scribed by
dr 2
di 2 = (1 − )dt − r 2 dφ 2 ~c 2 .
2GM 2
− (369)
rc 2 1 − 2G
rc
M
2
we find that an observer at the horizon would have vanishing proper time. In other words,
at the horizon the red-shift is infinite. (In fact, the surface of infinite red-shift and the hori-
zon coincide only for non-rotating black holes. For rotating black holes, the two surfaces
are distinct.) Everything happening at the horizon goes on infinitely slowly, as observed
by a distant observer. In other words, for a distant observer observing what is going on
at the horizon itself, nothing at all ever happens.
In the same way that observers cannot reach the speed of light, observers cannot reach
a horizon. For a second observer, it can only happen that the first is moving almost as fast
as light; in the same way, for a second observer, it can only happen that the first has almost
reached the horizon. In addition, a traveller cannot feel how much he is near the speed of
light for another, and experiences light speed as unattainable; in the same way, a traveller
(into a large black hole) cannot feel how much he is near a horizon and experiences the
horizon as unattainable.
In general relativity, horizons of any kind are predicted to be black. Since light cannot
escape from them, classical horizons are completely dark surfaces. In fact, horizons are
Copyright © Christoph Schiller November 1997–January 2007
the darkest entities imaginable: nothing in nature is darker. Nonetheless, we will later
discover that physical horizons are not completely black.
Orbits
Ref. 458 Since black holes curve space-time strongly, a body moving near a black hole behaves in
more complicated ways than predicted by universal gravity. In universal gravity, paths
are either ellipses, parabolas, or hyperbolas; all these are plane curves. It turns out that
488 iii gravitation and rel ativity • 10. bl ack holes – falling forever
impact
parameter
whole horizon. In fact, the result is valid for any extended body. This property of black
holes will be of importance later on, when we will discuss the size of elementary particles.
For falling bodies coming from infinity, the situation near black holes is even more in-
teresting. Of course there are no hyperbolic paths, only trajectories similar to hyperbolas
for bodies passing far enough away. But for small, but not too small impact parameters,
a body will make a number of turns around the black hole, before leaving again. The
* For such paths, Kepler’s rule connecting the average distance and the time of orbit
GMt 3
= r3
(2π)2
(371)
Challenge 903 ny still holds, provided the proper time and the radius measured by a distant observer are used.
bl ack holes – falling forever 489
limit orbit
limit orbit
number of turns increases beyond all bounds with decreasing impact parameter, until a
If we were located on that orbit, we would see the back of our head by looking forward!
Challenge 907 ny However, this orbit is unstable. The surface containing all orbits inside the circular one
is called the photon sphere. The photon sphere thus divides paths leading to capture from
those leading to infinity. Note that there is no stable orbit for light around a black hole.
Challenge 908 ny Are there any rosetta paths for light around a black hole?
For light around a rotating black hole, paths are much more complex. Already in the
equatorial plane there are two possible circular light paths: a smaller one in the direction
Challenge 909 ny of the rotation, and a larger one in the opposite direction.
For charged black holes, the orbits for falling charged particles are even more complex.
The electrical field lines need to be taken into account. Several fascinating effects appear
which have no correspondence in usual electromagnetism, such as effects similar to elec-
trical versions of the Meissner effect. The behaviour of such orbits is still an active area of
490 iii gravitation and rel ativity • 10. bl ack holes – falling forever
J 2 GQ 2 GM 2
+ D . (372)
cM 4πε 0 c 4 c2
Challenge 911 ny This follows from the limit on length-to-mass ratios at the basis of general relativity. Ro-
tating black holes realizing the limit (372) are called extremal black holes. The limit (372)
* The existence of three basic characteristics is reminiscent of particles. We will find out more about the
connection between black holes and particles in the third part of our mountain ascent.
** Mainly for marketing reasons, non-rotating and electrically neutral black holes are often called Schwarz-
Ref. 464 schild black holes; uncharged and rotating ones are often called Kerr black holes, after Roy Kerr, who
discovered the corresponding solution of Einstein’s field equations in 1963. Electrically charged but non-
rotating black holes are often called Reissner–Nordström black holes, after the German physicist Hans Re-
issner and the Finnish physicist Gunnar Nordström. The general case, charged and rotating, is sometimes
Ref. 465 named after Kerr and Newman.
Ref. 353 *** Wheeler claims that he was inspired by the difficulty of distinguishing between bald men; however, Feyn-
man, Ruffini and others had a clear anatomical image in mind when they stated that ‘black holes, in contrast
to their surroundings, have no hair.’
**** More about the still hypothetical magnetic charge later on. In black holes, it enters like an additional
type of charge into all expressions in which electric charge appears.
bl ack holes – falling forever 491
For example, for a black hole with the mass and half the angular momentum of the Sun,
namely 2 ë 1030 kg and 0.45 ë 1042 kg m2 ~s, the charge limit is about 1.4 ë 1020 C.
How does one distinguish rotating from non-rotating
black holes? First of all by the shape. Non-rotating black rotation axis
holes must be spherical (any non-sphericity is radiated
Ref. 468 away as gravitational waves) and rotating black holes have event horizon
a slightly flattened shape, uniquely determined by their
angular momentum. Because of their rotation, their sur- ergosphere
face of infinite gravity or infinite red-shift, called the static
16πG 2 2
A= M . (374)
c4
The relation between surface area and mass for a rotating and charged black hole is more
complex: it is given by
¾
8πG 2
A = 4 M2 1 +
J2 c2 Q2
1−
M 4 G 2 4πε 0 GM 2
− (375)
Copyright © Christoph Schiller November 1997–January 2007
where J is the angular momentum and Q the charge. In fact, the relation
A=
8πG
Mr h (376)
c2
is valid for all black holes. Obviously, in the case of an electrically charged black hole, the
rotation also produces a magnetic field around it. This is in contrast with non-rotating
black holes, which cannot have a magnetic field.
492 iii gravitation and rel ativity • 10. bl ack holes – falling forever
E2 Q2 J2 c2 Q2 J2 1
M2 = = (m irr + ) 2
+ 2
= (m irr + ) 2
+ 2 c2
(377)
c4 16πε 0 Gm irr 4m irr G 2 8πε 0 ρ irr ρ irr
which shows how the electrostatic and the rotational energy enter the mass of a black
hole. In the expression, m irr is the irreducible mass defined as
A(M, Q = 0, J = 0) c 4
2
c2
2
m irr = = ρ irr (378)
16π G2 2G Copyright © Christoph Schiller November 1997–January 2007
* And it would be much more dangerous, since any small object would hit such an against-the-stream satel-
Challenge 913 ny lite at about 15.8 km~s, thus transforming the object into a dangerous projectile. In fact, any power wanting
to destroy satellites of the enemy would simply have to load a satellite with nuts or bolts, send it into space
the wrong way, and distribute the bolts into a cloud. It would make satellites impossible for many decades
to come.
** It is also possible to extract energy from rotational black holes through gravitational radiation.
bl ack holes – falling forever 493
reversible, and all others irreversible. In fact, the area of black holes behaves like the en-
tropy of a closed system: it never decreases. That the area in fact is an entropy was first
Ref. 471 stated in 1970 by Jakob Bekenstein. He deduced that only when an entropy is ascribed to
a black hole, is it possible to understand where the entropy of all the material falling into
it is collected.
The black hole entropy is a function only of the mass, the angular momentum and
the charge of the black hole. You might want to confirm Bekenstein’s deduction that the
Challenge 917 ny entropy S is proportional to the horizon area. Later it was found, using quantum theory,
that
A kc 3 A k
S= = 2 . (379)
4 ħG 4 l Pl
This famous relation cannot be deduced without quantum theory, as the absolute value
of entropy, as for any other observable, is never fixed by classical physics alone. We will
“ Anonymous
Black holes have many counter-intuitive properties. We will first have a look at the clas- ”
Page 901 sical effects, leaving the quantum effects for later on.
Following universal gravity, light could climb upwards from the surface of a black hole
and then fall back down. In general relativity, a black hole does not allow light to climb
Challenge 919 ny up at all; it can only fall. Can you confirm this?
**
What happens to a person falling into a black hole? An outside observer gives a clear
answer: the falling person never arrives there since she needs an infinite time to reach the
Challenge 920 ny horizon. Can you confirm this result? The falling person, however, reaches the horizon
Challenge 921 ny in a finite amount of her own time. Can you calculate it?
* No translation possible.
494 iii gravitation and rel ativity • 10. bl ack holes – falling forever
observer dense
star
This result is surprising, as it means that for an outside observer in a universe with
finite age, black holes cannot have formed yet! At best, we can only observe systems that
are busy forming black holes. In a sense, it might be correct to say that black holes do not
exist. Black holes could have existed right from the start in the fabric of space-time. On
the other hand, we will find out later why this is impossible. In short, it is important to
keep in mind that the idea of black hole is a limit concept but that usually, limit concepts
E 2 .
L 4G
(380)
M c
**
Interestingly, the size of a person falling into a black hole is experienced in vastly different
ways by the falling person and a person staying outside. If the black hole is large, the in-
falling observer feels almost nothing, as the tidal effects are small. The outside observer
makes a startling observation: he sees the falling person spread all over the horizon of the
black hole. In-falling, extended bodies cover the whole horizon. Can you explain this fact,
Challenge 922 ny for example by using the limit on length-to-mass ratios?
This strange result will be of importance later on in our exploration, and lead to im-
portant results about the size of point particles.
**
An observer near a (non-rotating) black hole, or in fact near any object smaller than 7~4
times its gravitational radius, can even see the complete back side of the object, as shown
Challenge 923 ny in Figure 232. Can you imagine what the image looks like? Note that in addition to the Copyright © Christoph Schiller November 1997–January 2007
paths shown in Figure 232, light can also turn several times around the black hole before
reaching the observer! Therefore, such an observer sees an infinite number of images of
the black hole. The resulting formula for the angular size of the innermost image was
Page 403 given above.
In fact, the effect of gravity means that it is possible to observe more than half the
surface of any spherical object. In everyday life, however, the effect is small: for example,
light bending allows us to see about 50.0002 % of the surface of the Sun.
**
A mass point inside the smallest circular path of light around a black hole, at 3R~2, cannot
stay in a circle, because in that region, something strange happens. A body which circles
another in everyday life always feels a tendency to be pushed outwards; this centrifugal
bl ack holes – falling forever 495
effect is due to the inertia of the body. But at values below 3R~2, a circulating body is
Ref. 472 pushed inwards by its inertia. There are several ways to explain this paradoxical effect. The
simplest is to note that near a black hole, the weight increases faster than the centrifugal
Challenge 924 ny force, as you may want to check yourself. Only a rocket with engines switched on and
pushing towards the sky can orbit a black hole at 3R~2.
**
By the way, how can gravity, or an electrical field, come out of a black hole, if no signal
Challenge 925 n and no energy can leave it?
**
Do white holes exist, i.e. time-inverted black holes, in which everything flows out of, in-
Challenge 926 ny stead of into, some bounded region?
ds 2 dr 2 r2 2
dτ 2 = = 1 − − r dt 2 −
2GM Λ 2
Λc 2 2
− dφ . (381)
c 2 rc 2 3 c2 − 2G M
r
− 3
r c2
Note that this metric does not turn into the Minkowski metric for large values of r.
º
However, in the case that Λ is small, the metric is almost flat for values of r that satisfy
1~ Λ Q r Q 2Gm~c 2 .
As a result, the inverse square law is also modified:
Gm c 2 Λ
F=− + r. (382)
r2 6
With the known values of the cosmological constant, the second term is negligible inside
the solar system.
**
In quantum theory, the gyromagnetic ratio is an important quantity for any rotating
Challenge 928 ny charged system. What is the gyromagnetic ratio for rotating black holes?
Copyright © Christoph Schiller November 1997–January 2007
**
A large black hole is, as the name implies, black. Still, it can be seen. If we were to travel
towards it in a spaceship, we would note that the black hole is surrounded by a bright
circle, like a thin halo. The ring at the radial distance of the photon sphere is due to those
photons which come from other luminous objects, then circle the hole, and finally, after
Challenge 929 n one or several turns, end up in our eye. Can you confirm this result?
**
Challenge 930 ny Do moving black holes Lorentz-contract? Black holes do shine a little bit. It is true that
the images they form are complex, as light can turn around them a few times before
496 iii gravitation and rel ativity • 10. bl ack holes – falling forever
reaching the observer. In addition, the observer has to be far away, so that the effects of
curvature are small. All these effects can be taken into account; nevertheless, the question
remains subtle. The reason is that the concept of Lorentz contraction makes no sense in
general relativity, as the comparison with the uncontracted situation is difficult to define
precisely.
**
Challenge 931 ny Can you confirm that black holes imply a limit to power? Power is energy change over
time. General relativity limits power to P D c 5 ~4G. In other words, no engine in nature
can provide more than 0.92 ë 1052 W or 1.2 ë 1049 horsepower.
black holes is to look for extremely intense X-ray emission from point sources, using
space-based satellites or balloon-based detectors. If the distance to the object is known,
its absolute brightness can be deduced; if it is above a certain limit, it must be a black hole,
since normal matter cannot produce an unlimited amount of light. This method is being
perfected with the aim of directly observing of energy disappearing into a horizon. This
Ref. 476 may in fact have been observed.
To sum up the experimental situation, measurements show that in all galaxies studied
so far – more than a dozen – a supermassive black hole seems to be located at their centre.
The masses vary: the black hole at the centre of our own galaxy has about 2.6 million solar
Ref. 460 masses, while the central black hole of the galaxy M87 has 3000 million solar masses.
About a dozen stellar black holes of between 4 and 20 solar masses are known in the
Ref. 460 rest of our own galaxy; all have been discovered since 1971, when Cygnus X-1 was found.
In the year 2000, intermediate-mass black holes were found. Astronomers are also study-
ing how large numbers of black holes in star clusters behave, how often they collide, and
Singularities
Solving the equations of general relativity for various initial conditions, one finds that
a cloud of dust usually collapses to a singularity, i.e. to a point of infinite density. The
same conclusion appears when one follows the evolution of the universe backwards in
time. In fact, Roger Penrose and Stephen Hawking have proved several mathematical
theorems on the necessity of singularities for many classical matter distributions. These
theorems assume only the continuity of space-time and a few rather weak conditions on
Ref. 477 the matter in it. The theorems state that in expanding systems such as the universe itself, or
in collapsing systems such as black holes in formation, events with infinite matter density
should exist somewhere in the past, or in the future, respectively. This result is usually
summarized by saying that there is a mathematical proof that the universe started in a
singularity.
In fact, the derivation of the initial singularities makes a hidden, but strong assumption
about matter: that dust particles have no proper size. In other words, it is assumed that
dust particles are singularities. Only with this assumption can one deduce the existence
of initial singularities. However, we have seen that the maximum force principle can be
reformulated as a minimum size principle for matter. The argument that there must have
Copyright © Christoph Schiller November 1997–January 2007
been an initial singularity of the universe is thus flawed. The experimental situation is
clear: there is overwhelming evidence for an early state of the universe that was extremely
hot and dense; but there is no evidence for infinite temperature or density.
Mathematically inclined researchers distinguish two types of singularities: those with
and without a horizon. The latter ones, the so-called naked singularities, are especially
strange: for example, a toothbrush can fall into a naked singularity and disappear without
leaving any trace. Since the field equations are time invariant, we could thus expect
that every now and then, naked singularities emit toothbrushes. (Can you explain why
Challenge 932 ny ‘dressed’ singularities are less dangerous?)
To avoid the spontaneous appearance of toothbrushes, over the years many people
have tried to discover some theoretical principles forbidding the existence of naked sin-
498 iii gravitation and rel ativity • 10. bl ack holes – falling forever
gularities. It turns out that there are two such principles. The first is the maximum force
or maximum power principle we encountered above. The maximum force implies that
no infinite force values appear in nature; in other words, there are no naked singularities
Ref. 478 in nature. This statement is often called cosmic censorship. Obviously, if general relativity
were not the correct description of nature, naked singularities could still appear. Cosmic
censorship is thus still discussed in research articles. The experimental search for naked
singularities has not yielded any success; in fact, there is not even a candidate observation
for the less abstruse dressed singularities. But the theoretical case for ‘dressed’ singular-
ities is also weak. Since there is no way to interact with anything behind a horizon, it is
futile to discuss what happens there. There is no way to prove that behind a horizon a
singularity exists. Dressed singularities are articles of faith, not of physics.
In fact, there is another principle preventing singularities, namely quantum theory.
Whenever we encounter a prediction of an infinite value, we have extended our descrip-
tion of nature to a domain for which it was not conceived. To speak about singularities,
m0 = 72πGρ 0 ct 03 = 6 ë 1026 m
4π 2Gm 0
ρ o r 03 whence (384)
3 c2
r0
2Gm 0
, (385)
Copyright © Christoph Schiller November 1997–January 2007
c2
Challenge 933 ny which is similar to the black hole relation r S = 2Gm~c 2 . Is this a coincidence? No, it is not:
all systems with high curvature more or less obey this relation. But are we nevertheless
Challenge 934 ny falling into a large black hole? You can answer that question by yourself.
* Many physicists are still wary of making such strong statements at this point; and there are still some who
claim that space and time are continuous even down to the smallest distances. Our discussion of quantum
Page 1018 theory, and the first sections of the third part of our mountain ascent, will give the precise arguments leading
to the opposite conclusion.
d oes space differ from time? 499
“ Seneca
People in a bad mood say that time is our master. Nobody says that of space. Time and
space are obviously different in everyday life. But what is the precise difference between
”
them in general relativity? And do we need them at all? In general relativity it is assumed
that we live in a (pseudo-Riemannian) space-time of variable curvature. The curvature is
an observable and is related to the distribution and motion of matter and energy in the
way described by the field equations.
However, there is a fundamental problem. The equations of general relativity are in-
variant under numerous transformations which mix the coordinates x 0 , x 1 , x 2 and x 3 .
For example, the viewpoint transformation
is allowed in general relativity, and leaves the field equations invariant. You might want
Challenge 935 ny to search for other examples.
This has a consequence that is clearly in sharp contrast with everyday life: diffeomorph-
ism invariance makes it impossible to distinguish space from time inside general relativity.
More explicitly, the coordinate x 0 cannot simply be identified with the physical time t, as
implicitly done up to now. This identification is only possible in special relativity. In spe-
cial relativity the invariance under Lorentz (or Poincaré) transformations of space and
time singles out energy, linear momentum and angular momentum as the fundamental
observables. In general relativity, there is no (non-trivial) metric isometry group; con-
sequently, there are no basic physical observables singled out by their characteristic of
being conserved. But invariant quantities are necessary for communication! In fact, we
can talk to each other only because we live in an approximately flat space-time: if the
angles of a triangle did not add up to π (two right angles), there would be no invariant
quantities and we would not be able to communicate.
Copyright © Christoph Schiller November 1997–January 2007
How have we managed to sweep this problem under the rug so far? We have done so in
several ways. The simplest was to always require that in some part of the situation under
consideration space-time was our usual flat Minkowski space-time, where x 0 can be iden-
tified with t. We can fulfil this requirement either at infinity, as we did around spherical
masses, or in zeroth approximation, as we did for gravitational radiation and for all other
perturbation calculations. In this way, we eliminate the free mixing of coordinates and
the otherwise missing invariant quantities appear as expected. This pragmatic approach
is the usual way out of the problem. In fact, it is used in some otherwise excellent texts
Ref. 434 on general relativity that preclude any deeper questioning of the issue.
* ‘Care about time.’ Lucius Annaeus Seneca (c. 4 bce–65), Epistolae 88, 39.
500 iii gravitation and rel ativity • 11. d oes space differ from time?
A common variation of this trick is to let the distinction ‘sneak’ into the calculations
by the introduction of matter and its properties, or by the introduction of radiation. The
material properties of matter, for example their thermodynamic state equations, always
distinguish between space and time. Radiation does the same, by its propagation. Obvi-
ously this is true also for those special combinations of matter and radiation called clocks
and metre bars. In fact, the method of introducing matter is the same as the method of
introducing Minkowski space-time, if one looks closely: properties of matter are always
defined using space-time descriptions of special relativity.*
Another variation of the pragmatic approach is the use of the cosmological time co-
ordinate. An isotropic and homogeneous universe does have a preferred time coordinate,
Page 458, page 630 namely the one used in all the tables on the past and the future of the universe. This
method is in fact a combination of the previous two.
But we are on a special quest here. We want to understand motion in principle, not
only to calculate it in practice. We want a fundamental answer, not a pragmatic one. And
problem persists.
In fact, we need electrodynamics to solve it. It is only be using the elementary charge e
Ref. 479 that we can form length scales, of which the simplest one is
º
l emscale = º 1.4 ë 10−36 m .
e G
(387)
4πε 0 c2
Page 532 Here, ε 0 is the permittivity of free space. Alternatively, we can argue that quantum mech-
* We note something astonishing here: the inclusion of some condition at small distances (matter) has the
Challenge 936 ny same effect as the inclusion of some condition at infinity. Is this just coincidence? We will come back to this
Page 1072 issue in the third part of our mountain ascent.
d oes space differ from time? 501
anics provides a length scale, since one can use the quantum of action ħ to define the
length scale ¾
l qtscale = 1.6 ë 10−35 m ,
ħG
(388)
c3
which is called the Planck length or Planck’s natural length unit. However, this does not
change the argument, because one needs electrodynamics to measure the value of ħ. The
equivalence of the two arguments is shown by rewriting the elementary charge e as a
combination of nature’s fundamental constants:
¼
e = 4πε 0 cħα . (389)
Here, α 1~137.06 is the fine-structure constant that characterizes the strength of electro-
magnetism. In terms of α, expression (387) becomes
Summing up, every length measurement is based on the electromagnetic coupling con-
stant α and on the Planck length. Of course, the same is true for time and mass meas-
Challenge 937 e urements. There is thus no way to define or measure lengths, times and masses using
gravitation or general relativity only.*
Given this sobering result, we can ask whether in general relativity space and time are
really required at all.
(a 3 − v) = (a 1 − v) + (a 2 − v) .
Copyright © Christoph Schiller November 1997–January 2007
(391)
Such an observable is called the vacuum. Geroch shows how to use such an observable to
construct the derivatives of observables. Then the so-called Einstein algebra can be built,
which comprises the whole of general relativity.
Usually one describes motion by deducing space-time from matter observables, by
calculating the evolution of space-time, and then by deducing the motion of matter fol-
lowing from it. Geroch’s description shows that the middle step, and thus the use of space
Ref. 480 * In the past, John Wheeler used to state that his geometrodynamic clock, a device which measures time
by bouncing light back and forth between two parallel mirrors, was a counter-example; that is not correct,
Challenge 938 n however. Can you confirm this?
502 iii gravitation and rel ativity • 11. d oes space differ from time?
one, as the curvature inside the hole is different. This difference thus implies that the
curvature outside a region does not determine the curvature inside it. That is extremely
unsatisfactory. Worse, if we generalize this operation to the time domain, we seem to get
the biggest nightmare possible in physics: determinism is lost.
On the other hand, general relativity is diffeomorphism invariant. The deformation
shown in the figure is a diffeomorphism; so the new situation must be physically equival-
ent to the original situation.
Which argument is correct? Einstein first favoured the first point of view, and therefore
dropped the whole idea of diffeomorphism invariance for about a year. Only later did he
understand that the second assessment is correct, and that the first statement makes a
fundamental mistake: it assumes an independent existence of the coordinate axes x and y,
as shown in the figure. But during that deformation, the coordinates x and y automatically
d oes space differ from time? 503
change as well, so that there is no physical difference between the two situations.
brought up to believe that light travels in straight lines. Get rid of this wrong belief, they
say, and the hollow Earth appears in all its glory.
Interestingly, the reasoning is correct. There is no way to disprove this sort of de-
scription of the universe. In fact, as the great Austrian physicist Roman Sexl used to
Ref. 483 explain, the diffeomorphism invariance of general relativity even proclaims the equival-
ence between the two views. The fun starts when either of the two camps wants to tell
the other that only its own description can be correct. You might check that any such
argument is wrong; it is fun to slip into the shoes of such an eccentric and to defend the
Challenge 939 e hollow Earth hypothesis against your friends. It is easy to explain the appearance of day
and night, of the horizon, and of the satellite images of the Earth. It is easy to explain what
happened during the flight to the Moon. You can drive many bad physicists crazy in this
way. The usual description and the hollow Earth description are exactly equivalent. Can
504 iii gravitation and rel ativity • 11. d oes space differ from time?
you confirm that even quantum theory, with its introduction of length scales into nature,
Challenge 940 n does not change this situation?
Such investigations show that diffeomorphism invariance is not an easy symmetry to
swallow. But it is best to get used to it now, as the rest of our adventure will throw up
even more surprises. Indeed, in the third part of our walk we will discover that there is
an even larger symmetry of nature that is similar to the change in viewpoint from the
hollow Earth view to the standard view. This symmetry, space-time duality, is valid not
only for distances measured from the centre of the Earth, but for distances measured
Page 1072 from any point in nature.
between space and time in general relativity. Pragmatic distinctions, using matter, radi-
ation or space-time at infinity, are the only possible ones.
In the third part of our adventure we will discover that even the inclusion of quantum
theory is consistent with this view. We will show explicitly that no distinction is possible
in principle. We will discover that mass and space-time are on an equal footing, and that,
Page 1035 in a sense, particles and vacuum are made of the same substance. Distinctions between
space and time turn out to be possible only at low, everyday energies.
Page 160 In the beginning of our mountain ascent we found that we needed matter to define
space and time. Now we have found that we even need matter to distinguish between
space and time. Similarly, in the beginning we found that space and time are required to
Page 441 define matter; now we have found that we even need flat space-time to define it.
In summary, general relativity does not provide a way out of the circular reasoning
general rel ativity in ten points – a summary for the l ayman 505
we discovered in Galilean physics. Indeed, it makes the issue even less clear than before.
Continuing the mountain ascent is really worth the effort.
“ ”
General relativity is the final description of paths of motion, or if one prefers, of mac-
roscopic motion. General relativity describes how the observations of motion of any two
observers are related to each other; it also describes motion due to gravity. In fact, general
relativity is based on the following observations:
— Space and time are not distinguished globally, but only locally. Matter is required to
make the distinction.
In addition, all the matter and energy we observe in the sky lead us to the following
conclusions:
— On the cosmological scale, everything moves away from everything else: the universe
is expanding. This expansion of space-time is described by the field equations.
— The universe has a finite age; this is the reason for the darkness of the sky at night. A
horizon limits the measurable space-time intervals to about fourteen thousand million
years.
kilogram in Paris, instead of defining some standard curvature and fixing the value of
G. Indeed, no useful terrestrial curvature experiment has ever been carried out. A break-
through in this domain would make the news. The terrestrial curvature methods cur-
rently available would not even allow one to define a kilogram of gold or of oranges with
a precision of a single kilogram!
Another way to check general relativity is to search for alternative descriptions of
gravitation. Quite a number of alternative theories of gravity have been formulated and
Ref. 485, Ref. 488 studied, but so far, only general relativity is in agreement with all experiments.
In summary, as Thibault Damour likes to explain, general relativity is at least
99.999 999 999 9 % correct concerning the motion of matter and energy, and at least
Ref. 484 99.9 % correct about the way matter and energy curve and move space-time. No excep-
general rel ativity in ten points – a summary for the l ayman 507
tions, no anti-gravity and no unclear experimental data are known. All motion on Earth
and in the skies is described by general relativity. The importance of Albert Einstein’s
achievement cannot be understated.
We note that general relativity has not been tested for microscopic motion. In this
context, microscopic motion is any motion for which the action is around the quantum of
action, namely 10−34 Js. This issue is central to the third and last part of our adventure.
**
The most interesting experimental studies at present are those of double pulsars, the
search for gravitational waves and various dedicated satellites; among others a special
satellite will capture all possible pulsars of the galaxy. All these experiments will allow
experimental tests in domains that have not been accessible up to now.
**
The description of collisions and many-body problems, invloving stars, neutron stars and
* There is even a free and excellent internet-based research journal, called Living Reviews in Relativity, to be
found at the http://www.livingreviews.org website.
508 iii gravitation and rel ativity • 12. a summary for the l ayman
black holes helps astrophysicists to improve their understanding of the rich behaviour
Ref. 490 they observe in their telescopes.
**
The study of the early universe and of elementary particle properties, with phenomena
such as inflation, a short period of accelerated expansion during the first few seconds, is
Ref. 491 still an important topic of investigation.
**
The study of chaos in the field equations is of fundamental interest in the study of the
early universe, and may be related to the problem of galaxy formation, one of the biggest
Ref. 492 open problems in physics.
**
The inclusion of torsion in the field equations, a possible extension of the theory, is one of
Ref. 496 the more promising attempts to include particle spin in general relativity. The inclusion
of torsion in general relativity does not require new fundamental constants; indeed, the
absence of torsion was assumed in the Raychaudhuri equation used by Jacobson. The
Ref. 497 use of the extended Raychaudhuri equation, which includes torsion, should allow one to
deduce the full Einstein–Cartan theory from the maximum force principle. This issue is
a topic for future research.
**
Solutions with non-trivial topology, such as wormholes and particle-like solutions, con-
Ref. 498 stitue a fascinating field of enquiry, related to string theory.
general rel ativity in ten points – a summary for the l ayman 509
**
Other formulations of general relativity, describing space-time with quantities other than
the metric, are continuously being developed, in the hope of clarifying its relationship to
Ref. 499 the quantum world. The so-called Ashtekar variables are such a modern description.
**
The unification of quantum physics and general relativity, the topic of the third part of
Ref. 500 this mountain ascent, will occupy researchers for many years to come.
**
Finally, the teaching of general relativity, which for many decades has been hidden behind
Greek indices, differential forms and other antididactic methods, will benefit greatly from
Ref. 501 future improvements focusing more on the physics and less on the formalism.
tensor is compatible with non-vanishing curvature. On the other hand, more than three
dimensions give deviations from the inverse square ‘law’ of gravitation. So far, there are
no data pointing in this direction.
Could the equations of general relativity be different? Despite their excellent fit with
experiment, there is one issue that still troubles some people. The rotation speed of matter
far from the centre of galaxies does not seem to be consistent with the inverse square
dependence. There could be many reasons for this effect, and a change in the equations
for large distances might be one of them. This issue is still open.
Theoreticians have explored many alternative theories, such as scalar-tensor theories,
theories with torsion, or theories which break Lorentz invariance. However, none of the
alternative theories proposed so far seem to fit experimental data.
510 iii gravitation and rel ativity • 12. a summary for the l ayman
“
Copyright © Christoph Schiller November 1997–January 2007
”
bibliography 511
Biblio graphy
A man will turn over half a library to make one
“ book.
Samuel Johnson*
”
339 The simplest historical source is Albert Einstein, Sitzungsberichte der Preussischen
Akademie der Wissenschaften II pp. 844–846, 1915. It is the first explanation of the general
theory of relativity, in only three pages. The theory is then explained in detail in the famous
article Albert Einstein, Die Grundlage der allgemeinen Relativitätstheorie, Annalen
der Physik 49, pp. 769–822, 1916. The historic references can be found in German and Eng-
lish in John Stachel, ed., The Collected Papers of Albert Einstein, Volumes 1–9, Princeton
University Press, 1987–2004.
Below is a selection of English-language textbooks for deeper study, in ascending order
of depth and difficulty:
ters of the field is Wolf gang R indler, Relativity – Special, General and Cosmological,
Oxford University Press, 2001.
— A classic is Steven Weinberg, Gravitation and Cosmology, Wiley, 1972.
— The passion of general relativity can be experienced also in John Kl auder, ed., Magic
without Magic: John Archibald Wheeler – A Collection of Essays in Honour of His Sixtieth
Birthday, W.H. Freeman & Co., 1972.
— An extensive text is Kip S. Thorne, Black Holes and Time Warps – Einstein’s Outrageous
Legacy, W.W. Norton, 1994.
— The most mathematical – and toughest – text is Robert M. Wald, General Relativity,
University of Chicago Press, 1984.
— Much information about general relativity is available on the internet. As a good starting
point for US-American material, see the http://math.ucr.edu/home/baez/relativity.html
website.
There is still a need for a large and modern textbook on general relativity, with colour
material, that combines experimental and theoretical aspects.
For texts in other languages, see the next reference. Cited on pages 385, 418, 420, 438,
and 439.
340 A beautiful German teaching text is the classic G. Falk & W. Ruppel, Mechanik, Relativ-
ität, Gravitation – ein Lehrbuch, Springer Verlag, third edition, 1983.
A practical and elegant booklet is Ulrich E. Schröder, Gravitation – Einführung in
die allgemeine Relativitätstheorie, Verlag Harri Deutsch, Frankfurt am Main, 2001.
A modern reference is Torsten Fliessbach, Allgemeine Relativitätstheorie,
Akademischer Spektrum Verlag, 1998.
Excellent is Hubert Goenner, Einführung in die spezielle und allgemeine Relativität-
lativistic time gains, Science 177, pp. 166–167, and Around-the-world atomic clocks: ob-
served relativistic time gains, pp. 168–170, 14 July 1972. Cited on page 388.
347 R.F.C. Vessot & al., Test of relativistic gravitation with a space-borne hydrogen maser,
Physical Review Letters 45, pp. 2081–2084, 1980. The experiment was performed in 1976;
there are more than a dozen co-authors involved in this work, which involved shooting a
maser into space with a scout missile to a height of c. 10 000 km. Cited on page 388.
348 L. Briatore & S. Leschiu tta, Evidence for Earth gravitational shift by direct atomic-
time-scale comparison, Il Nuovo Cimento 37B, pp. 219–231, 1977. Cited on page 388.
349 More information about tides can be found in E.P. Cl ancy, The Tides, Doubleday, New
York, 1969. Cited on page 389.
350 The expeditions had gone to two small islands, namely to Sobral, north of Brazil, and to Prin-
cipe, in the gulf of Guinea. The results of the expedition appeared in The Times before they
bibliography 513
appeared in a scientific journal. Today this would be seen as a gross violation of scientific
honesty. The results were published as F.W. Dyson, A.S. Eddington & C. Davidson,
Philosophical Transactions of the Royal Society (London) 220A, p. 291, 1920, and Memoirs of
the Royal Astronomical Society 62, p. 291, 1920. Cited on page 391.
351 A good source for images of space-time is the text by G.F.R. Ellis & R. Williams, Flat
and Curved Space-times, Clarendon Press, Oxford, 1988. Cited on page 391.
352 J. Droste, Het veld van een enkel centrum in Einstein’s theorie der zwaartekracht, en de
beweging van een stoffelijk punt, Verslag gew. Vergad. Wiss. Amsterdam 25, pp. 163–180, 1916.
Cited on page 393.
353 The name black hole was introduced in 1967 at a pulsar conference, as described in his autobi-
ography by John A. Wheeler, Geons, Black Holes, and Quantum Foam: A Life in Physics,
W.W. Norton, 1998, pp. 296–297: ‘In my talk, I argued that we should consider the possibil-
ity that at the center of a pulsar is a gravitationally completely collapsed object. I remarked
that one couldn’t keep saying “gravitationally completely collapsed object” over and over.
One needed a shorter descriptive phrase. “How about black hole?” asked someone in the
valence principle, has been checked by many experiments, down to the 10−13 level. The
most precise experiments use so-called torsion balances. See, for example, the website of
the Eőt-Wash group at http://www.npl.washington.edu/eotwash/experiments/experiments.
html. Cited on page 397.
358 So far, the experiments confirm that electrostatic and (strong) nuclear energy fall like mat-
ter to within one part in 108 , and weak (nuclear) energy to within a few per cent. This is
summarized in Ref. 362. Cited on page 398.
359 J. Soldner, Berliner Astronomisches Jahrbuch auf das Jahr 1804, 1801, p. 161. Cited on page
398.
360 See for example K.D. Olum, Superluminal travel requires negative energies, Physical Re-
view Letters 81, pp. 3567–3570, 1998, or M. Alcubierre, The warp drive: hyper-fast travel
within general relativity, Classical and Quantum Gravity 11, pp. L73–L77, 1994. See also
514 iii gravitation and rel ativity
Chris Van Den Broeck, A warp drive with more reasonable total energy requirements,
Classical and Quantum Gravity 16, pp. 3973–3979, 1999. Cited on page 401.
361 See the Astronomical Almanac, and its Explanatory Supplement, H.M. Printing Office, Lon-
don and U.S. Government Printing Office, Washington, 1992. For the information about
various time coordinates used in the world, such as barycentric coordinate time, the time
at the barycentre of the solar system, see also the http://tycho.usno.navy.mil/systime.html
web page. It also contains a good bibliography. Cited on page 402.
362 An overview is given in C. Will, Theory and Experiment in Gravitational Physics, chapter
14.3, Cambridge University Press, revised edition, 1993. (Despite being a standard reference,
his view the role of tides and the role of gravitational energy within the principle of equi-
valence has been criticised by other researchers.) See also C. Will, Was Einstein Right? –
Putting General Relativity to the Test, Oxford University Press, 1993. See also his paper http://
arxiv.org/abs/gr-qc/9811036. Cited on pages 402, 418, and 513.
363 The calculation omits several smaller effects, such as rotation of the Earth and red-shift. For
insky & V.I. Panov, Soviet Physics – JETP 34, pp. 463–466, 1971. Modern results, with
errors less than one part in 1012 , are by Y. Su & al., New tests of the universality of free fall,
Physical Review D50, pp. 3614–3636, 1994. Several experiments have been proposed to test
the equality in space to less than one part in 1016 . Cited on pages 404 and 507.
368 The Thirring effect was predicted in H. Thirring, Über die Wirkung rotierender ferner
Massen in der Einsteinschen Gravitationstheorie, Physikalische Zeitschrift 19, pp. 33–39,
1918, and in H. Thirring, Berichtigung zu meiner Arbeit: “Über die Wirkung rotierender
Massen in der Einsteinschen Gravitationstheorie”, Physikalische Zeitschrift 22, p. 29, 1921.
The Thirring–Lense effect was predicted in J. Lense & H. Thirring, Über den Einfluß
der Eigenrotation der Zentralkörper auf die Bewegung der Planeten und Monde nach der
Einsteinschen Gravitationstheorie, Physikalische Zeitschrift 19, pp. 156–163, 1918. See also
Ref. 389. Cited on page 406.
bibliography 515
369 The feat used the LAGEOS and LAGEOS II satellites and is told in Ignazio Ciufolini, The
1995–99 measurements of the Thirring–Lense effect using laser-ranged satellites, Classical
and Quantum Gravity 17, pp. 2369–2380, 2000. See also I. Ciufolini & E.C. Pavlis, A
confirmation of the general relativistic prediction of the Lense–Thirring effect, Nature 431,
pp. 958–960, 2004. Cited on pages 406, 410, and 507.
370 The detection of the Thirring–Lense effect in binary pulsars is presented in R.D. Bl and-
ford, Lense–Thirring precession of radio pulsars, Journal of Astrophysics and Astronomy
16, pp. 191–206, 1995. Cited on page 407.
371 G. Holzmüller, Zeitschrift für Mathematik und Physik 15, p. 69, 1870, F. Tisserand,
Comptes Rendus 75, p. 760, 1872, and Comptes Rendus 110, p. 313, 1890. Cited on page 407.
372 B. Mashhoon, Gravitoelectromagnetism, http://arxiv.org/abs/gr-qc/0011014. See also its
extensive reference list on gravitomagnetism. Cited on page 408.
373 D. Bedford & P. Krumm, On relativistic gravitation, American Journal of Physics 53,
pp. 889–890, 1985, and P. Krumm & D. Bedford, The gravitational Poynting vector and
energy transfer, American Journal of Physics 55, pp. 362–363, 1987. Cited on pages 408
381 The beautiful summary by Daniel Kleppner, The gem of general relativity, Physics
Today 46, pp. 9–11, April 1993, appeared half a year before the authors of the cited work,
Joseph Taylor and Russel Hulse, received the Nobel Prize for the discovery of millisecond
pulsars. A more detailed review article is J.H. Taylor, Pulsar timing and relativistic grav-
ity, Philosophical Transactions of the Royal Society, London A 341, pp. 117–134, 1992. The
original paper is J.H. Taylor & J.M. Weisberg, Further experimental tests of relativ-
istic gravity using the binary pulsar PSR 1913+16, Astrophysical Journal 345, pp. 434–450,
1989. See also J.M. Weisberg, J.H. Taylor & L.A. Fowler, Pulsar PSR 1913+16 sen-
det Gravitationswellen, Spektrum der Wissenschaft, pp. 53–61, December 1981. Cited on page
416.
382 D.R. Lorimer, Binary and millisecond pulsars, in http://www.livingreviews.org/
lrr-2005-7, and J.M. Weisberg & J.H. Taylor, The relativistic binary pulsar B1913+16:
516 iii gravitation and rel ativity
thirty years of observations and analysis, pp. 25–31, in F.A. R asio & I.H. Stairs, editors,
Binary Radio Pulsars, Proceedings of a meeting held at the Aspen Center for Physics, USA,
12 Janaury - 16 January 2004, volume 328 of ASP Conference Series, Astronomical Society
of the Pacific, 2005. Cited on page 416.
383 W.B. B onnor & M.S. Piper, The gravitational wave rocket, Classical and Quantum Grav-
ity 14, pp. 2895–2904, 1997, or http://arxiv.org/abs/gr-qc/9702005. Cited on page 417.
384 L. Lerner, A simple calculation of the deflection of light in a Schwarzschild gravitational
field, American Journal of Physics 65, pp. 1194–1196, 1997. Cited on page 417.
385 A. Einstein, Über den Einfluß der Schwerkraft auf die Ausbreitung des Lichtes, Annalen
der Physik 35, p. 898, 1911. Cited on page 418.
386 I.I. Shapiro & al., Fourth test of general relativity, Physical Review Letters 13, pp. 789–
792, 1964. Cited on page 419.
387 I.I. Shapiro & al., Fourth test of general relativity: preliminary results, Physical Review
Letters 20, pp. 1265–1269, 1968. Cited on page 419.
399 Arguments for the emptiness of general covariance are given by John D. Norton, Gen-
eral covariance and the foundations of general relativity, Reports on Progress in Physics 56,
pp. 791–858, 1993. The opposite point, including the discussion of ‘absolute elements’, is
made in the book by J.L. Anderson, Principles of Relativity Physics, chapter 4, Academic
Press, 1967. Cited on page 435.
400 For a good introduction to mathematical physics, see the famous three-women text in
two volumes by Yvonne Choquet-Bruhat, Cecile DeWitt-Morette & Mar-
garet Dill ard-Bleick, Analysis, Manifolds, and Physics, North-Holland, 1996 and
2001. The first edition of this classic appeared in 1977. Cited on page 437.
401 See for example R.A. Knop & al., New constraints on Ω M , Ω Λ , and w from an independ-
ent set of eleven high-redshift supernovae observed with HST, Astrophysical Journal 598,
pp. 102–137, 2003. Cited on page 438.
402 R.P. Feynman, R.B. Leighton & M. Sands, The Feynman Lectures on Physics, Addison
Wesley, 1977, volume II, p. 42–14. Cited on page 439.
415 D.R. Lorimer & al., The Parkes multibeam pulsar survey: VI. Discovery and timing of
142 pulsars and a Galactic population analysis, Monthly Notices of the Royal Astronomical
Society preprint at http://arxiv.org/abs/astro-ph/0607640. Cited on page 451.
416 D. Figer, An upper limit to the masses of stars, Nature 434, pp. 192–194, 2005. Cited on
page 451.
417 G. Basri, The discovery of brown dwarfs, Scientific American 282, pp. 77–83, April 2001.
Cited on page 451.
418 P.M. Woods & C. Thompson, Soft gamma repeaters and anomalous X-ray pulsars: mag-
netar candidates, http://arxiv.org/abs/astro-ph/0406133. Cited on page 452.
419 B.M. Gaensler, N.M. McClure-Griffiths, M.S. Oey, M. Haverkorn, J.M.
Dickey & A.J. Green, A stellar wind bubble coincident with the anomalous X-ray pulsar
1E 1048.1-5937: are magnetars formed from massive progenitors?, The Astrophysical Journal
(Letters) 620, pp. L95–L98, 2005, or http://arxiv.org/abs/astro-ph/0501563. Cited on page
452.
427 The first oxygen seems to have appeared in the atmosphere, produced by microorganisms,
2.32 thousand million years ago. See A. Becker & al., Dating the rise of atmospheric
oxygen, Nature 427, pp. 117–120, 2003. Cited on page 459.
428 Gabriele Walker, Snowball Earth – The Story of the Great Global Catastrophe That
Spawned Life as We Know It, Crown Publishing, 2003. Cited on pages 460 and 843.
429 K. Knie, Spuren einer Sternexplosion, Physik in unserer Zeit 36, p. 8, 2005. The first step of
this connection is found in K.6 Knie, G. Korschinek, T. Faestermann, E.A. Dorfi,
G. Rugel & A. Wallner, 0 Fe anomaly in a deep-sea manganese crust and implications
for a nearby supernova source, Physics Review Letters 93, p. 171103, 2004, the second step in
N.D. Marsh & H. Svensmark, Low cloud properties influenced by cosmic rays, Physics
Review Letters 85, pp. 5004–5007, 2000, and the third step in P.B. de Menocal, Plio-
Pleistocene African climate, Science 270, pp. 53–59, 1995. Cited on page 460.
bibliography 519
430 A. Friedman, Über die Krümmung des Raumes, Zeitschrift für Physik 10, pp. 377–386,
1922, and A. Friedmann, Über die Möglichkeit einer Welt mit konstanter negativer Krüm-
mung des Raumes, Zeitschrift für Physik 21, pp. 326–332, 1924. (In the Latin transliteration,
the author aquired a second ‘n’ in his second paper.) Cited on page 462.
431 H. Knu tsen, Darkness at night, European Journal of Physics 18, pp. 295–302, 1997. Cited
on pages 466 and 467.
432 See for example P.D. Peşić, Brightness at night, American Journal of Physics 66, pp. 1013–
1015, 1998. Cited on page 467.
433 Paul Wesson, Olbers’ paradox and the spectral intensity of extra-galactic background
light, Astrophysical Journal 367, p. 399, 1991. Cited on page 467.
434 Steven Weinberg, Gravitation and Cosmology, John Wiley, 1972. An excellent book writ-
ten with a strong personal touch and stressing most of all the relation with experimental
data. It does not develop a strong feeling for space-time curvature, and does not address the
basic problems of space and time in general relativity. Excellent for learning how to actually
before he made Heaven and Earth?’ is not ‘He was preparing Hell for people who pry into
mysteries’. This frivolous retort has been made before now, so we are told, in order to evade
the point of the question. But it is one thing to make fun of the questioner and another to
find the answer. So I shall refrain from giving this reply. [...] But if before Heaven and Earth
there was no time, why is it demanded what you [god] did then? For there was no “then”
when there was no time.’ (Book XI, chapter 12 and 15). Cited on page 472.
442 Stephen Hawking, A Brief History of Time – From the Big Bang to Black Holes, 1988.
Reading this bestseller is almost a must for any physicist, as it is a frequent topic at dinner
parties. Cited on page 472.
443 Star details are explained in many texts. See for example ... Cited on page 474.
444 J. Pelt, R. Kayser, S. R ef sdal & T. Schramm, The light curve and the time delay of
QSO 0957+561, Astronomy and Astrophysics 305, p. 97, 1996. Cited on page 475.
520 iii gravitation and rel ativity
445 F. Zwicky, Nebulae as gravitational lenses, Physical Review Letters 51, p. 290, and F.
Zwicky, On the probability to detect nebulae which act as gravitational lenses, p. 679,
1937. The negative view by Einstein is found in A. Einstein, Lens-like action of a star
by the deviatioin of light in the gravitational field, Science 84, pp. 506–507, 1936. A review
on gravitational lensing can even be found online, in the paper by J. Wambsganss, Grav-
itational lensing in astronomy, Living Reviews in Relativity 1-12, pp. 1–80, 1998, to be found
on the http://www.livingreviews.org/Articles/Volume1/1998-12wamb website.
There is also the book by P. Schneider, J. Ehlers & E.E. Falco, Gravitational
Lenses, Springer Verlag, Berlin, 1992. Cited on page 476.
446 M. L achièze-R ey & J.-P. Luminet, Cosmic topology, Physics Reports 254, pp. 135–214,
1995. See also B.F. Roukema, The topology of the universe, http://arxiv.org/abs/astro-ph/
0010185 preprint. Cited on page 477.
447 Thanks to Steve Carlip for clarifying this point. Cited on page 478.
448 G.F.R. Ellis & T. Rothmann, Lost horizons, American Journal of Physics 61, pp. 883–
893, 1993. Cited on page 478.
455 The present status is given in the conference proceedings by Julian Barbour &
Herbert Pfister, eds., Mach’s Principle: From Newton’s Bucket to Quantum Gravity,
Birkhäuser, 1995. Various formulations of Mach’s principle – in fact, 21 different ones – are
compared on page 530.
In a related development, in 1953, Dennis Sciama published a paper in which he argues
that inertia of a particle is due to the gravitational attraction of all other matter in the uni-
verse. The paper is widely quoted, but makes no new statements on the issue. See D.W.
Sciama, On the origin of inertia, Monthly Notices of the Royal Astronomical Society 113,
pp. 34–42, 1953. Cited on page 482.
456 Information on the rotation of the universe is given in A. Kogu t, G. Hinshaw & A.J.
Banday, Limits to global rotation and shear from the COBE DMR four-year sky maps,
Physical Review D 55, pp. 1901–1905, 1997. Earlier information is found in J.D. Barrow,
bibliography 521
R. Juszkiewicz & D.H. Sonoda, Universal rotation: how large can it be?, Monthly No-
tices of the Royal Astronomical Society 213, pp. 917–943, 1985. See also J.D. Barrow, R.
Juszkiewicz & D.H. Sonoda, Structure of the cosmic microwave background, Nature
309, pp. 397–402, 1983, or E.F. Bunn, P.G. Fereira & J. Silk, How anisotropic is the
universe?, Physical Review Letters 77, pp. 2883–2886, 1996. Cited on page 483.
457 The issue has been discussed within linearized gravity by R ichard Tolman, in his text-
book Relativity, Thermodynamics, and Cosmology, Clarendon Press, 1934, on pp. 272–290.
The exact problem has been solved by A. Peres, Null electromagnetic fields in general re-
lativity theory, Physical Review 118, pp. 1105–1110, 1960, and by W.B. B onnor, The gravita-
tional field of light, Commun. Math. Phys. 13, pp. 163–174, 1969. See also N.V. Mitskievic
& K.K. Kumaradtya, The gravitational field of a spinning pencil of light, Journal of Math-
ematical Physics 30, pp. 1095–1099, 1989, and P.C. Aichelburg & R.U. Sexl, On the
gravitational field of a spinning particle, General Relativity and Gravitation 2, pp. 303–312,
1971. Cited on page 483.
458 See the delightful popular account by Igor Novikov, Black Holes and the Universe, Cam-
tweiler, Black Holes – Selected Reprints, American Association of Physics Teachers, 1982.
Cited on page 485.
463 The beautiful paper is R. Oppenheimer & H. Snyder, On continued gravitational con-
traction, Physical Review 56, pp. 455–459, 1939. Cited on page 486.
464 R.P. Kerr, Gravitational field of a spinning mass as an example of algebraically special
metrics, Physical Review Letters 11, pp. 237–238, 1963. Cited on page 490.
465 E.T. Newman, E. Couch, R. Chinnapared, A. Exton, A. Prakash & R. Tor-
rence, Metric of a rotating, charged mass, Journal of Mathematical Physics 6, pp. 918–919,
1965. Cited on page 490.
466 For a summary, see P.O. Mazur, Black hole uniqueness theorems, pp. 130–157, in M.A.H.
MacCallum, editor, General Relativity and Gravitation, Cambridge University Press,
1987, or the update at http://arxiv.org/abs/hep-th/0101012. See also D.C. Robinson, Four
522 iii gravitation and rel ativity
to R. Penrose, Gravitational collapse: the role of general relativity, Rivista del Nuovo Ci-
mento 1, pp. 252–276, 1969. Cited on page 498.
479 G.J. Stoney, On the physical units of nature, Philosophical Magazine 11, pp. 381–391, 1881.
Cited on page 500.
480 The geometrodynamic clock is discussed in D.E. Brahm & R.P. Gruber, Limitations of
the geometrodynamic clock, General Relativity and Gravitation 24, pp. 297–303, 1992. The
clock itself was introduced by R.F. Marzke, in his Ph.D. thesis The theory of measurement
in general relativity, 1959, with John Wheeler as thesis adviser. Cited on page 501.
481 R. Geroch, Einstein algebras, Commun. Math. Phys. 26, pp. 271–275, 1972. Cited on page
501.
482 A. Macd onald, Einstein’s hole argument, American Journal of Physics 69, pp. 223–225,
2001. Cited on page 502.
bibliography 523
498 Wormholes and nontrivial topologies ... A basic approach is the one by T. Diemer &
M. Hadley, Charge and the topology of spacetime, Classical and Quantum Gravity 16,
pp. 3567–3577, 1999, or http://www.arxive.org/abs/gr-qc/9905069 and M. Hadley, Spin
half in classical general relativity, Classical and Quantum Gravity 17, pp. 4187–4194, 2000,
or http://www.arxive.org/abs/gr-qc/0004029. Cited on page 508.
499 An important formulation of relativity is A. Ashtekar, New variables for classical and
quantum gravity, Physical Review Letters 57, pp. 2244–2247, 1986. Cited on page 509.
500 A well written text on the connections between the big bang and particle physics is by
I.L. Rozental, Big Bang – Big Bounce, How Particles and Fields Drive Cosmic Evolution,
Springer, 1988. For another connection, see M. Nagano & A.A. Watson, Observations
and implications of the ultrahigh energy cosmic rays, Reviews of Modern Physics 72, pp. 689–
732, 2000. Cited on page 509.
501 Teaching will benefit in particular from new formulations, from concentration on principles
and their consequences, as has happened in special relativity, from simpler descriptions at
the weak field level, and from future research in the theory of general relativity. The newer
W
hat is light? The study of relativity left us completely in the dark, even though
e had embarked in it precisely to find an answer to that question. True,
e have learned how the motion of light compares with that of objects. We also
learned that light is a moving entity that cannot be stopped; but we haven’t learned any-
Ref. 502 internal electricity; also the emission of light is an electrical process. As these aspects
are part of everyday life, we will leave aside all complications due to gravity and curved
space-time. The most productive way to study electrical motion is to start, as in the case
of gravity, with those types of motion which are generated without any contact between
the bodies involved.
Miletus, one of the original seven sages, in the sixth century bce. The same observation
regions in central Asia. When two stones of this mineral are put rubbed
comb
near each other, they attract or repel each other, depending on
their relative orientation. In addition, these stones attract objects
made of cobalt, nickel or iron.
Today we also find various small objects in nature with more
sophisticated properties, as shown in Figure 235. Some objects
enable you to switch on a television, others unlock car doors, still
others allow you to talk with far away friends.
All these observations show that in nature there are situations F I G U R E 236 How to
amaze kids
where bodies exert influence on others at a distance. The space
surrounding a body exerting such an influence is said to contain a field. A (physical) field
is thus an entity that manifests itself by accelerating other bodies in its region of space. A
Copyright © Christoph Schiller November 1997–January 2007
field is some ‘stuff ’ taking up space. Experiments show that fields have no mass. The field
surrounding the mineral found in Magnesia is called a magnetic field and the stones are
called magnets.* The field around amber – called ἤλεκτρον in Greek, from a root meaning
‘brilliant, shining’ – is called an electric field. The name is due to a proposal by the famous
English part-time physicist William Gilbert (1544–1603) who was physician to Queen
Elizabeth I. Objects surrounded by a permanent electric field are called electrets. They
are much less common than magnets; among others, they are used in certain loudspeaker
systems.**
* A pretty book about the history of magnetism and the excitement it generates is James D. L ivingston,
Driving Force – the Natural Magic of Magnets, Harvard University Press, 1996.
** The Kirlian effect, which allows one to make such intriguingly beautiful photographs, is due to a time-
amber, lodestone and mobile phones 527
The field around a mobile phone is called a radio field or, as we will see later, an elec-
tromagnetic field. In contrast to the previous fields, it oscillates over time. We will find
out later that many other objects are surrounded by such fields, though these are often
very weak. Objects that emit oscillating fields, such as mobile phones, are called radio
transmitters or radio emitters.
Fields influence bodies over a distance, without any material support. For a long time, Copyright © Christoph Schiller November 1997–January 2007
this was rarely found in everyday life, as most countries have laws to restrict machines
that use and produce such fields. The laws require that for any device that moves, pro-
duces sound, or creates moving pictures, the fields need to remain inside them. For this
reason a magician moving an object on a table via a hidden magnet still surprises and
entertains his audience. To feel the fascination of fields more strongly, a deeper look into
a few experimental results is worthwhile.
O b s e r va t i o n Magnetic field
Figure 237 show that the tip of a lightning flash advance with an average speed of around
600 km~s. But what is moving? To find out, we have to find a way of making lightning for
ourselves.
In 1995, the car company General Motors accidentally rediscovered an old and simple
method of achieving this. Their engineers had inadvertently built a spark generating
mechanism into their cars; when filling the petrol tank, sparks were generated, which
Ref. 504 sometimes lead to the explosion of the fuel. They had to recall 2 million vehicles of its
Opel brand.
amber, lodestone and mobile phones 529
What had the engineers done wrong? They had unwittingly copied the conditions for
a electrical device which anyone can build at home and which was originally invented by
William Thomson:* the Kelvin generator. Repeating his experiment today, we would take
two water taps, four empty bean or coffee cans, of which two have been opened at both
Ref. 505 sides, some nylon rope and some metal wire.
Putting this all together as shown in Figure 238, and letting the water flow, we find a
strange effect: large sparks periodically jump between the two copper wires at the point
where they are nearest to each other, giving out loud bangs. Can you guess what condition
for the flow has to be realized for this to work? And what did Opel do to repair the cars
Challenge 947 n they recalled?
If we stop the water flowing just before the next spark is due, we find that both buckets
are able to attract sawdust and pieces of paper. The generator thus does the same that
rubbing amber does, just with more bang for the buck(et). Both buckets are surrounded
by electric fields. The fields increase with time, until the spark jumps. Just after the spark,
* William Thomson (1824–1907), important Irish Unionist physicist and professor at Glasgow University.
He worked on the determination of the age of the Earth, showing that it was much older than 6000 years, as
several sects believed. He strongly influenced the development of the theory of magnetism and electricity, the
description of the aether and thermodynamics. He propagated the use of the term ‘energy’ as it is used today,
instead of the confusing older terms. He was one of the last scientists to propagate mechanical analogies for
the explanation of phenomena, and thus strongly opposed Maxwell’s description of electromagnetism. It
was mainly for this reason that he failed to receive a Nobel Prize. He was also one of the minds behind the
laying of the first transatlantic telegraphic cable. Victorian to his bones, when he was knighted, he chose the
name of a small brook near his home as his new name; thus he became Baron Kelvin of Largs. Therefore the
unit of temperature obtained its name from a small Scottish river.
** In fact, there are many other ways to produces sparks or even arcs, i.e. sustained sparks; there is even
a complete subculture of people who do this as a hobby at home. Those who have a larger budget do it
professionally, in particle accelerators. See the http://www.kronjaeger.com/hv website.
530 iv cl assical electrodynamics • 13. electricity and fields
water
nylon ropes pipe nylon ropes
metal cylinders
bang!
metal wires
metal cans
on the roof
pendulum
with metal
ball
in the hall
in the ground
Of course, one has to check whether natural lightning is actually electrical in origin.
In 1752, experiments performed in France, following a suggestion by Benjamin Franklin,
Copyright © Christoph Schiller November 1997–January 2007
published in London in 1751, showed that one can indeed draw electricity from a thun-
derstorm via a long rod.* These French experiments made Franklin famous worldwide;
they were also the start of the use of lightning rods all over the world. Later, Franklin had
Ref. 506 a lightning rod built through his own house, but of a somewhat unusual type, as shown
in Figure 239. Can you guess what it did in his hall during bad weather, all parts being
Challenge 948 n made of metal? (Do not repeat this experiment; the device can kill.)
* The details of how lightning is generated and how it propagates are still a topic of research. An introduction
is given on page 610.
amber, lodestone and mobile phones 531
=
q ma
, (392)
q ref m ref a ref
i.e., by comparing it with the acceleration and mass of the reference charge. This definition
reflects the observation that mass alone is not sufficient for a complete characterization
of a body. For a full description of motion we need to know its electric charge; charge is
therefore the second intrinsic property of bodies that we discover in our walk.
Nowadays the unit of charge, the coulomb, is defined through a standard flow through
metal wires, as explained in Appendix B. This is possible because all experiments show
that charge is conserved, that it flows, that it flows continuously and that it can accumu- Copyright © Christoph Schiller November 1997–January 2007
late. Charge thus behaves like a fluid substance. Therefore we are forced to use for its
description a scalar quantity q, which can take positive, vanishing, or negative values.
In everyday life these properties of electric charge, listed also in Table 43, describe ob-
servations with sufficient accuracy. However, as in the case of all previously encountered
classical concepts, these experimental results for electrical charge will turn out to be only
approximate. More precise experiments will require a revision of several properties. How-
ever, no counter-example to charge conservation has as yet been observed.
A charged object brought near a neutral one polarizes it. Electrical polarization is the
separation of the positive and negative charges in a body. For this reason, even neutral
objects, such as hair, can be attracted to a charged body, such as a comb. Generally, both
insulators and conductors can be polarized; this occurs for whole stars down to single
molecules.
532 iv cl assical electrodynamics • 13. electricity and fields
O b s e r va t i o n Charge
taken at every point in space x. The definition of the electric field is thus based on how it
Challenge 950 e moves charges.* The field is measured in multiples of the unit N~C or V~m.
To describe the motion due to electricity completely, we need a relation explaining how
charges produce electric fields. This relation was established with precision (but not for
the first time) by Charles-Augustin de Coulomb on his private estate, during the French
Revolution.** He found that around any small-sized or any spherical charge Q at rest
there is an electric field. At a position r, the electric field E is given by
Later we will extend the relation for a charge in motion. The bizarre proportionality con-
stant, built around the so-called permittivity of free space ε 0 , is due to the historical way
Copyright © Christoph Schiller November 1997–January 2007
the unit of charge was defined first.*** The essential point of the formula is the decrease of
Challenge 952 n the field with the square of the distance; can you imagine the origin of this dependence?
The two previous equations allow one to write the interaction between two charged
Challenge 949 ny * Does the definition of electric field given here assume a charge speed that is much less than that of light?
** Charles-Augustin de Coulomb (b. 1736 Angoulême, d. 1806 Paris), French engineer and physicist. His
careful experiments on electric charges provided a firm basis for the study of electricity.
*** Other definitions of this and other proportionality constants to be encountered later are possible,
leading to unit systems different from the SI system used here. The SI system is presented in detail in
Appendix B. Among the older competitors, the Gaussian unit system often used in theoretical calculations,
the Heaviside–Lorentz unit system, the electrostatic unit system and the electromagnetic unit system are
Ref. 507 the most important ones.
amber, lodestone and mobile phones 533
O b s e r va t i o n Electric field
bodies as
= =−
dp1 1 q1 q2 r dp2
, (395)
dt 4πε 0 r r
2 dt
where dp is the momentum change, and r is the vector connecting the two centres of mass.
This famous expression for electrostatic attraction and repulsion, also due to Coulomb,
is valid only for charged bodies that are of small size or spherical, and most of all, that are Copyright © Christoph Schiller November 1997–January 2007
at rest.
Electric fields have two main properties: they contain energy and they can polarize
bodies. The energy content is due to the electrostatic interaction between charges. The
strength of the interaction is considerable. For example, it is the basis for the force of our
muscles. Muscular force is a macroscopic effect of equation 395. Another example is the
material strength of steel or diamond. As we will discover, all atoms are held together
by electrostatic attraction. To convince yourself of the strength of electrostatic attraction,
answer the following: What is the force between two boxes with a gram of protons each,
Challenge 953 n located on the two poles of the Earth? Try to guess the result before you calculate the
astonishing value.
Coulomb’s relation for the field around a charge can be rephrased in a way that helps
to generalize it to non-spherical bodies. Take a closed surface, i.e., a surface than encloses
534 iv cl assical electrodynamics • 13. electricity and fields
a certain volume. Then the integral of the electric field over this surface, the electric flux,
is the enclosed charge Q divided by ε 0 :
E dA =
Q
∫closedsurface ε0
. (396)
Challenge 954 n This mathematical relation, called Gauss’s ‘law’, follows from the result of Coulomb. (In
the simplified form given here, it is valid only for static situations.) Since inside conduct-
ors the electrical field is zero, Gauss’s ‘law’ implies, for example, that if a charge q is sur-
rounded by a uncharged metal sphere, the outer surface of the metal sphere shows the
Challenge 955 e same charge q.
Owing to the strength of electromagnetic interactions, separating charges is not an
easy task. This is the reason that electrical effects have only been commonly used for
about a hundred years. We had to wait for practical and efficient devices to be invented
tric charge should fall, have inertia and be separable from matter. Indeed, each of these
effects has been observed.** For example, when a long metal rod is kept vertically, we can
measure an electrical potential difference, a voltage, between the top and the bottom. In
other words, we can measure the weight of electricity in this way. Similarly, we can meas-
Ref. 508 ure the potential difference between the ends of an accelerated rod. Alternatively, we can
measure the potential difference between the centre and the rim of a rotating metal disc.
The last experiment was, in fact, the way in which the ratio q~m for currents in metals
be subject to centrifugation
a resist acceleration
for all metals, with small variations in the second digit. In short, electrical current has
mass. Therefore, whenever we switch on an electrical current, we get a recoil. This simple
Ref. 509 effect can easily be measured and confirms the mass to charge ratio just given. Also, the
emission of current into air or into vacuum is observed; in fact, every television tube
Copyright © Christoph Schiller November 1997–January 2007
uses this principle to generate the beam producing the picture. It works best for metal
Ref. 510 objects with sharp, pointed tips. The rays created this way – we could say that they are
‘free’ electricity – are called cathode rays. Within a few per cent, they show the same mass
to charge ratio as expression (397). This correspondence thus shows that charges move
almost as freely in metals as in air; this is the reason metals are such good conductors.
If electric charge falls inside vertical metal rods, we can make the astonishing deduc-
tion that cathode rays – as we will see later, they consist of free electrons* – should not
* The name ‘electron’ is due to George Stoney. Electrons are the smallest and lightest charges moving in
metals; they are, usually – but not always – the ‘atoms’ of electricity – for example in metals. Their charge
is small, 0.16 aC, so that flows of charge typical of everyday life consist of large numbers of electrons; as a
result, electrical charge behaves like a continuous fluid. The particle itself was discovered and presented in
536 iv cl assical electrodynamics • 13. electricity and fields
O b s e r va t i o n Current
electricity using electrons but by using ions. The finer details were clarified only in the
twentieth century. Nerve signals propagate using the motion of sodium and potassium
ions in the cell membrane of the nerve. The resulting signal speed is between 0.5 m~s and
120 m~s, depending on the type of nerve. This speed is sufficient for the survival of most
species – it signals the body to run away in case of danger.
Being electrically controlled, all mammals can sense strong electric fields. Humans
can sense fields down to around 10 kV~m, when hair stands on end. In contrast, several
animals can sense weak electric and magnetic fields. Sharks, for example, can detect fields
down to 1 µV~m using special sensors, the Ampullae of Lorenzini, which are found around
their mouth. Sharks use them to detect the field created by prey moving in water; this
allows them to catch their prey even in the dark. Several freshwater fish are also able to
detect electric fields. The salamander and the platypus, the famous duck-billed mammal,
can also sense electric fields. Like sharks, they use them to detect prey in water which is
too muddy to see through. Certain fish, the so-called weakly-electric fish, even generate a
The study of magnetism progressed across the world independently of the study of elec-
tricity. Towards the end of the twelfth century, the compass came into use in Europe. At
that time, there were heated debates on whether it pointed to the north or the south. In
1269, the French military engineer Pierre de Maricourt (1219–1292) published his study of
Ref. 513 magnetic materials. He found that every magnet has two points of highest magnetization,
and he called them poles. He found that even after a magnet is cut, the resulting pieces
always retain two poles: one points to the north and the other to the south when the stone
is left free to rotate. Magnets are dipoles. Atoms are either dipoles or unmagnetic. There
* It took until the year 2000 for technology to make use of the same effect. Nowadays, airbag sensors in cars
often use electric fields to sense whether the person sitting in the seat is a child or an adult, thus changing
the way that the bag behaves in an accident.
538 iv cl assical electrodynamics • 13. electricity and fields
It is known that honey bees, sharks, pigeons, salmon, trout, sea turtles and certain bac- ”
Ref. 514 teria can feel magnetic fields. One speaks of the ability for magnetoreception. All these
life forms use this ability for navigation. The most common detection method is the use
of small magnetic particles inside a cell; the cell then senses how these small built-in mag-
nets move in a magnetic field. The magnets are tiny, typically around 50 nm in size. These
small magnets are used to navigate along the magnetic field of the Earth. For higher an-
imals, the variations of the magnetic field of the Earth, 20 to 70 µT, produce a landscape
Copyright © Christoph Schiller November 1997–January 2007
that is similar to the visible landscape for humans. They can remember it and use it for
navigation.
Can humans feel magnetic fields? Magnetic material seems to be present in the human
brain, but whether humans can feel magnetic fields is still an open issue. Maybe you can
Challenge 962 r devise a way to check this?
Are magnetism and electricity related? François Arago* found out that they were. He
observed that a ship that had survived a bad thunderstorm and had been struck by light-
ning, needed a new compass. Thus lightning has the ability to demagnetize compasses.
Arago knew, like Franklin, that lightning is an electrical phenomena. In other words, elec-
current-
carrying
metal
wire N S
N S battery
current-carrying compass
magnet
metal wire wire needle
tricity and magnetism must be related. More precisely, magnetism must be related to the
The reason for Lenin’s famous statement were two discoveries. One was made in 1820
Lenin.*
”
by the Danish physicist Hans Christian Oersted (1777–1851) and the other in 1831 by the
English physicist Michael Faraday.** The consequences of these experiments changed the
world completely in less than one century.
On the 21st of July of 1821, Oersted published a leaflet, in Latin, which took Europe by
storm. Oersted had found (during a lecture demonstration to his students) that when a
current is sent through a wire, a nearby magnet is put into motion. In other words, he
found that the flow of electricity can move bodies.
Further experiments show that two wires in which charges flow attract or repel each
other, depending on whether the currents are parallel or antiparallel. These and other
experiments show that wires in which electricity flows behave like magnets.*** In other
words, Oersted had found the definite proof that electricity could be turned into magnet-
ism.
Copyright © Christoph Schiller November 1997–January 2007
* Lenin (b. 1870 Simbirsk, d. 1924 Gorki), founder of the Union of Soviet Socialist Republics, in 1920 stated
this as the centre of his development plan for the country. In Russian, the local councils of that time were
called soviets.
** Michael Faraday (b. 1791 Newington Butts, d. 1867 London) was born to a simple family, without school-
ing, and of deep and naive religious ideas. As a boy he became assistant to the most famous chemist of his
time, Humphry Davy (1778–1829). He had no mathematical training, but late in his life he became member
of the Royal Society. A modest man, he refused all other honours in his life. He worked on chemical topics,
the atomic structure of matter and, most of all, he developed the idea of (magnetic) fields and field lines.
He used fields to describe all his numerous experimental discoveries about electromagnetism, such as the
Faraday effect. Fields were later described mathematically by Maxwell, who at that time was the only person
in Europe to take over Faraday’s field concept.
*** In fact, if one imagines tiny currents moving in circles inside magnets, one gets a unique description for
all magnetic fields observed in nature.
540 iv cl assical electrodynamics • 13. electricity and fields
Shortly afterwards, Ampère* found that coils increase these effects dramatically. Coils
behave like small magnets. In particular, coils, like magnetic fields, always have two poles,
usually called the north and the south pole. Opposite poles attract, like poles repel each
other. As is well known, the Earth is itself a large magnet, with its magnetic north pole
near the geographic south pole, and vice versa. (The magnetic field of the Earth is not due
to a solid permanent magnet inside it. The Earth’s solid core is to hot to be a permanent
magnet; instead, the magnetic field is due to circulating currents in the outer, liquid core.)
Moving electric charge produces magnetic fields. This result explains why magnetic
fields always have two poles. The lack of magnetic monopoles thus becomes clear. But
one topic is strange. If magnetic fields are due to the motion of charges, this must be also
the case for a normal magnet. Can this be shown?
In 1915, two men in the Netherlands found a simple way to prove that
even in a magnet, something is moving. They suspended a metal rod
from the ceiling by a thin thread and then put a coil around the rod, as
* André-Marie Ampère (b. 1775 Lyon, d. 1836 Marseille), French physicist and mathematician. Autodidact,
he read the famous Encyclopédie as a child; in a life full of personal tragedies, he wandered from maths to
chemistry and physics, worked as a school teacher, and published nothing of importance until 1820. Then
the discovery of Oersted reached all over Europe: electrical current can deviate magnetic needles. Ampère
worked for years on the problem, and in 1826 published the summary of his findings, which lead Maxwell
to call him the ‘Newton of electricity’. Ampère named and developed many areas of electrodynamics. In
1832, he and his technician also built the first dynamo, or rotative current generator. Of course, the unit of
electrical current is named after him.
** Wander Johannes de Haas (1878–1960), Dutch physicist. De Haas is best known for two additional
magneto-electric effects named after him, the Shubnikov–de Haas effect (the strong increase of the mag-
netic resistance of bismuth at low temperatures and high magnetic fields) and the de Haas–Van Alphen
effect (the diamagnetic susceptibility of bismuth at low temperatures is a periodic function of the magnetic
field).
*** A ferromagnetic material is a special kind of paramagnetic material that has a permanent magnetization.
amber, lodestone and mobile phones 541
magnet magnet
diamagnetic paramagnetic
material material
(usually) play a larger role in magnetism than their orbital angular momentum.
0 −B z B y
B = Bz 0 −B x , (398)
−B y B x 0 Copyright © Christoph Schiller November 1997–January 2007
called an antisymmetric tensor. In summary, magnetic fields are defined by the accelera-
tion they impart on moving charges. This acceleration turns out to follow
a= vB = v B
e e
(399)
m m
a relation which is often called Lorentz acceleration, after the important Dutch physicist
* The quantity B was not called the ‘magnetic field’ until recently. We follow here the modern, logical defin-
ition, which supersedes the traditional one, where B was called the ‘magnetic flux density’ or ‘magnetic
induction’ and another quantity, H, was called – incorrectly, but for over a century – the magnetic field. This
quantity H will not appear in this walk, but it is important for the description of magnetism in materials.
542 iv cl assical electrodynamics • 13. electricity and fields
Hendrik A. Lorentz (b. 1853 Arnhem, d. 1928 Haarlem) who first stated it clearly.* (The
relation is also called the Laplace acceleration.) The unit of the magnetic field is called
tesla and is abbreviated T. One has 1 T = 1 N s~C m = 1 V s~m2 = 1 V s2 ~A m.
The Lorentz acceleration is the effect at the root of any electric motor. An electric motor
is a device that uses a magnetic field as efficiently as possible to accelerate charges flowing
in a wire. Through the motion of the charges, the wire is then also moved. Electricity is
thus transformed into magnetism and then into motion. The first efficient motor was
built back in 1834.
As in the electric case, we need to know how the strength of a magnetic field is de-
termined. Experiments such as Oersted’s show that the magnetic field is due to moving
charges, and that a charge moving with velocity v produces a field B given by
also mixed up when changing from one inertial frame to the other. The theory of spe-
cial relativity thus tells us that there must be a single concept, an electromagnetic field,
describing them both. Investigating the details, one finds that the electromagnetic field F
surrounding charged bodies has to be described by an antisymmetric 4-tensor
0 −E x ~c −E y ~c −E z ~c 0 E x ~c E y ~c E z ~c
E ~c −E x ~c By
= x or F µν = .
0 −B z By 0 −B z
E y ~c −E y ~c B z −B x
F µν
Bz 0 −B x 0
E z ~c −B y Bx 0 −E z ~c −B y Bx 0
(401)
Challenge 964 ny * Does the definition of magnetic field given here assume a charge speed much lower than that of light?
amber, lodestone and mobile phones 543
Obviously, the electromagnetic field F, and thus every component of these matrices, de-
pends on space and time. The matrices show that electricity and magnetism are two faces
of the same effect.* In addition, since electric fields appear only in the topmost row and
leftmost column, the expressions show that in everyday life, for small speeds, electricity
Challenge 967 n and magnetism can be separated. (Why?)
The total influence of electric and magnetic fields on fixed or moving charges is then
given by the following expression for the relativistic force-acceleration relation K = mb:
mb = qFu or
du µ
m = qF µ ν u ν or
dτ
γc 0 E x ~c E y ~c E z ~c γc
d γv x E x ~c
m = q
0 Bz −B y γv x
~c
or
dτ γv E −B 0 Bx γv y
γv z E z ~c B y
which show how the work W and the three-force d p~dt depend on the electric and mag-
netic fields. All four expressions describe the same content; the simplicity of the first one
is the reason for the involved matrices (401) of the electromagnetic field. In fact, the exten-
ded Lorentz relation (402) is the definition of the electromagnetic field, since the field is
defined as that ‘stuff ’ which accelerates charges. In particular, all devices that put charges
into motion, such as batteries and dynamos, as well as all devices that are put into motion
by flowing charges, such as electric motors and muscles, are described by this relation.
That is why this relation is usually studied, in simple form, already in school. The Lorentz
relation describes all cases in which the motion of objects can be seen by the naked eye or
felt by our senses, such as the movement of an electrical motor in a high speed train, in a
lift and in a dental drill, the motion of the picture generating electron beam in a television
Ref. 517, Ref. 518 tube, or the travelling of an electrical signal in a cable and in the nerves of the body.
In equation (402) it is understood that one sums over indices that appear twice. The
electromagnetic field tensor F is an antisymmetric 4-tensor. (Can you write down the
Challenge 968 ny relation between F µν , F µν and F µ ν ?) Like any such tensor, it has two invariants, i.e., two
deduced properties that are the same for every observer: the expression B 2 − E 2 ~c 2 =
1
tr F2 and the product 4EB = −c tr F F. (Can you confirm this, using the definition of
Copyright © Christoph Schiller November 1997–January 2007
2
Challenge 969 n trace as the sum of the diagonal elements?)
The first invariant expression turns out to be the Lagrangian of the electromagnetic
field. It is a scalar and implies that if E is larger, smaller, or equal to cB for one observer,
it also is for all other observers. The second invariant, a pseudoscalar, describes whether
the angle between the electric and the magnetic field is acute or obtuse for all observers.**
The application of electromagnetic effects to daily life has opened up a whole new
world that did not exist before. Electrical light, electric motors, radio, telephone, X-rays,
television and computers have changed human life completely in less than one century.
º
* Actually, the expression for the field contains everywhere the expression 1~ µ o ε 0 instead of the speed of
light c. We will explain the reason for this substitution shortly.
** There is in fact a third Lorentz invariant, much less known. It is specific to the electromagnetic field and
544 iv cl assical electrodynamics • 13. electricity and fields
For example, the installation of electric lighting in city streets has almost eliminated the
previously so common night assaults. These and all other electrical devices exploit the
fact that charges can flow in metals and, in particular, that electromagnetic energy can
be transformed
— into mechanical energy – as used in loudspeakers, motors, piezo crystals;
— into light – as in lamps and lasers;
— into heat – as in electric ovens, blankets and tea pots;
— into chemical effects – as in hydrolysis, battery charging and electroplating;
— into coldness – as in refrigerators and Peltier elements;
— into radiation signals – as in radio and television;
— into stored information – as in magnetic records and in computers.
All electric motors are based on the result that electric currents interact with magnetic
Michael Faraday
”
fields. The simplest example is the attraction of two wires carrying parallel currents. This
observation alone, made in 1820 by Ampère, is sufficient to make motion larger than a
Ref. 520 certain maximal speed impossible.
The argument is beautifully simple. We
change the original experiment and imagine
v
two long, electrically charged rods of mass m,
d charged rods
moving in the same direction with velocity v
and separation d. An observer moving with
v
the rods would see an electrostatic repulsion
Challenge 971 e between the rods given by F I G U R E 246 The relativistic aspect of
magnetism
1 2λ 2
ma e = − (404)
4πε 0 d
where λ is the charge per length of the rods. A second, resting observer sees two effects:
the electrostatic repulsion and the attraction discovered by Ampère. The second observer
Copyright © Christoph Schiller November 1997–January 2007
This expression must be consistent with the observation of the first observer. This is the
case only if both observers find repulsions. It is easy to check that the second observer
sees a repulsion, as does the first one, only if
v2 <
1
. (406)
ε0 µ0
This maximum speed, with a value of 0.3 GM~s, is thus valid for any object carrying
charges. But all everyday objects contain charges: there is thus a maximum speed for
matter.
Are you able to extend the argument for a maximum speed to neutral particles as well?
“ Augustine of Hippo
Before we study the motion of an electromagnetic field in detail, let’s have some fun with ”
electricity.
**
Page 608 * ‘Electrons move in metal with a speed of about 1 µm~s; thus if I walk with the same speed along a cable
Challenge 974 ny carrying a constant current, I should not be able to sense any magnetic field.’ What is wrong with this argu-
ment?
546 iv cl assical electrodynamics • 13. electricity and fields
O b s e r va t i o n Vo l t a g e
capacitive head
(c.10-20 pF to earth)
F I G U R E 247 The schematics, the realization and the operation of a Tesla coil, including spark
and corona discharges (photographs © Robert Billon)
Nowadays, having fun with sparks is straightforward. Tesla coils, named after Nikola
amber, lodestone and mobile phones 547
Tesla * are the simplest devices that allow long sparks to be produced at home. Atten-
tion: this is dangerous; that is the reason that such devices cannot be bought (almost)
anywhere. The basic diagram and an example is shown in Figure 247. Tesla coils look
like large metal mushrooms (to avoid unwanted discharges) and plans for their construc-
tion can be found on numerous websites or from numerous enthusiast’s clubs, such as
http://www.stefan-kluge.de.
**
In 1722, George Graham discovered, by watching a compass needle, that the magnetic
Challenge 975 n field of the Earth shows daily variations. Can you imagine why these variations occur?
**
If even knocking on a wooden door is an electric effect, we should be able to detect fields
Challenge 976 ny when doing so. Can you devise an experiment to check this?
**
How do you wire up a light bulb to the mains and three switches so that the light can
Challenge 979 n be switched on at any of the switches and off at any other switch? And for four switches?
Nobody will take a physicist seriously who is able to write Maxwell’s equations but cannot
solve this little problem.
**
The first appliances built to generate electric currents were large rubbing machines. Then,
in 1799 the Italian scientist Alessandro Volta (1745–1827) invented a new device to gen-
erate electricity and called it a pile; today it is called a (voltaic) cell or, less correctly, a Copyright © Christoph Schiller November 1997–January 2007
battery. Voltaic cells are based on chemical processes; they provide much more current
and are smaller and easier to handle than electrostatic machines. The invention of the
battery changed the investigation of electricity so profoundly that Volta became world
famous. At last, a simple and reliable source of electricity was available for use in experi-
ments; unlike rubbing machines, piles are compact, work in all weather conditions and
make no noise.
* Никола Тесла (1856 Smiljan–1943 New York City), Serbian engineer and inventor. He invented and pro-
moted the polyphase alternating current system, the alternating current electric motor, wireless communic-
ation, fluorescent lighting and many other applications of electricity. He is also one of the inventors of radio.
The SI unit of the magnetic field is named after him. A flamboyant character, his ideas were sometimes
unrealistic; for example he imagined that Tesla coils could be used for wireless power transmission.
548 iv cl assical electrodynamics • 13. electricity and fields
An apple or a potato with a piece of copper and one of zinc inserted is one of the
simplest possible voltaic cells. It provides a voltage of about 1 V and can be used to run
digital clocks or to produce clicks in headphones. Volta was also the discoverer of the
charge law q = CU of capacitors (C being the capacity, and U the voltage) and the inventor
of the high sensitivity capacitor electroscope. A modest man, nevertheless, the unit of
electrical potential, or ‘tension’, as Volta used to call it, was deduced from his name. A
‘battery’ is a large number of voltaic cells; the term was taken from an earlier, almost
purely military use.* A battery in a mobile phone is just an elaborated replacement for a
number of apples or potatoes.
**
A PC or a telephone can communicate without wires, by using radio waves. Why are
these and other electrical appliances not able to obtain their power via radio waves, thus
Challenge 981 n eliminating power cables?
**
Electromagnetism is full of surprises and offers many effects that can be reproduced at
home. The internet is full of descriptions of how to construct Tesla coils to produce sparks, Copyright © Christoph Schiller November 1997–January 2007
coil guns or rail guns to shoot objects, electrostatic machines to make your hair stand on
end, glass spheres with touch-sensitive discharges and much more. If you like experi-
ments, just search for these terms.
**
A high voltage can lead to current flow through air, because air becomes conductive in
high electric fields. In such discharges, air molecules are put in motion. As a result, one
can make objects that are attached to a pulsed high tension source lift up in the air, if one
optimizes this air motion so that it points downwards everywhere. The high tension is
* A pile made of sets of a zinc plate, a sheet of blotting paper soaked with salt water and a copper coin is
Challenge 980 ny easily constructed at home.
amber, lodestone and mobile phones 549
thus effectively used to accelerate ionized air in one direction and, as a result, an object
will move in the opposite direction, using the same principle as a rocket. An example is
shown in Figure 248, using the power supply of a PC monitor. (Watch out: danger!) Nu-
merous websites explain how to build these so-called lifters at home; in Figure 248, the
bottle and the candle are used as high voltage insulator to keep one of the two thin high
voltage wires (not visible in the photograph) high enough in the air, in order to avoid dis-
charges to the environment or to interfere with the lifter’s motion. Unfortunately, the ma-
jority of websites – not all – give incorrect or confused explanations of the phenomenon.
These websites thus provide a good challenge for one to learn to distinguish fact from
Challenge 984 e speculation.
**
The electric effects produced by friction and by liquid flow are usually small. However, in
the 1990s, a number oil tankers disappeared suddenly. The sailors had washed out the oil
tanks by hosing sea water onto the tank walls. The spraying led to charging of the tank; Copyright © Christoph Schiller November 1997–January 2007
a discharge then led to the oil fumes in the tank igniting. This led to an explosion and
subsequently the tankers sank. Similar accidents also happen regularly when chemicals
are moved from one tank to another.
**
Rubbing a plastic spoon with a piece of wool charges it. Such a charged spoon can be
used to extract pepper from a salt–pepper mixture by holding the spoon over the mixture.
Challenge 985 n Why?
**
When charges move, they produce a magnetic field. In particular, when ions inside the
Earth move due to heat convection, they produce the Earth’s magnetic field. When the
550 iv cl assical electrodynamics • 13. electricity and fields
ions high up in the atmosphere are moved by solar wind, a geomagnetic storm appears; its
field strength can be as high as that of the Earth itself. In 2003, an additional mechanism
was discovered. When the tides move the water of the oceans, the ions in the salt water
produce a tiny magnetic field; it can be measured by highly sensitive magnetometers in
satellites orbiting the Earth. After two years of measurements from a small satellite it was
Ref. 521 possible to make a beautiful film of the oceanic flows. Figure 249 gives an impression.
**
The names electrode, electrolyte, ion, anode and cathode were suggested by William
Whewell (1794–1866) on demand of Michael Faraday; Faraday had no formal education
and asked his friend Whewell to form two Greek words for him. For anode and cathode, Copyright © Christoph Schiller November 1997–January 2007
Whewell took words that literally mean ‘upward street’ and ‘downward street’. Faraday
then popularized these terms, like the other words mentioned above.
**
The shortest light pulse produced so far had a length of 100 as. To how many wavelengths
Challenge 986 n of green light would that correspond?
**
Why do we often see shadows of houses and shadows of trees, but never shadows of the
Challenge 987 n electrical cables hanging over streets?
**
amber, lodestone and mobile phones 551
suspending
battery wire
mercury
**
The ionosphere around the Earth has a resonant frequency of 7 Hz; for this reason any
apparatus measuring low frequencies always gets a strong signal at this value. Can you
Challenge 990 n give an explanation of the frequency?
**
Challenge 991 n The Sun is visible to the naked eye only up to a distance of 50 light years. Is this true?
**
At home, electricity is mostly used as alternating current. In other words, no electrons
actually flow through cables; as the drift speed of electrons in copper wires is of the order
552 iv cl assical electrodynamics • 13. electricity and fields
electrical hydraulic
component component
current, mass flow,
voltage pressure
wire tube
flexible &
capacitor elastic
closure
battery pump
activated
transistor valve
inductor challenge
Page 608 of 1 µm~s, electrons just move back and forward by 20 nm. Nothing flows in or out of the
cables! Why do the electricity companies require a real flow of money in return, instead
Challenge 992 e of being satisfied with a back and forth motion of money? Copyright © Christoph Schiller November 1997–January 2007
**
Comparing electricity with water is a good way of understanding electronics. Figure 252
shows a few examples that even a teenager can use. Can you fill in the correspondence
Challenge 993 n for the coil, and thus for a transformer?
**
Do electrons and protons have the same charge? Experiments show that the values are
Challenge 994 ny equal to within at least twenty digits. How would you check this?
**
Challenge 995 ny Charge is also velocity-independent. How would you check this?
amber, lodestone and mobile phones 553
**
Magnets can be used, even by school children, to climb steel walls. Have a look at the
http://www.physicslessons.com/TPNN.htm website.
**
Extremely high magnetic fields have strange effects. At fields of 1010 T, vacuum becomes
effectively birefringent, photons can split and coalesce, and atoms get squeezed. Hydro-
gen atoms, for example, are estimated to get two hundred times narrower in one direction.
Fortunately, these conditions exist only in specific neutron stars, called magnetars.
**
A good way to make money is to produce electricity and sell it. In 1964, a completely new
method was invented by Fletcher Osterle. The method was presented to a larger public in
Ref. 524 a beautiful experiment in 2003. One can take a plate of glass, add a conducting layers on
**
Ohm’s law, the observation that for almost all materials the current is proportional to
the voltage, is due to a school teacher. Georg Simon Ohm* explored the question in great Copyright © Christoph Schiller November 1997–January 2007
depth; in those days, such measurements were difficult to perform. This has changed now.
Recently, even the electrical resistance of single atoms has been measured: in the case of
Ref. 580 xenon it turned out to be about 105 Ω. It was also found that lead atoms are ten times
Challenge 997 ny more conductive than gold atoms. Can you imagine why?
**
Ref. 581 The charges on two capacitors in series are not generally equal, as naive theory states.
* Georg Simon Ohm (b. 1789 Erlangen, d. 1854 München), Bavarian school teacher and physicist. His ef-
forts were recognized only late in his life, and he eventually was promoted to professor at the University in
München. Later the unit of electrical resistance, the proportionality factor between voltage and current, was
named after him.
554 iv cl assical electrodynamics • 13. electricity and fields
insulators
C1 high tension line
C2 wires
neon lamp
F I G U R E 253 F I G U R E 254 Small neon lamps on a high
Capacitors in voltage cable
series
For perfect, leak-free capacitors the voltage ratio is given by the inverse capacity ratio
V1 ~V2 = C 2 ~C 1 , due to the equality of the electric charges stored. This is easily deduced
from Figure 253. However, in practice this is only correct for times between a few and a
**
Electric polarizability is the property of matter responsible for the deviation of water flow-
Page 526 ing from a tap caused by a charged comb. It is defined as the strength of electric dipole
induced by an applied electric field. The definition simply translates the observation that
many objects acquire a charge when an electric field is applied. Incidentally, how precisely
combs get charged when rubbed, a phenomenon called electrification, is still one of the
mysteries of modern science.
**
A pure magnetic field cannot be transformed into a pure electric field by change of ob-
servation frame. The best that can be achieved is a state similar to an equal mixture of
Challenge 999 ny magnetic and electric fields. Can you provide an argument elucidating this relation?
common basic principles, as shown in Figures 255 and 256. Twenty years later, Heaviside
and Hertz extracted the main points of Maxwell ideas, calling their summary Maxwell’s
Page 1217 theory of the electromagnetic field. It consists of two equations (four in the non-relativistic
case).
The first equation is the precise statement that electromagnetic fields originate at
* James Clerk Maxwell (b. 1831 Edinburgh, d. 1879 Cambridge), Scottish physicist. He founded electro-
magnetism by theoretically unifying electricity and magnetism, as described in this chapter. His work on
thermodynamics forms the second pillar of his activity. In addition, he studied the theory of colours and
developed the colour triangle; he was one of the first people to make a colour photograph. He is regarded
by many as the greatest physicist ever. Both ‘Clerk’ and ‘Maxwell’ were his family names.
the description of electromagnetic field evolu tion 555
speed v N S
object with
charge ρ
magnetic field B
∇E = = µ0 j .
ρ 1 ∂E
and ∇ B − 2
ε0 c ∂t
Each of these four equivalent ways to write the equation makes a simple statement: elec-
trical charge carries the electromagnetic field. This statement, including its equations, are
equivalent to the three basic observations of Figure 255. It describes Coulomb’s relation, Copyright © Christoph Schiller November 1997–January 2007
Ampère’s relation, and the way changing electrical fields induce magnetic effects, as you
Challenge 1000 ny may want to check for yourself.
The second half of equation (407) contains the right hand rule for magnetic fields
Challenge 1001 e around wires, through the vector product. The equation also states that changing electric
fields induce magnetic fields. The effect is essential for the primary side of transformers.
The factor 1~c 2 implies that the effect is small; that is why coils with many windings or
strong electric currents are needed to find it. Due to the vector product, all induced mag-
netic field lines are closed lines.
* Maxwell generalized this equation to cases where the charges are not surrounded by vacuum, but located
inside matter. We will not explore these situations in our walk because, as we will see during our mountain
ascent, the apparently special case of vacuum in fact describes all of nature.
556 iv cl assical electrodynamics • 13. electricity and fields
The second result by Maxwell is the precise description of how changing electric fields
create magnetic fields, and vice versa. In particular, an electric field can have vortices only
when there is a changing magnetic field. In addition it expresses the observation that in
nature there are no magnetic charges, i.e. that magnetic fields have no sources. All these
results are described by the relation variously written
d F = 0 with = ε ρσ µν F µν or
ρσ 1
F
2
ε µνρ ∂ µ Fνρ = ∂ µ Fνρ + ∂ ν F ρ µ + ∂ ρ F µν = 0 or
0
γ c1 ∂ t Bx By Bz 0
−B x −E z ~c E y ~c
=
γ∂ x 0 0
−B y E z ~c −E x ~c
or (408)
γ∂ y 0 0
−B z −E y ~c E x ~c
γ∂ z 0 0
∇B = 0 and ∇ E = −
∂B
The relation expresses the lack of sources for the dual field tensor, usually written F: there
are no magnetic charges, i.e. no magnetic monopoles in nature. In practice, this equation
Challenge 1002 ny is always needed together with the previous one. Can you see why?
Since there are no magnetic charges, magnetic field lines are always closed; they never
start or end. For example, field lines continue inside magnets. This is often expressed
mathematically by stating that the magnetic flux through a closed surface S – such as a
sphere or a cube – always vanishes: ∫S BdA = 0.
The second half of equation (408), also shown in Figure 256, expresses that changes
in magnetic fields produce electric fields: this effect is used in the secondary side of trans-
formers and in dynamos. The cross product in the expression implies that an electric field
generated in this way – also called an electromotive field – has no start and endpoints. The
electromotive field lines thus run in circles: in most practical cases they run along elec-
tric circuits. The minus sign is essential to ensure energy conservation and has a special
name: it is called Lenz’s rule.
Together with Lorentz’ evolution equation (402), which describes how charges move
given the motion of the fields, Maxwell’s evolution equations (407) and (408) describe
all electromagnetic phenomena occurring on everyday scales, from mobile phones, car
batteries, to personal computers, lasers, lightning, holograms and rainbows. We now have
Copyright © Christoph Schiller November 1997–January 2007
m, q m, q
v v
0 distance r
they are moving away from one another. Imagine also that the two masses are large, so
that the acceleration due to their electrical repulsion is small. For an observer at the centre
of gravity of the two, each particle feels an acceleration from the electric field of the other.
q(1 − v 2 ~c 2 )
E= . (409)
4πe 0 r 2
We conclude that any sort of field leading to particle interactions must carry energy
and momentum, as the argument applies to all such cases. In particular, it applies to nuc-
lear interactions. Indeed, in the second part of our mountain ascent we will even find
an additional result: all fields are themselves composed of particles. The energy and mo-
mentum of fields then become an obvious state of affairs.
vector S
potential
They fascinate many.* However, in this mountain ascent we keep the discussion focused
When a charged particle moves through a magnetic potential A(x), its momentum
changes by q∆A; it changes by the difference between the potential values at the start
and end points, multiplied by its charge. Owing to this definition, the vector potential
has the property that
B = ∇ A = curl A (411) Copyright © Christoph Schiller November 1997–January 2007
i.e. that the magnetic field is the curl of the magnetic potential. In most languages the curl
is called the rotation and abbreviated rot. To visualize what the curl or rotation is, imagine
that the field vectors are the velocity vectors of flowing air. Now put a tiny paddle-wheel at
a point. If it turns, the curl is non-zero. The rotation speed of the paddle-wheel is maximal
for some direction of the axis; this maximal speed defines both the magnitude and the
direction of the curl at the point. (The right-hand rule is implied.) For example, the curl
Challenge 1006 ny for the velocities of a rotating solid body is everywhere 2ω, or twice the angular velocity.
Ref. 528 The vector potential for a long straight current-carrying wire is parallel to the wire; it
Challenge 1005 n * What is the relation, for static fields, between field lines and (equi-) potential surfaces? Can a field line
cross a potential surface twice? For more details on topics such as these, see the free textbook by B o Thidé,
the description of electromagnetic field evolu tion 559
which depends on the radial distance r from the wire and an integration constant r 0 . This
expression for the vector potential, pictured in Figure 258, shows how the moving current
produces a linear momentum in the (electro-) magnetic field around it. In the case of a
solenoid, the vector potential ‘circulates’ around the solenoid. The magnitude obeys
A(r) = −
Φ 1
, (413)
4π r
where Φ is the magnetic flux inside the solenoid. We see that, in general, the vector poten-
tial is dragged along by moving charges. The dragging effect decreases for larger distances.
This fits well with the image of the vector potential as the momentum of the electromag-
where Λ(t, x) is some scalar function, is also a vector potential for the same situation.
Copyright © Christoph Schiller November 1997–January 2007
(The magnetic field B stays the same, though.) Worse, can you confirm that the corres-
Challenge 1009 ny ponding (absolute) momentum values also change? This unavoidable ambiguity, called
gauge invariance, is a central property of the electromagnetic field. We will explore it in
more detail below.
Not only the momentum, but also the energy of the electromagnetic field is defined
ambiguously. Indeed, the second step in the specification of a state for the electromagnetic
Ref. 527 field is the definition of the electric potential as the energy U per charge:
φ=
U
(416)
q
In other words, the potential φ(x) at a point x is the energy needed to move a unit charge
to the point x starting from a point where the potential vanishes. The potential energy
is thus given by qφ. From this definition, the electric field E is simply the change of the
potential with position corrected by the time dependence of momentum, i.e.
E = −∇φ −
∂
A, (417)
∂t
Obviously, there is a freedom in the choice of the definition of the potential. If φ(x) is a
possible potential, then
φ′ (x) = φ(x) − Λ
∂
(418)
It is easy to see that the description of the field is complete, since we have
F = d A or F µν = ∂ µ A ν − ∂ ν A µ , (421)
which means that the electromagnetic field F is completely specified by the 4-potential A.
But as just said, the 4-potential itself is not uniquely defined. Indeed, any other equivalent
4-potential A′ is related to A by the gauge transformation
A′ µ = Aµ + ∂ µ Λ (422)
* This connection also shows why the expression P µ − qAµ appears so regularly in formulae; indeed, it plays
a central role in the quantum theory of a particle in the electromagnetic field.
the description of electromagnetic field evolu tion 561
where Λ = Λ(t, x) is any arbitrarily chosen scalar field. The new field A′ leads to the same
electromagnetic field, and to the same accelerations and evolutions. The 4-potential A is
thus an overdescription of the physical situation as several different gauge choices corres-
pond to the same physical situation.* Therefore we have to check that all measurement
results are independent of gauge transformations, i.e. that all observables are gauge invari-
ant quantities. Such gauge invariant quantities are, as we just saw, the fields F and F, and
in general all classical quantities. We add that many theoretical physicists use the term
‘electromagnetic field’ loosely for both the quantities F µν and A µ .
There is a simple image, due to Maxwell, to help overcoming the conceptual difficulties
of the vector potential. It turns out that the closed line integral over A µ is gauge invariant,
Challenge 1011 e because
y A µ dx = y (A µ + ∂ µ Λ)dx = y A µ dx .
∫ ∫ ∫
µ µ ′ µ
(423)
In other words, if we picture the vector potential as a quantity allowing one to associate
P=
ε0
4π ∫ E B dV . (425)
L=
ε0
4π ∫ E A dV , (426)
* Choosing a function Λ is often called choosing a gauge; the 4-potential A is also called the gauge field. This
strange usage has historic reasons and is now common to all of physics.
Ref. 529 ** In the second part of the text, on quantum mechanics, we will see that the exponent of this expression,
namely exp(iq x∫ A µ dx µ )~ħ, usually called the phase factor, can indeed be directly observed in experiments.
*** John Henry Poynting (1852–1914) introduced the concept in 1884.
562 iv cl assical electrodynamics • 13. electricity and fields
S CED = −mc 2 ∫ dτ − 1
4µ 0 ∫ F, F − ∫ j, A , (427)
−ª
∫ η µν
ds ds
ds −
M 4µ 0 ∫ (428)
δS = 0 when xµ = xµ + δµ and A µ = A µ + δ ′µ ,
provided δx µ (θ) 0 for SθS ª
and δA µ (x ν ) 0 for Sx ν S ª . (429)
Page 182 In the same way as in the case of mechanics, using the variational method for the two
Challenge 1013 ny variables A and x, we recover the evolution equations for particle and fields
¾
b = Fν u ν =j ε µνρσ ∂ ν F ρσ = 0 ,
µq µ µν ν µ0
, ∂µ F , and (430)
m ε0
which we know already. Obviously, they are equivalent to the variational principle based Copyright © Christoph Schiller November 1997–January 2007
on S CED . Both descriptions have to be completed by specifying initial conditions for the
particles and the fields, as well as boundary conditions for the latter. We need the first
and zeroth derivatives of the position of the particles, and the zeroth derivative for the
electromagnetic field.
Are you able to specify the Lagrangian of the pure electrodynamic field using the fields
Challenge 1014 ny E and B instead of F and F?
The form of the Lagrangian implies that electromagnetism is time reversible. This
means that every example of motion due to electric or magnetic causes can also take
* The product described by the symbol ,, ‘wedge’ or ‘hat’, has a precise mathematical meaning, defined for
Ref. 530 this case in equation (428). Its background, the concept of (mathematical) form, carries us too far from our
walk.
the description of electromagnetic field evolu tion 563
place backwards. This is easily deduced from the properties of the Lagrangian. On the
other hand, everyday life shows many electric and magnetic effects which are not time
invariant, such as the breaking of bodies or the burning of electric light bulbs. Can you
Challenge 1015 ny explain how this fits together?
In summary, with the Lagrangian (427) all of classical electrodynamics can be de-
scribed and understood. For the rest of this chapter, we look at some specific topics from
this vast field.
u S~c = cp 0 ε 0 cE B
= = ε 0 cë −ε 0 E i E j − B i B j ~µ 0 (431)
cp T E B 1~2δ i j (ε 0 E 2 + B 2 ~µ 0 )
The energy–momentum tensor shows again that electrodynamics is both Lorentz and
gauge invariant.
Both the Lagrangian and the energy–momentum tensor show that electrodynamics is
symmetric under motion inversion. If all charges change direction of motion – a situation
often incorrectly called ‘time inversion’ – they move backwards along the exact paths they
took when moving forward. Copyright © Christoph Schiller November 1997–January 2007
We also note that charges and mass destroy a symmetry of the vacuum that we men-
tioned in special relativity: only the vacuum is invariant under conformal symmetries. In
particular, only the vacuum is invariant under the spatial inversion r 1~r.
To sum up, electrodynamic motion, like all other examples of motion that we have
encountered so far, is deterministic, reversible and conserved. This is no big surprise.
Nevertheless, two symmetries of electromagnetism deserve special mention.
What is a mirror?
We will study the strange properties of mirrors several times during our walk. We start
with the simplest one first. Everybody can observe, by painting each of their hands in a
different colour, that a mirror does not exchange right and left, as little as it exchanges up
564 iv cl assical electrodynamics • 13. electricity and fields
and down; however, a mirror does exchange right and left handedness. In fact, it does so
by exchanging front and back.
To explore further this strange aspect of nature, try the following experiment: imagine
you are exchanging radio messages with a Martian; are you able to explain to him what
* The most famous is the position of the heart. The mechanisms leading to this disposition are still being
investigated. Recent research suggests that the oriented motion of the cilia on embryos, probably in the
Ref. 531 region called the node, determines the right–left asymmetry. The deep origin of this asymmetry is not yet
elucidated, however.
Most human bodies have more muscles on the right side for right-handers, such as Albert Einstein and
Pablo Picasso, and correspondingly on the left side for left-handers, such as Charlie Chaplin and Peter
Ustinov. This asymmetry reflects an asymmetry of the human brain, called lateralization, which is essen-
tial to human nature.
Another asymmetry of the human body is the hair whirl on the back of the head; the majority of humans
have only one, and in 80 % of the cases it is left turning. But many people have more than one.
the description of electromagnetic field evolu tion 565
right and left are, so that when you meet, you are sure you are talking about the same
Challenge 1018 n thing?
Actually, the mirror symmetry of everyday nature – also called its parity invariance
Ref. 532 – is so pervasive that most animals cannot distinguish left from right in a deeper sense.
Most animals react to mirror stimuli with mirror responses. It is hard to teach them dif-
ferent ways to react, and it is possible almost only for mammals. The many experiments
performed in this area gave the result that animals have symmetrical nervous systems,
and possibly only humans show lateralization, i.e. a preferred hand and different uses for
the left and the right parts of the brain.
To sum up this digression, classical electrodynamics is left–right symmetric, or parity
Challenge 1019 n invariant. Can you show this using its Lagrangian?
A concave mirror shows an inverted image; so does a plane mirror if it is partly folded
Challenge 1020 n along the horizontal. What happens if this mirror is rotated around the line of sight?
Why do metals provide good mirrors? Metals are strong absorbers of light. Any strong
E cB
Copyright © Christoph Schiller November 1997–January 2007
B −E~c (432)
to
cρ e ρm , ρm −cρ e . (433)
It was one of the great discoveries of theoretical physics that even though classical electro-
dynamics with matter is not symmetric under duality, nature is. In 1977, Claus Montonen
and David Olive showed that quantum theory allows duality transformations even with
the inclusion of matter. It has been known since the 1930s that quantum theory allows
magnetic monopoles. We will discover the important ramifications of this result in the
third part of the text. This duality turns out to be one of the essential stepping stones
that leads to a unified description of motion. (A somewhat difficult question: extending
this duality to quantum theory, can you deduce what transformation is found for the fine
Challenge 1022 ny structure constant, and why it is so interesting?)
Duality, by the way, is a symmetry that works only in Minkowski space-time, i.e. in
An inverse square dependence implies a vanishing mass of light and light particles,
Ref. 534 the photons. Is the mass really zero? The issue has been extensively studied. A massive
photon would lead to a wavelength dependence of the speed of light in vacuum, to de-
viations from the inverse square ‘law’, to deviations from Ampère’s ‘law’, to the existence
of longitudinal electromagnetic waves and more. No evidence for these effects has ever
been found. A summary of these studies shows that the photon mass is below 10−53 kg,
or maybe 10−63 kg. Some arguments are not universally accepted, thus the limit varies
somewhat from researcher to researcher.
* This can be deduced from special relativity from the reasoning of page 544 or from the formula in the
footnote of page 332.
the description of electromagnetic field evolu tion 567
A small non-vanishing mass for the photon would change electrodynamics some-
what. The inclusion of a tiny mass poses no special problems, and the corresponding
Ref. 534 Lagrangian, the so-called Proca Lagrangian, has already been studied, just in case.
Strictly speaking, the photon mass cannot be said to vanish. In particular, a photon
with a Compton wavelength of the radius of the visible universe cannot be distinguished
from one with zero mass through any experiment. This gives a mass of 10−69 kg for the
photon. One notes that the experimental limits are still much larger. Photons with such
a small mass value would not invalidate electrodynamics as we know it.
Interestingly, a non-zero mass of the photon implies the lack of magnetic monopoles,
as the symmetry between electric and magnetic fields is broken. It is therefore important
on the one hand to try to improve the experimental mass limit, and on the other hand
to explore whether the limit due to the universe’s size has any implications for this issue.
This question is still open.
this challenge might be the most complex of all challenges that science is facing. Clearly,
the experiment will require involved and expensive machinery, so that there is no danger
for a misuse of the technique. It could also be that the spatial resolution required is beyond
the abilities of technology. However, the understanding and modelling of the brain will
be a useful technology in other aspects of daily life as well.*
* This vision, formulated here in 2005, is so far from realization that it is unclear whether it will come true
in the twenty-first or in any subsequent century.
568 iv cl assical electrodynamics • 14. what is light?
∂2 φ ∂2 A x ∂2 A y ∂2 A z
2A = 0 or, equivalently ε0 µ0 + + + =0. (434)
∂t 2 ∂x 2 ∂y 2 ∂z 2
Challenge 1028 e
which are commonly called plane waves. Such a wave satisfies equation (434) for any
value of amplitude A 0 , of phase δ, and of angular frequency ω, provided the wave vector
k satisfies the relation
º
ω(k) = º ω(k) = º
1 1
k or k2 . (436)
ε0 µ0 ε0 µ0
what is light? 569
space
Electric field
wavelength λ
Magnetic Field
The relation ω(k) between the angular frequency and the wave vector, the so-called dis-
persion relation, is the main property of any type of wave, be it a sound wave, a water
wave, an electromagnetic wave, or any other kind. Relation (436) specifically character-
izes electromagnetic waves in empty space, and distinguishes them from all other types
of waves.*
Equation (434) for the electromagnetic field is linear in the field; this means that the
sum of two situations allowed by it is itself an allowed situation. Mathematically speak-
ing, any superposition of two solutions is also a solution. For example, this means that
two waves can cross each other without disturbing each other, and that waves can travel
Copyright © Christoph Schiller November 1997–January 2007
across static electromagnetic fields. Linearity also means that any electromagnetic wave
can be described as a superposition of pure sine waves, each of which is described by ex-
pression (435). The simplest possible electromagnetic wave, a harmonic plane wave with
Page 572 linear polarization, is illustrated in Figure 260. Note that for this simplest type of waves,
the electric and the magnetic field are in phase. (Can you prove this experimentally and
by calculation?) The surfaces formed by all points of maximal field intensity are parallel
planes, spaced by (half the) wavelength, which move along the direction of the propaga-
tion.
After Maxwell predicted the existence of electromagnetic waves, in the years between
Page 208 * For completeness, we remember that a wave in physics is any propagating imbalance.
570 iv cl assical electrodynamics • 14. what is light?
this half second, the signal goes up to the geostationary satellite, down again and returns
the same way. This half second gives a speed of c 4 ë 36 000 km~0.5 s 3 ë 105 km~s,
which is close to the precise value. Radio amateurs who reflect their signals from the
Moon can perform the same experiment and achieve higher precision.
But Maxwell did more. He strengthened earlier predictions that light itself is a solu-
tion of equation (435) and therefore an electromagnetic wave, albeit with a much higher
frequency. Let us see how we can check this.
* Heinrich Rudolf Hertz (b. 1857 Hamburg, d. 1894 Bonn), important Hamburger theoretical and exper-
imental physicist. The unit of frequency is named after him. Despite his early death, Hertz was a central
figure in the development of electromagnetism, in the explanation of Maxwell’s theory and in the unfolding
of radio communication technology. More about him on page 155.
what is light? 571
F I G U R E 263 The primary and secondary rainbow, and the supernumerary bows below the primary
bow (© Antonio Martos and Wolfgang Hinz)
It is easy to confirm the wave properties of light; indeed they were known already long
before Maxwell. In fact, the first to suggest that light is a (kind of) wave was, around the
Maybe the most beautiful is an experiment carried out by a team of Dutch physicists in
Ref. 537 1990. They simply measured the light transmitted through a slit in a metal plate. It turns
out that the transmitted intensity depends on the width of the slit. Their surprising result
is shown in Figure 264. Can you explain the origin of the unexpected intensity steps in
Challenge 1031 ny the curve?
Numerous other experiments on the cre-
ation, detection and measurement of elec- transmitted light power
tromagnetic waves were performed in the
nineteenth and twentieth centuries. For
example, in 1800, William Herschel dis-
covered infrared light using a prism and a
Challenge 1032 n thermometer. (Can you guess how?) In 1801,
Johann Wilhelm Ritter (1776–1810) a more
than colourful figure of natural Romanti-
(preliminary figure)
cism, discovered ultraviolet light using silver
is easily achieved, e.g. by shining it through a stretched plastic film. When the polariz-
ation of light was discovered in 1808 by the French physicist Louis Malus (1775–1812),
it definitively established the wave nature of light. Malus discovered it when he looked
at the strange double images produced by calcite, a transparent crystal found in many
minerals. Calcite (CaCO 3 ) splits light beams into two – it is birefringent – and polarizes
them differently. That is the reason that calcite is part of every crystal collection. Feldspar
(KAlSi3 O8 ) shows the same effect. If you ever get hold of a piece of transparent calcite,
do look through it at some written text.
By the way, the human eye is almost unable to detect polarization, in contrast to the
eyes of many insects, spiders and certain birds. Honey bees use polarization to deduce
the position of the Sun, even when it is hidden behind clouds, and use the effect for
what is light? 573
Haidinger’s brush
(color intensity human eye
exagerated)
polarized light
E
macula
cornea
and lens
2° to 4°
arm of the cross points to the Sun, if you look about 90° away from it, high in the sky.
To see it really clearly, hold a polaroid (or polaroid sunglasses) up to look through, and
rotate it about the line of sight.
Note that all possible polarizations of light form a continuous set. However, a general
plane wave can be seen as the superposition of two orthogonal, linearly polarized waves
with different amplitudes and different phases. Most books show pictures of plane, linear-
ized electrodynamic waves. Essentially, electric fields look like water waves generalized
to three dimensions, the same for magnetic fields, and the two are perpendicular to each
Challenge 1034 ny other. Can you confirm this?
Interestingly, a generally polarized plane wave can also be seen as the superposition of
right and left circularly polarized waves. However, no illustrations of circularly polarized
Challenge 1035 n waves are found in any textbook. Can you explain why?
574 iv cl assical electrodynamics • 14. what is light?
So far it is clear that light is a wave. To confirm that light waves are indeed electromag-
netic is more difficult. The first argument was given by Bernhard Riemann in 1858;* he
deduced that any electromagnetic wave must propagate with a speed c given by
c=º
1
. (437)
ε0 µ0
Already ten years before him, in 1848, Kirchoff had noted that the measured values on
both sides agreed within measurement errors. A few years later, Maxwell gave a beautiful
confirmation by deducing the expression from equation (436). You should be able to re-
Challenge 1036 ny peat the feat. Note that the right-hand side contains electric and magnetic quantities, and
the left-hand side is an optical entity. Riemann’s expression thus unifies electromagnetism
and optics.
Of course, people were not yet completely convinced. They looked for more ways to
* Bernhard Riemann (b. 1826 Breselenz, d. 1866 Selasca), important German mathematician. He studied
curved space, providing several of the mathematical and conceptual foundations of general relativity, but
then died at an early age.
** John Kerr (1824–1907), Scottish physicist, friend and collaborator of William Thomson.
what is light? 575
bicycle speedometers
microwaves
1.5 – 20 cm–2 cm SHF idem, absorbed night sky, emitted radio astronomy,
15 GHz by water by hydrogen atoms used for cooking
(2.45 GHz),
telecommunications,
radar
15 – 20 mm– EHF idem, absorbed
150 GHz 2 mm by water
infrared allows night emitted by every satellite photography
vision warm object of Earth, astronomy
576 iv cl assical electrodynamics • 14. what is light?
for the speed of light is so strange that one should be astonished when one sees it. Some-
thing essential is missing.
Indeed, the speed is independent of the proper motion of the observer measuring the
electromagnetic field. In other words, the speed of light is independent of the speed of
the lamp and independent of the speed of the observer. All this is contained in expression
578 iv cl assical electrodynamics • 14. what is light?
wall
(438). Incredibly, for five decades, nobody explored this strange result. In this way, the
theory of relativity remained undiscovered from 1848 to 1905. As in so many other cases,
the progress of physics was much slower than necessary.
The constancy of the speed of light is the essential point that distinguishes special
relativity from Galilean physics. In this sense, any electromagnetic device, making use of
expression (438), is a working proof of special relativity. Copyright © Christoph Schiller November 1997–January 2007
simplest possible accelerated motion: a bounce on a wall. In fact, the last diagram is suf-
Challenge 1038 d ficient to show that the transverse field decays as 1~r. Can you deduce the dependence?
If we perform the construction for a charge that undergoes repeated bounces, we get a
diagram with regular kinks in the field lines that move out from the source. For a charge
undergoing harmonic motion, we get the motion shown in Figure 267. It shows the mech-
anism of the simplest antenna (or light source) one can imagine.
The magnitude of the transverse electric field can be used to deduce the relation
between the acceleration a of a charge q and the radiated electromagnetic power P. First,
the transverse electric field (calculated in the last exercise) has to be squared, to give the
local electric energy density. Then it has to be doubled, to include magnetic energy. Fi-
nally, one has to integrate over all angles; this gives a factor of 2/3. In total one gets
q2 a2
P= . (439)
6πε 0 c 3
Copyright © Christoph Schiller November 1997–January 2007
The total radiated power P depends on the square of the acceleration and on the square
of the charge that is being accelerated. This is the so-called Larmor formula. It shows why
radio transmitters need power supplies.
* This was the book series in twenty volumes by Aaron Bernstein, Naturwissenschaftliche Volksbücher,
Duncker, 1873-1874. The young Einstein read them, between 1892 and 1894, with ‘breathless attention’, as
580 iv cl assical electrodynamics • 14. what is light?
air
light
beam
sugar and water
what kind of electromagnetic field he would observe in that case. Einstein later explained
that this Gedanken experiment convinced him already at that young age that nothing
could travel at the speed of light, since the field observed would have a property not found
Challenge 1039 n in nature. Can you find out which one he meant?
Riding on a light beam situation would have strange consequences.
Refraction is a consequence of the change of light speed from material to material. Are
Challenge 1041 n you able to explain refraction, and thus explain the syrup effect?
Refraction is chiefly used in the design of lenses. Using glass instead of water, one can
produce curved surfaces, so that light can be focused. Focusing devices can be used to
produce images. The two main types of lenses, with their focal points and the images
they produce, are shown in Figure 269. When an object is put between a converging lens
and its focus, the lens works as a magnifying glass. The lens then produces a real image, i.e.,
an image that can be projected onto a screen. In all other cases lenses produce so-called
virtual images: such images can be seen with the eye but not be projected onto a screen.
focus
do di
do
final, common
enlarged focus
virtual of both
image lenses
object
intermediate,
real image (if
ocular missing)
ocular
objective lens to human eye
Copyright © Christoph Schiller November 1997–January 2007
lens
F I G U R E 270 Refraction as the basis of the telescope – shown here in the original Dutch design
Figure 269 also allows one to deduce the thin lens formula that connects the lengths d o ,
Challenge 1042 n d o and f . What is it?
Even though glasses and lenses have been known since antiquity, the Middle Ages had
to pass by before two lenses were combined to make more elaborate optical instruments.
The telescope was invented in, or just before, 1608 by the German–Dutch lens grinder
Johannes Lipperhey (c. 1570–1619), who made a fortune by selling it to the Dutch mil-
itary. When Galileo heard about the discovery, he quickly took it over and improved it.
582 iv cl assical electrodynamics • 14. what is light?
In 1609, Galileo performed the first astronomical observations; they made him world-
famous. The Dutch telescope design has a short tube yielding a bright and upright image,
Challenge 1043 e and its magnification is the ratio of the focal distances of the two lenses. It is still used
today in opera glasses. Many other ways of building telescopes have been developed over
* A fascinating overview about what people have achieved in this domain up to now is given by Peter
Manly, Unusual Telescopes, Cambridge University Press, 1991, and Henry C. King, The History of the
Telescope, Dover, 2003.
Images can also be made with mirrors. Since mirrors are cheaper and easier to fabricate for high precision
imaging, most large telescopes have a mirror instead of the first lens.
By the way, telescopes also exist in nature. Many spiders have two types of eyes. The large ones, made to
see far away, have two lenses arranged in the same way as in the telescope.
** If not, read the beautiful text by Elizabeth M. Slater & Henry S. Slater, Light and Electron
Microscopy, Cambridge University Press, 1993.
what is light? 583
α
b
M
preliminary drawing
printed on a black background, as shown in Figure 271. One gets the impression that the
red letters float in front of the blue letters. Can you show how dispersion leads to the
Challenge 1046 n floating effect?
A second important observation is that light goes around corners. This effect was
called diffraction by Francesco Grimaldi, in his text Physico-mathesis de lumine, published
Challenge 1049 n can distinguish the two headlights of a car. Can you estimate it?
Resolution limits also make it impossible to see the Great Wall in northern China from
the Moon, contrary to what is often claimed. In the few parts that are not yet in ruins, the
wall is about 6 metres wide, and even if it casts a wide shadow during the morning or
the evening, the angle it subtends is way below a second of arc, so that it is completely
invisible to the human eye. In fact, three different cosmonauts who travelled to the Moon
Ref. 548 performed careful searches and confirmed that the claim is absurd. The story is one of the
Challenge 1050 ny most tenacious urban legends. (Is it possible to see the Wall from the space shuttle?) The
largest human-made objects are the polders of reclaimed land in the Netherlands; they
are visible from outer space. So are most large cities as well as the highways in Belgium
at night; their bright illumination makes them stand out clearly from the dark side of the
Earth.
584 iv cl assical electrodynamics • 14. what is light?
hot air
cold air
F I G U R E 274 An effective reflection (due to refraction in a hot air layer) is the basis
of mirages; for an example, see http://virtual.finland.fi/netcomm/news/showarticle.
asp?intNWSAID=25722 and http://www.polarimage.fi/mirages/mirages.htm
Diffraction also means that behind a small disc illuminated along its axis, the centre
of the shadow shows, against all expectations, a bright spot. This ‘hole’ in the shadow
was predicted in 1819 by Denis Poisson (1781–1840) in order to show to what absurd con-
sequences the wave theory of light would lead. He had just read the mathematical descrip-
* Augustin Jean Fresnel (1788–1827), engineer and part time physicist; he published in 1818 his great paper
on wave theory for which he got the prize of the French Academy of Sciences in 1819. (The ‘s’ in his name
is silent.) To improve his finances, he worked in the commission responsible for lighthouses, for which he
developed the well-known Fresnel lens. He died prematurely, partly of exhaustion due to overwork.
what is light? 585
high temperature. Essentially, the mirror provides a cheap way to fire an oven in its focus.
(And ‘focus’ is the Latin word for ‘hearth’).
Kids find out quite rapidly that large
lenses allow them to burn things more eas-
ily than small ones. It is obvious that the
Spanish site is the record holder in this
game. However, building a larger mirror
does not make sense. Whatever its size may
be, such a set-up cannot reach a higher
temperature than that of the original light
Ref. 588 source. The surface temperature of the Sun
is about 5800 K; indeed, the highest temper-
ature reached so far is about 4000 K. Are
you able to show that this limitation follows F I G U R E 275 The last mirror of the solar furnace
at Odeillo, in the French Pyrenees (© Gerhard
P= .
E
(441) F I G U R E 276 Levitating
c a small glass bead with
a laser
Challenge 1053 e As a result, the pressure p exerted by light on a body is given by
p= (1 + r)
T
(442)
c
where for black bodies we have that a reflectivity r = 0 and for mirrors r = 1; other bodies
have values in between. What is your guess for the amount of pressure due to sunlight on
* The heaviest object that has been levitated with a laser had a mass of 20 g; the laser used was enormous,
and the method also made use of a few additional effects, such as shock waves, to keep the object in the air.
586 iv cl assical electrodynamics • 14. what is light?
Challenge 1054 n a black surface of one square metre? Is this the reason that we feel more pressure during
the day than during the night?
In fact, rather delicate equipment is needed
to detect the momentum of light, in other words,
its radiation pressure. Already in 1619, Johannes
Kepler had suggested in De cometis that the tails
of comets exist only because the light of the Sun
hits the small dust particles that detach from it.
For this reason, the tail always points away from light
the Sun, as you might want to check at the next
Challenge 1055 e opportunity. Today, we know that Kepler was
right; but proving the hypothesis is not easy.
In 1873, William Crookes* invented the light
mill radiometer. He had the intention of demon-
is possible to do this through a microscope, so that one can also observe at the same time
what is happening. This technique is now routinely used in biological research around
the world, and has been used, for example, to measure the force of single muscle fibres,
by chemically attaching their ends to glass or Teflon spheres and then pulling them apart
with such optical tweezers.
But that is not all. In the last decade of the twentieth century, several groups even
Ref. 553 managed to rotate objects, thus realizing actual optical spanners. They are able to rotate
particles at will in one direction or the other, by changing the optical properties of the
laser beam used to trap the particle.
* William Crookes (b. 1832 London, d. 1919 London), English chemist and physicist, president of the Royal
Society, discoverer of thallium, and believer in spiritualism.
what is light? 587
suspension
wire
circularly
polarized
light beam
Challenge 1057 e Equivalently, the angular momentum of a wave is λ~2π times its linear momentum. For
Ref. 554 light, this result was already confirmed in the early twentieth century: a light beam can
Challenge 1058 ny put certain materials (which ones?) into rotation, as shown in Figure 278. Of course, the
whole thing works even better with a laser beam. In the 1960s, a beautiful demonstration
was performed with microwaves. A circularly polarized microwave beam from a maser
– the microwave equivalent of a laser – can put a metal piece absorbing it into rotation.
Indeed, for a beam with cylindrical symmetry, depending on the sense of rotation, the
angular momentum is either parallel or antiparallel to the direction of propagation. All
these experiments confirm that light also carries angular momentum, an effect which will
play an important role in the second part of our mountain ascent.
We note that not for all waves angular momentum is energy per angular frequency.
This is only the case for waves made of what in quantum theory will be called spin 1
Copyright © Christoph Schiller November 1997–January 2007
particles. For example, for gravity waves the angular momentum is twice this value, and
they are therefore expected to be made of spin 2 particles.
In summary, light can touch and be touched. Obviously, if light can rotate bodies, it
Challenge 1059 n can also be rotated. Could you imagine how this can be achieved?
fiction novels managed to persuade the military – who also indulge in this habit – that
lasers could be used to shoot down missiles, and that a lot of tax money should be spent
on developing such lasers. Using the definition of the Poynting vector and a hitting time
of about 0.1 s, are you able to estimate the weight and size of the battery necessary for
Challenge 1060 ny such a device to work? What would happen in cloudy or rainy weather?
Other people tried to persuade NASA to study the possibility of propelling a rocket
Challenge 1061 ny using emitted light instead of ejected gas. Are you able to estimate whether this is feasible?
What is colour?
We saw that radio waves of certain frequencies are visible. Within that range, different
frequencies correspond to different colours. (Are you able to convince a friend about
Challenge 1062 n this?) But the story does not finish here. Numerous colours can be produced either by a
single wavelength, i.e. by monochromatic light, or by a mixture of several different colours.
For example, standard yellow can be, if it is pure, an electromagnetic beam of 575 nm
Ref. 555 1304 to 1310 and, at the same time, the Persian mathematician Kamal al-Din al-Farisi. To
check the explanation, they did something smart and simple; anybody can repeat this at
Challenge 1065 e home. They built an enlarged water droplet by filling a thin spherical (or cylindrical) glass
container with water; then they shone a beam of white light through it. Theodoricus and
al-Farisi found exactly what is shown in Figure 280. With this experiment, each of them
was able to reproduce the angle of the main or primary rainbow, its colour sequence, as
well as the existence of a secondary rainbow, its observed angle and its inverted colour
Page 571 sequence.* All these bows are visible in Figure 263. Theodoricus’s beautiful experiment is
Challenge 1066 ny * Can you guess where the tertiary and quaternary rainbows are to be seen? There are rare reported sightings
of them. The hunt to observe the fifth-order rainbow is still open. (In the laboratory, bows around droplets
Ref. 556 up to the thirteenth order have been observed.) For more details, see the beautiful website at http://www.
what is light? 589
white red
glass green
violet
violet
green
red
50.3° 53.6°
red
green
violet
F I G U R E 280 Proving that white light is a mixture of colours (angles much exaggerated!)
sometimes called the most important contribution of natural science in the Middle Ages.
Incidentally, the explanation of Figure 280 is not complete. It is drawn with the light
ray hitting the water droplet at a specific spot. If the light ray hits the droplet at other
spots, the rainbows appear at other angles, but all those rainbows wash out. Only the
visible rainbow remains, because its deflection angles are extremal.
Incidentally, at sunset the atmosphere itself also acts as a prism, or more precisely, Copyright © Christoph Schiller November 1997–January 2007
as a cylindrical lens affected by spherochromatism. That means that the Sun is split into
different images, one for each colour, which are slightly shifted with respect to each other;
Ref. 557 the total shift is about 1% of the diameter. As a result, the rim of the Sun is coloured. If the
weather is favourable, if the air is clear up to and beyond the horizon, and if the correct
temperature profile is present in the atmosphere, a colour-dependent mirage will appear.
As a result, for about a second it will be possible to see, after or near the red, orange and
yellow images of the setting Sun, the green–blue image, sometimes even detached. This
Ref. 558 is the famous green flash described by Jules Verne in his novel Le Rayon-vert. It is often
seen on tropical beaches, for example in Hawaii, and from the decks of ships in warm
atoptics.co.uk. There are several formulae for the angles of the various orders of rainbows; they follow from
straightforward geometric considerations, but are too involved to be given here.
590 iv cl assical electrodynamics • 14. what is light?
v ph
v gr
v So
v fr
where k 0 is the central wavelength of the wave packet. We observe that ω = c(k)k =
592 iv cl assical electrodynamics • 14. what is light?
dv ph
v gr = V = v ph − λ
dω
. (446)
dk k 0 dλ
This means that the sign of the last term determines whether the group velocity is larger
or smaller than the phase velocity. For a travelling group, as shown by the dashed line in
Figure 283, this means that new maxima appear either at the end or at the front of the
group. Experiments show that this is only the case for light passing through matter; for
light in vacuum, the group velocity has the same value v gr = c for all values of the wave
vector k.
You should be warned that many publications are still propagating the incorrect state-
ment that the group velocity in a material is never greater than c, the speed of light in
Challenge 1068 ny vacuum. Actually, the group velocity in a material can be zero, infinite or even negative;
* In quantum mechanics, Schrödinger proved that the velocity of an electron is given by the group velocity
of its wave function. Therefore the same discussion reappeared in quantum theory, as we will find out in the
second part of the mountain ascent.
** Arnold Sommerfeld (b. 1868 Königsberg, d. 1951 München) was a central figure in the spread of special
and general relativity, of quantum theory, and of their applications. A professor in Munich, an excellent
teacher and text book writer, he worked on atomic theory, on the theory of metals and on electrodynamics,
and was the first to understand the importance and the mystery around ‘Sommerfeld’s famous fine structure
constant.’
what is light? 593
However, as in the case of the front velocity, in the case of the energy velocity we have
to specify if we mean the energy transported by the main pulse or by the front of it. In
vacuum, neither is ever greater than the speed of light.* (In general, the velocity of energy
Ref. 561 in matter has a value slightly different from Sommerfeld’s signal velocity.)
In recent years, the progress in light detector technology, allowing one to detect even
the tiniest energies, has forced scientists to take the fastest of all these energy velocities to
describe signal velocity. Using detectors with the highest possible sensitivity we can use as
signal the first point of the wave train whose amplitude is different from zero, i.e. the first
tiny amount of energy arriving. This point’s velocity, conceptually similar to Sommerfeld’s
signal velocity, is commonly called the front velocity or, to distinguish it even more clearly
Challenge 1069 n from Sommerfeld’s case, the forerunner velocity. It is simply given by
v fr = lim
ω
. (448)
ω ª k
either phase velocity or group velocity, but not for the energy or true signal velocity. The
conceptual problems would arise only because in some physical systems the refraction
angle for phase motion and for energy motion differ.
Ref. 566 Today, the consensus is the following: a positive index of refraction less than one is im-
possible, as it implies an energy speed of greater than one. A negative index of refraction,
however, is possible if it is smaller than −1. Negative values have indeed been frequently
observed; the corresponding systems are being extensively explored all over the world.
* Signals not only carry energy, they also carry negative entropy (‘information’). The entropy of a transmitter
increases during transmission. The receiver decreases in entropy (but less than the increase at the transmitter,
Ref. 563 of course).
Note that the negative group velocity implies energy transport against the propagation velocity of light.
Ref. 564 This is possible only in energy loaded materials.
594 iv cl assical electrodynamics • 14. what is light?
α air α air
water left-handed
(n>0) material
(n<0)
β β
The materials showing this property are called left-handed. The reason is that the vectors
of the electric field, the magnetic field and the wave vector form a left-handed triplet,
tion is always less than the speed of light. But if the neighbour already knows the text, he
can recite it without having heard the readers’ voice. To the third observer such a situation
appears to imply motion that is faster than light. Prediction can thus mimic communic-
ation and, in particular, it can mimic faster-than-light (superluminal) communication.
Ref. 569 Such a situation was demonstrated most spectacularly in 1994 by Günter Nimtz, who
seemingly transported music – all music is predictable for short time scales – through a
‘faster-than-light’ system. To distinguish between the two situations, we note that in the
case of prediction, no transport of energy takes place, in contrast to the case of commu-
nication. In other words, the definition of a signal as a transporter of information is not
as useful and clear-cut as the definition of a signal as a transporter of energy. In the above-
mentioned experiment, no energy was transported faster than light. The same distinction
between prediction on the one hand and signal or energy propagation on the other will
what is light? 595
Physical propert y E x p e r i m e n t a l va l u e
Sometimes one hears that certain experiments or even the theory of relativity show
that the aether does not exist. This is not strictly correct. In fact, experiments show some-
thing more important. All the data show that the aether is indistinguishable from the
vacuum. Of course, if we use the change of curvature as the definition for motion of the
vacuum, the vacuum can move, as we found out in the section on general relativity; but
the aether still remains indistinguishable from it.*
Ref. 571 * In fact, the term ‘aether’ has been used as an expression for several different ideas, depending on the
author. First of all it was used for the idea that a vacuum is not empty, but full; secondly, that this fullness
can be described by mechanical models, such as gears, little spheres, vortices, etc.; thirdly, it was imagined
that a vacuum is similar to matter, being made of the same substrate. Interestingly, some of these issues will
reappear in the third part of our mountain ascent.
596 iv cl assical electrodynamics • 14. what is light?
It is well known that the glowing material in light bulbs is tungsten wire in an inert gas.
This was the result of a series of experiments that began with the grandmother of all
lamps, namely the cucumber. The older generation knows that a pickled cucumber, when
attached to the 230 V of the mains, glows with a bright green light. (Be careful; the exper-
iment is dirty and somewhat dangerous.)
**
If the light emitted by the headlights of cars were polarized from the bottom left to the
upper right (as seen from the car’s driver) one could vastly improve the quality of driving
at night: one could add a polarizer to the wind shield oriented in the same direction. As
a result, a driver would see the reflection of his own light, but the light from cars coming
Challenge 1076 ny towards him would be considerably dampened. Why is this not done in modern cars?
images and the eye 597
Sometimes we see less than there is. Close your left eye, look at the white spot in
Figure 286, bring the page slowly towards your eye, and pay attention to the middle lines.
Challenge 1078 n At a distance of about 15 to 20 cm the middle line will seem uninterrupted. Why?
On the other hand, sometimes we see more than there is, as Figures 287 and 288 show.
The first shows that parallel lines can look skewed, and the second show a so-called Her-
mann lattice, named after its discoverer.* The Hermann lattice of Figure 288, discovered
by Elke Lingelbach in 1995, is especially striking. Variations of these lattices are now used
Ref. 574 to understand the mechanisms at the basis of human vision. For example, they can be
* Ludimar Herrmann (1838–1914), Swiss physiologist. The lattices are often falsely called ‘Hering lattices’
after the man who made Hermann’s discovery famous.
598 iv cl assical electrodynamics • 14. what is light?
F I G U R E 288 The Lingelbach lattice: do you see white, grey, or black dots? Copyright © Christoph Schiller November 1997–January 2007
used to determine how many light sensitive cells in the retina are united to one signal
pathway towards the brain. The illusions are angle dependent because this number is
also angle dependent.
Our eyes also ‘see’ things differently: the retina sees an inverted image of the world.
There is a simple method to show this, due to Helmholtz.* You need only a needle and
a piece of paper, e.g. this page of text. Use the needle to make two holes inside the two
* See Hermann von Helmholtz, Handbuch der physiologischen Optik, 1867. This famous classic is
available in English as Handbook of Physiological Optics, Dover, 1962. The Prussian physician, physicist and
science politician born as Hermann Helmholtz (b. 1821 Potsdam, d. 1894 Charlottenburg) was famous for
his works on optics, acoustics, electrodynamics, thermodynamics, epistemology and geometry. He founded
several physics institutions across Germany. He was one of the first to propagate the idea of conservation
images and the eye 599
letters ‘oo’. Then keep the page as close to your eye as possible, look through the two holes
towards the wall, keeping the needle vertical, a few centimetres behind the paper. You
will see two images of the needle. If you now cover the left hole with your finger, the right
needle will disappear, and vice versa. This shows that the image inside the eye, on the
Challenge 1079 ny retina, is inverted. Are you able to complete the proof?
Two other experiments can show the same result. If you push very lightly on the inside
of your eye (careful!), you will see a dark spot appear on the outside of your vision field.
And if you stand in a dark room and ask a friend to look at a burning candle, explore his
eye: you will see three reflections: two upright ones, reflected from the cornea and from
the lens, and a dim third one, upside-down, reflected from the retina.
Another reason that we do not see the complete image of nature is that our eyes have
a limited sensitivity. This sensitivity peaks around 560 nm; outside the red and the violet,
the eye does not detect radiation. We thus see only part of nature. For example, infrared
photographs of nature, such as the one shown in Figure 289, are interesting because they
show us something different from what we see usually.
Every expert of motion should also know that the sensitivity of the eye does not cor-
respond to the brightest part of sunlight. This myth has been spread around the world
Copyright © Christoph Schiller November 1997–January 2007
Ref. 575
by the numerous textbooks that have copied from each other. Depending on whether
frequency or wavelength or wavelength logarithm is used, the solar spectrum peaks at
500 nm, 880 nm or 720 nm. The human eye’s spectral sensitivity, like the completely dif-
ferent sensitivity of birds or frogs, is due to the chemicals used for detection. In short,
the human eye can only be understood by a careful analysis of its particular evolutionary
history.
An urban legend says that newborn babies see everything upside down. Can you ex-
Challenge 1080 n plain why this idea is wrong?
of energy. His other important book, Die Lehre von den Tonempfindungen, published in 1863, describes the
basis of acoustics and, like the Handbook, is still worth reading.
600 iv cl assical electrodynamics • 14. what is light?
F I G U R E 290 A high quality photograph of a live human retina, including a measured (false colour)
ated in front of the cones. Why don’t they appear in the picture? And why don’t they
Challenge 1081 n disturb us in everyday life? (The picture does not show the other type of sensitive light
cells, the rods, because the subject was in ambient light; rods come to the front of the
retina only in the dark, and then produce black and white pictures.
Of all the mammals, only primates can see colours. Bulls for example, don’t; they can-
not distinguish red from blue. On the other hand, the best colour seers overall are the
birds. They have cone receptors for red, blue, green, UV and, depending on the bird, for
up to three more sets of colours. A number of birds (but not many) also have a better eye
* Nature uses another trick to get maximum resolution: the eye continuously performs small movements,
called micronystagmus. The eye continuously oscillates around the direction of vision with around 40 to
50 Hz. In addition, this motion is also used to allow the cells in the retina to recharge.
images and the eye 601
reference reconstruction
beam beam
depth.
But obviously the most spectacular effect is obtained whenever position dependent
images can be created. Some virtual reality systems mimic this effect using a sensor at-
tached to the head, and creating computer–generated images that depend on this pos-
ition. However, such systems are not able to reproduce actual situations and thus pale
when compared with the impression produced by holograms.
Holograms reproduce all that is seen from any point of a region of space. A hologram
is thus a stored set of position dependent pictures of an object. It is produced by storing
amplitude and phase of the light emitted by an object. To achieve this, the object is illu-
minated by a coherent light source, such as a laser, and the interference pattern is stored.
Illuminating the developed photographic film by a coherent light source then allows one
to see a full three-dimensional image. In particular, due to the reproduction of the situ-
602 iv cl assical electrodynamics • 14. what is light?
ation, the image appears to float in free space. Holograms were developed in 1947 by the
Hungarian physicist Dennis Gabor (1900–1979), who received the 1971 Nobel Prize for
physics for this work.
Holograms can be made to work in reflection or transmission. The simplest holograms
use only one wavelength. Most coloured holograms are rainbow holograms, showing false
colours that are unrelated to the original objects. Real colour holograms, made with three
different lasers, are rare but possible.
Is it possible to make moving holograms? Yes; however, the technical set-ups are still
extremely expensive. So far, they exist only in a few laboratories and cost millions of euro.
By the way, can you describe how you would distinguish a moving hologram from a real
Challenge 1083 n body, if you ever came across one, without touching it?
Imaging
Producing images is an important part of modern society. The quality of images depends
dE
λ
, (449)
2n sin α
where λ is the wavelength, n the index of refraction and α is the angle of observation.
The new technique, a special type of fluorescence microscopy developed by Stefan Hell,
modifies this expression to
dE »
λ
, (450)
2n sin α; I~I sat
so that a properly chosen saturation intensity allows one to reduce the diffraction limit
to arbitrary low values. So far, light microscopy with a resolution of 16 nm has been
images and the eye 603
Ref. 577 performed, as shown in Figure 292. This and similar techniques should become com-
monplace in the near future.
* Most bodies are not black, because colour is not only determined by emission, but also by absorption of
light.
604 iv cl assical electrodynamics • 14. what is light?
The precise expression for the emitted energy density u per frequency ν can be de-
duced from the radiation law for black bodies discovered by Max Planck*
ν3
u(ν, T) =
8πh
. (451)
c 3 e hν~k T − 1
He made this important discovery, which we will discuss in more detail in the second
part of our mountain ascent, simply by comparing this curve with experiment. The new
constant h, quantum of action or Planck’s constant, turns out to have the value 6.6 ë 10−34 Js,
Page 721 and is central to all quantum theory, as we will see. The other constant Planck introduced,
the Boltzmann constant k, appears as a prefactor of temperature all over thermodynamics,
as it acts as a conversion unit from temperature to energy.
Challenge 1085 ny The radiation law gives for the total emitted energy density the expression
Challenge 1086 ny from which equation (460) is deduced using I = uc~4. (Why?)
The second property of black body radiation is the value of the peak wavelength, i.e. the
wavelength emitted with the highest intensity. This wavelength determines their colour;
Challenge 1087 ny it is deduced from equation (451) to be
Above, we predicted that any material made of charges emits radiation. Are you able
to find a simple argument showing whether heat radiation is or is not this classically
* Max Planck (1858–1947), professor of physics in Berlin, was a central figure in thermostatics. He dis-
covered and named Boltzmann’s constant k and the quantum of action h, often called Planck’s constant. His
introduction of the quantum hypothesis gave birth to quantum theory. He also made the works of Einstein
known in the physical community, and later organized a job for him in Berlin. He received the Nobel Prize
for physics in 1918. He was an important figure in the German scientific establishment; he also was one of
the very few who had the courage to tell Adolf Hitler face to face that it was a bad idea to fire Jewish profess-
ors. (He got an outburst of anger as answer.) Famously modest, with many tragedies in his personal life, he
was esteemed by everybody who knew him.
** Wilhelm Wien (b. 1864 Gaffken, d. 1928 München), East-Prussian physicist; he received the Nobel Prize
for physics in 1911 for the discovery of this relation.
images and the eye 605
Figure to be inserted
F I G U R E 293 Bodies inside an oven at room temperature (left) and red hot (right)
Light as weapon?
In many countries, there is more money to study assault weapons than to increase the edu-
cation and wealth of their citizen. Several types of assault weapons using electromagnetic
radiation are being researched. Two are particularly advanced. Copyright © Christoph Schiller November 1997–January 2007
The first weapon is a truck with a movable parabolic antenna on its roof, about 1 m in
size, that emits a high power (a few kW) microwave (95 GHz) beam. The beam, like all
microwave beams, is invisible; depending on power and beam shape, it is painful or lethal,
up to a distance of a 100 m. This terrible device, with which the operator can make many
many victims without even noticing, was ready in 2006. (Who expects that a parabolic
antenna is dangerous?) Efforts to ban it across the world are slowly gathering momentum.
The second weapon under development is the so-called pulsed impulse kill laser. The
idea is to take a laser that emits radiation that is not absorbed by air, steam or similar
obstacles. An example is a pulsed deuterium fluoride laser that emits at 3.5 µm. This laser
burns every material it hits; in addition, the evaporation of the plasma produced by the
burn produces a strong hit, so that people hit by such a laser are hurt and hit at the same
time. Fortunately, it is still difficult to make such a device rugged enough for practical
606 iv cl assical electrodynamics • 14. what is light?
use. But experts expect battle lasers to appear soon. In short, it is probable that radiation
weapons will appear in the coming years.*
**
Colour blindness was discovered by the great English scientist John Dalton (1766–1844)
Challenge 1095 ny – on himself. Can you imagine how he found out? It affects, in all its forms, one in 20 men.
In many languages, a man who is colour blind is called daltonic. Women are almost never
Ref. 587 daltonic, as the property is linked to defects on the X chromosome. If you are colour blind,
you can check to which type you belong with the help of Figure 294.
**
Light beams, such as those emitted from lasers, are usually thought of as lines. However,
* What a man working on such developments tells his children when he comes home in the evening is not
clear.
charges are discrete – the limits of cl assical electrodynamics 607
light beams can also be tubes. Tubular laser beams, or Bessel beams of high order, are
used in modern research to guide plasma channels.
**
Is it really possible to see stars from the bottom of a deep pit or of a well, even during the
Challenge 1096 n day, as is often stated?
**
Imaging is an important part of modern industry. Without laser printers, photocopying
machines, CD players, DVD players, microscopes, digital photographic cameras, film and
video cameras, lithography machines for integrated circuit production, telescopes, film
projectors, our world would look much more boring. And there are not enough experts
Ref. 588 in the world on this fascinating field.
In short, like all flows in nature, the flow of electricity is due to a flow of discrete particles.
Copyright © Christoph Schiller November 1997–January 2007
In other systems, charges move both as electrons and as ions. Examples are neon lamps,
fire, plasmas and the Sun. Inside atoms, electrons behave even more strangely. One tends
to think that they orbit the nucleus (as we will see later) at a rather high speed, as the
orbital radius is so small. However, it turns out that in most atoms many electrons do not
orbit the nucleus at all. The strange story behind atoms and their structure will be told in
Page 773 the second part of our mountain ascent.
The discreteness of charge implies that one can estimate the size of atoms by observing
Challenge 1100 ny galvanic deposition. How?
**
Cosmic radiation consists of charged particles hitting the Earth. (We will discuss this
Page 919 in more detail later.) Astrophysicists explain that these particles are accelerated by the
Ref. 578 magnetic fields around the Galaxy. However, the expression of the Lorentz acceleration
shows that magnetic fields can only change the direction of the velocity of a charge, not
Challenge 1101 ny its magnitude. How can nature get acceleration nevertheless?
**
What would be the potential of the Earth in volt if we could take away all the electrons of
Challenge 1102 n a drop of water?
ρ=
2m
, (456)
τe 2 n
with n being the electron number density. Inserting numbers for copper (n =
10.3 ë 1028 ~m−3 and ρ = 0.16 ë 10−7 Ωm), one gets a time τ = 42 ps. This time is so
short that the switch-on process can usually be neglected.
**
Does it make sense to write Maxwell’s equations in vacuum? Both electrical and magnetic
fields require charges in order to be measured. But in vacuum there are no charges at all.
Worse, fields are defined by using infinitesimally small test charges. But such charges do Copyright © Christoph Schiller November 1997–January 2007
not exist. In fact, only quantum theory solves this apparent contradiction. Are you able
Challenge 1104 d to imagine how?
610 iv cl assical electrodynamics • 16. electromagnetic effects
Incidentally, you have a 75 % chance of survival after being hit by lightning, especially
if you are completely wet, as in that case the current will flow outside the skin. Usually, wet
people who are hit loose all their clothes, as the evaporating water tears them off. Rapid
resuscitation is essential to help somebody to recover after a hit.*
As a note, you might know how to measure the distance of a lightning by counting
the seconds between the lightning and the thunder and multiplying this by the speed of
sound, 330 m~s; it is less well known that one can estimate the length of the lightning bolt
by measuring the duration of the thunder, and multiplying it by the same factor.
In the 1990s more electrical details about thunderstorms became known. Airline pilots
and passengers sometime see weak and coloured light emissions spreading from the top
of thunderclouds. There are various types of such emissions: blue jets and mostly red
sprites and elves, which are somehow due to electric fields between the cloud top and
the ionosphere. The details are still under investigation, and the mechanisms are not yet
clear.**
Ref. 593 down towards the Earth at a given moment. Present research also aims at measuring the
activity of the related electrical sprites and elves in this way.
The ions in air play a role in the charging of thunderclouds via the charging of ice
crystals and rain drops. In general, all small particles in the air are electrically charged.
When aeroplanes and helicopters fly, they usually hit more particles of one charge than
of the other. As a result, aeroplanes and helicopters are charged up during flight. When
* If you are ever hit by lightning and survive, go to the hospital! Many people died three days later having
failed to do so. A lightning strike often leads to coagulation effects in the blood. These substances block the
kidneys, and one can die three days later because of kidney failure. The remedy is to have dialysis treatment.
** For images, have a look at the interesting http://sprite.gi.alaska.edu/html/sprites.htm, http://www.
fma-research.com/spriteres.htm and http://paesko.ee.psu.edu/Nature websites.
Challenge 1106 ny *** The Earth is thus charged to about −1 MC. Can you confirm this?
612 iv cl assical electrodynamics • 16. electromagnetic effects
a helicopter is used to rescue people from a raft in high seas, the rope pulling the people
upwards must first be earthed by hanging it in the water; if this is not done, the people
on the raft could die from an electrical shock when they touch the rope, as has happened
a few times in the past.
The charges in the atmosphere have many other effects. Recent experiments have con-
Ref. 594 firmed what was predicted back in the early twentieth century: lightning emits X-rays.
The confirmation is not easy though; it is necessary to put a detector near the lightning
flash. To achieve this, the lightning has to be directed into a given region. This is possible
using a missile pulling a metal wire, the other end of which is attached to the ground.
These experimental results are now being collated into a new description of lightning
which also explains the red-blue sprites above thunderclouds. In particular, the processes
Ref. 595 also imply that inside clouds, electrons can be accelerated up to energies of a few MeV.
Why are sparks and lightning blue? This turns out to be a material property: the colour
comes from the material that happens to be excited by the energy of the discharge, usu-
they were noticed during thunderstorms, often after a lighting had struck. With a few
Ref. 596 exceptions, nobody took these reports seriously, because no reproducible data existed.
When microwave ovens become popular, several methods to produce ball-shaped dis-
charges became known. To observe one, just stick a toothpick into a candle, light the
toothpick, and put it into (somebody else’s) microwave oven at maximum power. This set-
up produces a beautiful ball-like discharge. However, humans do not live in a microwave
oven.
The experimental situation changed completely in the years 1999 to 2001. In those
Ref. 597 years the Russian physicists Anton Egorov and Gennady Shabanov discovered a way to
produce plasma clouds, or plasmoids, floating in air, using three main ingredients: water,
metal and high voltage. If high voltage is applied to submerged metal electrodes of the
electromagnetic effects and challenges 613
proper shape and make, plasma clouds emerge from the water, about 10 to 20 cm in size,
float above the surface, and disappear after about half a second. Two examples can be
seen in Figure 296.
The phenomenon of floating plasmoids is still being studied. There are variations in
shape, colour, size and lifetime. The spectrum of observations and techniques will surely
evolve in the coming years. In any case, it seems that the phenomenon could explain a
number of ball lighting observations.
since all accelerated charges radiate. However, the world around us is full of charges at
fixed heights, and there is no such radiation. How is this possible?
Ref. 598 The question has been a pet topic for many years. Generally speaking, the concept of
radiation is not observer invariant: If one observer detects radiation, a second one does
not necessarily do so as well. The exact way a radiation field changes from one observer
to the other depends on the type of relative motion and on the field itself.
A precise solution of the problem shows that for a uniformly accelerated charge, an ob-
server undergoing the same acceleration only detects an electrostatic field. In contrast, an
Ref. 599 inertial observer detects a radiation field. Since gravity is (to a high precision) equivalent
to uniform acceleration, we get a simple result: gravity does not make electrical charges
radiate for an observer at rest with respect to the charge, as is observed. The results holds
true also in the quantum theoretical description.
614 iv cl assical electrodynamics • 16. electromagnetic effects
Research questions
The classical description of electrodynamics is coherent and complete; nevertheless there
are still many subjects of research. Here are a few of them.
The origin of the magnetic field of the Earth, the
other planets, the Sun and even of the galaxy is a fas- ocean
cinating topic. The way that the convection of fluids in- crust
side the planets generates magnetic fields, an intrinsic-
ally three-dimensional problem, the influence of turbu- mantle
lence, of nonlinearities and of chaos makes it a surpris-
ingly complex question. liquid core
The details of the generation of the magnetic field solid core
of the Earth, usually called the geodynamo, began to
appear only in the second half of the twentieth century,
when the knowledge of the Earth’s interior reached a
thousands and a few million years, is one of the central questions. The answers are dif-
ficult; experiments are not yet possible, 150 years of measurements is a short time when
compared with the last transition – about 730 000 years ago – and computer simulations
are extremely involved. Since the field measurements started, the dipole moment of the
magnetic field has steadily diminished, presently by 5% a year, and the quadrupole mo-
ment has steadily increased. Maybe we are heading towards a surprise.* (By the way, the
study of galactic magnetic fields is even more complex, and still in its infancy.)
Another important puzzle about electricity results from the equivalence of mass and
energy. It is known from experiments that the size d of electrons is surely smaller than
* In 2005, it has been reported that the inner core of the Earth seems to rotate faster than the Earth’s crust
Ref. 600 by up to half a degree per year.
electromagnetic effects and challenges 615
Ref. 601 10−22 m. This means that the electric field surrounding it has an energy content E given
Challenge 1108 ny by at least
On the other hand, the mass of an electron, usually given as 511 keV~c2 , corresponds to
an energy of only 82 fJ, ten million times less than the value just calculated. In other
words, classical electrodynamics has considerable difficulty describing electrons. In fact,
a consistent description of charged point particles within classical electrodynamics is
Ref. 602 impossible. This pretty topic receives only a rare – but then often passionate – interest
nowadays, because the puzzle is solved in a different way in the upcoming, second part
Challenge 1109 r placed outside the head? One could start with a simpler challenge: Would it be possible
to distinguish the thought ‘yes’ from the thought ‘no’ by measuring electrical or magnetic
fields around the head? In other words, is mind-reading possible? Maybe the twenty-first
century will come up with a positive answer. If so, the team performing the feat will be
instantly famous.
Levitation
We have seen that it is possible to move certain objects without touching them, using a
magnetic or electric field or, of course, using gravity. Is it also possible, without touching
an object, to keep it fixed, floating in mid-air? Does this type of rest exist?
It turns out that there are several methods of levitating objects. These are commonly
616 iv cl assical electrodynamics • 16. electromagnetic effects
Ref. 604 divided into two groups: those that consume energy and those who do not. Among the
methods that consume energy is the floating of objects on a jet of air or of water, the float-
ing of objects through sound waves, e.g. on top of a siren, or through a laser beam com-
ing from below, and the floating of conducting material, even of liquids, in strong radio-
frequency fields. Levitation of liquids or solids by strong ultrasound waves is presently
Ref. 605 becoming popular in laboratories. All these methods give stationary levitation. Another
group of energy consuming methods sense the way a body is falling and kick it up again in
the right way via a feedback loop; these methods are non-stationary and usually use mag-
netic fields to keep the objects from falling. The magnetic train being built in Shanghai
Ref. 606 by a German consortium is levitated this way. The whole train, including the passengers,
is levitated and then moved forward using electromagnets. It is thus possible, using mag-
nets, to levitate many tens of tonnes of material.
For levitation methods that do not consume energy – all such methods are necessar-
ily stationary – a well-known limitation can be found by studying Coulomb’s ‘law’ of
= − (ε − ε 0 )E 2 .
U 1
(458)
V 2
Challenge 1110 ny Since the electric field E never has a maximum in the absence of space charge, and since
for all materials ε A ε 0 , there cannot be a minimum of potential energy in free space for
a neutral body.**
To sum up, using static electric or static gravitational fields it is impossible to keep an
Copyright © Christoph Schiller November 1997–January 2007
object from falling; neither quantum mechanics, which incorporates phenomena such as
antimatter, nor general relativity, including phenomena such as black holes, change this
basic result.
For static magnetic fields, the argument is analogous to electrical fields: the potential
energy U of a magnetizable body of volume V and permeability µ in a medium with
Page 81 * To the disappointment of many science-fiction addicts, this would even be true if a negative mass existed.
And even though gravity is not really due to a field, but to space-time curvature, the result still holds in
general relativity.
Ref. 608 ** It is possible, however, to ‘levitate’ gas bubbles in liquids – ‘trap’ them to prevent them from rising would
be a better expression – because in such a case the dielectric constant of the environment is higher than that
Challenge 1111 ny of the gas. Can you find a liquid–gas combination where bubbles fall instead of rise?
electromagnetic effects and challenges 617
= − ( − )B 2
U 1 1 1
(459)
V 2 µ µ0
and due to the inequality ∆B 2 E 0, isolated maxima of a static magnetic field are not
possible, only isolated minima. Therefore, it is impossible to levitate paramagnetic (µ A
µ o ) or ferromagnetic (µ Q µ 0 ) materials such as steel, including bar magnets, which are
Challenge 1113 ny all attracted, and not repelled to magnetic field maxima.
There are thus two ways to get magnetic levitation: levitating a diamagnet or using a
Ref. 609 time dependent field. Diamagnetic materials (µ < µ o ) can be levitated by static magnetic
fields because they are attracted to magnetic field minima; the best-known example is the
levitation of superconductors, which are, at least those of type I, perfects diamagnets (µ =
0). Strong forces can be generated, and this method is also being tested for the levitation
Ref. 606 of passenger trains in Japan. In some cases, superconductors can even be suspended in
mid-air, below a magnet. Single atoms with a magnetic moment are also diamagnets; they
Ref. 610 are routinely levitated this way and have also been photographed in this state.
Also single neutrons, which have a magnetic dipole moment, have been kept in mag- Copyright © Christoph Schiller November 1997–January 2007
netic bottles in this way, until they decay. Recently, scientists have levitated pieces of wood,
plastic, strawberries, water droplets, liquid helium droplets as large as 2 cm, grasshoppers,
fish and frogs (all alive and without any harm) in this way. They are, like humans, all made
Ref. 611 of diamagnetic material. Humans themselves have not yet been levitated, but the feat is
being planned and worked on.
Diamagnets levitate if ∇B 2 A 2µ 0 ρд~χ, where ρ is the mass density of the object and
Challenge 1114 ny χ = 1 − µ~µ 0 its magnetic susceptibility. Since χ is typically about 10−5 and ρ of order
1000 kg~m3 , field gradients of about 1000 T2 ~m are needed. In other words, levitation re-
quires fields changes of 10 T over 10 cm, which is nowadays common for high field labor-
atory magnets.
Finally, time dependent electrical or magnetic fields, e.g. periodic fields, can lead to
Ref. 604 levitation in many different ways without any consumption of energy. This is one of the
618 iv cl assical electrodynamics • 16. electromagnetic effects
methods used in the magnetic bearings of turbomolecular vacuum pumps. Also single
charged particles, such as ions and electrons, are now regularly levitated with Paul traps
Ref. 604 and Penning traps. The mechanical analogy is shown in Figure 298.
Ref. 612 Figure 299 shows a toy that allows you to personally levitate a spinning top in mid-air
above a ring magnet, a quite impressive demonstration of levitation for anybody looking
Ref. 613 at it. It is not hard to build such a device yourself.
Even free electrons can be levitated, letting them float above the surface of fluid helium.
Ref. 614 In the most recent twist of the science of levitation, in 1995 Stephen Haley predicted that
the suspension height of small magnetic particles above a superconducting ring should
be quantized. However, the prediction has not been verified by experiment yet.
For the sake of completeness we mention that nuclear forces cannot be used for levita-
tion in everyday life, as their range is limited to a few femtometres. However, we will see
Page 944 later that the surface matter of the Sun is prevented from falling into the centre by these
interactions; we could thus say that it is indeed levitated by nuclear interactions.
Challenge 1116 r a new one? For example, can electric charge change the colour of objects?
TA B L E 50 Selected matter properties related to electromagnetism, showing among other things the
role it plays in the constitution of matter; at the same time a short overview of atomic, solid state, fluid
and business physics
Propert y Example Definition
Challenge 1115 ny * The issue is far from simple: which one of the levitation methods described above is used by tables or
chairs?
electromagnetic effects and challenges 619
All matter properties in the list can be influenced by electric or magnetic fields or
directly depend on them. This shows that the nature of all these material properties is
Copyright © Christoph Schiller November 1997–January 2007
electromagnetic. In other words, charges and their interactions are an essential and fun-
damental part of the structure of objects. The table shows so many different electromag-
netic properties that the motion of charges inside each material must be complex indeed.
Most effects are the topic of solid state physics,* fluid and plasma physics.
Solid state physics is by far the most important part of physics, when measured by the
impact it has on society. Almost all effects have applications in technical products, and
give employment to many people. Can you name a product or business application for
Challenge 1119 e any randomly chosen effect from the table?
* Probably the best and surely the most entertaining introductory English language book on the topic is the
one by Neil Ashcroft & David Mermin, Solid State Physics, Holt Rinehart & Winston, 1976.
electromagnetic effects and challenges 625
In our mountain ascent however, we look at only one example from the above list:
thermal radiation, the emission of light by hot bodies.
Earnshaw’s theorem about the impossibility of a stable equilibrium for charged
particles at rest implies that the charges inside matter must be moving. For any charged
particle in motion, Maxwell’s equations for the electromagnetic field show that it radiates
energy by emitting electromagnetic waves. In short, classical mechanics thus predicts that
matter must radiate electromagnetic energy.
Interestingly, everybody knows from experience that this is indeed the case. Hot bod-
ies light up depending on their temperature; the working of light bulbs thus proves that
metals are made of charged particles. Incandescence, as it is called, requires charges. Ac-
tually, every body emits radiation, even at room temperature. This radiation is called
thermal radiation; at room temperature it lies in the infrared. Its intensity is rather weak
Ref. 626 in everyday life; it is given by the general expression
body does not influence fields and charges inside it. A closed metal surface thus forms a
Challenge 1123 n shield against an electric field. Can you give an explanation? In fact, a tight metal layer is
not required to get the effect; a cage is sufficient. One speaks of a Faraday cage.
The detailed mechanism allows you to answer the following question: do Faraday cages
for gravity exist? Why?
For moving external fields or charges, the issue is more complex. Fields due to accel-
erated charges – radiation fields – decay exponentially through a shield. Fields due to
charges moving at constant speed are strongly reduced, but do not disappear. The reduc-
tion depends on the thickness and the resistivity of the metal enclosure used. For sheet
metal, the field suppression is very high; it is not necessarily high for metal sprayed plastic.
Ref. 583 Such a device will not necessarily survive a close lightning stroke.
626 iv cl assical electrodynamics • 16. electromagnetic effects
In practice, there is no danger if lightning hits an aeroplane or a car, as long they are
made of metal. (There is one film on the internet of a car hit by lightning; the driver does
not even notice.) However, if your car is hit by lightning in dry weather, you should wait
a few minutes before getting out of it. Can you imagine why?
Faraday cages also work the other way round. (Slowly) changing electric fields that
are inside a Faraday cage are not felt outside. For this reason, radios, mobile phones and
computers are surrounded by boxes made of metal or metal-sprayed plastics. The metal
keeps the so-called electromagnetic smog to a minimum.
There are thus three reasons to surround electric appliances by a grounded shield:
to protect the appliance from outside fields, to protect people and other machines from
electromagnetic smog, and to protect people against the mains voltage accidentally being
fed into the box (for example, when the insulation fails). In high precision experiments,
these three functions can be realized by three separate cages.
For purely magnetic fields, the situation is more complex. It is quite difficult to shield
**
Ref. 584 Researchers are trying to detect tooth decay with the help of electric currents, using the
observation that healthy teeth are bad conductors, in contrast to teeth with decay. How
Challenge 1125 ny would you make use of this effect in this case? (By the way, it might be that the totally
unrelated technique of imaging with terahertz waves could yield similar results.)
**
A team of camera men in the middle of the Sahara were using battery-driven electrical
equipment to make sound recordings. Whenever the microphone cable was a few tens
of metres long, they also heard a 50 Hz power supply noise, even though the next power
supply was thousands of kilometres away. An investigation revealed that the high voltage
lines in Europe lose a considerable amount of power by irradiation; these 50 Hz waves are
reflected by the ionosphere around the Earth and thus can disturb recording in the middle
of the desert. Can you estimate whether this observation implies that living directly near
Challenge 1126 ny a high voltage line is dangerous?
** Copyright © Christoph Schiller November 1997–January 2007
When solar plasma storms are seen on the Sun, astronomers first phone the electricity
company. They know that about 24 to 48 hours later, the charged particles ejected by
the storms will arrive on Earth, making the magnetic field on the surface fluctuate. Since
power grids often have closed loops of several thousands of kilometres, additional electric
currents are induced, which can make transformers in the grid overheat and then switch
off. Other transformers then have to take over the additional power, which can lead to
their overheating, etc. On several occasions in the past, millions of people have been left
without electrical power due to solar storms. Today, the electricity companies avoid the
problems by disconnecting the various grid sections, by avoiding large loops, by reducing
the supply voltage to avoid saturation of the transformers and by disallowing load transfer
from failed circuits to others.
electromagnetic effects and challenges 627
**
If the electric field is described as a sum of components of different frequencies, its so-
Ref. 586 called Fourier components, the amplitudes are given by
Ê(k, t) =
1
(2π)3 ~2 ∫ E(x, t)e −ikx
d3 x (461)
and similarly for the magnetic field. It then turns out that a Lorentz invariant quantity N,
describing the energy per circular frequency ω, can be defined:
Challenge 1127 n Can you guess what N is physically? (Hint: think about quantum theory.)
**
A capacitor of capacity C is charged with a voltage U. The stored electrostatic energy is
E = CU 2 ~2. The capacitor is then detached from the power supply and branched on to an
empty capacitor of the same capacity. After a while, the voltage obviously drops to U~2. Copyright © Christoph Schiller November 1997–January 2007
However, the stored energy now is C(U~2)2 , which is half the original value. Where did
Challenge 1130 n the energy go?
**
The maximum force in nature limits the maximum charge that a black hole can carry.
Challenge 1131 ny Can you find the relation?
**
Challenge 1132 n How can you give somebody an electrical shock using a 4.5 V battery and some wire?
628 iv cl assical electrodynamics • 17. cl assical physics in a nu tshell
found out. In nature, we learned to distinguish and to characterize objects, radiation and
space-time. All these three can move. In all motion we distinguish the fixed, intrinsic
properties from the varying state. All motion happens in such a way as to minimize
change.
Looking for all the fixed, intrinsic aspects of objects, we find that all sufficiently small
objects or particles are described completely by their mass and their electric charge. There
is no magnetic charge. Mass and electric charge are thus the only localized intrinsic prop-
erties of classical, everyday objects. Both mass and electric charge are defined by the ac-
celerations they produce around them. Both quantities are conserved; thus they can be
added. Mass, in contrast to charge, is always positive. Mass describes the interaction of
objects with their environment, charge the interaction with radiation.
cl assical physics in a nu tshell – one and a half steps ou t of three 629
All varying aspects of objects, i.e. their state, can be described using momentum and
position, as well as angular momentum and orientation. All can vary continuously in
amount and direction. Therefore the set of all possible states forms a space, the so-called
phase space. The state of extended objects is given by the states of all its constituent
particles. These particles make up all objects and somehow interact electromagnetically.
The state of a particle depends on the observer. The state is useful to calculate the
change that occurs in motion. For a given particle, the change is independent of the ob-
server, but the states are not. The states found by different observers are related: the re-
lations are called the ‘laws’ of motion. For example, for different times they are called
evolution equations, for different places and orientations they are called transformation
relations, and for different gauges they are called gauge transformations. All can be con-
densed in the principle of least action.
We also observe the motion of a massless entity: radiation. Everyday types of radiation,
such as light, radio waves and their related forms, are travelling electromagnetic waves.
Energy and mass speed is bound from above by a universal constant c, and energy
change per time is bound from above by a universal constant c 5 ~4G. The speed value c is
realized for the motion of massless particles. It also relates space to time. The power value
c 5 ~4G is realized by horizons. They are found around black holes and at the border of the
universe. The value also relates space-time curvature to energy flow and thus describes
the elasticity of space-time.
No two objects can be at the same spot at the same time. This is the first statement that
humans encounter about electromagnetism. More detailed investigation shows that elec-
tric charge accelerates other charges, that charge is necessary to define length and time
intervals, and that charges are the source of electromagnetic fields. Also light is such a
field. Light travels at the maximum possible velocity. In contrast to objects, light can in-
terpenetrate. In summary, we learned that of the two naive types of object motion, namely
630 iv cl assical electrodynamics • 17. cl assical physics in a nu tshell
motion due to gravity – or space-time curvature – and motion due to the electromagnetic
field, only the latter is genuine.
Above all, classical physics showed us that motion, be it linear or rotational, be it that
of matter, radiation or space-time, is conserved. Motion is continuous. More than that,
motion is similar to a continuous substance: it is never destroyed, never created, but al-
ways redistributed. Owing to conservation, all motion, that of objects, images and empty
space, is predictable and reversible. Owing to conservation of motion, time and space
can be defined. In addition, we found that classical motion is also right–left symmetric.
Classical physics showed us that motion is predictable: there are no surprises in nature.
C r i t i c a l si t uat i o n Ye a r s f r o m n o w
Several magnetic north and south poles appear, allowing solar c. 800
storms to disturb radio and telecommunications, to interrupt
electricity supplies, to increase animal mutations and to disori-
ent migrating animals such as wales, birds and tortoises
Our interstellar gas cloud detaches from the solar systems, chan- c. 3 000
ging the size of the heliosphere, and thus expose us more to au-
rorae and solar magnetic fields
Reversal of Earth’s magnetic field, implying a time with almost no unknown
magnetic field, with increased cosmic radiation levels and thus
more skin cancers and miscarriages
C r i t i c a l si t uat i o n Ye a r s f r o m n o w
Despite the fascination of the predictions, we leave aside these literally tremendous issues
and continue on our adventure.
632 iv cl assical electrodynamics • 17. cl assical physics in a nu tshell
We might think that we know nature now, as did Albert Michelson at the end of the
nineteenth century. He claimed that electrodynamics and Galilean physics implied that
the major laws of physics were well known. The statement is often quoted as an example
”
of flawed predictions, since it reflects an incredible mental closure to the world around
him. General relativity was still unknown, and so was quantum theory.
At the end of the nineteenth century, the progress in technology due to the use of
electricity, chemistry and vacuum technology had allowed better and better machines
and apparatuses to be built. All were built with classical physics in mind. In the years
Copyright © Christoph Schiller November 1997–January 2007
between 1890 and 1920, these classical machines completely destroyed the foundations
of classical physics. Experiments with these apparatuses showed that matter is made of
atoms, that electrical charge comes in the smallest amounts and that nature behaves ran-
domly. Nature does produce surprises – through in a restricted sense, as we will see. Like
the British Empire, the reign of classical physics collapsed. Speaking simply, classical phys-
ics does not describe nature at small scales.
* No surprises also imply no miracles. Classical physics is thus in opposition to many religions. Indeed, many
religions argue that infinity is the necessary ingredient to perform miracles. Classical physics shows that this
is not the case.
** From his 1894 address at the dedication ceremony for the Ryerson Physical Laboratory at the University
of Chicago.
cl assical physics in a nu tshell – one and a half steps ou t of three 633
But even without machines, the Victorian physicist could have predicted the situ-
ation. (In fact, many more progressive minds did so.) He had overlooked a contradiction
between electrodynamics and nature, for which he had no excuse. In our walk so far we
found that clocks and metre bars are necessarily made of matter and based on electro-
magnetism. But as we just saw, classical electrodynamics does not explain the stability of
matter. Matter is made of small particles, but the relation between these particles, electri-
city and the smallest charges is not clear. If we do not understand matter, we do not yet
understand space and time, since they are defined using measurement devices made of
matter.
Worse, the Victorian physicist overlooked a simple fact: the classical description of
nature does not allow one to understand life. The abilities of living beings – growing,
seeing, hearing, feeling, thinking, being healthy or sick, reproducing and dying – are all
unexplained by classical physics. In fact, all these abilities contradict classical physics.
Understanding matter and its interactions, including life itself, is therefore the aim of the
Biblio graphy
502 Julian Schwinger, L.L. DeR aad, K.A. Milton & W.Y. Tsai, Classical Electrody-
namics, Perseus, 1998. An excellent text on the topic by one of its greatest masters.
See also the beautiful problem book by André Bu toli & Jean-Marc Lév y-
Leblond, La physique en questions – électricité et magnétisme, Vuibert, 1999. Cited on
pages 525 and 559.
503 K. Hagiwara & al., Review of particle physics, Physical Review D 66, p. 010001, 2002,
or Hans V. Kl apd or-Kleingrothaus & Kai Zuber, Particle Astrophysics, second
edition, The Institute of Physics, 2000. See also H. Jeon & M. Longo, Search for magnetic
monopoles trapped in matter, Physical Review Letters 75, pp. 1443–1447, 1995. See also A.S.
Goldhaber & W.P. Trower, Resource letter MM-1: magnetic monopoles, American
Journal of Physics 58, pp. 429–439, 1990. Cited on page 527.
504 R. Edwards, Filling station fires spark cars’ recall, New Scientist, pp. 4–5, 4 March 1995.
510 See for example C. Schiller, A.A. Koomans, T.L. van Rooy, C. Schönenberger
& H.B. Elswijk, Decapitation of tungsten field emitter tips during sputter sharpening,
Surface Science Letters 339, pp. L925–L930, 1996. Cited on page 535.
511 L.I. Schiff & M.V. Barnhill, Gravitational-induced electric field near a metal, Physical
Review 151, pp. 1067–1071, 1966. F.C. Witteborn & W.M. Fairbank, Experimental
comparison of the gravitational force on freely falling electrons and metallic electrons, Phys-
ical Review Letters 19, pp. 1049–1052, 1967. Cited on page 536.
512 J. Lepak & M. Crescimanno, Speed of light measurement using ping, electronic pre-
print available at http://arxiv.org/abs/physics/0201053. Cited on page 536.
513 Pierre de Maricourt, Tractatus de magnete, 1269. Cited on page 537.
514 R. Wiltschko & W. Wiltschko, Magnetic Orientation in Animals, Springer, Berlin,
1995. Cited on page 538.
bibliography 635
=
L 2m 1
ë , (463)
M e д
where e is the electron charge and m its mass. Both L and M are measurable. The first meas-
urements were published with a д-value of 1, most probably because the authors expected
the value. In later experiments, de Haas found other values. Measurements by other research-
ers gave values nearer to 2 than to 1, a fact that was only understood with the discovery of
spin. The original publications are A. Einstein & W.J. de Haas, Proefondervinderlijk
bewijs voor het bestaan der moleculaire stroomen van Ampère, Konninklijke Akademie der
Wetenschappen te Amsterdam, Verslagen 23, p. 1449, 1915, and A. Einstein & W.J. de
Haas, Experimental proof of the existence of Ampère’s molecular currents, Konninklijke
Akademie der Wetenschappen te Amsterdam, Proceedings 18, p. 696, 1916. Cited on page 540.
516 S.J. Barnett, Magnetization by rotation, Physical Review 6, pp. 171–172, 1915, and S.J.
Barnett, Magnetization by rotation, Physical Review 6, pp. 239–270, 1915. Cited on page
527 Mark D. Semon & John R. Taylor, Thoughts on the magnetic vector potential, Amer-
ican Journal of Physics 64, pp. 1361–1369, 1996. Cited on pages 558, 559, and 560.
528 Jean Sivardière, Simple derivation of magnetic vector potentials, European Journal of
Physics 14, pp. 251–254, 1993. Cited on page 558.
529 T.T. Wu & C.N. Yang, 1975, Concept of nonintegrable phase factors and global formula-
tion of gauge fields, Physical Review D 12, pp. 3845–3857, Cited on page 561.
530 An electrodynamics text completely written with (mathematical) forms is Kurt Meetz
& Walter L. Engl, Elektromagnetische Felder – mathematische und physikalische
Grundlagen, Springer, 1980. Cited on page 562.
531 J. Travis, Twirl those organs into place – getting to the heart of how a heart knows left
from right, Science News 156, 21 August, 1999. A good book on asymmetries in nature is H.
Brunner, Rechts oder links, Wiley–Vch, 1999. Cited on page 564.
532 See for example the discussion by M.C. Corballis & I.L. Beale, On telling left from
right, Scientific American 224, pp. 96–104, March 1971. Cited on page 565.
538 A recent measurement of the frequency of light is presented in Th. Udem, A. Huber, B.
Gross, J. R eichert, M. Prevedelli, M. Weitz & T.W. Hausch, Phase-coherent
measurement of the hydrogen 1S–2S transition frequency with an optical frequency inter-
val divider chain, Physical Review Letters 79, pp. 2646–2649, 6 October 1997. Another is
C. Schwob, L. Jozefowski, B. de Beauvoir, L. Hilico, F. Nez, L. Julien, F.
Biraben, O. Acef & A. Cl airon, Optical frequency measurement of the 2S-12D trans-
itions in hydrogen and deuterium: Rydberg constant and Lamb shift determinations, Phys-
ical Review Letters 82, pp. 4960–4963, 21 June 1999. Cited on page 572.
539 The discoverors of two such methods were awarded the 2005 Nobel Prize for physics. Cited
on page 572.
540 See for example G. Horváth, J. Gál & R. Wehner, Why are water-seeking insects not
attracted by mirages? The polarization pattern of mirages, Naturwissenschaften 83, pp. 300–
bibliography 637
change of direction of the tails of comets when they circle around the Sun. Cited on page
586.
552 P. Gal ajda & P. Ormos, Applied Physics Letters 78, p. 249, 2001. Cited on page 586.
553 A short overview is given by Miles Pad gett & Les Allen, Optical tweezers and span-
ners, Physics World pp. 35–38, September 1997. The original papers by Ashkin’s group are
A. Ashkin, J.M. Dziedzic, J.E. Bjorkholm & S. Chu, Observation of a gradient
force optical trap for dielectric particles, Optics Letters 11, p. 288, 1986, and A. Askin,
J.M. Dziedzic & T. Yamane, Optical trapping and manipulation of single cells using in-
frared laser beams, Nature 330, p. 769, 1987. A pedagogical explanation on optical spanners,
together with a way to build one, can be found in D.N. Moothoo, J. Arlt, R.S. Con-
roy, F. Akerboom, A. Voit & K. Dhol akia, Beth’s experiment using optical tweezers,
American Journal of Physics 69, pp. 271–276, 2001, and in S.P. Smith, S.R. Bhalotra,
A.L. Brody, B.L. Brown, E.K. B oyda & M. Prentiss, Inexpensive optical tweezers
for undergraduate laboratories, American Journal of Physics 67, pp. 26–35, 1999. Cited on
page 586.
and the numerous physical effects involved. Detailed simulations and extensive material is
available. See also his paper A.T. Young, Sunset science – III. Visual adaptation and green
flashes,Journal of the Optical Society of America A 17, pp. 2129–2139, 2000. Cited on pages
589 and 590.
558 There are also other ways to see the green ray, for longer times, namely when a mirage
appears at sunset. An explanation with colour photograph is contained in M. Vollmer,
Gespiegelt in besonderen Düften ...– Oasen, Seeungeheuer und weitere Spielereien der Fata
Morgana, Physikalische Blätter 54, pp. 903–909, 1998. Cited on page 589.
559 See the website by Les Cowley on atmospheric optics, http://www.atoptics.co.uk. Or the
book David K. Lynch & William Livingston,Color and Light in Nature, second
edition, Cambridge University Press, 2001. They updated and expanded the fascination for
colours in nature, such as, for example, the halos around the Moon and the Sun, or the
bibliography 639
colour of shadows,, that was started by the beautiful and classic book by Marcel Minnaert
mentioned on page 76. Cited on page 590.
560 This famous discovery is by Brent Berlin & Paul Kay, Basic Color Terms: Their Uni-
versality and Evolution, University of California Press, 1969. Their ongoing world colour sur-
vey is eagerly awaited. Of course there are also ongoing studies to find possible exceptions;
the basic structure is solid, as shown in the conference proceedings C.L. Hardin & Lu-
isa Maffi, Colour Categories in Thought and Language, Cambridge University Press, 1997.
Cited on page 590.
561 For a thorough discussion of the various velocities connected to wave trains, see the classic
text by Louis Brillouin, Wave Propagation and Group Velocity, Academic Press, New
York, 1960. It expands in detail the theme discussed by Arnold Sommerfeld, Über die
Fortpflanzung des Lichtes in dispergierenden Medien, Annalen der Physik, 4th series, 44,
pp. 177–202, 1914. See also Arnold Sommerfeld, Optik, Dietrichssche Verlagsbuchand-
lung, Wiesbaden 1950, section 22. An English translation Arnold Sommerfeld, Lec-
tures on Theoretical Physics: Optics, 1954, is also available. Cited on pages 591, 592, and 593.
565
substances with simultaneously negative values of ε and µ, Soviet Physics Uspekhi 10, p. 509,
1968. The explanation with different refraction directions was published by P.M. Val anju,
R.M. Walser & A.P. Val anju, Wave refraction in negative-index media: always positive
and very inhomogeneous, Physical Review Letters 88, p. 187401, 8 May 2002. Also Fermat’s
principle is corrected, as explained in V.G. Vesel ago, About the wording of Fermat’s
principle for light propagation in media with negative refraction index, http://arxiv.org/
abs/cond-mat/0203451. Cited on page 593.
566 The first example of material system with a negative refraction index were presented by
David Smith and his team. R.A. Schelby, D.R. Smith & S. Schultz, Experimental
verification of a negative index of refraction, Science 292, p. 77-79, 2001. More recent ex-
amples are A.A. Houck, J.B. Brock & I.L. Chuang, Experimental observations of a
left-handed material that obeys Snell’s law, Physical Review Letters 90, p. 137401, 2003, C.G.
640 iv cl assical electrodynamics
572 This happened to Giovanni Bellini (c. 1430–1516) the great Venetian Renaissance painter,
who even put this experience into writing, thus producing one of the greatest ‘gaffes’ ever. If
you take a photograph of the effect with a remotely controlled camera, you can prove that
your camera is holy as well. Cited on page 597.
573 S.R. Wilk, How retroreflectors really work, Optics & Photonics News, pp. 6–7, December
1993. Cited on page 597.
574 W.H. Ehrenstein & B. Lingelbach, Das Hermann–Gitter, Physik in unserer Zeit 6,
pp. 263–268, 2002. The journal also shows a colour variation of these lattices. Cited on page
597.
575 The eye sensitivity myth is debunked in detail by B.H. Soffer & D.K. Lynch, Some
paradoxes, errors, and resolutions concerning the spectral optimization of human vision,
American Journal of Physics 67, pp. 946–953, 1999. Cited on page 599.
bibliography 641
576 David R. Williams, Supernormal Vision, Science News 152, pp. 312–313, 15 November
1997. See also http://www.cvs.rochester.edu/people/d_williams/d_williams.html as well as
the photographs at http://www.cvs.rochester.edu/people/~aroorda/ao_research.html of the
interior of living human eyes. Their last publication is A. Roorda, A. Metha, P. Lennie
& D.R. Williams, Packing arrangement of the three cone classes in the primate retina,
Vision Research 41, pp. 1291–1306, 2001. Cited on page 600.
577 S.W. Hell, Strategy for far-field optical imaging and writing without diffraction limit, Phys-
ics Letters A 326, pp. 140–145, 2004, see also V. Westphal & S.W. Hell, Nanoscale resol-
ution in the focal plane of an optical microscope, Physical Review Letters 94, p. 143903, 2005,
and V. Westphal, J. Seeger, T. Salditt & S.W. Hell, Stimulated emission depletion
microscopy on lithographic microstructures, Journal of Physics B 38, pp. S695–S705, 2005.
Cited on page 603.
578 A.D. Erlykin & A.W. Wolfendale, The origin of cosmic rays, European Journal of
Physics 20, pp. 409–418, 1999, Cited on page 609.
579 D. Singleton, Electromagnetic angular momentum and quantum mechanics, American
586 J.E. Avron, E. Berg, D. Goldsmith & A. Gord on, Is the number of photons a clas-
sical invariant?, European Journal of Physics 20, pp. 153–159, 1999. Cited on page 627.
587 If you want to see more on how the world looks for the different types of colour blind, have
a look at the http://webexhibits.org/causesofcolor/2.html or the http://www.vischeck.com/
examples web pages. Cited on page 606.
588 If you want to read more on the topic, have a look at the classic text by Warren J. Smith,
Modern Optical Engineering : the Design of Optical Systems, 3rd edition, McGraw-Hill, 2000.
Cited on pages 585 and 607.
589 About the life-long passion that drove Luke Howard, see the book by R ichard Hamblyn,
The Invention of Clouds, Macmillan 2001. Cited on page 610.
590 J. L atham, The electrification of thunderstorms, Quartely Journal of the Royal Meteorolo-
gical Society 107, pp. 277–289, 1981. For a more recent and wider review, see Earle R.
642 iv cl assical electrodynamics
53, pp. 1203–1211, 1985. See also A. Kovetz & G.E. Tauber, Radiation from an acceler-
ated charge and the principle of equivalence, American Journal of Physics 37, pp. 382–385,
1969. Cited on page 613.
599 C. de Almeida & A. Saa, The radiation of a uniformly accelerated charge is beyond the
horizon: a simple derivation, American Journal of Physics 74, pp. 154–158, 2006. Cited on
page 613.
600 J. Zhang, X.D. Song, Y.C. Li, P.G. R ichards, X.L. Sun & F. Waldhauser, Inner
core differential motion confirmed by earthquake doublet waveform doublets, Science 309,
pp. 1357–1360, 2005. Cited on page 614.
601 This is deduced from the д − 2 measurements, as explained in his Nobel Prize talk by Hans
Dehmelt, Experiments with an isolated subatomic particle at rest, Reviews of Modern Phys-
ics 62, pp. 525–530, 1990, and in Hans Dehmelt, Is the electron a composite particle?,
bibliography 643
magnetism takes off, Physics World April 1997, page 28. The experiments by Andre Geim,
Jan Kees Maan, Humberto Carmona and Peter Main were made public by P. Rod gers,
Physics World 10, p. 28, 1997. Some of the results can be found in M.V. Berry & A.K.
Geim, Of flying frogs and levitrons, European Journal of Physics 18, pp. 307–313, 1997. See
also their http://www-hfml.sci.kun.nl/hfml/levitate.html website. Cited on page 617.
612 The well-known toy allows levitation without the use of any energy source and is called the
‘Levitron’. It was not invented by Bill Hones of Fascination Toys & Gifts in Seattle, as the
http://www.levitron.com website explains. The toy is discussed by Ron Ed ge, Levitation
using only permanent magnets, Physics Teacher 33, p. 252, April 1995. It is also discussed
in M.V. Berry, The LevitronT M : an adiabatic trap for spins, Proceedings of the Royal So-
ciety A 452, pp. 1207–1220, 1996, (of Berry’s phase fame) as well as by M.D. Simon, L.O.
Heflinger & S.L. R id geway, Spin stabilized magnetic levitation, American Journal of
644 iv cl assical electrodynamics
Physics 65, pp. 286–92, 1997, and by T.B. Jones, M. Washizu & R. Gans, Simple theory
for the Levitron, Journal of Applied Physics 82, pp. 883–889, 1997. Cited on page 618.
613 The drill trick and the building of a Levitron are described in the beautiful lecture script
by Josef Zweck, Physik im Alltag, Skript zur Vorlesung im WS 1999/2000 der Universität
Regensburg. Cited on page 618.
614 The prediction about quantized levitation is by Stephen B. Haley, Length quantization
in levitation of magnetic microparticles by a mesoscopic superconducting ring, Physical Re-
view Letters 74, pp. 3261–3264, 1995. The topic is discussed in more detail in Stephen B.
Haley, Magnetic levitation, suspension, and superconductivity: macroscopic and meso-
scopic, Physical Review B 53, p. 3506, 1996, reversed in order with Stephen B. Haley,
Quantized levitation of superconducting multiple-ring systems, Physical Review B 53,
p. 3497, 1996, as well as Stephen B. Haley, Quantized levitation by multiply-connected
superconductors, LT-21 Proceedings, in Czechoslovak Journal of Physics 46, p. 2331, 1996. In
1998, there was not yet an experimental confirmation (Stephen Haley, private communica-
tion). Cited on page 618.
Ohtani, Electric-field control of ferromagnetism, Nature 408, pp. 944–946, 21-28 Decem-
ber 2000. Cited on page 619.
620 This effect was discovered by Geert R ikken, Bart van Tiggelen & Anja Sparen-
berg, Lichtverstrooiing in een magneetveld, Nederlands tijdschrift voor natuurkunde 63,
pp. 67–70, maart 1998. Cited on page 621.
621 Vitalij Pecharsky & Karl A. Gschneidner, Giant magnetocaloric effect in
Gd5(Si2Ge2), Physical Review Letters 78, pp. 4494–4497, 1995, and, from the same authors,
Tunable magnetic regenerator alloys with a giant magnetocaloric effect for magnetic refri-
geration from 20 to 2990 K, Applied Physics Letters 70, p. 3299, 1997. Cited on page 621.
622 J. Weissmüller, R.N. Viswanath, D. Kramer, P. Zimmer, R. Würschum & H.
Gleiter, Charge-induced reversible strain in a metal, Science 300, pp. 312–315, 11 April
2003. Cited on page 621.
bibliography 645
”
I
n our quest for increased precision in the description of all motion around us, it
physical statements with particular care, using all criteria imaginable. Physics is detailed
prattle by curious people about moving things. The criteria for precision appear once we
Challenge 1135 e ask: which abilities does this prattle require? You might want to fill in the list yourself.
The abilities necessary for talking are a topic of research even today. The way that
the human species acquired the ability to chat about motion is studied by evolutionary
biologists. Child psychologists study how the ability develops in a single human being.
Physiologists, neurologists and computer scientists are concerned with the way the brain
and the senses make this possible; linguists focus on the properties of the language we use,
* ‘Everything that can be thought at all can be thought clearly.’ This and other quotes of Ludwig Wittgenstein
are from the equally short and famous Tractatus logico-philosophicus, written in 1918, first published in 1921;
it has now been translated into many other languages.
648 intermezzo: the brain, l anguage and the human condition
while logicians, mathematicians and philosophers of science study the general properties
of statements about nature. All these fields investigate tools that are essential for the devel-
opment of physics, the understanding of motion and the specification of the undefined
concepts listed above. The fields structure this intermezzo.
Evolution
A hen is only an egg’s way of making another
“ egg.
Samuel Butler, Life and Habit.
The evolution of the human species is the result of a long story that has been told in ”
Ref. 632 many excellent books. A summarizing table on the history of the universe is given in the
Page 457 chapter on general relativity. The almost incredible chain of events that has lead to one’s
own existence includes the formation of atoms, of the galaxies, the stars, the planets, the
Moon, the atmosphere, the first cells, the water animals, the land animals, the mammals,
Ref. 634
During childhood, everybody is a physicist. When we follow our own memories back-
wards in time as far as we can, we reach a certain stage, situated before birth, which forms
the starting point of human experience. In that magic moment, we sensed somehow that
” Copyright © Christoph Schiller November 1997–January 2007
apart from ourselves, there is something else. The first observation we make about the
world, during the time in the womb, is thus the recognition that we can distinguish two
parts: ourselves and the rest of the world. This distinction is an example – perhaps the
first – of a large number of ‘laws of nature’ that we stumble upon in our lifetime. By dis-
covering more and more distinctions we bring structure in the chaos of experience. We
quickly find out that the world is made of related parts, such as mama, papa, milk, Earth,
toys, etc.
Later, when we learn to speak, we enjoy using more difficult words and we call the
surroundings the environment. Depending on the context, we call the whole formed by
oneself and the environment together the (physical) world, the (physical) universe, nature,
children, l aws and physics 649
or the cosmos. These concepts are not distinguished from each other in this walk;* they
are all taken to designate the sum of all parts and their relations. They are simply taken
here to designate the whole.
The discovery of the first distinction starts a chain of similar discoveries. We extract
the numerous distinctions that are possible in the environment, in our own body and in
the various types of interactions between them. The ability to distinguish is the central
ability that allows us to change our view from that of the world as chaos, i.e. as a big
mess, to that of the world as a system, i.e. a structured set, in which parts are related in
Challenge 1137 n specific ways. (If you like precision, you may ponder whether the two choices of ‘chaos’
and ‘system’ are the only possible ones. We will return to this issue in the third part of our
mountain ascent.)
In particular, the observation of the differences between oneself and the environment
goes hand in hand with the recognition that not only are we not independent of the en-
vironment, but we are firmly tied to it in various inescapable ways: we can fall, get hurt,
describe nature.
During childhood we learn to distinguish between interactions with the environment
(or perceptions): some are shared with others and called observations, others are uniquely
personal and are called sensations.** A still stricter criterion of ‘sharedness’ is used to
divide the world into ‘reality’ and ‘imagination’ (or ‘dreams’). Our walk will show that this
* The differences in usage can be deduced from their linguistic origins. ‘World’ is derived from old Germanic
‘wer’ – person – and ‘ald’ – old – and originally means ‘lifetime’. ‘Universe’ is from the Latin, and designates
the one – ‘unum’ – which one sees turning – ‘vertere’, and refers to the starred sky at night which turns
around the polar star. ‘Nature’ comes also from the Latin, and means ‘what is born’. ‘Cosmos’ is from Greek
κόσµος and originally means ‘order’.
** A child that is unable to make this distinction among perceptions – and who is thus unable to lie – almost
Ref. 635 surely develops or already suffers from autism, as recent psychological research has shown.
650 intermezzo: the brain, l anguage and the human condition
distinction is not essential, provided that we stay faithful to the quest for ever increasing
precision: we will find that the description of motion that we are looking for does not
depend on whether the world is ‘real’ or ‘imagined’, ‘personal’ or ‘public’.
Humans enjoy their ability to distinguish parts, which in other contexts they also call
details, aspects or entities, and enjoy their ability to associate them or to observe the re-
Ref. 636 lations between them. Humans call this activity classification. Colours, shapes, objects,
mother, places, people and ideas are some of the entities that humans discover first.
Our anatomy provides a handy tool to make efficient use of these discoveries: memory.
It stores a large amount of input that is called experience afterwards. Memory is a tool used
by both young and old children to organize their world and to achieve a certain security
in the chaos of life.
Memorized classifications are called concepts. Jean Piaget was the first researcher to
describe the influence of the environment on the concepts that a child forms. Step by
step, children learn that objects are localized in space, that space has three dimensions,
* An overview of the origin of developmental psychology is given by J.H. Flavell, The Developmental
Psychology of Jean Piaget, 1963. This work summarizes the observations by the French speaking Swiss Jean
Piaget (1896–1980), the central figure in the field. He was one of the first researchers to look at child devel-
opment in the same way that a physicist looks at nature: carefully observing, taking notes, making exper-
iments, extracting hypotheses, testing them, deducing theories. His astonishingly numerous publications,
based on his extensive observations, cover almost all stages of child development. His central contribution
is the detailed description of the stages of development of the cognitive abilities of humans. He showed that
all cognitive abilities of children, the formation of basic concepts, their way of thinking, their ability to talk,
etc., result from the continuous interaction between the child and the environment.
In particular, Piaget described the way in which children first learn that they are different from the ex-
ternal environment, and how they then learn about the physical properties of the world. Of his many books
related to physical concepts, two especially related to the topic of this walk are J. Piaget, Les notions de
Copyright © Christoph Schiller November 1997–January 2007
mouvement et de vitesse chez l’enfant, Presses Universitaires de France, 1972 and Le developpement de la
notion de temps chez l’enfant, Presses Universitaires de France, 1981, this last book being born from a sugges-
tion by Albert Einstein. These texts should be part of the reading of every physicist and science philosopher
interested in these questions.
Piaget also describes how in children the mathematical and verbal intelligence derives from sensomo-
torial, practical intelligence, which itself stems from habits and acquired associations to construct new con-
cepts. Practical intelligence requires the system of reflexes provided by the anatomical and morphological
structure of our organism. Thus his work shows in detail that our faculty for mathematical description of
the world is based, albeit indirectly, on the physical interaction of our organism with the world.
Some of his opinions on the importance of language in development are now being revised, notably
Ref. 637 through the rediscovery of the work of Lev Vigotsky, who argues that all higher mental abilities, emotions,
recollective memory, rational thought, voluntary attention and self-awareness, are not innate, but learned.
This learning takes place through language and culture, and in particular through the process of talking to
oneself.
children, l aws and physics 651
Even though everyone has been a physicist in their youth, most people remain classical
physicists. In this adventure we continue, using all the possibilities of a toy with which
nature provides us: the brain.
Experience is the name everyone gives to their
“ mistakes.
Oscar Wilde, Lady Windermere’s Fan.
”
Why a brain?
Denken ist bereits Plastik.*
Ref. 639
Numerous observations show that sense input is processed, i.e. classified, stored and re- ”
trieved in the brain. Notably, lesions of the brain can lead to the loss of part or all of these
functions. Among the important consequences of these basic abilities of the brain are
ger the number of these entities, often called ‘neurons’ by analogy to the brain, the more
sophisticated classifications can be produced by the classifier.*** Classifiers thus work by
applying more or less sophisticated combinations of ‘same’ and ‘different’. The distinction
by a child of red and blue objects is such a classification; the distinction of compact and
What is information?
These thoughts did not come in any verbal
We started by stating that studying physics means to talk about motion. To talk is to
transmit information. Can information be measured? Can we measure the progress of
physics in this way? Is the universe made of information?
”
Information is the result of classification. A classification is the answer to one or to
several yes–no questions. Such yes–no questions are the simplest classifications possible;
they provide the basic units of classification, from which all others can be built. The
simplest way to measure information is therefore to count the implied yes–no questions,
the bits, leading to it. Are you able to say how many bits are necessary to define the place
where you live? Obviously, the number of bits depends on the set of questions with which
we start; that could be the names of all streets in a city, the set of all coordinates on the
Copyright © Christoph Schiller November 1997–January 2007
surface of the Earth, the names of all galaxies in the universe, the set of all letter combin-
Challenge 1140 n ations in the address. What is the most efficient method you can think of? A variation
of the combination method is used in computers. For example, the story of this walk re-
quired about a thousand million bits. But since the amount of information in a normal
letter depends on the set of questions with which we start, it is impossible to define a
precise measure for information in this way.
The only way to measure information precisely is to take the largest possible set of
questions that can be asked about a system, and to compare it with what is known about
the system. In this case, the amount of unknown information is called entropy, a concept
Page 244 that we have already encountered. With this approach you should able to deduce yourself
Challenge 1141 n whether it is really possible to measure the advance of physics.
children, l aws and physics 653
Since categorization is an activity of the brain and other, similar classifiers, informa-
tion as defined here is a concept that applies to the result of activities by people and by
other classifiers. In short, information is produced when talking about the universe – the
universe itself is not the same as information. There is a growing number of publications
based on the opposite of this view; however, this is a conceptual short circuit. Any trans-
mission of information implies an interaction; physically speaking, this means that any
information needs energy for transmission and matter for storage. Without either of these,
there is no information. In other words, the universe, with its matter and energy, has to
exist before transmission of information is possible. Saying that the universe is made of
information is as meaningful as saying that it is made of toothpaste.
The aim of physics is to give a complete classification of all types and examples of mo-
tion, in other words, to know everything about motion. Is this possible? Or are you able
Challenge 1142 n to find an argument against this endeavour?
“ Woody Allen
Can we estimate the probability for a record to appear or disappear by chance? Yes, we
can. Every record is made of a characteristic number N of small entities, for example the
number of the possible ink dots on paper, the number of iron crystals in a cassette tape,
the electrons in a bit of computer memory, the silver iodide grains in a photographic
negative, etc. The chance disturbances in any memory are due to internal fluctuations,
also called noise. Noise makes the record unreadable; it can be dirt on a signature, thermal
magnetization changes in iron crystals, electromagnetic noise inside a solid state memory,
etc. Noise is found in all classifiers, since it is inherent in all interactions and thus in all
information processing.
It is a general property that internal fluctuations due to noise decrease when the size,
i.e. the number of components of the record is increased. In fact, the probability p mis for
654 intermezzo: the brain, l anguage and the human condition
where N is the number of particles or subsystems used for storing it. This relation appears
because, for large numbers, the so-called normal distribution is a good approximation of
almost any process; the width of the normal distribution, which determines the probab-
ility of record errors, grows less rapidly than its integral when the number of entities is
Challenge 1144 ny increased. (Are you able to confirm this?)
We conclude that any good record must be made from a large number of entities. The
larger the number, the less sensitive the memory is to fluctuations. Now, a system of large
size with small fluctuations is called a (physical) bath. Only baths make memories possible.
In other words, every record contains a bath. We conclude that any observation of a system
is the interaction of that system with a bath. This connection will be used several times in
and the number ln 2 0.69 is the natural logarithm of 2. Erasing thus on one hand re-
duces the disorder of the data – the local entropy–, but on the other hand increases the Copyright © Christoph Schiller November 1997–January 2007
total entropy. As is well known, energy is needed to reduce the entropy of a local system.
In short, any system that erases memory requires energy. For example, a logical AND gate
effectively erases one bit per operation. Logical thinking thus requires energy. It is also
known that dreaming is connected with the erasing and reorganization of information.
Could that be the reason that, when we are very tired, without any energy left, we do not
Challenge 1147 n dream as much as usual?
Entropy is thus necessarily created when we forget. This is evident when we remind
Ref. 645 ourselves that forgetting is similar to the deterioration of an ancient manuscript. Entropy
increases when the manuscript is not readable any more, since the process is irreversible
and dissipative.* Another way to see this is to recognize that to clear a memory, e.g. a
Ref. 646 * As Wojciech Zurek clearly explains, the entropy created inside the memory is the main reason that even
children, l aws and physics 655
magnetic tape, we have to put energy into it, and thus increase its entropy. Conversely,
writing into a memory can often reduce entropy; we remember that signals, the entities
that write memories, carry negative entropy. For example, the writing of magnetic tapes
usually reduces their entropy.
“ answers.
(Wrongly) attributed to Pablo Picasso
The human brain is built in such a way that its fluctuations cannot destroy its contents. ”
The brain is well protected by the skull for exactly this reason. In addition, the brain
literally grows connections, called synapses, between its various neurons, which are the
cells doing the signal processing. The neuron is the basic processing element of the brain,
performing the basic classification. It can only do two things: to fire and not to fire. (It
(One byte, abbreviated B, is the usual name for eight bits of information.) Note that evolu-
Copyright © Christoph Schiller November 1997–January 2007
tion has managed to put as many neurons in the brain as there are stars in the galaxy, and
that if we add all the synapse lengths, we get a total length of about 1011 m, which cor-
responds to the distance to from the Earth to the Sun. Our brain truly is astronomically
Maxwell’s demon cannot reduce the entropy of two volumes of gases by opening a door between them in
such a way that fast molecules accumulate on one side and slow molecules accumulate on the other. (Maxwell
had introduced the ‘demon’ in 1871, to clarify the limits posed by nature to the gods.) This is just another
way to rephrase the old result of Leo Szilard, who showed that the measurements by the demon create more
Ref. 647 entropy than they can save. And every measurement apparatus contains a memory.
To play being Maxwell’s demon, click on the http://www.wolfenet.com/~zeppelin/maxwell.htm website.
* The number of neurons seems to be constant, and fixed at birth. The growth of interconnections is highest
between age one and three, when it is said to reach up to 107 new connections per second.
656 intermezzo: the brain, l anguage and the human condition
complex.
In practice, the capacity of the brain seems almost without limit, since the brain frees
memory every time it needs some new space, by forgetting older data, e.g. during sleep.
Note that this standard estimate of 1014 bits is not really correct! It assumes that the only
component storing information in the brain is the synapse strength. Therefore it only
measures the erasable storage capacity of the brain. In fact, information is also stored in
the structure of the brain, i.e. in the exact configuration in which every cell is connected
to other cells. Most of this structure is fixed at the age of about two years, but it continues
to develop at a lower level for the rest of human life. Assuming that for each of the N cells
»
with n connections there are f n connection possibilities, this write once capacity of the
Challenge 1148 ny brain can be estimated as roughly N f n f n log f n bits. For N = 1011 , n = 102 , f = 6, this
gives
M writeonce 1016 bit 106 GB . (467)
the number of bits in human DNA and those in the brain nicely shows that almost all
information stored in the brain is taken from the environment; it cannot be of genetic
origin, even allowing for smart decompression of stored information.
In total, all these tricks used by nature result in the most powerful classifier yet
known.* Are there any limits to the brain’s capacity to memorize and to classify? With
the tools that humans have developed to expand the possibilities of the brain, such as
paper, writing and printing to help memory, and the numerous tools available to simplify
and to abbreviate classifications explored by mathematicians, brain classification is only
Ref. 651 limited by the time spent practising it. Without tools, there are strict limits, of course.
* Also the power consumption of the brain is important: even though it contains only about 2% of the body’s
mass, is uses 25% of the energy taken in by food.
what is l anguage? 657
The two-millimetre thick cerebral cortex of humans has a surface of about four sheets of
A4 paper, a chimpanzee’s yields one sheet and a monkey’s is the size of a postcard. It is
estimated that the total intellectually accessible memory is of the order of
M intellectual 1 GB , (468)
What is l anguage?
Reserve your right to think, for even to think
Language possibly is the most wonderful gift of human nature. Using the ability to pro- ”
duce sounds and to put ink on paper, people attach certain symbols,** also called words or
terms in this context, to the many partitions they specify with the help of their thinking.
Such a categorization is then said to define a concept or notion, and is set in italic typeface
in this text. A standard set of concepts forms a language.*** In other words, a (human)
language is a standard way of symbolic interaction between people.**** There are human
languages based on facial expressions, on gestures, on spoken words, on whistles, on writ-
ten words, and more. The use of spoken language is considerably younger than the human
* Propositions can only say how things are, not what they are.
Copyright © Christoph Schiller November 1997–January 2007
** A symbol is a type of sign, i.e. an entity associated by some convention to the object it refers. Following
Charles Peirce (1839–1914) – see http://www.peirce.org – the most original philosopher born in the United
States, a symbol differs from an icon (or image) and from an index, which are also attached to objects by
convention, in that it does not resemble the object, as does an icon, and in that it has no contact with the
object, as is the case for an index.
*** The recognition that language is based on a partition of ideas, using the various differences between
them to distinguish them from each other, goes back to the Swiss thinker Ferdinand de Saussure (1857–
1913), who is regarded as the founder of linguistics. His textbook Cours de linguistique générale, Editions
Payot, 1985, has been the reference work of the field for over half a century. Note that Saussure, in contrast
to Peirce, prefers the term ‘sign’ to ‘symbol’, and that his definition of the term ‘sign’ includes also the object
to which it refers.
**** For slightly different definitions and a wealth of other interesting information about language, see the
beautiful book by David Crystal, The Cambridge Encyclopedia of Language, Cambridge University Press,
1987.
658 intermezzo: the brain, l anguage and the human condition
species; it seems that it appeared only about two hundred thousand years ago. Written lan-
guage is even younger, namely only about six thousand years old. But the set of concepts
used, the vocabulary, is still expanding. For humans, the understanding of language be-
gins soon after birth (perhaps even before), the active use begins at around a year of age,
the ability to read can start as early as two, and personal vocabulary continues to grow as
long as curiosity is alive.
Physics being a lazy way to chat about motion, it needs language as an essential tool.
Of the many aspects of language, from literature to poetry, from jokes to military orders,
from expressions of encouragement, dreams, love and emotions, physics uses only a small
and rather special segment. This segment is defined by the inherent restriction to talk
about motion. Since motion is an observation, i.e. an interaction with the environment
that several people experience in the same way, this choice puts a number of restrictions
on the contents – the vocabulary – and on the form – the grammar – of such discussions.
For example, from the definition that observations are shared by others, we get the
entities subject, verb and object, though not always in this order. Just check the languages
Challenge 1149 e you know.
On the semantic aspect, the long list of lexical universals, i.e. words that appear in all
* A comprehensive list with 6 800 languages (and with 41 000 language and dialect names) can be found on
the world wide website by Barbara Grimes, Ethnologue – Languages of the World, to be found at the address
http://www.ethnologue.com or in the printed book of the same name.
It is estimated that 15 000 5 000 languages have existed in the past.
Nevertheless, in today’s world, and surely in the sciences, it is often sufficient to know one’s own language
plus English. Since English is the language with the largest number of words, learning it well is a greater
Ref. 652 challenge than learning most other languages.
** Studies explore topics such as the observation that in many languages the word for ‘little’ contains an ‘i’
(or high pitched ‘e’) sound: petit, piccolo, klein, tiny, pequeño, chiisai; exceptions are: small, parvus.
what is l anguage? 659
languages, such as ‘mother’ or ‘Sun’, has recently been given a structure. The linguist Anna
Wierzbicka performed a search for the building blocks from which all concepts can be
built. She looked for the definition of every concept with the help of simpler ones, and
continued doing so until a fundamental level was reached that cannot be further reduced.
The set of concepts that are left over are the primitives. By repeating this exercise in many
languages, Wierzbicka found that the list is the same in all cases. She thus had discovered
universal semantic primitives. In November 1992, the list contained the terms given in
Table 52.
Following the life-long research of Anna Wierzbicka and her research school, all these
concepts exist in all languages of the world studied so far.* They have defined the mean-
ing of each primitive in detail, performed consistency checks and eliminated alternative
approaches. They have checked this list in languages from all language groups, in lan-
guages from all continents, thus showing that the result is valid everywhere. In every
Ref. 653 language all other concepts can be defined with the help of the semantic primitives.
Simply stated, learning to speak means learning these basic terms, learning how to
combine them and learning the names of these composites. The definition of language
given above, namely as a means of communication that allows one to express everything
Copyright © Christoph Schiller November 1997–January 2007
one wants to say, can thus be refined: a human language is any set of concepts that includes
the universal semantic primitives.
For physicists – who aim to talk in as few words as possible – the list of semantic
primitives has three facets. First, the approach is appealing, as it is similar to physics’
own aim: the idea of primitives gives a structured summary of everything that can be
said, just as the atomic elements structure all objects that can be observed. Second, the
* It is easy to imagine that this research steps on the toes of many people. A list that maintains that ‘true’,
‘good’, ‘creation’, ‘life’, ‘mother’ or ‘god’ are composite will elicit violent reactions, despite the correctness
of the statements. Indeed, some of these terms were added in the 1996 list, which is somewhat longer. In
addition, a list that maintains that we only have about thirty basic concepts in our heads is taken by many
to be offensive.
660 intermezzo: the brain, l anguage and the human condition
list of primitives can be structured. In fact, the list of primitives can be divided into two
groups: one group contains all terms describing motion (do, happen, when, where, feel,
small, etc. – probably a term from the semantic field around light or colour should be
added) and the other group contains all terms necessary to talk about abstract sets and
relations (this, all, kind of, no, if, etc.). Even for linguistics, aspects of motion and logical
concepts are the basic entities of human experience and human thinking. To bring the
issue to a point, the semantic primitives contain the basic elements of physics and the
basic elements of mathematics. All humans are thus both physicists and mathematicians.
The third point is that the list of primitives is too long. The division of the list into two
groups directly suggests shorter lists; we just have to ask physicists and mathematicians
for concise summaries of their respective fields. To appreciate this aim, try to define what
Challenge 1150 d ‘if ’ means, or what an ‘opposite’ is – and explore your own ways of reducing the list.
Reducing the list of primitives is also one of our aims in this adventure. We will explore
the mathematical group of primitives in this intermezzo; the physical group will occupy
What is a concept?
“ Ralph Waldo Emerson*
There is a group of people that has taken the strict view on translation and on precision
to the extreme. They build all concepts from an even smaller set of primitives, namely
only two: ‘set’ and ‘relation’, and explore the various possible combinations of these two
”
concepts, studying their classifications. Step by step, this radical group, commonly called
mathematicians, came to define with full precision concepts such as numbers, points,
curves, equations, symmetry groups and more. The construction of these concepts is sum-
marized partly in the following and partly in Appendix D.
However, despite their precision, in fact precisely because of it, no mathematical
concept talks about nature or about observations.* Therefore the study of motion needs
other, more useful concepts. What properties must a useful concept have? For example,
what is ‘passion’ and what is a ‘cotton bud’? Obviously, a useful concept implies a list of
its parts, its aspects and their internal relations, as well as their relation to the exterior
world. Thinkers in various fields, from philosophy to politics, agree that the definition of
any concept requires:
— explicit and fixed content,
— explicit and fixed limits,
— explicit and fixed domain of application.
adopted, in the same way that children reject or adopt a new toy. Children do this un-
consciously, scientists do it consciously, using language.*** For this reason, concepts are
universally intelligible.
* Insofar as one can say that mathematics is based on the concepts of ‘set’ and ‘relation’, which are based on
experience, one can say that mathematics explores a section of reality, and that its concepts are derived from
experience. This and similar views of mathematics are called platonism. More concretely, platonism is the
view that the concepts of mathematics exist independently of people, and that they are discovered, and not
created, by mathematicians.
In short, since mathematics makes use of the brain, which is a physical system, actually mathematics is
applied physics.
** We see that every physical concept is an example of a (mathematical) category, i.e. a combination of objects
and mappings. For more details about categories, with a precise definition of the term, see page 666.
*** Concepts formed unconsciously in our early youth are the most difficult to define precisely, i.e. with
662 intermezzo: the brain, l anguage and the human condition
Note that the concept ‘concept’ itself is not definable independently of experience; a
concept is something that helps us to act and react to the world in which we live. Moreover,
concepts do not live in a world separate from the physical one: every concept requires
memory from its user, since the user has to remember the way in which it was formed;
therefore every concept needs a material support for its use and application. Thus all
thinking and all science is fundamentally based on experience.
In conclusion, all concepts are based on the idea that nature is made of related parts.
This idea leads to complementing couples such as ‘noun–verb’ in linguistics, ‘set–relation’
or ‘definition–theorem’ in mathematics, and ‘aspect of nature–pattern of nature’ in phys-
ics. These couples constantly guide human thinking, from childhood onwards, as devel-
opmental psychology can testify.
Defining sets and defining relations are the two fundamental acts of our thinking. This
can be seen most clearly in any book about mathematics; such a book is usually divided
into paragraphs labelled ‘definition’, ‘theorem’, ‘lemma’ and ‘corollary’. The first type of
”
paragraph defines concepts, i.e. defines sets, and the other three types of paragraphs ex-
press relations, i.e. connections between these sets. Mathematics is thus the exploration
of the possible symbolic concepts and their relations. Mathematics is the science of sym-
bolic necessities.
Sets and relations are tools of classification; that is why they are also the tools of any
bureaucrat. (See Figure 300.) This class of humans is characterized by heavy use of pa-
per clips, files, metal closets, archives – which all define various types of sets – and by
the extensive use of numbers, such as reference numbers, customer numbers, passport
numbers, account numbers, law article numbers – which define various types of relations
between the items, i.e. between the elements of the sets.
Both the concepts of set and of relation express, in different ways, the fact that nature
can be described, i.e. that it can be classified into parts that form a whole. The act of group-
ing together aspects of experience, i.e. the act of classifying them, is expressed in formal
Copyright © Christoph Schiller November 1997–January 2007
language by saying that a set is defined. In other words, a set is a collection of elements
of our thinking. Every set distinguishes the elements from each other and from the set
itself. This definition of ‘set’ is called the naive definition. For physics, the definition is
sufficient, but you won’t find many who will admit this. In fact, mathematicians have re-
fined the definition of the concept ‘set’ several times, because the naive definition does
language. Some who were unable to define them, such as the Prussian philosopher Immanuel Kant (1724–
1804) used to call them ‘a priori’ concepts (such as ‘space’ and ‘time’) to contrast them with the more clearly
defined ‘a posteriori’ concepts. Today, this distinction has been shown to be unfounded both by the study
of child psychology (see the footnote on page 650) and by physics itself, so that these qualifiers are therefore
not used in our walk.
* Whatever we see could be other than it is. Whatever we can describe at all could be other than it is. There
is no a priori order of things.
what is l anguage? 663
a. The more common formulation (though equivalent to the above) is: The entity
g, g, g, g, g, g, g, g, ... is a set.
not work well for infinite sets. A famous example is the story about sets which do not con-
tain themselves. Obviously, any set is of two sorts: either it contains itself or it does not.
Challenge 1153 n If we take the set of all sets that do not contain themselves, to which sort does it belong?
To avoid problems with the concept of ‘set’,
mathematics requires a precise definition. The
first such definition was given by the German
mathematician Ernst Zermelo (b. 1871 Berlin,
d. 1951 Freiburg i.B.) and the German–Israeli
mathematician Adolf/Abraham Fraenkel (b. 1891
München, d. 1965 Jerusalem). Later, the so-called F I G U R E 300 Devices for the definition
axiom of choice was added, in order to make it of sets (left) and of relations (right)
possible to manipulate a wider class of infinite sets.
The result of these efforts is called the ZFC definition.* From this basic definition we
can construct all mathematical concepts used in physics. From a practical point of view,
Copyright © Christoph Schiller November 1997–January 2007
it is sufficient to keep in mind that for the whole of physics, the naive definition of a
set is equivalent to the precise ZFC definition, actually even to the simper ZF definition.
Subtleties appear only for some special types of infinite sets, but these are not used in
physics. In short, from the basic, naive set definition we can construct all concepts used
* A global overview of axiomatic set theory is given by Paul J. Cohen & Reuben Hersch, Non-
Cantorian set theory, Scientific American 217, pp. 104–116, 1967. Those were the times when Scientific Amer-
ican was a quality magazine.
Ref. 654 Other types of entities, more general than standard sets, obeying other properties, can also be defined,
and are also subject of (comparatively little) mathematical research. To find an example, see the section
Page 665 on cardinals later on. Such more general entities are called classes whenever they contain at least one set.
Challenge 1154 n Can you give an example? In the third part of our mountain ascent we will meet physical concepts that are
described neither by sets nor by classes, containing no set at all. That is were the real fun starts.
664 intermezzo: the brain, l anguage and the human condition
in physics.
Ref. 655 The naive set definition is far from boring. To satisfy two people when dividing a cake,
we follow the rule: I cut, you choose. The method has two properties: it is just, as every-
body thinks that they have the share that they deserve, and it is fully satisfying, as every-
body has the feeling that they have at least as much as the other. What rule is needed for
Challenge 1155 d three people? And for four?
Apart from defining sets, every child and every brain creates links between the dif-
ferent aspects of experience. For example, when it hears a voice, it automatically makes
the connection that a human is present. In formal language, connections of this type are
called relations. Relations connect and differentiate elements along other lines than sets:
the two form a complementing couple. Defining a set unifies many objects and at the
same time divides them into two: those belonging to the set and those that do not; defin-
ing a (binary) relation unifies elements two by two and divides them into many, namely
into the many couples it defines.
Infinity
Mathematicians soon discovered that the concept of ‘set’ is only useful if one can also
Copyright © Christoph Schiller November 1997–January 2007
call collections such as 0, 1, 2, 3..., i.e. of the number 0 and all its successors, a ‘set’. To
achieve this, one property in the Zermelo–Fraenkel list defining the term ‘set’ explicitly
specifies that this collection can be called a set. (In fact, also the axiom of replacement
states that sets may be infinite.) Infinity is thus put into mathematics and into the tools
of our thought right at the very beginning, in the definition of the term ‘set’. When de-
scribing nature, with or without mathematics, we should never forget this fact. A few
additional points about infinity should be of general knowledge to any expert on motion.
Only sets can be infinite. And sets have parts, namely their elements. When a thing or
a concept is called ‘infinite’ one can always ask and specify what its parts are: for space
the parts are the points, for time the instants, for the set of integers the integers, etc. An
what is l anguage? 665
the continuum hypothesis come out either right or wrong, as you prefer.
Another result of research into transfinites is important: for every definition of a type
of infinite cardinal, it seems to be possible to find a larger one. In everyday life, the idea
of infinity is often used to stop discussions about size: ‘My big brother is stronger than
yours.’ ‘But mine is infinitely stronger than yours!’ Mathematics has shown that questions
on size do continue afterwards: ‘The strength of my brother is the power set of that of
* Therefore, most gods, being concepts and thus sets, are either finite or, in the case where they are infinite,
they are divisible. It seems that only polytheistic world views are not disturbed by this conclusion.
** In fact, there is such a huge number of types of infinities that none of these infinities itself actually de-
scribes this number. Technically speaking, there are as many infinities as there are ordinals.
*** Many results are summarized in the excellent and delightful paperback by Rudy Rucker, Infinity and
the Mind – the Science and Philosophy of the Infinite, Bantam, Toronto, 1983.
666 intermezzo: the brain, l anguage and the human condition
Ref. 657 yours!’ Rucker reports that mathematicians conjecture that there is no possible nor any
conceivable end to these discussions.
For physicists, a simple question appears directly. Do infinite quantities exist in nature?
Or better, is it necessary to use infinite quantities to describe nature? You might want to
Challenge 1159 e clarify your own opinion on the issue. It will be settled during the rest of our adventure.
where the type of arrow – with initial bar or not – shows whether we are speaking about
sets or about elements.
We note that it is also possible to use the couple ‘set’ and ‘mapping’ to define all math-
ematical concepts; in this case a relation is defined with the help of mappings. A modern
school of mathematical thought formalized this approach by the use of (mathematical)
categories, a concept that includes both sets and mappings on an equal footing in its defin-
ition.*
To think and talk more clearly about nature, we need to define more specialized con-
cepts than sets, relations and functions, because these basic terms are too general. The
most important concepts derived from them are operations, algebraic structures and
Copyright © Christoph Schiller November 1997–January 2007
numbers.
A (binary) operation is a function that maps the Cartesian product of two copies of a
set X into itself. In other words, an operation w takes an ordered couple of arguments
Numbers
Which numbers are multiplied by six when
Challenge 1161 n
“ ”their last digit is taken away and transferred to
the front?
Numbers are the oldest mathematical concept and are found in all cultures. The notion
of number, in Greek ἀριθµός, has been changed several times. Each time the aim was to
include wider classes of objects, but always retaining the general idea that numbers are
entities that can be added, subtracted, multiplied and divided.
The modern way to write numbers, as e.g. in 12 345 679 ë 45 = 666 666 666, is essential
for science.* It can be argued that the lack of a good system for writing down and for
calculating with numbers delayed the progress of science by several centuries. (By the
way, the same can be said for the affordable mass reproduction of written texts.)
The simplest numbers, 0, 1, 2, 3, 4, ..., are usually seen as being taken directly from ex- Copyright © Christoph Schiller November 1997–January 2007
perience. However, they can also be constructed from the notions of ‘relation’ and ‘set’.
Challenge 1162 n One of the many possible ways to do this (can you find another?) is by identifying a nat-
ural number with the set of its predecessors. With the relation ‘successor of ’, abbreviated
S, this definition can be written as
0 = g , 1 = S 0 = 0 = g ,
2 = S 1 = 0, 1 = g, g and n + 1 = S n = 0, ..., n . (471)
This set, together with the binary operations ‘addition’ and ‘multiplication,’ constitutes
* However, there is no need for written numbers for doing mathematics, as shown by Marcia Ascher,
Ethnomathematics – A Multicultural View of Mathematical Ideas, Brooks/Cole, 1991.
668 intermezzo: the brain, l anguage and the human condition
the algebraic system N = (N, +, ë, 1) of the natural numbers. For all number systems
the algebraic system and the set are often sloppily designated by the same symbol. The
Page 1210 algebraic system N is a so-called semi-ring, as explained in Appendix D. (Some authors
prefer not to count the number zero as a natural number.) Natural numbers are fairly
useful.
TA B L E 54 Some large numbers
Number E x a m p l e i n n at u r e
Around us
1 number of angels that can be in one place at the same time, following
Thomas Aquinas Ref. 659
8 number of times a newspaper can be folded in alternate perpendicular dir-
ections
12 largest number of times a paper strip has been folded in the same direction
Information
51 record number of languages spoken by one person
c. 5000 words spoken on an average day by a man
c. 7000 words spoken on an average day by a woman
c. 7000 number of languages on Earth
c. 350 000 words of the English language (more than any other language, with the pos-
sible exception of German)
c. 2 000 000 number of scientists on Earth around the year 2000
3 ë 108 words spoken during a lifetime (2/3 time awake, 30 words per minute)
4 ë 109 pulses exchanged between both brain halves every second
what is l anguage? 669
Number E x a m p l e i n n at u r e
109 words heard and read during a lifetime
1017 image pixels seen in a lifetime (3 ë 109 s ë (1~15 ms) ë 2~3 (awake) ë106 (nerves
to the brain) Ref. 662
1019 bits of information processed in a lifetime (the above times 32)
c. 5 ë 1012 printed words available in (different) books around the world (c. 100 ë 106
books consisting of 50 000 words)
210 ë 37 ë 8! ë 12!
= 4.3 ë 1019 possible positions of the 3 3 3 Rubik’s Cube Ref. 663
5.8 ë 1078 possible positions of the 4 4 4 Rubik-like cube
5.6 ë 10117 possible positions of the 5 5 5 Rubik-like cube
c. 10200 possible games of chess
c. 10800 possible games of go
The system of integers Z = (..., −2, −1, 0, 1, 2, ..., +, ë, 0, 1) is the minimal ring that is an
Copyright © Christoph Schiller November 1997–January 2007
extension of the natural numbers. The system of rational numbers Q = (Q, +, ë, 0, 1) is the
minimal field that is an extension of the ring of the integers. (The terms ‘ring’ and ‘field’ are
Page 1210 explained in Appendix D.) The system of real numbers R = (R, +, ë, 0, 1, A) is the minimal
extension of the rationals that is continuous and totally ordered. (For the definition of
continuity, see page 1230 and 1211.) Equivalently, the reals are the minimal extension of
the rationals forming a complete, totally strictly-Archimedean ordered field. This is the
historical construction – or definition – of the integer, rational and real numbers from
the natural numbers. However, it is not the only one construction possible. The most
beautiful definition of all these types of numbers is the one discovered in 1969 by John
Ref. 664 Conway, and popularized by him, Donald Knuth and Martin Kruskal.
— A number is a sequence of bits. The two bits are usually called ‘up’ and ‘down’. Examples
670 intermezzo: the brain, l anguage and the human condition
.
.
.
of numbers and the way to write them are given in Figure 301.
— The empty sequence is zero.
— A finite sequence of n ups is the integer number n, and a finite sequence of n downs
is the integer −n. Finite sequences of mixed ups and downs give the dyadic rational
numbers. Examples are 1, 2, 3, −7, 19/4, 37~256, etc. They all have denominators with
a power of 2. The other rational numbers are those that end in an infinitely repeating
Copyright © Christoph Schiller November 1997–January 2007
string of ups and downs, such as the reals, the infinitesimals and simple infinite num-
bers. Longer countably infinite series give even more crazy numbers. The complete
class is called the class of surreal numbers.*
There is a second way to write surreal numbers. The first is the just mentioned se-
quence of bits. But in order to define addition and multiplication, another notation is
usually used, deduced from Figure 301. A surreal α is defined as the earliest number of
* The surreal numbers do not form a set since they contain all ordinal numbers, which themselves do not
form a set, even though they of course contain sets. In short, ordinals and surreals are classes which are
larger than sets.
what is l anguage? 671
all those between two series of earlier surreals, the left and the right series:
showing that the finite surreals are the dyadic numbers m~2n (n and m being integers).
Given two surreals α = ..., a, ...S..., A, ... with a < α < A and β = ..., b, ...S..., B, ...
with b < β < B, addition is defined recursively, using earlier, already defined numbers, as
ternions H, are presented in Appendix D. A few more elaborate number systems are also
presented there.
To conclude, in physics it is usual to call numbers the elements of any set that is a semi-
ring (e.g. N), a ring (e.g. Z) or a field (Q, R, C or H). All these concepts are defined in
Appendix D. Since numbers allow one to compare magnitudes and thus to measure, they
play a central role in the description of observations.
672 intermezzo: the brain, l anguage and the human condition
”
Challenge 1165 n
Ref. 666
Several well-known physicists have repeatedly asked why mathematics is so important.
For example, Niels Bohr is quoted as having said: ‘We do not know why the language of
”
start is built on sand. The assumption that observations in nature can be counted, and thus
that nature is separable, is an approximation. The quoted ‘incomprehensibility’ becomes
amazement at the precision of this approximation. Nevertheless, Pythagoras’ sect, which
was based on the thought that ‘everything in nature is numbers’, was wrong. Like so many
beliefs, observation will show that it was wrong.
Die Physik ist für Physiker viel zu schwer.**
“ David Hilbert
”
* The requirement that simple signs be possible is the requirement that sense be determinate.
** Physics is much too difficult for physicists.
what is l anguage? 673
Is mathematics a language?
Die Sätze der Mathematik sind Gleichungen,
Surely, mathematics is a vocabulary that helps us to talk with precision. Mathematics can
be seen as the exploration of all possible concepts that can be constructed from the two
fundamental bricks ‘set’ and ‘relation’ (or some alternative, but equivalent pair). Mathem-
”
atics is the science of symbolic necessities. Rephrased again, mathematics is the explor-
ation of all possible types of classifications. This explains its usefulness in all situations
where complex, yet precise classifications of observations are necessary, such as in phys-
ics.
However, mathematics cannot express everything that humans want to communicate,
such as wishes, ideas or feelings. Just try to express the fun of swimming using mathem-
Curiosities and fun challenges about mathematics Copyright © Christoph Schiller November 1997–January 2007
Challenge 1166 n What is the largest number that can be written with four digits of 2 and no other sign?
* The propositions of mathematics are equations, and therefore pseudo-propositions. A proposition of math-
ematics does not express a thought.
** David Hilbert (1862 Königsberg–1943 Göttingen), professor of mathematics in Göttingen, greatest math-
ematician of his time. He was a central figure to many parts of mathematics, and also played an important
role both in the birth of general relativity and of quantum theory. His textbooks are still in print. His famous
personal credo was: ‘Wir müssen wissen, wir werden wissen.’ (We must know, we will know.) His famous
Paris lecture is published e.g. in Die Hilbertschen Probleme, Akademische Verlagsgesellschaft Geest & Portig,
1983. The lecture galvanized all of mathematics. (Despite efforts and promises of similar fame, nobody in the
world had a similar overview of mathematics that allowed him or her to repeat the feat in the year 2000.) In
his last decade he suffered the persecution of the Nazi regime; the persecution eliminated Göttingen from
the list of important science universities, without recovering its place up to this day.
674 intermezzo: the brain, l anguage and the human condition
**
Pythagorean triplets are integers that obey a 2 + b 2 = c 2 . Give at least ten examples. Then
Challenge 1167 e show the following three properties: at least one number in a triplet is a multiple of 3; at
least one number in a triplet is a multiple of 4; at least one number in a triplet is a multiple
of 5.
**
The number 1~n, when written in decimal notation, has a periodic sequence of digits.
The period is at most n − 1 digits long, as for 1~7 = 0.142857 142857 1428.... Which other
Challenge 1168 d numbers 1~n have periods of length n − 1?
**
**
Leonhard Euler in his notebooks sometimes wrote down equations like
1 + 22 + 24 + 26 + 28 + ... = − .
1
(476)
3
Challenge 1172 d Can this make sense?
**
In the history of recreational mathematics, several people have found the well-known ma-
gic hexagon shown in Figure 302. The first discoverer was, in 1887, Ernst von Hasselberg.
The hexagon is called magic because all lines add up to the same number, 38. Incredibly,
physical concepts, lies and patterns of nature 675
15
14 13
9 8 10
6 4
11 5 12
1 2
18 7 16
17 19
3
Ref. 670 In contrast to mathematics, physics does aim at being a language. Through the descrip-
tion of motion it aims to express everything observed and, in particular, all examples and
possibilities of change.*** Like any language, physics consists of concepts and sentences.
”
In order to be able to express everything, it must aim to use few words for a lot of facts.****
Physicists are essentially lazy people: they try to minimize the effort in everything they do.
The concepts in use today have been optimized by the combined effort of many people to
be as practical, i.e. as powerful as possible. A concept is called powerful when it allows one
to express in a compact way a large amount of information, meaning that it can rapidly
convey a large number of details about observations.
*** All observations are about change or variation. The various types of change are studied by the various
sciences; they are usually grouped in the three categories of human sciences, formal sciences and natural
sciences. Among the latter, the oldest are astronomy and metallurgy. Then, with the increase of curiosity
in early antiquity, came the natural science concerned with the topic of motion: physics. In the course of
our walk it will become clear that this seemingly restrictive definition indeed covers the whole set of topics
studied in physics. In particular it includes the more common definition of physics as the study of matter,
its properties, its components and their interactions.
**** A particular, specific observation, i.e. a specific example of input shared by others, is called a fact, or
in other contexts, an event. A striking and regularly observed fact is called a phenomenon, and a general
observation made in many different situations is called a (physical) principle. (Often, when a concept is
introduced that is used with other meaning in other fields, in this walk it is preceded by the qualifier ‘physical’
or ‘mathematical’ in parentheses.) Actions performed towards the aim of collecting observations are called
experiments. The concept of experiment became established in the sixteenth century; in the evolution of a
child, it can best be compared to that activity that has the same aim of collecting experiences: play.
676 intermezzo: the brain, l anguage and the human condition
General statements about many examples of motion are called rules or patterns. In the
past, it was often said that ‘laws govern nature’, using an old and inappropriate ideology.
A physical ‘law’ is only a way of saying as much as possible with as few words as possible.
When saying ‘laws govern nature’ we actually mean to say ‘being lazy, we describe obser-
vations with patterns’. Laws are the epitome of laziness. Formulating laws is pure sloth. In
fact, the correct expression is patterns describe nature.
Physicists have defined the laziness necessary for their field in much detail. In order
to become a master of laziness, we need to distinguish lazy patterns from those which are
not, such as lies, beliefs, statements that are not about observations, and statements that
Page 679 are not about motion. We do this below.
The principle of extreme laziness is the origin, among others, of the use of numbers in
physics. Observables are often best described with the help of numbers, because numbers
allow easy and precise communication and classification. Length, velocity, angles, tem-
perature, voltage or field strength are of this type. The notion of ‘number’, used in every
The title question is often rephrased as: are physical concepts free of beliefs, taste or per- ”
sonal choices? The question has been discussed so much that it even appears in Holly-
wood films. We give a short summary that can help you to distinguish honest from dis-
honest teachers.
Creation of concepts, in contrast to their discovery, would imply free choice between
Copyright © Christoph Schiller November 1997–January 2007
many alternative possibilities. The chosen alternative would then be due to the beliefs or
tastes used. In physics (in obvious contrast to other, more ideological fields of enquiry),
we know that different physical descriptions of observations are either equivalent or, in
the opposite case, imprecise or even wrong. A description of observations is thus essen-
tially unique: any choices of concepts are only apparent. There is no real freedom in the
definition of physical concepts. In this property, physics is in strong contrast to artistic
activity.
* ‘It is better that people do not know how laws and sausages are made. Otherwise they would not sleep well
at night.’ Otto von Bismarck (1815–1898), Prussian chancellor.
** A logical picture of facts is a thought.
physical concepts, lies and patterns of nature 677
If two different concepts can be used to describe the same aspect of observations, they
must be equivalent, even if the relation that leads to the equivalence is not immediately
clear. In fact, the requirement that people with different standpoints and observing the
same event deduce equivalent descriptions lies at the very basis of physics. It expresses
the requirement that observations are observer independent. In short, the strong require-
ment of viewpoint independence makes the free choice of concepts a logical impossibility.
The conclusion that concepts describing observations are discovered rather than cre-
ated is also reached independently in the field of linguistics by the above-mentioned re-
search on semantic primitives,* in the field of psychology by the observations on the
formation of the concepts in the development of young children, and in the field of eth-
ology by the observations of animal development, especially in the case of mammals. In
all three fields detailed observations have been made of how the interactions between an
individual and its environment lead to concepts, of which the most basic ones, such as
space, time, object or interaction, are common across the sexes, cultures, races and across
gies – not meant to be taken too seriously – physical concepts are not created; they are
discovered. If a teacher tells you the opposite, he is lying.
Having handled the case of physical concepts, let us now turn to physical statements.
The situation is somewhat similar: physical statements must be lazy, arrogant and boring.
Let us see why.
Wo der Glaube anfängt, hört die Wissenschaft
“ auf.**
Ernst Haeckel, Natürliche Schöpfungsgeschichte,
1879.
Ref. 653 particular that they are deduced from the fundamentals of human experience, and not invented.
** Where belief starts, science ends.
”
* Anna Wierzbicka concludes that her research clearly indicates that semantic primitives are discovered, in
678 intermezzo: the brain, l anguage and the human condition
Progressing through the study of motion reflects a young child’s attitude towards life. The
progress follows the simple programme on the left of Table 55.
Adult scientists do not have much more to add, except the more fashionable terms
”
on the right, plus several specialized professions to make money from them. The experts
of step 7 are variously called lobbyists or fund raisers; instead of calling this program
‘curiosity’, they call it the ‘scientific method.’ They mostly talk. Physics being the talk about
motion,** and motion being a vast topic, many people specialize in this step.
The experts of step 6 are called experimental physicists or simply experimentalists, a
Copyright © Christoph Schiller November 1997–January 2007
term derived from the Latin ‘experiri’, meaning ‘to try out’. Most of them are part of the
category ‘graduate students’. The experts of steps 5 and 4 are called theoretical physicists
* ‘Grey, dear friend, is all theory, and green the golden tree of life.’ Johann Wolfgang von Goethe (1749–1832),
the most influential German poet.
** Several sciences have the term ‘talk’ as part of their name, namely all those whose name finishes in ‘-logy’,
such as e.g. biology. The ending stems from ancient Greek and is deduced from λήγηιν meaning ‘to say, to
talk’. Physics as the science of motion could thus be called ‘kinesiology’ from κίνησις, meaning ‘motion’; but
for historical reasons this term has a different meaning, namely the study of human muscular activity. The
term ‘physics’ is either derived from the Greek φύσικη (τέχνη is understood) meaning ‘(the art of) nature’,
or from the title of Aristotle’ works τά φυσικά meaning ‘natural things’. Both expressions are derived from
φύσις, meaning ‘nature’.
physical concepts, lies and patterns of nature 679
or simply theoreticians.* This is a rather modern term; for example, the first professors of
theoretical physics were appointed around the start of the twentieth century. The term is
derived from the Greek θεωρία meaning ‘observation, contemplation’. Finally, there are
the people who focus on steps 1 to 3, and who induce others to work on steps 4 to 6; they
are called geniuses.
Obviously an important point is hidden in step 6: how do all these people know
whether their checks fail? How do they recognize truth?
All professions are conspiracies against laymen.
What is a lie?
“ George Bernard Shaw
”
Get your facts straight, and then you can distort
“
Lies are useful statements, as everybody learns during their youth. One reason that they
Bert Hellinger
”
are useful is because we can draw any imaginable conclusion from them. A well-known
discussion between two Cambridge professors early in the twentieth century makes the
point. McTaggart asked: ‘If 2 + 2 = 5, how can you prove that I am the pope?’ Godfrey
Hardy: ‘If 2+2 = 5, then 4 = 5; subtract 3; then 1 = 2; but McTaggart and the pope are two;
therefore McTaggart and the pope are one.’ As noted long ago, ex falso quodlibet. From
what is wrong, anything imaginable can be deduced. It is true that in our mountain ascent
we need to build on previously deduced results and that our trip could not be completed
if we had a false statement somewhere in our chain of arguments. But lying is such an
important activity that one should learn to perform it well.
There are various stages in the art of lying. Many animals have been shown to de-
Ref. 635 ceive their kin. Children start lying just before their third birthday, by hiding experiences.
Adults cheat on taxes. And many intellectuals or politicians even claim that truth does
not exist.
However, in most countries, everybody must know what ‘truth’ is, since in a law court
for example, telling an untruth can lead to a prison sentence. The courts are full of experts
in lie detection. If you lie in court, you better do it well; experience shows that you might
Copyright © Christoph Schiller November 1997–January 2007
get away with many criminal activities. In court, a lie is a statement that knowingly con-
trasts with observations.** The truth of a statement is thus checked by observation. The
check itself is sometimes called the proof of the statement. For law courts, as for physics,
truth is thus the correspondence with facts, and facts are shared observations. A ‘good’ lie
is thus a lie whose contrast with shared observations is hard to discover.
* If you like theoretical physics, have a look at the refreshingly candid web page by Nobel Prize winner
Gerard ‘t Hooft with the title How to become a good theoretical physicist. It can be found at http://www.phys.
uu.nl/~thooft/theorist.html.
** Statements not yet checked are variously called speculations, conjectures, hypotheses, or – wrongly – simply
theses. Statements that are in correspondence with observations are called correct or true; statements that
contrast with observations are called wrong or false.
680 intermezzo: the brain, l anguage and the human condition
The first way of lying is to put an emphasis on the sharedness only. Populists and po-
lemics do this regularly. (‘Every foreigner is a danger for the values of our country.’) Since
almost any imaginable opinion, however weird, is held by some group – and thus shared
– one can always claim it as true.* Unfortunately, it is no secret that ideas also get shared
because they are fashionable, imposed or opposed to somebody who is generally disliked.
Often a sibling in a family has this role – remember Cassandra.** For a good lie we thus
need more than sharedness, more than intersubjectivity alone.
A good lie should be, like a true statement, really independent of the listener and the
observer and, in particular, independent of their age, their sex, their education, their civil-
ization or the group to which they belong. For example, it is especially hard – but not im-
possible – to lie with mathematics. The reason is that the basic concepts of mathematics,
be they ‘set’, ‘relation’ or ‘number’, are taken from observation and are intersubjective, so
that statements about them are easily checked. Usually, lies thus avoid mathematics.***
Secondly, a ‘good’ lie should avoid statements about observations and use interpreta-
usually unsuccessful. If somebody insists on doing so, the lie becomes a superstition, a
belief, a prejudice or a doctrine. These are the low points in the art of lying. A famous case
* The work of the French sociologist Gabiel Tarde (1843–1903), especially his concepts of imitation and
group mind, already connects to this fact.
** The implications of birth order on creativity in science and on acceptance of new ideas has been studied in
the fascinating book by Frank J. Sulloway, Born to Rebel – Birth Order, Family Dynamics and Creative
Lives, Panthon Books, 1996. This exceptional book tells the result of a life-long study correlating the personal
situations in the families of thousands of people and their receptivity to about twenty revolutions in the
recent history. The book also includes a test in which the reader can deduce their own propensity to rebel,
on a scale from 0 to 100 %. Darwin scores 96 % on this scale.
*** In mathematics, ‘true’ is usually specified as ‘deducible’ or ‘provable’; this is in fact a special case of the
usual definition of truth, namely ‘correspondence with facts’, if one remembers that mathematics studies the
properties of classifications.
physical concepts, lies and patterns of nature 681
of insistence on a lie is that of the colleagues of Galileo, who are said to have refused to
look through his telescope to be convinced that Jupiter has moons, an observation that
would have shaken their belief that everything turns around the Earth. Obviously these
astronomers were amateurs in the art of lying. A good universal lie is one whose counter-
example is not so easily spotted.
There should be no insistence on lies in physics. Unfortunately, classical physics is full
of lies. We will dispel them during the rest of our walk.
Lying by giving specific instead of universal statements is much easier. (‘I can’t remem-
ber.’) Even a specific statement such as ‘yesterday the Moon was green, cubic and smelled
of cheese’ can never be completely falsified: there is no way to show with absolute cer-
tainty that this is wrong. The only thing that we can do is to check whether the statement
is compatible with other observations, such as whether the different shape affected the
tides as expected, whether the smell can be found in air collected that day, etc. A good
specific lie is thus not in contrast with other observations.*
This is what makes them so popular. Children learn specific lies first. (‘I haven’t eaten the
* It is often difficult or tedious to verify statements concerning the past, and the difficulty increases with
the distance in time. That is why people can insist on the occurrence of events which are supposed to be
exceptions to the patterns of nature (‘miracles’). Since the advent of rapid means of communication these
checks are becoming increasingly easy, and no miracles are left over. This can be seen in Lourdes in France,
where even though today the number of visitors is much higher than in the past, no miracles have been seen
in decades.
In fact, all modern ‘miracles’ are kept alive only by consciously eschewing checks, such as the supposed
yearly liquefaction of blood in Napoli, the milk supposedly drunk by statues, the supposed healers in tele-
Ref. 672 vision evangelism, etc. Most miracles only remain because many organizations make money out of the dif-
ficulty of falsifying specific statements. For example, when the British princess Diana died in a car crash
in 1997, even though the events were investigated in extreme detail, the scandal press could go on almost
without end about the ‘mysteries’ of the accident.
682 intermezzo: the brain, l anguage and the human condition
jam.’) General lies, like all general statements, can always be corroborated by examples.
This is the reason for the success of ideologies. But the criteria for recognizing lies, even
general lies, have become so commonplace that beliefs and lies try to keep up with them. It
became fashionable to use expressions such as ‘scientific fact’ – there are no non-scientific
facts –, or ‘scientifically proven’ – observations cannot be proven otherwise – and similar
empty phrases. These are not ‘good’ lies; whenever we encounter sentences beginning
with ‘science says ...’ or ‘science and religion do ...’, replacing ‘science’ by ‘knowledge’ or
‘experience’ is an efficient way of checking whether such statements are to be taken seri-
ously or not.*
Lies differ from true statements in their emotional aspect. Specific statements are usu-
ally boring and fragile, whereas specific lies are often sensational and violent. In contrast,
general statements are often daring and fragile whereas general lies are usually boring
and violent. The truth is fragile. True statements require the author to stick his neck out
to criticism. Researchers know that if one doesn’t stick the neck out, it can’t be an observa-
* To clarify the vocabulary usage of this text: religion is spirituality plus a varying degree of power abuse. The
mixture depends on each person’s history, background and environment. Spirituality is the open participa-
tion in the whole of nature. Most, maybe all, people with a passion for physics are spiritual. Most are not
religious.
** In other words, a set of not yet falsified patterns of observations on the same topic is called a (physical)
theory. The term ‘theory’ will always be used in this sense in this walk, i.e. with the meaning ‘set of correct
general statements’. This use results from its Greek origin: ‘theoria’ means ‘observation’; its original meaning,
‘passionate and emphatic contemplation’, summarizes the whole of physics in a single word. (‘Theory’, like
‘theatre’, is formed from the root θέ, meaning ‘the act of contemplating’.) Sometimes, however, the term
‘theory’ is used – being confused with ‘hypothesis’ – with the meaning of ‘conjecture’, as in ‘your theory
is wrong’, sometimes with the meaning of ‘model’, as in ‘Chern–Simons’ theory and sometimes with the
meaning of ‘standard procedure’, as in ‘perturbation theory’. These incorrect uses are avoided here. To bring
the issue to a point: the theory of evolution is not a conjecture, but a set of correct statements based on
observation.
physical concepts, lies and patterns of nature 683
An important aspect of any ‘good’ lie is to make as few public statements as possible, so
that critics can check as little as possible. (For anybody sending corrections of mistakes in
this text, the author provides a small reward.) To detect lies, public scrutiny is important,
though not always reliable. Sometimes, even scientists make statements which are not
based on observations. However, a ‘good’ lie is always well prepared and told on purpose;
accidental lies are frowned upon by experts. Examples of good lies in science are ‘aether’,
‘UFOs’, ‘creation science’, or ‘cold fusion’. Sometimes it took many decades to detect the
lies in these domains.
To sum up, the central point of the art of lying without being caught is simple: do
not divulge details. Be vague. All the methods used to verify a statement ask for details,
for precision. For any statement, its degree of precision allows one to gauge the degree
to which the author is sticking his neck out. The more precision that is demanded, the
weaker a statement becomes, and the more likely a fault will be found, if there is one. This
is the main reason that we chose an increase in precision as a guide for our mountain
“ Democritus
”
Truth is a rhetorical concept.
“ Paul Feyerabend
Not all statements can be categorized as true or false. Statements can simply make no ”
sense. There are even such statements in mathematics, where they are called undecid-
Copyright © Christoph Schiller November 1997–January 2007
computer science, have captured a large public.* Similarly undecidable statements can be
Ref. 675 constructed with terms such as ‘calculable’, ‘provable’ and ‘deducible’.
In fact, self-referential statements are undecidable because they are meaningless. If the
usual definition of ‘true’, namely corresponding to facts, is substituted into the sentence
‘This statement is not true’, we quickly see that it has no meaningful content. The most
famous meaningless sentence of them all was constructed by the linguist Noam Chom-
sky:
Colorless green ideas sleep furiously.
Ref. 641 It is often used as an example for the language processing properties of the brain, but
nobody sensible elevates it to the status of a paradox and writes philosophical discussions
about it. To do that with the title of this section is a similar waste of energy.
The main reason for the popular success of self-reference is the difficulty in perceiving
the lack of meaning.** A good example is the statement:
”
* A general introduction is given in the beautiful books by R aymond Smullyan: Satan, Cantor and
Copyright © Christoph Schiller November 1997–January 2007
Infinity and Other Mind-boggling Puzzles, Knopf, 1992; What is the Name of This Book? The Riddle of Dracula
and Other Logical Puzzles, Touchstone, 1986, and The Lady or the Tiger? And Other Puzzles, Times Books,
1982. Also definitions can have no content, such as David Hilbert’s ‘smallest number that has not been
mentioned this century’ or ‘the smallest sequence of numbers that is described by more signs than this
sentence’.
** A well-known victim of this difficulty is Paulus of Tarsus. The paradox of the Cretan poet Epimenedes
(6th century bce) who said ‘All Cretans lie’ is too difficult for the notoriously humour-impaired Paulus,
who in his letter to Titus (chapter 1, verses 12 and 13, in the christian bible) calls Epimenedes a ‘prophet’,
Ref. 676 adds some racist comments, and states that this ‘testimony’ is true. But wait; there is a final twist to this story.
The statement ‘All Cretans lie’ is not a paradox at all; a truth value can actually be ascribed to it, because the
Challenge 1173 n statement is not really self-referential. Can you confirm this? The only genuine paradox is ‘I am lying’, to
which it is indeed impossible to ascribe a truth value.
Challenge 1175 ny *** Why are circular statements, like those of Galilean physics, not self-referential?
**** It is quite impossible for a proposition to state that it itself is true.
physical concepts, lies and patterns of nature 685
**
A famous provocation: the world has been created last Saturday. Can you decide whether
Challenge 1180 ny this is wrong?
**
Hundreds of hoaxes are found on the http://www.museumofhoaxes.com website. It gives
an excellent introduction into the art of lying; of course it exposes only those who have
been caught. Enjoy the science stories, especially those about archaeology. (Several other
sites with similar content can be found on the internet.)
**
Copyright © Christoph Schiller November 1997–January 2007
In the 1990s, many so-called ‘healers’ in the Philippines made millions by suggesting
patients that they were able to extract objects from their bodies without operating. Why
Challenge 1181 e is this not possible? (For more information on health lies, see the http://www.quackwatch.
com website.)
**
Since the 1980s, people have claimed that it is possible to acquire knowledge simply from
somebody 1000 km away, without any communication between the two people. However,
Challenge 1182 e the assumed ‘morphogenetic fields’ cannot exist. Why not?
**
686 intermezzo: the brain, l anguage and the human condition
It is claimed that a Fire Brigade building in a city in the US hosts a light bulb that has
been burning without interruption since 1901 (at least it was so in 2005). Can this be
Challenge 1183 n true? Hundreds of such stories, often called ‘urban legends,’ can be found on the http://
www.snopes.com website. However, some of the stories are not urban legends, but true,
as the site shows.
**
‘This statement has been translated from French into English’. Is the statement true, false
or neither?
**
Aeroplanes have no row 13. Many tall hotels have no floor 13. What is the lie behind this
Challenge 1184 ny habit? What is the truth behind it?
Observations
Knowledge is a sophisticated statement of
“ ignorance.
Attributed to Karl Popper
The collection of a large number of true statements about a type of observations, i.e. of ”
a large number of facts, is called knowledge. Where the domain of observations is suffi-
Ref. 677 ciently extended, one speaks of a science. A scientist is thus somebody who collects know-
ledge.* We found above that an observation is classified input into the memory of several
people. Since there is motion all around, to describe all these observations is a mammoth
task. As for every large task, to a large extent the use of appropriate tools determines the
degree of success that can be achieved. These tools, in physics and in all other sciences,
fall in three groups: tools for the collection of observations, tools to communicate obser-
vations and tools to communicate relations between observations. The latter group has
been already discussed in the section on language and on mathematics. We just touch on
Copyright © Christoph Schiller November 1997–January 2007
* The term ‘scientist’ is a misnomer peculiar to the English language. Properly speaking, a ‘scientist’ is a
follower of scientism, an extremist philosophical school that tried to resolve all problems through science.
For this reason, some religious sects have the term in their name. Since the English language did not have a
shorter term to designate ‘scientific persons’, as they used to be called, the term ‘scientist’ started to appear in
the United States, from the eighteenth century onwards. Nowadays the term is used in all English-speaking
countries – but not outside them, fortunately.
observations 687
so frequent that scepticism has become a common response. Although in many domains
of science, including physics, discoveries are still expected, on the fundamentals of mo-
tion the arguments just presented seem to show that new observations are only a remote
possibility. The task of collecting observations on the foundations of motion (though not
on other topics of physics) seems to be complete. Indeed, most observations described
* Julian Seymour Schwinger (1918–1994), US-American infant prodigy. He was famous for his clear think-
ing and his excellent lectures. He worked on waveguides and synchroton radiation, made contributions to
nuclear physics and developed quantum electrodynamics. For the latter he received the 1965 Nobel Prize
in physics together with Tomonaga and Feynman. He was a thesis advisor to many famous physicists and
Ref. 678 wrote several excellent and influential textbooks. Nevertheless, at the end of his life, he became strangely
interested in a hoax turned sour: cold fusion.
688 intermezzo: the brain, l anguage and the human condition
here were obtained before the end of the twentieth century. We are not too early with our
walk.
Measure what is measurable; make measurable
Ref. 682
“ what is not.
Wrongly attributed to Galileo.
”
Are all physical observables known?
Scientists have odious manners, except when
The most practical way to communicate observations was developed a long time ago:
by measurements. A measurement allows effective communication of an observation to
other times and places. This is not always as trivial as it sounds; for example, in the Middle
”
the state of an object are also called variables. Variables on which other observables de-
pend are often called parameters. (Remember: a parameter is a variable constant.) For
example, the speed of light is a constant, the position a variable, the temperature is often
a parameter, on which the length of an object, for example, can depend. Note that not all
observables are quantities; in particular, parities are not multiples of any unit.
* All mathematical symbols used in this walk, together with the alphabets from which they are taken, are
listed in Appendix A on notation. They follow international standards whenever they are defined. The stand-
ard symbols of the physical quantities, as defined by the International Standards Organization (ISO), the
International Union of Pure and Applied Physics (IUPAP) and the International Union of Pure and Applied
Chemistry (IUPAC), can be found for example in the bible, i.e. the CRC Handbook of Chemistry and Physics,
CRC Press, Boca Raton, 1992.
observations 689
Today the task of defining tools for the communication of observations can be con-
sidered complete. (For quantities, this is surely correct; for parity-type observables there
could be a few examples to be discovered.) This is a simple and strong statement. Even
the BIPM, the Bureau International des Poids et Mesures, has stopped adding new units.*
As a note, the greatness of a physicist can be ranked by the number of observables
he has introduced. Even a great scientist such as Einstein, who discovered many ‘laws’ of
nature, only introduced one new observable, namely the metric tensor for the description
of gravity. Following this criterion – as well as several others – Maxwell is the most import-
ant physicist, having introduced electric and magnetic fields, the vector potential, and
several other material dependent observables. For Heisenberg, Dirac and Schrödinger,
the wave function describing electron motion could be counted as half an observable (as
it is a quantity necessary to calculate measurement results, but not itself an observable).
Incidentally, even the introduction of any term that is taken up by others is a rare event;
‘gas’, ‘entropy’ and only a few others are such examples. It has always been much more
process applied to a support. The necessary irreversible interaction process is often called
writing the record. Obviously, writing takes a certain amount of time; zero interaction
time would give no record at all. Therefore any recording device, including our brain,
always records some time average of the observation, however short it may be.
What we call a fixed image, be it a mental image or a photograph, is always the time
average of a moving situation. Without time averaging, we would have no fixed memories.
* The last, the katal or mol~s, was introduced in 1999. Physical units are presented in Appendix B.
** Is it possible to talk about observations at all? It is many a philosopher’s hobby to discuss whether there
actually is an example for an ‘Elementarsatz’ mentioned by Wittgenstein in his Tractatus. There seems to
be at least one that fits: Differences exist. It is a simple sentence; in the third part of our walk, it will play a
central role.
690 intermezzo: the brain, l anguage and the human condition
On the other hand, any time averaging introduces a blur that hides certain details; and in
our quest for precision, at a certain moment, these details are bound to become important.
The discovery of these details will begin in the second part of the walk, the one centred
on quantum theory. In the third part of our mountain ascent we will discover that there
is a shortest possible averaging time. Observations of that short duration show so many
details that even the distinction between particles and empty space is lost. In contrast,
our concepts of everyday life appear only after relatively long time averages. The search
for an average-free description of nature is one of the big challenges of our adventure.
“ denkbar.*
Ludwig Wittgenstein, Tractatus, 6.361
There is a tradition of opposition between
”
“
Induction is the usual term used for the act of making, from a small and finite number
of experiments, general conclusions about the outcome of all possible experiments per-
formed in other places, or at other times. In a sense, it is the technical term for sticking
”
out one’s neck, which is necessary in every scientific statement. Induction has been a ma-
jor topic of discussion for science commentators. Frequently one finds the remark that
knowledge in general, and physics in particular, relies on induction for its statements. Ac-
cording to some, induction is a type of hidden belief that underlies all sciences but at the
same time contrasts with them.
To avoid wasting energy, we make only a few remarks. The first can be deduced from
a simple experiment. Try to convince a critic of induction to put their hand into a fire.
Nobody who honestly calls induction a belief should conclude from a few unfortunate
experiences in the past that such an act would also be dangerous in the future... In short,
somehow induction works.
A second point is that physical universal statements are always openly stated; they are
never hidden. The refusal to put one’s hand into a fire is a consequence of the invariance
of observations under time and space translations. Indeed, general statements of this type
Copyright © Christoph Schiller November 1997–January 2007
form the very basis of physics. However, no physical statement is a belief only because
it is universal; it always remains open to experimental checks. Physical induction is not
a hidden method of argumentation, it is an explicit part of experimental statements. In
fact, the complete list of ‘inductive’ statements used in physics is given in the table on page
202. These statements are so important that they have been given a special name: they are
called symmetries. The table lists all known symmetries of nature; in other words, it lists
all inductive statements used in physics.
Perhaps the best argument for the use of induction is that there is no way to avoid it
when one is thinking. There is no way to think, to talk or to remember without using con-
cepts, i.e. without assuming that most objects or entities have the same properties over
* Only connexions that are subject to law are thinkable.
the quest for precision and its implications 691
time. There is also no way to communicate with others without assuming that the obser-
vations made from the other’s viewpoint are similar to one’s own. There is no way to think
without symmetry and induction. Indeed, the concepts related to symmetry and induc-
Page 659 tion, such as space and time, belong to the fundamental concepts of language. The only
Ref. 677 sentences which do not use induction, the sentences of logic, do not have any content
Challenge 1185 n (Tractatus, 6.11). Indeed, without induction, we cannot classify observations at all! Evolu-
tion has given us memory and a brain because induction works. To criticize induction is
not to criticize natural sciences, it is to criticize the use of thought in general. We should
never take too seriously people who themselves do what they criticize in others; sporadic-
ally pointing out the ridicule of this endeavour is just the right amount of attention they
deserve.
The topic could be concluded here, were it not for some interesting developments in
modern physics that put two additional nails in the coffin of arguments against induction.
First, in physics whenever we make statements about all experiments, all times or all
“
Copyright © Christoph Schiller November 1997–January 2007
To talk well about motion means to talk precisely. Precision requires avoiding ”
hree common mistakes in the description of nature.
First, concepts should never have a contradiction built into their definition. For ex-
ample, any phenomenon occurring in nature evidently is a ‘natural’ phenomenon; there-
fore, to talk about either ‘supernatural’ phenomena or ‘unnatural’ phenomena is a mis-
take that nobody interested in motion should let go unchallenged; such terms contain
a logical contradiction. Naturally, all observations are natural. Incidentally, there is a re-
Ref. 683 ward of more than a million dollars for anybody proving the opposite. In over twenty
years, nobody has yet been able to collect it.
Second, concepts should not have unclear or constantly changing definitions. Their
content and their limits must be kept constant and explicit. The opposite of this is often
Ref. 684 encountered in crackpots or populist politicians; it distinguishes them from more reliable
thinkers. Physicists can also fall into the trap; for example, there is, of course, only one
single (physical) universe, as even the name says. To talk about more than one universe
is an increasingly frequent error.
Third, concepts should not be used outside their domain of application. It is easy to suc-
“ unimaginative.
Oscar Wilde
”
What are interactions? – No emergence
The whole is always more than the sum of its
“ parts.
Aristotle, Metaphysica, 10f–1045a.
In the physical description of nature, the whole is always more than the sum of its parts. ”
Actually, the difference between the whole and the sum of its parts is so important that Copyright © Christoph Schiller November 1997–January 2007
it has a special name: the interaction between the parts. For example, the energy of the
whole minus the sum of the energies of its parts is called the energy of interaction. In fact,
the study of interactions is the main topic of physics. In other words, physics is concerned
primarily with the difference between the parts and the whole, contrary to what is often
suggested by bad journalists or other sloppy thinkers.
Note that the term ‘interaction’ is based on the general observation that anything that
affects anything else is, in turn, affected by it; interactions are reciprocal. For example, if
one body changes the momentum of another, then the second changes the momentum
of the first by the same (negative) amount. The reciprocity of interactions is a result of
conservation ‘laws’. The reciprocity is also the reason that somebody who uses the term
‘interaction’ is considered a heretic by monotheistic religions, as theologians regularly
point out. They repeatedly stress that such a reciprocity implicitly denies the immutability
the quest for precision and its implications 693
What is existence?
You know what I like most?
Ref. 686
“ Rhetorical questions.
”
Assume a friend tells you ‘I have seen a grampus today!’ You would naturally ask what it
looks like. What answer do we expect? We expect something like ‘It’s an animal with a
certain number of heads similar to a X, attached to a body like a Y, with wings like a Z, it
make noises like a U and it felt like a V ’ – the letters denoting some other animal or object.
Generally speaking, in the case of an object, this scene from Darwin’s voyage to South
America shows that in order to talk to each other, we first need certain basic, common
concepts (‘animal’, ‘head’, ‘wing’, etc.). In addition, for the definition of a new entity we
Copyright © Christoph Schiller November 1997–January 2007
need a characterization of its parts (‘size’, ‘colour’), of the way these parts relate to each
other, and of the way that the whole interacts with the outside world (‘feel’, ‘sound’). In
other words, for an object to exist, we must be able to give a list of relations with the
outside world. An object exists if we can interact with it. (Is observation sufficient to
Challenge 1188 n determine existence?)
For an abstract concept, such as ‘time’ or ‘superstring’, the definition of existence has to
be refined only marginally: (physical) existence is the effectiveness to describe interactions
accurately. This definition applies to trees, time, virtual particles, imaginary numbers,
entropy and so on. It is thus pointless to discuss whether a physical concept ‘exists’ or
whether it is ‘only’ an abstraction used as a tool for descriptions of observations. The two
possibilities coincide. The point of dispute can only be whether the description provided
694 intermezzo: the brain, l anguage and the human condition
Do things exist?
Wer Wissenschaft und Kunst besitzt,
Using the above definition of existence, the question becomes either trivial or imprecise.
It is trivial in the sense that things necessarily exist if they describe observations, since
they were defined that way. But perhaps the questioner meant to ask: Does reality exist
”
independently of the observer?
Using the above, this question can be rephrased: ‘Do the things we observe exist in-
dependently of observation?’ After thousands of years of extensive discussion by profes-
sional philosophers, logicians, sophists and amateurs the answer is the same: it is ‘Yes’,
Copyright © Christoph Schiller November 1997–January 2007
because the world did not change after great-grandmother died. The disappearance of
observers does not seem to change the universe. These experimental findings can be cor-
roborated by inserting the definition of ‘existence’ into the question, which then becomes:
‘Do the things we observe interact with other aspects of nature when they do not interact
with people?’ The answer is evident. Recent popular books on quantum mechanics fan-
tasize about the importance of the ‘mind’ of observers – whatever this term may mean;
they provide pretty examples of authors who see themselves as irreplaceable, seemingly
having lost the ability to see themselves as part of a larger entity.
Of course there are other opinions about the existence of things. The most famous is
that of the Irishman George Berkeley (1685–1753) who rightly understood that thoughts
* He who possesses science and art, also has religion; he who does not possess the two, better have religion.
the quest for precision and its implications 695
based on observation alone, if spread, would undermine the basis of the religious organ-
ization of which he was one of the top managers. To counteract this tendency, in 1710
he published A Treatise Concerning the Principles of Human Knowledge, a book denying
the existence of the material world. This reactionary book became widely known in like-
minded circles (it was a time when few books were written) even though it is based on
a fundamentally flawed idea: it assumes that the concept of ‘existence’ and that of ‘world’
Challenge 1191 e can be defined independently. (You may be curious to try the feat.)
Berkeley had two aims when he wrote his book. First, he tried to deny the capacity of
people to arrive at judgements on nature or on any other matter from their own experience.
Second, he also tried to deny the ontological reach of science, i.e. the conclusions one can
draw from experience on the questions about human existence. Even though Berkeley is
generally despised nowadays, he actually achieved his main aim: he was the originator of
the statement that science and religion do not contradict, but complement each other. By
religion, Berkeley did not mean either morality or spirituality; every scientists is a friend
past, people have been killed for giving an answer that was unacceptable to authorities.
It is not obvious, but it is nevertheless important, that the modern physical concepts
of ‘vacuum’ and ‘empty space’ are not the same as the philosophical concept of ‘void’.
‘Vacuum’ is not defined as ‘non-existence’; on the contrary, it is defined as the absence
of matter and radiation. Vacuum is an entity with specific observable properties, such
as its number of dimensions, its electromagnetic constants, its curvature, its vanishing
mass, its interaction with matter through curvature and through its influence on decay,
etc. (A table of the properties of a physical vacuum is given on page 595.) Historically, it
took a long time to clarify the distinction between a physical vacuum and a philosophical
void. People confused the two concepts and debated the existence of the vacuum for more
than two thousand years. The first to state that it existed, with the courage to try to look
696 intermezzo: the brain, l anguage and the human condition
through the logical contradiction at the underlying physical reality, were Leucippus and
Democritus, the most daring thinkers of antiquity. Their speculations in turn elicited the
reactionary response of Aristotle, who rejected the concept of vacuum. Aristotle and his
disciples propagated the belief about nature’s horror of the vacuum.
The discussion changed completely in the seventeenth century, when the first experi-
mental method to realize a vacuum was devised by Torricelli.* Using mercury in a glass
Challenge 1192 n tube, he produced the first laboratory vacuum. Can you guess how? Arguments against
the existence of the vacuum again appeared around 1900, when it was argued that light
needed ‘aether’ for its propagation, using almost the same arguments that had been used
two hundred years earlier, but in different words. However, experiments failed to detect
any of the supposed properties of this unclearly defined concept. Experiments in the field
of general relativity showed that a vacuum can move – though in a completely different
way from the way in which the aether was expected to move – that the vacuum can be
bent, but it then tends to return to its shape. Then, in the late twentieth century, quantum
Is nature infinite?
It is certain and evident to our senses, that in
Theologiae, I, q. 2.
Most of the modern discussions about set theory centre on ways to defining the term ‘set’
for various types of infinite collections. For the description of motion this leads to two
questions: Is the universe infinite? Is it a set? We begin with the first one. Illuminating
”
the question from various viewpoints, we will quickly discover that it is both simple and
imprecise.
Do we need infinite quantities to describe nature? Certainly, in classical and quantum
physics we do, e.g. in the case of space-time. Is this necessary? We can say already a few
* Evangelista Torricelli (b. 1608 Faenza, d. 1647 Florence), Italian physicist, pupil and successor to Galileo.
The (non-SI) pressure unit ‘torr’ is named after him.
the quest for precision and its implications 697
things.
Any set can be finite in one aspect and infinite in another. For example, it is possible
to proceed along a finite mathematical distance in an infinite amount of time. It is also
possible to travel along any distance whatsoever in a given amount of mathematical time,
making infinite speed an option, even if relativity is taken into account, as was explained
Page 305 earlier.
Despite the use of infinities, scientists are still limited. We saw above that many types of
Page 664 infinities exist. However, no infinity larger than the cardinality of the real numbers plays
a role in physics. No space of functions or phase space in classical physics and no Hilbert
Ref. 673 space in quantum theory has higher cardinality. Despite the ability of mathematicians to
define much larger kinds of infinities, the description of nature does not need them. Even
the most elaborate descriptions of motion use only the infinity of the real numbers.
But is it possible at all to say of nature or of one of its aspects that it is indeed infin-
Challenge 1193 n ite? Can such a statement be compatible with observations? No. It is evident that every
Eddington was ridiculed over and over again for this statement and for his beliefs that
lead up to it. His arguments were indeed based on his personal preferences for certain
pet numbers. However, we should not laugh too loudly. In fact, for 2500 years almost all
scientists have thought along the same line, the only difference being that they have left
the precise number unspecified! In fact, any other number put into the above sentence
would be equally ridiculous. Avoiding specifying it is just a cowards’ way of avoiding
698 intermezzo: the brain, l anguage and the human condition
way that separability is thus necessary for covariant descriptions, the unity of nature is
necessary for invariant descriptions. In other words, the so-called ‘laws’ of nature are
based on the experience that nature is both separable and unifiable – that it is a set.
These arguments seem overwhelmingly to prove that the universe is a set. However,
these arguments apply only to everyday experience, everyday dimensions and everyday
energies. Is nature a set also outside the domains of daily life? Are objects different at all
energies, even when they are looked at with the highest precision possible? We have three
open issues left: the issue of the number of particles in the universe; the circular defini-
* In quantum mechanics also other, less clear definitions of locality are used. We will mention them in the
second part of this text. The issue mentioned here is a different, more fundamental one, and not connected
with that of quantum theory.
the quest for precision and its implications 699
tion of space, time and matter; and the issue as to whether describing nature as made of
particles and void is an overdescription, an underdescription, or neither. These three is-
sues make us doubt whether objects are countable at all energies. We will discover in the
Page 1067 third part of our mountain ascent that this is not the case in nature. The consequences will
be extensive and fascinating. As an example, try to answer the following: if the universe
Challenge 1194 n is not a set, what does that mean for space and time?
Following the definition above, existence of a concept means its usefulness to describe
”
interactions. There are two common definitions of the concept of ‘universe’. The first is
the totality of all matter, energy and space-time. But this usage results in a strange con-
What is creation?
(Gigni) De nihilo nihilum, in nihilum nil posse
Ref. 689
“ reverti.*
Persius, Satira, III, v. 83-84.
”
Ref. 690
The term ‘creation’ is often heard when talking about nature. It is used in various contexts
with different meanings.
One speaks of creation as the characterization of human actions, such as observed
in an artist painting or a secretary typing. Obviously, this is one type of change. In the
classification of change introduced at the beginning of our walk, the changes cited are
movements of objects, such as the electrons in the brain, the molecules in the muscles,
the material of the paint, or the electrons inside the computer. This type of creation is
* Nothing (can appear) from nothing, nothing can disappear into nothing.
700 intermezzo: the brain, l anguage and the human condition
direct consequence of the negation of creation. It took humanity over 2000 years before
Page 225 it stopped locking people in jail for talking about atoms, as had happened to Galileo.
However, there is one exception in which the naive concept of creation does apply:
it describes what magicians do on stage. When a magician makes a rabbit appear from
nowhere, we indeed experience ‘creation’ from nothing. At its best such magic is a form of
entertainment, at its worst, a misuse of gullibility. The idea that the universe results from
either of these two does not seem appealing; on second thought though, maybe looking
at the universe as the ultimate entertainment could open up a fresh and more productive
approach to life.
Voltaire (1694–1778) popularized an argument against creation often used in the past:
we do not know whether creation has taken place or not. Today the situation is different:
we do know that it has not taken place, because creation is a type of motion and, as we
the quest for precision and its implications 701
will see in the third part of our mountain ascent, motion did not exist near the big bang.
Have you ever heard the expression ‘creation of the laws of nature’? It is one of the
most common examples of disinformation. First of all, this expression confuses the ‘laws’
with nature itself. A description is not the same as the thing itself; everybody knows that
giving their beloved a description of a rose is different from giving an actual rose. Second,
the expression implies that nature is the way it is because it is somehow ‘forced’ to follow
the ‘laws’ – a rather childish and, what is more, incorrect view. And third, the expression
assumes that it is possible to ‘create’ descriptions of nature. But a ‘law’ is a description,
and a description by definition cannot be created: so the expression makes no sense at all.
The expression ‘creation of the laws of nature’ is the epitome of confused thinking.
It may well be that calling a great artist ‘creative’ or ‘divine’, as was common during
the Renaissance, is not blasphemy, but simply an encouragement to the gods to try to
do as well. In fact, whenever one uses the term ‘creation’ to mean anything other than
some form of motion, one is discarding both observations and human reason. It is one
“ destruction.
Pablo Picasso
”
Is nature designed?
In the beginning the universe was created. This
The tendency to infer the creation of an object from its simple existence is widespread.
Some people jump to this conclusion every time they see a beautiful landscape. This habit
stems from the triple prejudice that a beautiful scene implies a complex description, in
”
turn implying complex building instructions, and therefore pointing to an underlying
design.
This chain of thought contains several mistakes. First, in general, beauty is not a con- Copyright © Christoph Schiller November 1997–January 2007
sequence of complexity. Usually it is the opposite: indeed, the study of chaos and of self-
Ref. 691 organization demonstrated how beautifully complex shapes and patterns can be gener-
ated with extremely simple descriptions. True, for most human artefacts, complex descrip-
tions indeed imply complex building processes; a personal computer is a good example
of a complex object with a complex production process. But in nature, this connection
does not apply. We have seen above that even the amount of information needed to con-
struct a human body is about a million times smaller than the information stored in the
brain alone. Similar results have been found for plant architecture and for many other ex-
amples of patterns in nature. The simple descriptions behind the apparent complexities
of nature have been and are still being uncovered by the study of self-organization, chaos,
turbulence and fractal shapes. In nature, complex structures derive from simple processes.
Beware of anyone who says that nature has ‘infinite complexity’: first of all, complexity
702 intermezzo: the brain, l anguage and the human condition
is not a measurable entity, despite many attempts to quantify it. In addition, all known
complex system can be described by (relatively) few parameters and simple equations.
Finally, nothing in nature is infinite.
The second mistake in the argument for design is to link a description with an ‘instruc-
tion’, and maybe even to imagine that some unknown ‘intelligence’ is somehow pulling
the strings of the world’s stage. The study of nature has consistently shown that there is
no hidden intelligence and no instruction behind the processes of nature. An instruction
is a list of orders to an executioner. But there are no orders in nature, and no executioners.
There are no ‘laws’ of nature, only descriptions of processes. Nobody is building a tree;
the tree is an outcome of the motion of molecules making it up. The genes in the tree do
contain information; but no molecule is given any instructions. What seem to be instruc-
tions to us are just natural movements of molecules and energy, described by the same
patterns taking place in non-living systems. The whole idea of instruction – like that of
‘law’ of nature – is an ideology, born from an analogy with monarchy or even tyranny,
What is a description?
In theory, there is no difference between theory
A description can be compared to the ‘you are here’ sign on a city tourist map. Out
of a set of possible positions, the ‘you are here’ sign gives the actual one. Similarly, a
description highlights the given situation in comparison with all other possibilities. For
example, the formula a = GM~r 2 is a description of the observations relating motion
to gravity, because it classifies the observed accelerations a according to distance to the
central body r and to its mass M; indeed such a description sees each specific case as
an example of a general pattern. The habit of generalizing is one reason for the often
disturbing dismissiveness of scientists: when they observe something, their professional
training usually makes them classify it as a special case of a known phenomenon and thus
keeps them from being surprised or from being exited about it.
A description is thus the opposite of a metaphor; the latter is an analogy relating an
observation with another special case; a description relates an observation with a general
case, such as a physical theory.
Felix qui potuit rerum cognoscere causas,
— Why are the leaves of most trees green? Because they absorb red and blue light. Why
do they absorb those colours? Because they contain chlorophyll. Why is chlorophyll
green? Because all chlorophyll types contain magnesium between four pyrrole groups,
”
and this chemical combination gives the green colour, as a result of its quantum mech-
anical energy levels. Why do plants contain chlorophyll? Because this is what land
plants can synthesize. Why only this? Because all land plants originally evolved from
the green algae, who are only able to synthesize this compound, and not the com-
pounds found in the blue or in the red algae, which are also found in the sea.
— Why do children climb trees, and why do some people climb mountains? Because of
the sensations they experience during their activity: the feelings of achievement, the
Copyright © Christoph Schiller November 1997–January 2007
symbolic act to go upwards, the wish to get a wider view of the world are part of this
type of adventure.
The ‘why’-questions in the last two paragraphs show the general difference between
reasons and purposes (although the details of these two terms are not defined in the
same way by everybody). A purpose or intention is a classification applied to the actions
* ‘Happy he who can know the causes of things and who, free of all fears, can lay the inexorable fate and
the noise of Acheron to his feet.’ Georgica, book II, verses 490 ss.) Publius Vergilius Maro (70–19 bce ), the
great roman poet, is author of the Aeneid. Acheron was the river crossed by those who had just died and
were on their way to the Hades.
** The whole modern conception of the world is founded on the illusion that the so-called laws of nature
are the explanations of natural phenomena.
704 intermezzo: the brain, l anguage and the human condition
of humans or animals; strictly speaking, it specifies the quest for a feeling, namely for
achieving some type of satisfaction after completion of the action. On the other hand, a
reason is a specific relation of a fact with the rest of the universe, usually its past. What
we call a reason always rests outside the observation itself, whereas a purpose is always
internal to it.
Reasons and purposes are the two possibilities of explanations, i.e. the two possible
answers to questions starting with ‘why’. Usually, physics is not concerned with purpose or
with people’s feeling, mainly because its original aim, to talk about motion with precision,
does not seem to be achievable in this domain. Therefore, physical explanations of facts
are never purposes, but are always reasons. A physical explanation of an observation is
Ref. 692 always the description of its relation with the rest of nature.*
This means that – contrary to common opinion – a question starting with ‘why’ is ac-
cessible to physical investigation as long as it asks for a reason and not for a purpose. In
particular, questions such as ‘why do stones fall downwards and not upwards?’ or ‘why do
Studying the properties of motion, constantly paying attention to increase the accuracy ”
of description, we find that explanations are generally of two types:***
— ‘It is like all such cases; also this one is described by ...’ The situation is recognized as a
special case of a general behaviour.
— ‘If the situation were different, we would have a conclusion in contrast with observa-
Copyright © Christoph Schiller November 1997–January 2007
— ‘It is like the case of ...’ A similarity with another single case is not an explanation.
— ‘If it were different, it would contradict the idea that ...’ A contradiction with an idea or with a theory is
not an explanation.
the quest for precision and its implications 705
In other words, the first approach is to formulate rules or ‘laws’ that describe larger and
larger numbers of observations, and compare the observation with them. This endeavour
is called the unification of physics – by those who like it; those who don’t like it, call it ‘re-
ductionism’. For example, the same rule describes the flight of a tennis ball, the motion of
the tides at the sea shore, the timing of ice ages, and the time at which the planet Venus
ceases to be the evening star and starts to be the morning star. These processes are all con-
sequences of universal gravitation. Similarly, it is not evident that the same rule describes
the origin of the colour of the eyes, the formation of lightning, the digestion of food and
the working of the brain. These processes are described by quantum electrodynamics.
Unification has its most impressive successes when it predicts an observation that has
not been made before. A famous example is the existence of antimatter, predicted by Dirac
when he investigated the solutions of an equation that describes the precise behaviour of
common matter.
The second procedure in the search for explanations is the elimination of all other
”
The wish to achieve demarcation of the patterns of nature is most interesting when we
Copyright © Christoph Schiller November 1997–January 2007
follow the consequences of different rules of nature until we find them in contradiction
with the most striking observation: our own human existence. In this special case the pro-
gram of demarcation is often called the anthropic principle – from the Greek ἄνθρωπος,
meaning ‘man’.
For example, if the Sun–Earth distance were different from what it is, the resulting tem-
perature change on the Earth would have made impossible the appearance of life, which
needs liquid water. Similarly, our brain would not work if the Moon did not circle the
Earth. It is also well-known that if there were fewer large planets in the solar system, the
evolution of humans would have been impossible. The large planets divert large numbers
* The most important instrument of a scientist is the waste paper basket.
706 intermezzo: the brain, l anguage and the human condition
of comets, preventing them from hitting the Earth. The spectacular collision of comet
Shoemaker–Levy-9 with Jupiter, the astronomical event of July 1994, was an example of
this diversion of a comet.*
Also the anthropic principle has its most impressive successes when it predicts un-
known observations. The most famous example stems from the study of stars. Carbon
atoms, like all other atoms except most hydrogen, helium or lithium atoms, are formed
in stars through fusion. While studying the mechanisms of fusion in 1953, the well-known
British astrophysicist Fred Hoyle** found that carbon nuclei could not be formed from
the alpha particles present inside stars at reasonable temperatures, unless they had an ex-
cited state with an increased cross-section. From the fact of our existence, which is based
on carbon, Hoyle thus predicted the existence of a previously unknown excited state of
Ref. 693 the carbon nucleus. And, indeed, the excited state was found a few months later by Willy
Fowler.***
In its serious form, the anthropic principle is therefore the quest to deduce the descrip-
* For a collection of pictures of this event, see e.g. the http://garbo.uwasa.fi/pc/gifslevy.html website.
** Fred Hoyle (b. 1915 Bingley, Yorkshire, d. 2001), important British astronomer and astrophysicist. He
was the first and maybe only physicist who ever made a specific prediction – namely the existence of an
excited state of the carbon nucleus – from the simple fact that humans exist. A permanent maverick, he
coined the term ‘big bang’ even though he did not accept the evidence for it, and proposed another model,
the ‘steady state’. His most important and well-known research was on the formation of atoms inside stars.
He also propagated the belief that life was brought to Earth from extraterrestrial microbes.
*** William A. Fowler (1911–1995) shared the 1983 Nobel Prize in physics with Subramanyan
Chandrasekhar for this and related discoveries.
**** Though apes do not seem to be good physicists, as described in the text by D.J. Povinelli, Folk
Physics for Apes: the Chimpanzee’s Theory of How the World Works, Oxford University Press, 2000.
the quest for precision and its implications 707
Ref. 695
Historically, the two terms ‘cause’ and ‘effect’ have played an important role in philosoph-
ical discussions. In particular, during the birth of modern mechanics, it was important
to point out that every effect has a cause, in order to distinguish precise thought from
”
thought based on beliefs, such as ‘miracles’, ‘divine surprises’ or ‘evolution from noth-
ing’. It was equally essential to stress that effects are different from causes; this distinction
avoids pseudo-explanations such as the famous example by Molière where the doctor ex-
plains to his patient in elaborate terms that sleeping pills work because they contain a
dormitive virtue.
But in physics, the concepts of cause and effect are not used at all. That miracles do not
appear is expressed every time we use symmetries and conservation theorems. The obser-
vation that cause and effect differ from each other is inherent in any evolution equation.
Moreover, the concepts of cause and effect are not clearly defined; for example, it is espe-
cially difficult to define what is meant by one cause as opposed to several of them, and the
same for one or several effects. Both terms are impossible to quantify and to measure. In
other words, useful as ‘cause’ and ‘effect’ may be in personal life for distinction between
events that regularly succeed each other, they are not necessary in physics. In physical
explanations, they play no special roles.
Ὰγαθον καὶ ξαξόν ë ἔν καὶ ταὐτό.**
“ Heraclitus
* ‘He was amazed that cats had holes cut into their fur precisely in those places where they had eyes.’ ”
Georg Christoph Lichtenberg (1742–1799), German physicist and intellectual, professor in Göttingen, still
famous today for his extremely numerous and witty aphorisms and satires. Among others of his time,
Lichtenberg made fun of all those who maintained that the universe was made exactly to the measure of
man, a frequently encountered idea in the foggy world of the anthropic principle.
** ‘Good and bad – one and the same.’
*** ‘When a doctor walks behind the coffin of his patient, indeed the cause sometimes follows the effect.’
708 intermezzo: the brain, l anguage and the human condition
Is consciousness required?
Variatio delectat.*
Ref. 696
“
A lot of mediocre discussions are going on about this topic, and we will skip them here.
What is consciousness? Most simply and concretely, consciousness means the possession
Cicero
”
of a small part of oneself that is watching what the rest of oneself is perceiving, feeling,
thinking and doing. In short, consciousness is the ability to observe oneself, and in par-
ticular one’s inner mechanisms and motivations. Consciousness is the ability of introspec-
tion. For this reason, consciousness is not a prerequisite for studying motion. Indeed,
animals, plants or machines are also able to observe motion. For the same reason, con-
sciousness is not necessary to observe quantum mechanical motion. On the other hand,
both the study of motion and that of oneself have a lot in common: the need to observe
carefully, to overcome preconceptions, to overcome fear and the fun of doing so.
”
Curiosity
Precision is the child of curiosity.
“
Like the history of every person, also the history of mankind charts a long struggle to
avoid the pitfalls of accepting the statements of authorities as truth, without checking the
”
facts. Indeed, whenever curiosity leads us to formulate a question, there are always two
general ways to proceed. One is to check the facts personally, the other is to ask somebody.
However, the last way is dangerous: it means to give up a part of oneself. Healthy people,
children whose curiosity is still alive, as well as scientists, choose the first way. After all,
science is adult curiosity.
Curiosity, also called the exploratory drive, plays strange games with people. Starting
with the original experience of the world as a big ‘soup’ of interacting parts, curiosity can
drive one to find all the parts and all the interactions. It drives not only people. It has been
Copyright © Christoph Schiller November 1997–January 2007
observed that when rats show curious behaviour, certain brain cells in the hypothalamus
get active and secrete hormones that produce positive feelings and emotions. If a rat has
the possibility, via some implanted electrodes, to excite these same cells by pressing a
Ref. 697 switch, it does so voluntarily: rats get addicted to the feelings connected with curiosity.
Like rats, humans are curious because they enjoy it. They do so in at least four ways:
because they are artists, because they are fond of pleasure, because they are adventurers
and because they are dreamers. Let us see how.
Originally, curiosity stems from the desire to interact in a positive way with the envir-
onment. Young children provide good examples: curiosity is a natural ingredient of their
* ‘Change pleases.’ Marcus Tullius Cicero (106–43 bce ), important lawyer, orator and politician at the end
of the Roman republic.
the quest for precision and its implications 709
life, in the same way that it is for other mammals and a few bird species; incidentally, the
same taxonomic distribution is found for play behaviour. In short, all animals that play
are curious, and vice versa. Curiosity provides the basis for learning, for creativity and
thus for every human activity that leaves a legacy, such as art or science. The artist and art
theoretician Joseph Beuys (1920–1986) had as his own guiding principle that every creat-
Ref. 698 ive act is a form of art. Humans, and especially children, enjoy curiosity because they feel
its importance for creativity, and for growth in general.
Curiosity regularly leads one to exclaim: ‘Oh!’, an experience that leads to the second
reason to be curious: relishing feelings of wonder and surprise. Epicurus (Epikuros) (341–
271 bce ) maintained that this experience, θαυµάζειν, is the origin of philosophy. These
feelings, which nowadays are variously called religious, spiritual, numinous, etc., are the
same as those to which rats can become addicted. Among these feelings, Rudolf Otto
has introduced the now classical distinction into the fascinating and the frightening.
He named the corresponding experiences ‘mysterium fascinans’ and ‘mysterium tremen-
an innate desire, most obvious in children, to reduce uncertainty and fear. This drive is
the father of all adventures. It has a well-known parallel in ancient Greece, where the first
men studying observations, such as Epicurus, stated explicitly that their aim was to free
people from unnecessary fear by deepening knowledge and transforming people from
* This distinction is the basis of Rudolf Otto, Das Heilige – Über das Irrationale in der Idee des Göttlichen
und sein Verhältnis zum Rationalen, Beck, München, 1991. This is a new edition of the epoch-making work
originally published at the beginning of the twentieth century. Rudolf Otto (1869–1937) was one of the most
important theologians of his time.
** Several researchers have studied the situations leading to these magic moments in more detail, notably
the Prussian physician and physicist Hermann von Helmholtz (1821–1894) and the French mathematician
Henri Poincaré (1854–1912). They distinguish four stages in the conception of an idea at the basis of such
Ref. 700 a magic moment: saturation, incubation, illumination and verification.
710 intermezzo: the brain, l anguage and the human condition
frightened passive victims into fascinated, active and responsible beings. Those ancient
thinkers started to popularize the idea that, like the common events in our life, the rarer
events also follow rules. For example, Epicurus underlined that lightning is a natural phe-
nomenon caused by interactions between clouds, and stressed that it was a natural pro-
cess, i.e. a process that followed rules, in the same way as the falling of a stone or any other
familiar process of everyday life.
Investigating the phenomena around them, philosophers and later scientists suc-
ceeded in freeing people from most of their fears caused by uncertainty and a lack of
knowledge about nature. This liberation played an important role in the history of hu-
man culture and still pervades in the personal history of many scientists. The aim to ar-
rive at stable, rock-bottom truths has inspired (but also hindered) many of them; Albert
Einstein is a well-known example for this, discovering relativity, helping to start up but
then denying quantum mechanics.
Interestingly, in the experience and in the development of every human being, curios-
Courage
“ Bert Hellinger
”
It is dangerous to be right in matters on which
Wahl läßt.**
Bert Hellinger
Ref. 701
Most of the material in this intermezzo is necessary in the adventure to get to the top of
Motion Mountain. But we need more. Like any enterprise, curiosity also requires courage,
and complete curiosity, as aimed for in our quest, requires complete courage. In fact, it is
”
easy to get discouraged on this trip. The journey is often dismissed by others as useless,
uninteresting, childish, confusing, damaging or, most often, evil. For example, between
* ‘The unveiled secret takes revenge.’
** ‘Some look for security where courage is required and look for freedom where the right way doesn’t leave
any choice.’ This is from the beautiful booklet by Bert Hellinger, Verdichtetes, Carl-Auer Systeme Verlag,
1996.
the quest for precision and its implications 711
the death of Socrates in 399 bce and Paul Thierry, Baron d’Holbach, in the eighteenth
century, no book was published with the statement ‘gods do not exist’, because of the
threats to the life of anyone who dared to make the point. Even today, this type of attitude
still abounds, as the newspapers show.
Through the constant elimination of uncertainty, both curiosity and scientific activity
are implicitly opposed to any idea, person or organization that tries to avoid the com-
parison of statements with observations. These ‘avoiders’ demand to live with supersti-
tions and beliefs. But superstitions and beliefs produce unnecessary fear. And fear is the
basis of all unjust authorities. One gets into a vicious circle: avoiding comparison with
observation produces fear – fear keeps unjust authority in place – unjust authority avoids
comparison with observation – etc.
As a consequence, curiosity and science are fundamentally opposed to unjust author-
ity, a connection that made life difficult for people such as Anaxagoras (500–428 bce )
in ancient Greece, Hypatia in the Christian Roman empire, Galileo Galilei in the church
disinformation and fear with all one’s courage, one achieves freedom from all beliefs. In
exchange, you come to savour the fullest pleasures and the deepest satisfaction that life
has to offer.
We thus continue our hike. At this point, the trail towards the top of Motion Mountain
is leading us towards the next adventure: discovering the origin of sizes and shapes in
nature.
And the gods said to man: ‘Take what you want,
”
712 intermezzo: the brain, l anguage and the human condition
Biblio graphy
632 See for example the beautiful textbook by Stephen C. Stearns & Rolf F. Hoekstra,
Evolution: An Introduction, Oxford University Press, 2000. For fascinating story of evolu-
tion for non-specialists, see R ichard Fortey, Life – An Unauthorized Biography, Harper
Collins, 1997, or also Menno Schilthuizen, Frogs, Flies & Dandelions – the Making of
Species, Oxford University Press/ 2001. See also Stephen J. Gould, The Panda’s thumb,
W.W. Norton & Co., 1980, one of the several interesting and informative books on evolution-
ary biology by the best writer in the field. An informative overview over the results of evolu-
tion, with the many-branched family tree that it produced, is given on the http://phylogeny.
arizona.edu/tree website. About the results of evolution for human beings, see the informat-
ive text by K. Kusch & S. Kusch, Der Mensch in Zahlen, Spektrum Akademischer Verlag,
2nd edn., 2000. The epochal work by Charles Darwin, On the Origin of Species, can be
found on the web, e.g. on the http://entisoft.earthlink.net/origspec.htm website. Cited on
page 648.
properties of the elements of the system, and this approach prevents a full understanding of
the system under discussion. Cited on page 650.
637 For a view of the mental abilities different from that of Piaget (described on page 650),
a presently much discussed author is the Soviet experimental psychologist Lev Vigotsky,
whose path-breaking ideas and complicated life are described, e.g., in Lev Vigotsky,
Mind in Society, Harvard University Press, 1978, or in R ené van der Veer & Jaan
Valsiner, Understanding Vigotsky: a Quest for Synthesis, Blackwell Publishers, 1994. More
extensive material can be found in the extensive work by R ené van der Veer & Jaan
Valsinger, The Vigotsky Reader, Blackwell, 1994. Cited on page 650.
638 A somewhat unconventional source for more details is the beautiful text by Bruno
Bettelheim, The Uses of Enchantment: the Meaning and Importance of Fairy Tales,
Knopf, 1976. Cited on page 650.
714 intermezzo: the brain, l anguage and the human condition
648 J.J. Hopfield/ Nature 376, pp. 33–36, 1995. This paper by one of the fathers of the field
presents one possibility by which the timing of nerve signals, instead of the usually assumed
firing frequency, could also carry information. Cited on page 655.
649 The details of the properties of the firing patterns of neurons are nicely described in the
article by M. Mahowald & R. Dougl as, A silicon neuron, Nature 354, pp. 515–518,
19/26 December 1991, in which they show how to simulate a neuron’s electrical behaviour
using a silicon circuit. Cited on page 655.
650 A. Mechelli, J.T. Crinion, U. Noppeney, J. O’Doberty, J. Ashburner, R.S.
Frackowiak & C.J. Price, Neurolinguistics: structural plasticity in the bilingual brain,
Nature 431, p. 757, 2004. Cited on page 656.
651 The discussion whether the brain is or is not superior to a computer is nicely summarised
by G. Vollmer, Algorithmen, Gehirne, Computer – Was sie können und was sie nicht
bibliography 715
können, Teil I und Teil II, Naturwissenschaften 78, p. 481, 1991, and 78, pp. 533–542, 1991.
Cited on page 656.
652 However, the language with the largest available dictionary is Dutch, with the 40 volumes
of the Wordenboek der Nederlandsche Taal, which appeared between 1864 and 1998. It has
almost 400 000 entries. Cited on page 658.
653 The list and the remark on discovery on concepts is due to a personal communication from
Anna Wierzbicka. A longer list is published in her book Semantics, Primes and Universals,
Oxford University Press, 1996. Cited on pages 659 and 677.
654 W.S. Hatcher, Foundations of Mathematics, W.B. Saunders Co., Philadelphia, 1968. There
is also the article by P. J. Cohen & R. Hersch, Non-Cantorian set theory, Scientific
American 217, pp. 104–116, 1967. Cohen was the mathematician who in 1963 proved that
the negation of the continuum hypothesis could be added to the axioms of set theory and
still produce a consistent theory; he calls such sets non-Cantorian. Cited on page 663.
655 See the beautiful article by I. Stewart, Fair shares for all, New Scientist, pp. 42–46, 17 June
1995. Cited on page 664.
whose number of petals is from the Fibonacci series 1, 1, 2, 3, 5, 8, 13, 21, 34, 55, 89, 144, etc.
(The figure on page 177 is an example.) Deviations from this ideal case are also explained.
The original work are two articles by S. Douady & Y. Couder, La physique des spirales
végétales, La Recherche 24, pp. 26–36, 1993, and Phyllotaxis as a self-organized growth pro-
cess, in Growth Patterns in Physical Sciences and Biology, edited by J.M. Garcia-Ruiz &
al., Plenum Press, 1993. Cited on page 668.
662 H. Davson, The Eye, Academic Press, 1962. Cited on page 669.
663 See the http://akbar.marlboro.edu/~mahoney/cube/NxN.txt website. Cited on page 669.
664 An introduction to the surreal numbers is given by the article by Polly Shulman, Infinity
plus one, and other surreal numbers, Discover, pp. 99–105, December 1995. There is also the
text by D. Knu th, Surreal Numbers: How two ex-Students Turned on to Pure Mathematics
and Found Total Happiness, Addison Wesley, 1974, or http://www-cs-faculty.stanford.edu/
716 intermezzo: the brain, l anguage and the human condition
~knuth/sn.html. The usually quoted references on the topic include John H. Conway,
On Numbers and Games, Academic Press, 1976, E.R. Berlekamp, J.H. Conway & R.K.
Guy, Winning Ways for Your Mathematical Plays, Volume I: Games in General, Academic
Press, 1982, and H. Gonshor, An Introduction to Surreal Numbers, Cambridge University
Press, 1986. Cited on pages 669 and 671.
665 This beautiful problem is discussed by Ian Stewart, A bundling fool beats the wrap, Sci-
entific American, pp. 109–111, June 1993. In four dimensions, the answer is known to lie some-
where between 50 000 and 100 000, whereas the five-dimensional answer is conjectured to
be ‘never’. Cited on page 672.
666 A. Pais, Niels Bohr’s Times: in Physics, Philosophy and Polity, Oxford University Press, 1991,
page 176. Cited on page 672.
667 Eugene Wigner, Symmetries and Reflections, Indiana University Press, 1962. Cited on
page 672.
668 Alfred Tarski, Introduction to Modern Logic, Dover, 1946. See also the famous chil-
and the exquisite book by Hans Conrad Z ander, Warum ich Jesus nicht leiden kann,
Rowohlt, 1994. Cited on page 681.
673 Roger Penrose, The Road to Reality: A Complete Guide to the Laws of the Universe,
Jonathan Cape, 2004, page 378. Cited on page 697.
674 John Horgan, The End of Science – Facing the Limits of Knowledge in the Twilight of the
Scientific Age, Broadway Books, 1997, pp. 31–32, and chapter 2, note 2. Cited on pages 683
and 697.
675 For an opinion completely contrary to the one described here, see the book by Gregory J.
Chaitin, The Limits of Mathematics, Springer Verlag, 1997, which can also be found on
the author’s website http://www.cs.auckland.ac.nz/CDMTCS/chaitin/lm.html, along with
his other works. Chaitin has devoted most of his life to the questions discussed in the section,
especially on computability. Cited on page 684.
bibliography 717
676 See the book by J. Barwise & J. Etchemendy, The Liar, Oxford University Press, New
York, 1987. Cited on page 684.
677 This definition (statement 4.11) and many other statements about science are in the beau-
tiful and rightly famous text by Ludwig Wittgenstein, Tractatus logico-philosophicus,
Edition Suhrkamp, 1963. It gives a condensed summary of the basis of science, thought and
language in a collection of highly structured and numbered sentences. Cited on pages 686
and 691.
678 See M. Dresden, The Klopsteg memorial lecture, American Journal of Physics 66, pp. 468–
482, 1998. Cited on page 687.
679 Well-known books are e.g. Friedrich Kohlrausch, Praktische Physik, Teubner, 24. Au-
flage, 1996. Cited on page 687.
680 Results are described e.g. in L. Bergmann & C. Schäfer, Lehrbuch der Experimental-
physik, Band I, II, III und IV, W. de Gruyter. Cited on page 687.
681 L and olt-B örnstein, edited by K.-H. Hellwege & O. Madelung, Zahlenwerte
of Greek Philosophy, Dover, 1980, a reprint of a book published in 1928. Among others, it
gives a clear exposition of the philosophy of Democritus and the other presocratics. Cited
on page 697.
688 The famous quote is found at the beginning of chapter XI, ‘The Physical Universe’, in Ar-
thur Eddington, The Philosophy of Physical Science, Cambridge, 1939. Cited on page
697.
689 Giuseppe Fumagalli, Chi l’ha detto?, Hoepli, Milano, 1983. Cited on page 699.
690 See Jean-Paul Dumont, Les écoles présocratiques, Folio Essais, Gallimard, p. 653, 1991.
Cited on page 699.
691 For a beautiful text on fractals, see the footnote on page 53. Cited on page 701.
692 As has been pointed out by René Descartes. Cited on page 704.
718 iv intermezzo: the brain, l anguage and the human condition
12
693 The famous carbon C resonance was found by Willy Fowler, as described in E. Mar-
garet Burbrid ge, G.R. Burbrid ge, W.A. Fowler & F. Hoyle, Synthesis of the
elements in stars, Reviews of Modern Physics 29, pp. 547–560, 1957. Cited on page 706.
694 An extensive overview of the topic is given in the thick book by John D. Barrow &
Frank J. Tipler, The Anthropic Cosmological Principle, Oxford University Press, 1986.
The term itself is due to Brandon Carter, who coined it in 1973 and presented it in a sym-
posium devoted to the 500th anniversary of Nicolaus Copernicus. For more literature, see
Yuri I. Bal ashov, Resource Letter AP-1: the anthropic principle, American Journal of
Physics 59, pp. 1069–1076, 1991. Cited on page 706.
695 Voltaire, Candide ou l’optimisme, 1759. See the footnote on page 183. The book is so good
that it was still being seized by the US customs in 1930, and the US post office refused to
transport it as late as 1944. For more details, search for ‘banned books online’ on the world-
wide web. Cited on page 707.
696 The number of books on consciousness is large and the contents not always interesting, and
often not based on fact, as shown by Karl R. Popper & John Eccles, The Self and its
701 For example, one needs the courage to face envy. About this topic see the classic text by
Helmu t Schoeck, Der Neid, 1966, published in English as Envy: A Theory of Social Be-
havior, 1969. It is the standard work in the field. Cited on page 710.
Second Part
Quantum Theory:
What Is Matter?
What Are Interactions?
“ 15th century
ing sky, a waterfall, a caress or a happy child. Classical quantum system (© Ata Masafumi)
Challenge 1199 n physics is not able to explain it: the involved colours,
shapes and sizes remain mysterious.
In the beginning of physics this limitation was not seen as a shortcoming: in those
times neither senses nor material properties were imagined to be related to motion. And
of course, in older times the study of pleasure was not deemed a serious topic of invest-
Page 261 igation for a respectable researcher. However, in the meantime we know that the senses
of touch, smell and sight are first of all detectors of motion. Without motion, no senses!
In addition, all detectors are built of matter. In the chapter on electromagnetism we star-
ted to understand that all properties of matter are due to motion of charged constituents.
Density, stiffness, colour and all other material properties result from the electromagnetic
behaviour of the Lego bricks of matter, namely the molecules, the atoms and the electrons.
Thus, also matter properties are consequences of motion. In addition, we saw that these
tiny constituents are not correctly described by classical electrodynamics. We even found
Page 603 that light itself behaves unclassically. Therefore the inability of classical physics to describe
matter and the senses is indeed due to its intrinsic limitations.
In fact, every failure of classical physics can be traced back to a single, fundamental
Ref. 703 discovery made in 1899 by Max Planck:*
In nature, actions smaller than the value ħ~2 = 0.53 ë 10−34 Js are not observed.
All experiments trying to do so invariably fail.** In other words, in nature there is al-
ways some action – like in a good film. This existence of a minimal action, the so-called
Challenge 1200 n quantum principle, is in full contrast with classical physics. (Why?) However, it has passed
surprising consequences.
* About Max Planck and his accomplishments, see the footnote on page 603.
** In fact, the cited quantum principle is a simplification; the constant originally introduced by Planck was
the (unreduced) constant h = 2πħ. The factors 2π and 1/2 leading to the final quantum principle were found
somewhat later, by other researchers.
This somewhat unconventional, but useful didactic approach is due to Niels Bohr. Nowadays, the ap-
proach is almost never found in the literature; it might be used in a teaching text for the first time here since
Ref. 704 many decades.
Niels Bohr (b. 1885 Copenhagen, d. 1962 Copenhagen) made Copenhagen University into one of the
centres of quantum theory, overshadowing Göttingen. He developed the description of the atom with
quantum theory, for which he received the 1922 Nobel Prize in physics. He had to flee Denmark in 1943
after the German invasion, because of his Jewish background, but returned there after the war.
722 v quanta of light and matter • 18. minimum action
of all measurable quantities can be defined, including time and thus frequency. (Can you
Challenge 1202 n find a combination of c, G and ħ giving a time? It only will take a few minutes.) When
Planck found all such combinations, he was happy like a child; he knew straight away
that he had made a fundamental discovery, even though in 1899 quantum theory did not
exist yet. He even told his seven-year-old son Erwin about it, while walking with him
through the forests around Berlin. Planck explained to his son that he had made a discov-
ery as important as universal gravity. Indeed, Planck knew that he had found the key to
understanding most of the effects which were unexplained so far. In particular, without
the quantum of action, colours would not exist: every colour is a quantum effect.*
Planck also realized that the quantum of action allows to understand the size of all
Challenge 1203 n * It is also possible to define all units using c, G and e, the electron charge. Why is this not satisfactory?
minimum action – quantum theory for poets and l aw yers 723
Challenge 1204 e things. (Can you find the combination of c, G and ħ that yields a length?) With the
quantum of action, it was finally possible to answer the question on the maximum size of
Page 226 mountains, of trees and of humans. Planck knew that the quantum of action confirmed
the answer Galileo had deduced already long before him: sizes are due to fundamental,
minimal scales in nature.
The size of objects is related to the size of atoms. In turn, the size of atoms is a direct
consequence of the quantum of action. Can you deduce an approximation for the size
of atoms, knowing that it is given by the motion of electrons of mass m e and charge e,
Challenge 1205 n constrained by the quantum of action? This connection, a simple formula, was discovered
in 1910 by Arthur Erich Haas, 15 years before quantum theory was formulated; at the time,
everybody made fun of him. Nowadays, the expression is found in all textbooks.*
By determining the size of atoms, the quantum of action has an important con-
sequence: Gulliver’s travels are impossible. There are no tiny people and no giant ones.
Classically, nothing speaks against the idea; but the quantum of action does. Can you
properties of objects, from stones to whipped cream. All measurements are only possible
due to the existence of the quantum of action.
Why ‘quantum’?
Quantum effects surround us from all sides. However, since the minimum action is so
small, its effects on motion appear mostly, but not exclusively, in microscopic systems. The
study of such systems has been called quantum mechanics by Max Born, one of the main
* Before the discovery of ħ, the only simple length scale for the electron was the combination
e 2 ~(4πε 0 m e c 2 ) 3 fm; this value is ten thousand times smaller than an atom.
724 v quanta of light and matter • 18. minimum action
TA B L E 56 Some small systems in motion and the observed action values for their changes
Light
Smallest amount of light absorbed by a coloured surface 1ħ quantum
Smallest hit when light reflects from mirror 2ħ quantum
Smallest visible amount of light c. 5 ħ quantum
Smallest amount of light absorbed in flower petal c. 1 ħ quantum
Blackening of photographic film c. 3 ħ quantum
Photographic flash c. 1017 ħ classical
Electricity
Electron ejected from atom c. 1 − 2 ħ quantum
Electron added to molecule c. 1 − 2 ħ quantum
figures of the field.* Later on, the term quantum theory became more popular.
Quantum theory results from the existence of minimum measurable values in nature,
precisely in the way that Galileo already speculated about in the seventeenth century. As
Page 225 mentioned in detail earlier on, it was Galileo’s insistence on these ‘piccolissimi quanti’
that got him accused and into trouble. Of course, we will soon discover that only the idea
of a smallest change leads to a precise and accurate description of nature. Born took up
Galileo’s term and translated it from an italian plural back to a latin singular.
The term ‘quantum’ theory does not mean that all measurement values are multiples
of a smallest one; this is correct only in a few rare cases.
Quantum physics is the description of microscopic motion. But when is quantum the-
ory necessary? Table 56 shows that all processes on atomic and molecular scale, including
biological and chemical ones, involve action values near the quantum of action. So do pro-
cesses of light emission and absorption. All these phenomena can be described only with
quantum theory.
* Max Born (b. 1882 Breslau, d. 1970 Göttingen) first studied mathematics, then turned to physics. Professor
in Göttingen, he made the city one of the world centres of physics. He developed quantum mechanics with
his assistants Werner Heisenberg and Pascual Jordan, then applied it to scattering, to solid state physics, to
Ref. 705 optics and to liquids. He was the first to understood that the state function describes a probability amplitude.
He is one of the authors of the famous Born & Wolf textbook on optics; it still remains the main book of
the field. Born attracted to Göttingen the most brilliant talents of the time, receiving as visitors Hund, Pauli,
Nordheim, Oppenheimer, Goeppert–Mayer, Condon, Pauling, Fock, Frenkel, Tamm, Dirac, Mott, Klein,
Heitler, London, von Neumann, Teller, Wigner and dozens of others. Being Jewish, Max Born lost his job
in 1933; he emigrated and became professor in Edinburgh, where he stayed for twenty years. Physics at
Göttingen University never recovered from this loss. For his elucidation of the meaning of the wave function
he received the 1954 Nobel Prize in physics.
726 v quanta of light and matter • 18. minimum action
∆E∆t E
ħ
, (478)
2
where E is the energy of the system and t its age. In other words, ∆E is the change of
energy and ∆t the time between two successive observations.
Challenge 1208 e By a similar reasoning we find that for any system the pos-
ition and momentum values are constrained by
∆x∆p E
ħ
, (479)
2
where ∆p is the indeterminacy in momentum and ∆x the
indeterminacy in position. These two famous relations were
the one for momentum and position. That is wrong; it is a myth propagated by the older generation of
physicists. This myth survived through many textbooks for over 70 years; just forget it, as it is incorrect. It
is essential to remember that all four quantities appearing in the inequalities are quantities describing the
internal properties of the system. In particular, it means that t is some time variable deduced from changes
observed inside the system and not the external time coordinate measured by an outside clock, in the same
Ref. 706 way that the position x is not the external space coordinate, but the position characterizing the system.
Werner Heisenberg (1901–1976) was an important German theoretical physicist and an excellent table
tennis and tennis player. In 1925, as a young man, he developed, with some help by Max Born and Pas-
cual Jordan, the first version of quantum theory; from it he deduced the indeterminacy relations. For these
achievements he received the Nobel Prize for physics in 1932. He also worked on nuclear physics and on tur-
bulence. During the second world war, he worked in the German nuclear fission program. After the war, he
published several successful books on philosophical questions in physics and he unsuccessfully tried, with
some half-hearted help by Wolfgang Pauli, to find a unified description of nature based on quantum theory,
the ‘world formula’.
minimum action – quantum theory for poets and l aw yers 727
infinite precision, the very concept of motion needs to be used with great care, as it can-
not be applied in certain situations. In a sense, the rest of our quest is an exploration of
the implications of this result. In fact, as long as space-time is flat, it turns out that we
can keep motion as a concept describing observations, provided we remain aware of the
limitations of the quantum principle.
Challenge 1209 n as you might want to check. However, the quantum of action allows that a microscopic
particle, such as an electron, disappears for a short time, provided it reappears afterwards.
The quantum of action also implies that the vacuum is not empty. If one looks at empty
space twice in a row, the two observations being spaced by a tiny time interval, some
energy will be observed the second time. If the time interval is short enough, due to
the quantum of action, matter particles will be observed. Indeed, particles can appear
anywhere from nowhere, and disappear just afterwards, as the action limit requires it.
In summary, nature shows short time appearance and disappearance of matter. In other
words, classical physics’ idea of an empty vacuum is correct only when observed over long
time scales.
The quantum of action implies that compass needles cannot work. If we look twice in
728 v quanta of light and matter • 18. minimum action
E
m p
0 ∆x
a row at a compass needle or even at a house, we usually observe that they stay oriented
Challenge 1210 e in the same direction. But since physical action has the same unit as angular momentum,
a minimum value for action also means a minimum value for angular momentum. Even
every macroscopic object has a minimum value for its rotation. In other words, quantum
theory predicts that in everyday life, everything rotates. Lack of rotation exists only ap-
In everyday life, thus in classical physics, the particle will never be observed on the
other side of the hill, if its kinetic energy p2 ~2m is smaller than the height E of the hill.
In everyday lifeºthis is impossible. But imagine that the missing momentum to overcome
the hill, ∆p = 2mE − p, satisfies ∆x∆p D ħ~2. The particle will have the possibility to
overcome the hill, despite its insufficient energy. The quantum of action thus implies that
a hill of width
∆x D º
ħ~2
(480)
2mE − p
is not an obstacle to a particle of mass m. But this is not all. Since the value of the particle
momentum p is itself uncertain, a particle can overcome the hill even if the hill is wider
than value (480), though the broader it is the smaller the probability will be. As a res-
minimum action – quantum theory for poets and l aw yers 729
E1
m E2
ult, any particle can overcome any obstacle. This effect is called the tunnelling effect, for
obvious reasons.
In short, the minimum action principle implies that there are no tight boxes in nature.
Due to tunnelling, matter is not impenetrable, in contrast to everyday, classical observa-
Challenge 1211 n tion. Can you explain why lion cages work despite the quantum of action?
By the way, the quantum of action also implies that a particle with a kinetic energy
larger than the energy height of a hill can get reflected by the hill. Classically this is im-
Challenge 1212 ny possible. Can you explain the observation?
The minimum action principle also implies that book shelves are dangerous. Shelves
are obstacles to motion. A book in a shelf is in the same situation as the mass in Figure 306;
the mass is surrounded by energy hills hindering its escape to the outer, lower energy
world. Now, due to the tunnelling effect, escape is always possible. The same picture ap-
plies to a branch of a tree, a nail in a wall, or to anything attached to anything else. Fixing
things to each other is never for ever. In particular, we will find out that every example
of light emission, even radioactivity, results from this effect. The quantum of action thus
implies that decay is part of nature. Note that decay often appears in everyday life, where it Copyright © Christoph Schiller November 1997–January 2007
just has a different name: breaking. In fact, all cases in which something breaks require the
Ref. 707 quantum of action for their description. Obviously, the cause of breaking is often classical,
but the mechanism of breaking is always quantum. Only objects that follow quantum the-
ory can break. In short, there are no stable excited systems in nature. For the same reason
Challenge 1213 ny by the way, no memory can be perfect. (Can you confirm the deduction?)
Taking a more general view, also ageing and death result from the quantum of action.
Death, like ageing, is a composition of breaking processes. Breaking is a form of decay,
and is due to tunnelling. Death is thus a quantum process. Classically, death does not
Challenge 1214 n exist. Might this be the reason that so many believe in immortality or eternal youth?
We will also discover that the quantum of action is the origin for the importance of
the action observable in classical physics. In fact, the existence of a minimal action is the
reason for the least action principle of classical physics.
730 v quanta of light and matter • 18. minimum action
M
m1
m2
m
m3
A minimum action also implies that matter cannot be continuous, but must be com-
posed of smallest entities. Indeed, the flow of a truly continuous material would contra-
Challenge 1215 n dict the quantum principle. Can you give the precise argument? Of course, at this point
that if the approach distance gets small, things get fuzzy. Now, if something happens in
that small domain in such a way that the resulting outgoing products have the same total
momentum and energy as the incoming ones, the minimum action principle makes such
processes possible. Indeed, ruling out such processes would imply that arbitrary small ac-
tions could be observed, thus eliminating nature’s fuzziness, as you might want to check
Challenge 1218 e by yourself. In short, a minimum action allows transformation of matter. One also says that
the quantum of action allows particle reactions. In fact, we will discover that all kinds of
reactions in nature, including breathing, digestion and all other chemical and nuclear
reactions, are only due to the existence of the quantum of action.
An example of transformation is especially dear to us: growth. The quantum of ac-
tion implies that all growth happens in smallest steps. This is indeed the case. All growth
processes in nature are quantum processes.
minimum action – quantum theory for poets and l aw yers 731
R andomness
But the quantum of action has more interesting consequences. Due to the indeterminacy
relations, it is impossible to give a definite value to both the momentum and the position
of a particle. Obviously, this is also impossible for all the components of a measurement
set-up or an observer. This implies that initial conditions – both for a system and for
the measurement set-up – cannot be exactly duplicated. A minimum action thus implies
that whenever an experiment on a microscopic system is performed twice, the outcome
will be different. The result would be the same only if both the system and the observer
would be in exactly the same condition in both situations. This turns out to be impossible,
both due to the second principle of thermodynamics and due to the quantum principle.
Therefore, microscopic systems behave randomly. Obviously, there will be some average
outcome; nevertheless, microscopic observations are probabilistic. Albert Einstein found Copyright © Christoph Schiller November 1997–January 2007
this conclusion of quantum theory the most difficult to swallow, as this randomness im-
plies that the behaviour of quantum systems is strikingly different from that of classical
systems. But the conclusion is unavoidable: nature behaves randomly.
Every window proves that nature behaves randomly. Everybody knows that one can
use a train window to look either at the outside landscape or, by concentrating on the re-
flected image, to observe some interesting person inside the carriage. In other words, glass
reflects some of the light particles and lets some others pass through. More precisely, glass
reflects a random selection of light particles, yet with constant average. Partial reflection
is thus similar to the tunnelling effect. Indeed, the partial reflection of glass for photons is
a result of the quantum of action. Again, the average situation can be described by classical
physics, but the precise amount of partial reflection cannot be explained without quantum
theory. Without the quantum of action, train trips would be much more boring.
732 v quanta of light and matter • 18. minimum action
Waves
The quantum of action implies a central result about the path of particles. If a particle
travels from a point to another, there is no way to say which path it has taken in between.
Indeed, in order to distinguish between the two possible, but only slightly different paths,
actions smaller than ħ~2 would have to be measured. In particular, if a particle is sent
through a screen with two sufficiently close slits, it is impossible to say through which slit
the particle passed to the other side. The impossibility is fundamental. Matter is predicted
to show interference.
We know already moving phenomena for which it is not possible to say with precision
which path is taken when two slits are crossed: waves behave in this way. All waves follow
Page 214 the indeterminacy relations
∆ω∆t E ∆k∆x E
1 1
and . (481)
2 2
∆E∆t E ∆p∆x E
ħ ħ
and . (482)
2 2
As a result, we are lead to ascribe a frequency and a wavelength to quantum systems:
E = ħω p = ħk = ħ
2π
and (483)
λ
The energy–frequency relation was deduced by Albert Einstein in 1905; it is found to be
valid in particular for all examples of light emission and, surprisingly, also for all quantum
matter particles. The more spectacular momentum–wavelength relation for matter was
first predicted by Louis de Broglie* in 1923 and 1924. (This is thus another example of a
discovery that was made about twenty years too late.) In short, the quantum of action
implies that matter particles behave like waves.
Quantum information
In computer science, a smallest change is called a ‘bit’. The existence of a smallest change
Copyright © Christoph Schiller November 1997–January 2007
in nature implies that computer language or information science can be used to describe
nature, and in particular quantum theory. However, computer language can describe only
the software side; the hardware side of nature is also required. The hardware of nature
enters the description whenever the actual value ħ of the quantum of action must be
introduced.
* Louis de Broglie (b. 1892 Dieppe, d. 1987 Paris) French physicist and professor at the Sorbonne. The
energy–frequency relation for light had earned Albert Einstein his Nobel prize already in 1921. De Broglie
expanded it to the prediction of the wave nature of the electron (and of all other quantum matter particles);
this was the essential part of his PhD. The prediction was confirmed experimentally a few years later, in 1927.
For the prediction of the wave nature of matter, de Broglie received the Nobel Prize in physics in 1929. Being
an aristocrat, de Broglie never did anything else in research after that. For example, it was Schrödinger who
then wrote down the wave equation, even though de Broglie could equally have done it.
minimum action – quantum theory for poets and l aw yers 733
Exploring the similarities and differences between nature and information science, we
will discover that the quantum of action implies that macroscopic physical systems can-
not be copied or ‘cloned’, as quantum theorists like to say. Nature does not allow to copy
objects. Copying machines do not exist. The quantum of action makes it impossible to
gather and use all information in a way that allows to produce a perfect copy. As a result,
we will deduce that the precise order in which measurements are performed does play a
role in experiments. When the order is important, physicists speak of lack of ‘commuta-
tion’. The quantum of action implies that physical observables do not commute.
We will also find out that the quantum of action implies that systems are not always in-
Page 820 dependent, but can be ‘entangled’. This term, introduced by Erwin Schrödinger, describes
the most absurd consequences of quantum theory. Entanglement makes everything in
nature connected to everything else. Entanglement produces effects which look (but are
not) faster than light. Entanglement produces a (fake) form of non-locality. Entanglement
Ref. 708 also implies that trustworthy communication does not exist.
”
Ref. 709 * ‘Never and nowhere has matter existed, or can it exist without motion.’ Friedrich Engels (1820–1895) was
one of the theoreticians of Marxism, often also called Communism.
734 v quanta of light and matter • 18. minimum action
**
(Adult) plants stop growing in the dark. Plants need light to grow. Without light, the
reactions necessary for growth stop. Can you deduce that this is a quantum effect, not
Challenge 1223 ny explainable by classical physics?
Copyright © Christoph Schiller November 1997–January 2007
**
Most quantum processes in everyday life are electromagnetic. Can you show that the
Challenge 1224 n quantum of action must also hold for nuclear processes?
**
Challenge 1225 n Is the quantum of action independent of the observer, even near the speed of light? This
question was the reason that Planck contacted the young Einstein, inviting him to Berlin,
thus introducing him to the world-wide physics community.
light – the strange consequences of the quantum of action 735
”
What is the faintest lamp?
If all the colours of materials are quantum effects, as just argued, it becomes especially
Ref. 711 interesting to study the properties of light itself. If in nature there is a minimum change,
there should also be a minimum illumination. This had been already predicted in ancient
Greece, for example by Epicurus (341–271 bce ). He stated that light is a stream of little
particles, so that the smallest illumination would be that of a single light particle.
become infinitely small, as that would contradict the quantum of action. To check this
prediction, we only need some black and white photographic film. Everybody knows that
film is blackened by daylight of any colour; at medium light intensities it becomes dark
grey and at lower intensities light grey. Looking at an extremely light grey film under the
microscope, we discover that even under uniform illumination the grey shade actually
is a more or less dense collections of black spots. Exposed film, as shown in Figure 312,
does not show a homogeneous colour; on the contrary, it reacts as if light is made of small
particles.
This is a general observation: whenever sensitive light detectors are constructed with
the aim to ‘see’ as accurately as possible, thus in environments as dark as possible, one
* ‘All beings live of light, every happy creature.’ Friedrich Schiller (b. 1759 Marbach, d. 1805 Weimar), Ger-
man poet, playwright and historian.
736 v quanta of light and matter • 19. light
photographic
glass film
white red
violet
Figure to be inserted
always finds that light manifests itself as a stream of light quanta. Nowadays they are
usually called photons, a term that appeared in 1926. A low or high light intensity is simply
a small or high number of photons.
An obvious low-intensity source of light are single atoms. Atoms are tiny spheres;
when they radiate light or X-rays, the radiation should be emitted as spherical waves.
But in all experiments it is found that each atom emits only one ‘blob’ of light. One never Copyright © Christoph Schiller November 1997–January 2007
finds that the light emitted by atoms forms a spherical wave, as is suggested by every-
day physics. If a radiating atom is surrounded by detectors, there is always only a single
detector that is triggered.
Experiments in dim light thus show that the continuum description of light is not
correct. More precise measurements confirm the role of the quantum of action: every
photon leads to the same amount of change in the film. This amount of change is the
minimal amount of change that light can produce. Indeed, if a minimum action would
not exist, light could be packaged into arbitrary small amounts. In other words, the clas-
sical description of light by a continuous vector potential A(t, x) or electromagnetic field
Page 562 F(t, x), whose evolution is described by a principle of least action, is wrong. Continuous
functions do not describe the observed particle effects. Another, modified description is
required. The modification has to be important only at low light intensities, since at high
light – the strange consequences of the quantum of action 737
E = ħ 2π f = ħ ω . (484)
This shows that for light, the smallest measurable action is given by the quantum of action
ħ. This is twice the smallest action observable in nature; the reasons and implications
will unfold during the rest of our walk. In summary, colour is a property of photons. A
coloured light beam is a hailstorm of the corresponding photons.
The value of Planck’s constant can be determined from measurements of black bodies
Page 603 or other light sources. The result,
Challenge 1227 ny consciously, under ideal conditions, flashes of about half a dozen detected photons.
Let us explore the remaining properties of photons. To start, photons have no measur-
Challenge 1228 ny able (rest) mass or electric charge. Can you confirm this? In fact, experiments can only
give an upper limit for both quantities. The present experimental upper limit for the (rest)
Ref. 713 mass of a photon is 10−52 kg, and for the charge, 5 ë 10−30 of the electron charge.
We know that light can hit objects. Since the energy and the speed of photons is known,
Challenge 1229 ny we guess that the photon momentum obeys
p= =ħ or p = ħ k .
E 2π
(486)
c λ
* This transition from the classical case to the quantum case used to be called quantization. The concept and
the ideas behind it are only of historical interest today.
738 v quanta of light and matter • 19. light
In other words, if light is made of particles, we should be able to play billiard with them.
Ref. 714 This is indeed possible, as Arthur Compton showed in a famous experiment in 1923. He
directed X-rays, which are high energy photons, onto graphite, a material in which elec-
trons move almost freely. He found that whenever the electrons in the material get hit by
the X-ray photons, the deflected X-rays change colour. As expected, the strength of the hit
depends on the deflection angle of the photon. From the colour change and the reflection
angle, Compton confirmed that the photon momentum indeed obeys the above expres-
sion. All other experiments agree that photons have momentum. For example, when an
atom emits light, the atom feels a recoil; the momentum again turns out to be given by the
same value (486). In short, every photon has momentum.
The value of photon momentum respects the indeterminacy principle. In the same way
that it is impossible to measure exactly both the wavelength of a wave and the position
of its crest, it becomes impossible to measure both the momentum and the position of a
Challenge 1230 ny photon. Can you confirm this? In other words, the value of the photon momentum is a
“ corps lumineux.
Blaise Pascal*
In the seventeenth century, Blaise Pascal used this sentence to make fun about certain
”
Copyright © Christoph Schiller November 1997–January 2007
physicists. He ridiculed (rightly so) the blatant use of a circular definition. Of course,
he was right; in his time, the definition was indeed circular, as no meaning could be
given to any of the terms. But as usual, whenever an observation is studied with care
by physicists, they give philosophers a beating. All those originally undefined terms now
have a definite meaning. Light is indeed a type of motion; this motion can rightly be called
luminary because in opposition to motion of material bodies, it has the unique property
v = c; the luminous bodies, today called photons, are characterized and differentiated
* ‘Light is the luminary movement of luminous bodies.’ Blaise Pascal (b. 1623 Clermont, d. 1662 Paris)
important French mathematician and physicist up to the age of twenty-six; he then turned theologian and
philosopher.
light – the strange consequences of the quantum of action 739
from all other particles by their dispersion relation E = c p, their energy E = ħω, their
spin L = ħ, the vanishing value of all other quantum numbers, and by being the quanta
of the electromagnetic field.
In short, light is a stream of photons. It is indeed a luminary movement of luminous
bodies. The existence of photons is the first example of a general property of the world
on small scales: all waves and all flows in nature are made of quantum particles. Large
numbers of (coherent) quantum particles – or quantons – do indeed behave as waves. We
will see shortly that this is the case even for matter. The fundamental constituents of all
waves are quantons. There is no exception. The everyday, continuum description of light
is thus similar in many aspects to the description of water as a continuous fluid; photons
are the atoms of light, and continuity is an approximation valid for large particle numbers.
Small numbers of quantons often behave like classical particles. In the old days of physics,
books used to discuss at length a so-called wave–particle duality. Let us be clear from the
start: quantons, or quantum particles, are neither classical waves nor classical particles. In
appear only later on. At the moment we simply have to accept the situation as it is. We
therefore turn to an easier issue.
“ Max Planck*
”
* ‘Thus they do exist after all.’ Max Planck, in later years, said this after standing silently, for a long time, in
front of an apparatus which counted single photons by producing a click for each photon it detected. It is
not a secret that for a large part of his life, Planck was not a friend of the photon concept, even though his
own results were the starting point for its introduction.
740 v quanta of light and matter • 19. light
Above we saw that in order to count photons, the simplest way is to distribute them across
a large screen and then to absorb them. But this method is not fully satisfying, as it des-
troys the photons. How can one count photons without destroying them?
One way is to reflect photons on a mirror and to measure the recoil of the mirror.
This seems almost unbelievable, but nowadays the effect is becoming measurable even
for small number of photons. For example, this effect has to be taken into account in the
mirrors used in gravitational wave detectors, where the position of laser mirrors has to
be measured to high precision.
Another way of counting photons without destroying them uses special high quality
laser cavities. Using smartly placed atoms inside such a cavity, it is possible to count the
number of photons by the effect they have on these atoms.
In other words, light intensity can indeed be measured without absorption. However,
the next difficulty appears straight away. Measurements show that even the best light
beams, from the most sophisticated lasers, fluctuate in intensity. There are no steady
* A large photon number is assumed in the expression; this is obvious, as ∆φ cannot grow beyond all bounds.
The exact relations are
time
Intensity I(t)
time
Probability P(I)
Sub-
Bose-Einstein Poisson Poisson
2 2 2
1 1 1
coherence time
time
Amplitude-
phase
diagram
source detectors
mirrors
beam beam
splitter splitter
possible
two identical light
photons paths
F I G U R E 314 An interferometer
In summary, unlike little stones, photons are not countable. And this it not the last
difference between photons and stones.
To the shame of physicists, the study of this question was started by two astronomers,
Ref. 715 Robert Hanbury Brown and Richard Twiss, in 1956. They used a simple method to meas-
ure the probability that a second photon in a light beam arrives at a given time after a
first one. They simply split the beam, put one detector in the first branch and varied the
position of a second detector in the other branch.
Hanbury Brown and Twiss found that for coherent light the clicks in the two counters,
and thus the photons, are correlated. This result is in complete contrast with classical
electrodynamics. Photons are indeed necessary to describe light. In more detail, their
experiment showed that whenever a photon hits, the probability that a second one hits
* This conclusion cannot be avoided by saying that photons are split at the beam splitter: if one puts a detector
into each arm, one finds that they never detect a photon at the same time. Photons cannot be divided.
light – the strange consequences of the quantum of action 743
D1
coincidence
counter
D2
threshold
metal plate
frequency of lamp light ω
in vacuum
given frequency always has a minimum intensity, namely one photon. Splitting such a one
photon beam into two beams, e.g. using a half-silvered mirror, does produce two beams.
However, there is no way to find more than one photon in those two beams together.
Another interesting experiment requiring photons is the observation of ‘molecules of
Ref. 721 photons’. In 1995, Jacobson et al. predicted that the de Broglie wavelength of a packet of
photons could be observed. Following quantum theory it is given by the wavelength of
a single photon divided by the number of photons in the packet. The team argued that
the packet wavelength could be observable if one would be able to split and recombine
such packets without destroying the cohesion within the packet. In 1999, this effect was
indeed observed by de Pádua and his brazilian research group. They used a careful set-up
with a nonlinear crystal to create what they call a biphoton, and observed its interference
light – the strange consequences of the quantum of action 745
properties, finding a reduction of the effective wavelength by the predicted factor of two.
In the meantime, packages with three and even four entangled photons have been created
Ref. 722 and observed.
Still another argument for the necessity of photons is the mentioned recoil felt by
atoms emitting light. The recoil measured in these cases is best explained by the emis-
sion of a photon in a particular direction. In contrast, classical electrodynamics predicts
the emission of a spherical wave, with no preferred direction.
Obviously, the observation of non-classical light, also called squeezed light, also argues
for the existence of photons, as squeezed light proves that photons indeed are an intrinsic
Ref. 723 aspect of light, necessary even when no interactions with matter play a role. The same is
true for the Hanbury Brown–Twiss effect.
Finally, the spontaneous decay of excited atomic states also requires the existence of
photons. A continuum description of light does not explain the observation.
In summary, the concept of photon is indeed necessary for a precise description of
If a light wave is made of particles, one must be able to explain each and every wave prop-
erties with the help of photons. The experiments mentioned above already hinted that
this is possible only because photons are quantum particles. Let us take a more detailed
” Copyright © Christoph Schiller November 1997–January 2007
* ‘Fifty years of intense reflection have not brought me nearer to the answer of the question ‘What are light
quanta?’ Of course nowadays every little mind thinks he knows the answer. But he is wrong.’ He said this a
few years before his death.
746 v quanta of light and matter • 19. light
lasers or other
pocket lamps coherent sources
* The model gives a correct description of light with the exception that it neglects polarization.
light – the strange consequences of the quantum of action 747
screen
source image
mirror
behaves as expected: the averages then simply add up, as in the common region in the
left case of Figure 317.
You might want to calculate the distance of the lines when the source distance, the
Challenge 1236 ny colour and the distance to the screen is given.
Obviously, the photon model implies that interference patterns are built up as the sum
of a large number of one-photon hits. Using low intensity beams, we should therefore be
able to see how these little spots slowly build up an interference pattern by accumulat-
ing at the bright spots and never hitting the dark regions. That is indeed the case. All
experiments have confirmed this description.
It is important to stress that interference of two light beams is not the result of two
different photons cancelling out or adding each other up. The cancelling would contra-
dict energy and momentum conservation. Interference is an effect valid for each photon
separately, because each photon is spread out over the whole set-up; each photon takes
Ref. 724 all possible paths and interferes. As Paul Dirac said, each photon interferes only with itself.
Interference only works because photons are quantons, and not at all classical particles. Copyright © Christoph Schiller November 1997–January 2007
Dirac’s widely cited statement leads to a famous paradox: if a photon can interfere only
with itself, how can two laser beams from two different lasers show interference? The
answer of quantum physics is simple but strange: in the region where the beams interfere,
there is no way to say from which source a photon is arriving. Photons are quantons; the
photons in the crossing region cannot be said to come from a specific source. Photons in
the interference region are quantons on their own right, which indeed interfere only with
themselves. In that region, one cannot honestly say that light is a flow of photons. Despite
regular claims of the contrary, Dirac’s statement is correct. That is the strange result of the
quantum of action.
Waves also show diffraction. To understand this phenomenon with photons, let us
start with a simple mirror and study reflection first. Photons (like any quantum particle)
move from source to detector in all ways possible. As the discoverer of this explanation,
748 v quanta of light and matter • 19. light
Richard Feynman,* likes to stress, the term ‘all’ has to be taken literally. This was not a
big deal in the explanation of interference. But in order to understand a mirror we have
to include all possibilities, as crazy as they seem, as shown in Figure 319.
As said above a source emits
rotating arrows. To determine the source point
probability that light arrives at a
certain image location, we have to
add up all the arrows arriving at
the same time at that location. For
arrow sum
each path, the arrow orientation at at point
the image is shown – for conveni- usual vanishes
ence only – below the correspond- mirror
ing segment of the mirror. Depend-
ing on the length of the path, the
the left. The arrow addition now shows that such a specialized mirror, usually called a
grating, allows light to be reflected in unusual directions. And indeed, this behaviour is
* Richard (‘Dick’) Phillips Feynman (b. 1918 New York City, d. 1988), US American physicist. One of the
founders of quantum electrodynamics, he discovered the ‘sum-over-histories’ reformulation of quantum
theory, made important contributions to the theory of the weak interaction and of quantum gravity, and
co-authored a famous physics textbook, the Feynman Lectures on Physics. He is one of those theoretical
physicists who made career mainly by performing complex calculations, a fact he tried to counter at the end
of his life. Though he tried to surpass the genius of Wolfgang Pauli throughout his whole life, he failed in this
endeavour. He was famously arrogant, disrespectful of authorities, as well as deeply dedicated to physics and
to enlarging knowledge in his domain. He also was a well known collector of surprising physical explanations
and an author of several popular texts on his work and his life. He shared the 1965 Nobel Prize in physics
for his work on quantum electrodynamics.
light – the strange consequences of the quantum of action 749
standard for waves: it is called diffraction. In short, the arrow model for photons does al-
low to describe this wave property of light, provided that photons follow the mentioned
crazy probability scheme. Do not get upset; as said before, quantum theory is the theory
of crazy people.
You may want to check that the arrow model, with the approximations it generates
by summing over all possible paths, automatically ensures that the quantum of action is
Challenge 1237 ny indeed the smallest action that can be observed.
All waves have a signal velocity. As a consequence, waves show
refraction when they move from one medium into another with
different signal velocity. Interestingly, the naive particle picture of
photons as little stones would imply that light is faster in materi- air
als with high indices of refraction, the so-called dense materials. water
Challenge 1238 ny Just try it. However, experiments show that light in dense mater-
ials moves slowly. The wave picture has no difficulties explaining
tion for energy and time is respected; every emitted wave will possess a certain spectral
width. Purely monochromatic light does not exist. Similarly, no system which emits a
wave at random can produce a monochromatic wave. All experiments confirm these res-
ults.
Waves can be polarized. So far, we disregarded this property. In the photon picture,
polarization is the result of carefully superposing beams of photons spinning clockwise
and anticlockwise. Indeed, we know that linear polarization can be seen as a result of
superposing circularly polarized light of both signs, using the proper phase. What seemed
a curiosity in classical optics turns out to be the fundamental explanation of quantum
theory.
Photons are indistinguishable. When two photons of the same colour cross, there is no
way to say, after the crossing, which of the two is which. The quantum of action makes
750 v quanta of light and matter • 19. light
wavelength λ obeys
λQ
ħc
, (489)
kT
where k = 1.4 ë 10−23 J~K is Boltzmann’s constant. If the wavelength is much smaller than
the right hand side, the particle description is most appropriate. If the two sides are of
the same order of magnitude, both effects play a role.
flash can differ a bit from the official value, though only a tiny bit. Can you estimate the
Challenge 1241 ny allowed difference in arrival time for a light flash from the dawn of times?
The little arrow explanation gives the same result. If one takes into account the crazy
possibility that photons can move with any speed, one finds that all speeds very different
from c cancel out. The only variation that remains, translated in distances, is the inde-
Challenge 1242 ny terminacy of about one wavelength in the longitudinal direction which we mentioned
already above.
However, the most absurd results of the quantum of action appear when one stud-
ies static electric fields, such as the field around a charged metal sphere. Obviously, such
a field must also be made of photons. How do they move? It turns out that static elec-
tric fields are built of virtual photons. In the case of static electric fields, virtual photons
are longitudinally polarized, do not carry energy away, and cannot be observed as free
particles. Virtual photons are photons who do not appear as free particles; they only have
extremely short-lived appearances before they disappear again. In other words, photons,
Virtual particles do not obey the indeterminacy relation, but its opposite; the opposite
relation expresses their short-lived appearance. Despite their intrinsically short life, and
the impossibility to detect them directly, they have important effects. We will explore
Page 779 virtual particles in detail shortly.
In fact, the vector potential A allows four polarizations, corresponding to the four
coordinates (t, x, y, z). It turns out that for the photons one usually talks about, the free
or real photons, the polarizations in t and z direction cancel out, so that one observes
only the x and y polarizations in actual experiments.
For bound or virtual photons, the situation is different.
– CS – more to be written – CS –
In short, static electric and magnetic fields are continuous flows of virtual photons.
Virtual photons can have mass, can have spin directions not pointing along the motion
path, and can have momentum opposite to their direction of motion. All these proper-
ties are different from real photons. In this way, exchange of virtual photons leads to the Copyright © Christoph Schiller November 1997–January 2007
attraction of bodies of different charge. In fact, virtual photons necessarily appear in any
description of electromagnetic interactions; more about their effects, such as the famous
attraction of neutral bodies, will be discussed later on.
In summary, light can indeed move faster than light, though only in amounts allowed
by the quantum of action. For everyday situations, i.e. for cases with a high value of the
action, all quantum effects average out, including light velocities different from c.
A different topic also belongs into this section. Not only the position, but also the en-
Ref. 727 ergy of a single photon can be undefined. For example, certain materials split one photon
of energy ħω into two photons, whose two energies sum up to the original one. Quantum
mechanics implies that the energy partitioning is known only when the energy of one of
the two photons is measured. Only at that very instant the energy of the second photon is
known. Before that, both photons have undefined energies. The process of energy fixing
752 v quanta of light and matter • 19. light
p1
α1 air
α
water
p2
α2
axis
ma = q(E + v B) . (491)
∆E =
ħ
, (492)
q ∆x T Copyright © Christoph Schiller November 1997–January 2007
where T is the measurement time and ∆x is the position uncertainty. Every value of an
electric field, and similarly that of every magnetic field, is thus affected with an indeterm-
inacy. The physical state of the electromagnetic field behaves like the state of matter in
this aspect. This is the topic we explore now.
n=
cp
. (493)
E
As a result, the ‘law’ of refraction follows.
There is an important issue here. The velocity of a photon v = δE~δp in a light ray is
not the same as the phase velocity u = E~p that enters in the calculation.
**
If en electromagnetic wave has an amplitude A, the photon density d is
A2
d= . (494)
ħω
The existence of a smallest action has numerous effects on the motion of matter. We start ”
with a few experimental results that show that the quantum of action is indeed the smal-
lest measurable action. Copyright © Christoph Schiller November 1997–January 2007
“ Cicero, De oratore.
absolute zero temperature. Experiments show that this is impossible. (Indeed, this im-
possibility, the so-called third ‘law’ of thermodynamics, is equivalent to the existence of a
minimum action.) There is no rest in nature. In other words, the quantum of action proves
the old truth that a glass of wine is always partially empty and partially full.
The absence of microscopic rest, predicted by the quantum of action, is observed in
many experiments. For example, a pencil on its tip cannot stay vertical, even if it is isolated
from all disturbances, such as vibrations, air molecules and thermal motion. This follows
from the indeterminacy relation. In fact, it is even possible to calculate the time after
which a pencil must have fallen over.*
The absence of rest for the electrons inside atoms also prevents them from falling into
the nuclei. The size of atoms, and thus of all matter, is thus a direct consequence of the
absence of microscopic rest.
Cool gas
No infinite precision
The quantum of action prevents the observation of rest in many ways. Whenever the
position of a system is determined to high precision, we need a high energy probe. Indeed,
only a high energy probe has a wavelength small enough to allow a high precision for
position measurements. As a result of this high energy however, the system is disturbed.
Worse, the disturbance itself is also found to be imprecisely measurable. There is thus no Copyright © Christoph Schiller November 1997–January 2007
way to determine the original position even by taking the disturbance itself into account.
In short, perfect rest cannot be observed even in principle.
Indeed, all systems that have ever been observed with high precision confirm that
perfect rest does not exist. Among others, this result has been confirmed for electrons,
Ref. 729 * That is not easy, but neither too difficult. For an initial orientation close to the vertical, the fall time T turns
out to be
1 8
T= T0 ln (495)
2π α
where α is the starting angle, and a fall by π is assumed. Here T0 is the oscillation time of the pencil for small
Challenge 1247 ny angles. (Can you determine it?)
The indeterminacy relation for the tip of the pencil yields a minimum starting angle, because the mo-
Challenge 1248 ny mentum indeterminacy cannot be made as large as wanted. You should be able to provide an upper limit.
Once the angle is known, you can calculate the maximum time.
motion of matter – beyond cl assical physics 755
Current
preliminary
graph
Voltage
Not only is rest made impossible by the quantum of action; the same impossibility ap- ”
plies to any situation that does not change in time. The most important examples of (ap-
parently) stationary situations are flows. The quantum of action implies that no flow can
be stationary. More precisely, a smallest action implies that no flow can be continuous.
All flows are made of smallest entities. Flows are made of quantum particles. Apart from
light, two flows from everyday life ask for direct confirmation: flows of electricity and
flows of liquids.
If electrical current would be a continuous flow, it would be possible to observe action
values as small as desired. The simplest confirmation of the discontinuity of current flow
was discovered only in the 1990s: take two gold wires on a table and lay them across each
other. It is not hard to let a current flow from one wire to the other, via the crossover,
and to measure the voltage. Hit the table and measure the current–voltage characteristic
of the crossing contact. A curve like the one shown in Figure 324 is found: the current
Copyright © Christoph Schiller November 1997–January 2007
given by
λ= and ω = .
2π ħ E
(496)
p ħ
where p and E are the momentum and the energy of the single particles. Soon after the
prediction, experiments started to provide the confirmation of the idea. It is indeed ob-
served that matter streams can diffract, refract and interfere, with the values predicted by
de Broglie. Due to the small value of the wavelength, one needs careful experiments to de-
tect the effects. Nevertheless, one after the other, all experiments which proved the wave
properties of light have been repeated for matter beams. For example, in the same way
that light diffracts when passing around an edge or through a slit, matter has been found
to diffract in these situations. This is true even for electrons, the simplest matter particle,
motion of matter – beyond cl assical physics 757
Ref. 734 as shown in Figure 325. (The experiment with electrons, performed by Claus Jönsson in
Tübingen in 1961, was voted the most beautiful experiment of all of physics in 2002.)
Researchers inspired by light interferometers started to build matter interferomet-
ers; today they work with beams of electrons, nucleons, nuclei, atoms and even large
Ref. 735 molecules. In the same way that the observations of interference of light proves the wave
Page 570 property of light, the interference patterns observed with these instruments proves the
wave properties of matter.
Like light, matter is also made of particles; like light, matter behaves as a wave when
large numbers of particles with the same momentum are involved. Even though beams
of large molecules behave as waves, for everyday objects, such as cars on a highway, one
never makes such observations. There are two main reasons. First, for cars on highways
the involved wavelength is extremely small. Second, the speeds of cars vary too much;
streams of objects with the same speed for all objects – only such streams have a chance
to be coherent – are extremely rare in everyday life.
∆λ∆X =
1
. (497)
2
Similarly, for every wave the frequency ω and the instant T of its peak amplitude cannot
both be sharply defined. Their indeterminacies are related by
∆ω∆T =
1
. (498)
2
Using the wave properties of matter we get
∆p∆X E ∆E∆T E
ħ ħ
and . (499)
2 2
These famous relations are called Heisenberg’s indeterminacy relations. They were dis-
covered by the German physicist Werner Heisenberg in 1925. They state that there is no
way to ascribe a precise momentum and position to a material system, nor a precise en-
Copyright © Christoph Schiller November 1997–January 2007
ergy and age. The more accurately one quantity is known, the less accurately the other is.*
Matter quantons – somewhat like stones, but in contrast to photons – can be localized,
but only approximately.
Both indeterminacy relations have been checked experimentally in great detail. The in-
determinacy limitations are easily experienced in experiments. In fact, every experiment
proving that matter behaves like a wave is a confirmation of the indeterminacy relation,
and vice versa.
Incidentally, Niels Bohr called the relation between two variables linked in this way
complementarity. He then explored systematically all such possible pairs. You can search
* A policeman stops the car being driven by Werner Heisenberg. ‘Do you know how fast you were driving?’
‘No, but I know exactly where I am!’
758 v quanta of light and matter • 20. motion of matter
Challenge 1251 ny for such observable pairs yourself. Bohr was deeply fascinated by the existence of a com-
plementarity principle. Bohr later extended it also to philosophical aspects. In a fam-
ous story, somebody asked him what was the quantity complementary to precision. He
answered: ‘Clarity’.
In short, the quantum of action prevents position and momentum values to be exactly
defined for quantons. The values are fuzzy – position and momentum are undetermined.
2mc 3
aD
Velocity of quantons
Usual, or real, matter quantons move slower than the speed of light. However, the
quantum of action allows exceptions to this limit, the so-called virtual particles, which
Page 779 we will meet later on.
are only of historical interest today. Even Bohr knew that they were not correct, because
the calculations assumed that hydrogen atoms are flat. However, this is not observed;
Challenge 1254 e moreover, it is in contrast with the quantum of action. Since the motion of quantum
constituents is fuzzy, all composed quantons must be clouds of constituents.
* We note that the acceleration limit is different from the acceleration limit due to general relativity:
c4
aD . (501)
4Gm
In particular, the quantum limit (500) applies to microscopic particles, whereas the general relativistic limit
applies to macroscopic systems. Can you confirm that in each domain the respective limit is the smaller of
Challenge 1253 e the two?
motion of matter – beyond cl assical physics 759
F I G U R E 326 Probability clouds: a hydrogen atom in its spherical ground state (left) and in a
Atoms are spherical, molecules have more complex shapes. Experiment and theory
showed that the shape of all atoms and molecules is due to the cloud, or probability dis-
tribution, of their electrons. Quantum theory states that atoms or molecules are not hard
balls, as Democritus or Dalton believed, but that they are clouds. Atomic electron clouds
are not infinitely hard, but to a certain degree can mix and interpenetrate. This mixing
leads to molecules, liquids, solids, flowers and people.
Obviously, the quantum of action also implies that shapes fluctuate. For example, if a
long molecule is held fixed at its two ends, the molecule cannot remain at rest in between.
Such experiments are common today; they again confirm that perfect rest does not exist,
as it would contradict the existence of a minimum action in nature.
“ maestro.
Leonardo da Vinci*
Stones can rotate. The quantum of action also has important consequences for rotational
” Copyright © Christoph Schiller November 1997–January 2007
motion. Action and angular momentum have the same physical dimensions. It only takes
a little thought to show that if matter or radiation has a momentum and wavelength
related by the quantum of action, then angular momentum is fixed in multiples of the
quantum of action.
Ref. 744 The argument is due to Dicke and Wittke. Just imagine a source at the centre of a
circular fence, made of N steel bars spaced by a distance a = 2πR~N, as shown in
Figure 327. In the centre of the fence we imagine a source of matter or radiation that
emits particles towards the fence in any chosen direction. The linear momentum of the
particle is p = ħk = 2πħ~λ. Outside the fence, the direction of the particle is given by the
* ‘Sad is that disciple who does not surpass his master.’ The statement is painted in large letters in the Aula
Magna of the University of Rome.
760 v quanta of light and matter • 20. motion of matter
source
condition of positive interference. In other words, the angle θ is given by a sin θ = Mλ,
where M is an integer. In this process, the fence receives a linear momentum p sin θ, or an
angular momentum L = pR sin θ. Inserting all expressions one finds that the transferred
angular momentum is
L = N Mħ . (502)
In other words, the angular momentum of the fence is an integer multiple of ħ. Of course,
this is only a hint, not a proof. Nevertheless, the argument is correct. The intrinsic angular
momentum of bodies is always a multiple of ħ. Quantum theory thus states that every
object’s angular momentum increases in steps. Angular momentum is quantized.
But rotation has more interesting aspects. Due to the quantum of action, in the same
way that linear momentum is fuzzy, angular momentum is so as well. There is an inde-
Ref. 736 terminacy relation for angular momentum L. The complementary variable is the phase
Ref. 737 angle φ of the rotation. The indeterminacy relation can be expressed in several ways. The
Page 740 simplest, but also the less precise is Copyright © Christoph Schiller November 1997–January 2007
∆L ∆φ E
ħ
. (503)
2
(This is obviously an approximation: the relation is only valid for large angular momenta;
the relation cannot be valid for small values, as ∆φ by definition cannot grow beyond 2π.
In particular, angular momentum eigenstates have ∆L = 0.*)
S1 − 2πP(π)S ,
ħ
∆L ∆φ E (504)
2
º
where P(π) is the normalized probability that the angular position has the value π. For an angular mo-
mentum eigenstate, one has ∆φ = π~ 3 and P(π) = 1~2π. This expression has been tested and confirmed
motion of matter – beyond cl assical physics 761
entities: all matter, including electrons, can be polarized. This was shown most clearly by
the famous Stern–Gerlach experiment.
observation
classical
prediction
silver
z beam
N
∂B
∂z
silver
oven beam
Thus they let a beam of silver atoms, extracted from an oven by evaporation, pass an in-
Ref. 730 homogeneous magnetic field. They found that the beam splits into two separate beams, as
shown in Figure 328. No atoms leave the magnetic field region at intermediate directions,
in contrast to what is expected from classical physics.
The splitting into two beams is an intrinsic property of silver atoms; it is due to the
spin value of the atoms. Silver atoms have spin ħ~2, and depending on their orientation
in space, they are deflected either in direction of the field inhomogeneity or against it.
The splitting of the beam is a pure quantum effect: there are no intermediate options.
Indeed, the Stern–Gerlach experiment is one of the clearest observations that classical
physics does not work well in the microscopic domain. This result is so peculiar that it
was studied in great detail all over the world.
When one of the two beams is selected – say the ‘up’ beam – and passed through a Copyright © Christoph Schiller November 1997–January 2007
second set-up, all atoms end up in the ‘up’ beam. The other exit, for the ‘down’ beam,
remains unused in this case. The up and down beams, in contrast to the original beam,
cannot be split further. This is not surprising.
But if the second set-up is rotated by π~2 with respect to the first, again two beams –
‘right’ and ‘left’ – are formed; it plays no role whether the incoming beam was from the
oven or an ‘up’ beam. A partially rotated set-up yields a partial, uneven split. The number
ratio depends on the angle.
We note directly that if a beam from the oven is split first vertically and then hori-
zontally, the result differs from the opposite order. Splitting processes do not commute.
(Whenever the order of two operations is important, physicists speak of ‘lack of commut-
ation’.) Since all measurements are processes as well, we deduce that measurements in
quantum systems do not commute in general.
motion of matter – beyond cl assical physics 763
Beam splitting is direction dependent. Matter beams behave almost in the same way as
polarized light beams. The inhomogeneous magnetic field acts somewhat like a polarizer.
The up and down beams, taken together, behave like a fully polarized light beam. In fact,
the polarization direction can be rotated (with the help of a homogeneous magnetic field).
Indeed, a rotated beam behaves in a unrotated magnet like an unrotated beam in a rotated
magnet.
The ‘digital’ split forces us to rethink the description of motion. In special relativity,
the existence of a maximum speed forced us to introduce the concept of space-time, and
then to refine the description of motion. In general relativity, the maximum force obliged
us to introduce the concepts of horizon and curvature, and then to refine the description
of motion. At this point, the existence of the quantum of action forces us to take two
similar steps. We will introduce the new concept of Hilbert space, and then we will refine
the description of motion.
might be −1) showing that a general atom has no intrinsic direction. It was also found that
feeding ‘up’ into the measurement apparatus gives ‘up’ states; thus certain special states
(‘eigenstates’) do remain unaffected by measurement. Finally, the experiment shows that
states can be rotated by applied fields; they have an abstract direction.
These details can be formulated in a straightforward way. Since measurements are
operations that take a state as input and produce as output a measurement result and an
output state, we can say:
— States are described by small rotating arrows; in other words they are complex vectors
in an abstract space. The space of all possible states is called a Hilbert space.
— Measurements are operations on the state vectors. Measurements are said to be de-
764 v quanta of light and matter • 20. motion of matter
Tψ = λψ (505)
are called eigenvectors, and the multiplication factor λ is called the associated eigenvalue.
Experiments show that the state of the system after the measurement is given by the ei-
genvector of the measured eigenvalue. In the Stern–Gerlach experiment, the eigenstates
are the ‘up’ and the ‘down’ states.
In summary, the quantum of action obliges us to distinguish between three concepts
that are all mixed up in classical physics: the state of a system, the measurement on a sys-
tem and the measurement result. The quantum of action forces us to change the vocabu-
lary with which we describe nature and obliges to use more differentiated concepts. Now
follows the main step: we describe motion with these concepts. This is the description
that is usually called quantum theory.
Page 181 In classical physics, motion is given by the path that minimizes the action. Motion
takes place in such a way that the action variation δS vanishes when the actual path is
compared to other paths with the same fixed end points. For quantum systems, we need
to redefine the concept of action and to find a description of its variation that is not based
on paths, as the concept of ‘path’ does not exist for quantum systems.
Copyright © Christoph Schiller November 1997–January 2007
In classical physics, the action is the integral of the Lagrangian between the initial and
final points in time. In quantum theory, the action* is defined using the initial and final
state, ψ i and ψ f . In detail, the action S between an initial and a final state is defined as
S = `ψ i S ∫ LdtSψ e ,
f (506)
where L is the Lagrangian (operator). The Lagrangian operator is defined in the same way
as in classical physics, except that the momentum and position variables are replaced by
the corresponding operators.* The angled brackets represent the multiplication of states
and operators as defined in quantum theory.
In classical physics, the principle of least action states that the actual motion happens
along that particular path, out of all paths with the same end points, for which the vari-
ation of the action is zero. In the quantum domain, the variation of the action for the
actual motion must be as near to zero as possible when when compared to similar mo-
tions, while at the same time it must be impossible to observe actions below ħ~2. This
double condition is realized by the so-called quantum action principle
`ψ i Sδ ∫ LdtSψ e = −iħδ`ψ Sψ e .
f i f (507)
This principle describes all quantum motion. Classically, the right hand side is zero – since
ħ can then be neglected – and we then recover the minimum action principle of classical
physics. In quantum theory however, the variation of the action is proportional to the
Sψe = HSψe .
∂
iħ (508)
∂t
Ref. 739 This famous equation is Schrödinger’s equation of motion.** In fact, Erwin Schrödinger
had found his equation in two slightly different ways. In his first paper, he used a vari-
Ref. 740 ational principle slightly different from the one just given. In the second paper he simply Copyright © Christoph Schiller November 1997–January 2007
asked: how does the state evolve? He knew that the state of a quanton behaves like a wave
and like a particle at the same time. A wave is described by a field, which he called ψ(t, x).
If the state ψ behaves like a wave, it must be described by a function (hence he called it
* More precisely, there is also a condition for ordering of operators in mixed products, so that the lack of
commutation between operators is taken into account. We do not explore this issue here.
** Erwin Schrödinger (b. 1887 Vienna, d. 1961 Vienna) was famous for being a physicien bohémien, always
living in a household with two women. In 1925 he discovered the equation which brought him international
fame and the Nobel prize for physics in 1933. He was also the first to show that the radiation discovered by
Victor Hess in Vienna was indeed coming from the cosmos. He left Germany and then again Austria out of
dislike of national socialism, and was for many years professor in Dublin. There he published the famous
and influential book What is life?. In it, he comes close to predicting the then unknown nuclear acid DNA
from theoretical insight alone.
766 v quanta of light and matter • 20. motion of matter
‘wave function’) with amplitude W multiplied by a phase factor e ikx−ωt . The state can thus
be written as
Sψe = ψ(t, x) = W(t, x)e ikx−ωt . (509)
At the same time, the state must also behave like a a particle. In particular, the non-
relativistic particle relation between energy and momentum E = p2 ~2m + V (x) must
remain valid for these waves. Using the two relations for matter wavelength and frequency,
we thus must have
= + V (x)ψ = Hψ .
∂ψ ∆ψ
iħ (510)
∂t 2m
This wave equation for the complex field ψ became instantly famous in 1926 when
Schrödinger used it, by inserting the potential felt by an electron near a proton, to cal-
culate the energy levels of the hydrogen atom. Light is emitted by electrons; a precise
description of electron motion in atoms thus allows to describe the light frequencies they
we will soon discover. But for the other, more well-behaved cases, we find that the wave
function thus moves in the same way as a classical particle does.
As an example, the left corner pictures in these pages show the evolution of a wave
function – actually its modulus SΨS2 – for the case of an exploding two-particle system.
The modulus SΨS2 is the probability distribution. In atoms, the probability distribution
of electrons determines the shapes of the electron clouds. That is why it is said that the
Schrödinger equation contains not only most of physics, but all of chemistry and biology.
Page 857 We come back to these sciences later on.
The Schrödinger equation makes another point: velocity and position of matter are
not independent variables and cannot be chosen at leisure. Indeed, the initial condition
of a system is given by the initial value of the wave function alone. No derivatives have
motion of matter – beyond cl assical physics 767
to or can be specified. In other words, quantum systems are described by a first order
evolution equation, in strong contrast to classical physics.
We note for completeness that in the Schrödinger equation the wave function is in-
deed a vector, despite the apparent differences. The scalar product of two wave func-
tions/vectors is the spatial integral of the product between complex conjugate of the first
function and the second function. In this way, all concepts of vectors, such as unit vectors,
null vectors, basis vectors, etc. can be reproduced also with wave functions.
it is made of 1028 atoms of room temperature, and assuming that a garage wall has a
thickness of 0.1 m and a potential height of 1 keV= 160 aJ for the passage of an atom, one
gets a probability of finding the car outside the garage of
P 10−(10 ) 10−(10
12 (10 28 ) 40
)
. (512)
Challenge 1256 ny This rather small value – just try to write it down to be convinced – is the reason why it
is never taken into account by the police when a car is missing. (Actually, the probability
is considerably smaller; can you name at least one effect that has been forgotten in this
Challenge 1257 ny simple calculation?) Obviously, tunnelling can be important only for small systems, made
768 v quanta of light and matter • 20. motion of matter
»
of a few particles, and for thin barriers, with a thickness of the order of ħ 2 ~2m(V − T) .
Tunnelling of single atoms is observed in solids at high temperature, but is not of import-
ance in daily life. For electrons the effect is larger; the formula gives
º º
w 0.5 nm aJ V −T . (513)
At room temperature, kinetic energies are of the order of 6 zJ; increasing temperature
obviously increases tunnelling. As a result, electrons or other light particles tunnel quite
easily. Indeed, every tv tube uses tunnelling at high temperature to generate the electron
beam producing the picture. The heating is the reason that TV tubes take time to switch
on.
For example, the tunnelling of electrons limits the ability to reduce the size of computer
memories, and thus makes it impossible to produce silicon integrated circuits with one
Challenge 1258 ny terabyte (TB) of random access memory (RAM). Are you able to imagine why? In fact,
“
Spin describes how a particle behaves under rotations. The full details of spin of electrons
Anonymous
”
were deduced from experiments by two Dutch students, George Uhleneck and Samuel
Ref. 747 Goudsmit. They had the guts to publish what also Ralph Kronig had suspected: that elec-
trons rotate around an axis with an angular momentum of ħ~2. In fact, this value is correct
for all elementary matter particles. (In contrast, radiation particles have spin values that
are integer multiples of ħ.) In particular, Uhlenbeck and Goudsmit proposed a g-value
of 2 for the electron in order to explain the optical spectra. The factor was explained by
Ref. 748 Llewellyn Thomas as a relativistic effect a few months afterwards.
Copyright © Christoph Schiller November 1997–January 2007
one needs a state function with two components. Nowadays, Pauli’s equation is mainly of
conceptual interest, because like the one by Schrödinger, it does not comply with special
relativity. However, the idea to double the necessary components was taken up by Dirac
when he introduced the relativistic description of the electron, and the idea is used for all
other particle equations.
one might ask: what is the corresponding Hamilton operator? A simple answer was given,
Ref. 755 only in 1950, by L.L. Foldy and S.A. Wouthuysen. The operator is almost the same one:
1 0 0 0
»
H = β c 4 m 2 + c 2 p2 β=
0 1 0 0
with (515)
0 0 −1 0
0 0 0 −1
The signs of the operator β distinguishes particles and antiparticles; it has two 1s and
two -1s to take care of the two possible spin directions. With this Hamiltonian operator,
a wave function for a particle has vanishing antiparticle components, and vice versa. The
Hamilton operator yields the velocity operator v through the same relation that is valid
in classical physics:
v = x = β»
d p
. (516)
dt
Copyright © Christoph Schiller November 1997–January 2007
c 4 m 2 + c 2 p2
l=xp . (517)
Ref. 749 credit for himself. He was seen by many, including Einstein, as the greatest and sharpest mind of twentieth
century physics. He was also famous for the ‘Pauli effect’, i.e. his ability to trigger disasters in laboratories,
machines and his surroundings by his mere presence. As we will see shortly, one can argue that Pauli got
the Nobel Prize in physics in 1945 (officially ‘for the discovery of the exclusion principle’) for finally settling
the question on the number of angels that can dance on the tip of a pin.
770 v quanta of light and matter • 20. motion of matter
Ref. 757 Both the orbital angular momentum l and the spin σ are separate constants of motion. A
particle (or antiparticle) with positive (or negative) component has a wave function with
only one non-vanishing component; the other three components vanish.
But alas, the representation of relativistic motion given by Foldy and Wouthuysen is
not the most simple for a generalization to particles when electromagnetic interactions
are present. The simple identity between classical and quantum-mechanical description
Page 776 is lost when electromagnetism is included. We solve the problem below.
The quantum of action implies that there are no fractals in nature. Fractals have complex
shapes; but quantum theory requires that all shapes in nature are clouds. Quantum theory
requires all shapes to be fuzzy.
”
**
Can atoms rotate? Can an atom that falls on the floor roll under the table? Can atoms
be put into high speed rotation? The answer is no to all questions, because angular mo-
Ref. 760 mentum is quantized; moreover, atoms are not solid objects, but clouds. The macroscopic
case of an object turning slower and slower until it stops does not exist in the microscopic
world. The quantum of action does not allow it.
**
Do hydrogen atoms exist? Most types of atoms have been imaged with microscopes, pho-
tographed under illumination, levitated one by one, and even moved with needles, one by
one, as the picture on page 229 shows. Researchers have even moved single atoms using
Ref. 763 laser beams to push them. However, not a single of these experiments measured hydro-
gen atoms. Is that a reason to doubt the existence of hydrogen atoms? Taking seriously
Challenge 1259 n this not-so-serious discussion can be a lot of fun. Copyright © Christoph Schiller November 1997–January 2007
**
Light is refracted when entering dense matter. Do matter waves behave similarly? Yes,
they do. In 1995, David Pritchard showed this for sodium waves entering helium and
Ref. 762 xenon gas.
**
Two observables can commute for two different reasons: either they are very similar, such
as the coordinate x and x 2 , or they are very different, such as the coordinate x and the
Challenge 1260 ny momentum p y . Can you give an explanation?
**
colours and other interactions between light and matter 771
Space and time translations commute. Why then do the momentum operator and the
Challenge 1261 ny Hamiltonian not commute in general?
**
Ref. 764 With two mirrors and a few photons, it is possible to capture an atom and keep it floating
between the two mirrors. This feat, one of the several ways to isolate single atoms, is now
Challenge 1262 ny standard practice in laboratories. Can you imagine how it is realized?
**
For a bound system in a non-relativistic state with no angular momentum, one has the
Ref. 765 relation
9ħ 2
`rre `Te E , (518)
8m
where m is the reduced mass and T the kinetic energy of the components, and r the size
**
Most examples of quantum motion given so far where due to electromagnetic effects. Can
Challenge 1265 ny you argue that the quantum of action must also apply to nuclear motion?
**
Often one reads that the universe might have been born from a quantum fluctuation.
Challenge 1266 ny Does this statement make sense? Copyright © Christoph Schiller November 1997–January 2007
Stones have colours. So do quantons. After the separate description of the motion of mat- ”
ter and that of radiation, the next step is the description of their interactions. In other
words, how do charged quantons react to electromagnetic fields and vice versa? Interac-
tions lead to surprising effects, in particular when special relativity is taken into account.
772 v quanta of light and matter • 21. colours and other interactions
* Born as Joseph Fraunhofer (b. 1787 Straubing, d. 1826 München). Bavarian, orphan at 11, he learned lens
polishing at that age; autodidact, he studied optics from books. He entered an optical company at age 19,
ensuring the success of the business, by producing the best available lenses, telescopes, micrometers, optical
gratings and optical systems of his time. He invented the spectroscope and the heliometer. He discovered
and counted 476 lines in the spectrum of the Sun, today named after him. (Up to this day, Fraunhofer lines
are used as measurement standards; the second and the metre are defined with a Fraunhofer line.) Physicists
across the world would buy their equipment from him, visit him and ask for copies of his publications. Even
after his death, his instruments remain unsurpassed for generations. With his telescopes, in 1837 Bessel
was able to measure the first parallax of a star and in 1846 Johann Gottfried Galle discovered Neptune.
Fraunhofer became professor in 1819; he died young, from the consequences of the years spent working
with lead and glass powder.
colours and other interactions between light and matter 773
Understanding the colour lines produced by each element had started to become of
interest already before the discovery of helium; the interest rose even more afterwards,
due to the increasing applications of colours in chemistry, physics, technology, crystallo-
graphy, biology and lasers. It is obvious that classical electrodynamics cannot explain the
sharp lines. Indeed, only quantum theory can explain colours.
= R − 2 .
1 1 1
(519)
λ mn 4 m
= R 2 − 2 ,
1 1 1
(520)
λ mn n m
where n and m A n are positive integers, and the so-called Rydberg constant R has the
value 10.97 µm−1 or 91.16 nm. All colour lines emitted by hydrogen fall into this simple
formula. Thus quantum theory has a clearly defined challenge here: to explain the formula
and the value of R.
Incidentally, the transition λ 21 for hydrogen is called the Lyman-alpha line. Its
wavelength, 121.6 nm, lies in the ultraviolet. It is easily observed with telescopes, since
most of the visible stars consist of excited hydrogen. The Lyman-alpha line is regularly
used to determine the speed of distant stars or galaxies, since the Doppler effect changes
Ref. 750 the wavelength when the speed is large. The record so far is a galaxy found in 2004, with
a Lyman-alpha line shifted to 1337 nm. Can you calculate the speed with which it moves
Challenge 1267 ny away from the Earth?
There are many ways to deduce Balmer’s formula from the minimum action. In
1926, Schrödinger solved his equation of motion for the electrostatic potential V (r) =
e 2 ~4πε 0 r of a point-like proton; this famous calculation however, is long and complex.
Copyright © Christoph Schiller November 1997–January 2007
S= y dq i p i = (n i + )ħ
1 µi
2π ∫ 4
(521)
for every coordinate q i and its conjugate momentum p i . Here, n i can be 0 or any positive
integer and µ i is the so-called Maslov index (an even integer) that in the case of atoms
has the value 2 for the radial and azimuthal coordinates r and θ, and 0 for the rotation
774 v quanta of light and matter • 21. colours and other interactions
¾
r
p θ = L2 −
2
Lz
sin2 θ
pφ = Lz . (522)
Using the idea that a hydrogen atom emits a single photon when its electron changes
from state E n to E m , one gets exactly the formula deduced by Balmer and Rydberg from
Challenge 1270 e observations. The quantum of action thus explains the colours of hydrogen. (This whole
discussion assumes that the electrons in hydrogen atoms are in eigenstates. Can you argue
Challenge 1271 ny why this is the case?)
The lowest energy level, for n = 1, is called the ground state. The energy value 2.19 aJ
is the ionization energy of hydrogen; if if that energy is added to the ground state, the
electron is not bound to the nucleus any more. The ionization energy for electron in atoms
thus plays the same role for electron around atoms as the escape velocity for satellites
around planets.
The calculation also yields the effective radius of the electron orbit in hydrogen. It is
given by
ħ 2 4πε 0
rn = n2 = = n 2 a 0 n 2 52.918 937 pm .
ħ
(526)
me e2 m e cα
In contrast to classical physics, quantum theory allows only selected orbits around the
Page 775, page 782 nucleus. (About the fine structure constant α, see below.) The smallest value 53 pm for
n = 1 is called the Bohr radius and is abbreviated a 0 . To be more precise, these radii are
Copyright © Christoph Schiller November 1997–January 2007
the average sizes of the electron clouds surrounding the nucleus. Quantum theory thus
implies that a hydrogen atom excited to the level n = 500 is about 12 µm in size, larger
* The calculation is straightforward. After insertion of V (r) = e~4πε 0 r into equation (522) one needs to
Challenge 1269 ny perform the (tricky) integration. Using the general result
dz º 2 º
Az + 2Bz − C = − C + º
1 B
2π y∫ (523)
z −A
¾
one gets
e2
(n r + )ħ + L = nħ =
1 m
. (524)
2 4πε 0 −2E
This leads to the energy formula (525).
colours and other interactions between light and matter 775
Figure to be included
F I G U R E 331 The non-relativistic energy levels and the visible spectrum of hydrogen (NASA)
than many bacteria! This feat has indeed been achieved, even though such blown-up
Ref. 753 atoms, usually called Rydberg atoms, are extremely sensitive to perturbations.
The orbital frequency of electrons in hydrogen is
1 e4 me 1 m e c 2 α 2 6.7 PHz
fn = = (527)
n 3 4ε 20 h 3 n 3 h n3
1 e2
vn = =
αc 2.2 Mm~s 0.007 c
(528)
n 4πε 0 ħ n n n Copyright © Christoph Schiller November 1997–January 2007
As expected, the further electrons orbit the nucleus, the slower they move. This result can
also be checked by experiment. Exchanging the electron by a muon allows to measure the
Ref. 752 time dilation of its lifetime. The measurements coincide with the formula.
We note that the speeds of the electron in hydrogen are slightly relativistic. However,
this calculation did not take into account relativistic effects. However, high precision
measurements show slight differences between the calculated energy levels and the meas-
ured ones.
Relativistic hydrogen
Also in the relativistic case, the EBK action has to be a multiple of ħ~2. From the relativistic
Ref. 751 expression of energy
776 v quanta of light and matter • 21. colours and other interactions
Figure to be included
» e2
E + mc 2 = p2 c 2 + m 2 c 4 − (529)
4πε 0 r
mc 2
E n l + mc 2 = ½ 2
. (531)
1+ ¼α
(n−l − 21 + (l + 21 )2 −α 2 )2
This result, first found by Arnold Sommerfeld, is correct for point-like electrons. In reality,
the electron has spin 1/2; the correct relativistic energy levels thus appear when we set
l = j 1~2 in the above formula. In this way, the formula explains the so-called fine
structre of the hydrogen spectra. The result can be approximated by
α2 n
En j = − (1 ( − ) + ...)
R 3
+ (532)
n2 n 2 j + 21 4
It reproduces the hydrogen spectrum to an extremely high accuracy. Only the introduc- Copyright © Christoph Schiller November 1997–January 2007
tion of virtual particle effects yields an even better result. We will present this issue later
on.
H Dirac = βm + α ë p (533)
Its position operator x is not the position of a particle, but has ad-
ditional terms; its velocity operator has only the eigenvalues plus Paul Dirac
or minus the velocity of light; the velocity operator is not simply
related to the momentum operator; the equation of motion contains the famous ‘Zitterbe-
wegung’ term; orbital angular momentum and spin are not separate constants of motion.
So why use this horrible Hamiltonian? It is the only Hamiltonian that can be easily
p p − qA . (534)
that treats electromagnetic momentum like particle momentum. With this prescription,
Dirac’s Hamiltonian describes the motion of charged particles interacting with an electro-
magnetic field A. This substitution is not possible in the Foldy–Wouthuysen Hamiltonian.
In the Dirac representation, particles are pure, point-like, structureless electric charges;
in the Foldy–Wouthuysen representation they acquire a charge radius and a magnetic
Ref. 756 moment interaction. (We come back to the reasons below, in the section on QED.)
In more detail, the simplest description of an electron (or any other elementary, stable,
* Paul Adrien Maurice Dirac (b. 1902 Bristol, d. 1984 Tallahassee), British physicist, born as son of a French-
speaking Swiss immigrant. He studied electrotechnics in Bristol, then went to Cambridge, where he later
became professor on the chair Newton had held before. In the years from 1925 to 1933 he published a stream
of papers, of which several were worth a Nobel Prize, which he received in 1933. He unified special relativ-
ity and quantum theory, he predicted antimatter, he worked on spin and statistics, he predicted magnetic
monopoles, he speculated on the law of large numbers etc. His introversion, friendliness and shyness, his
deep insights into nature, combined with a dedication to beauty in theoretical physics, made him a legend
all over the world already during his lifetime. For the latter half of his life he tried, unsuccessfully, to find an
alternative to quantum electrodynamics, of which he was the founder, as he was repelled by the problems of
infinities. He died in Florida, where he lived and worked after his retirement from Cambridge.
778 v quanta of light and matter • 21. colours and other interactions
= [H, ρ]
dρ
dt
H Dirac = βmc 2 + α ë (p − qA(x, t))c + qφ(x, t) with
0 0 0 1 0 0 0 −i 0 0 1 0
0 0 i 0 0 0 0 −1
α1 = α2 = α3 =
0 0 1 0
0 1 0 0 0 −i 0 0 1 0 0 0
1 0 0 0 i 0 0 0 0 −1 0 0
1 0 0 0
β=
0 1 0 0
0 0 −1 0
0 0 0 −1
H Maxwell =
The first Hamiltonian describes how charged particles are moved by electromagnetic
fields, and the second describes how fields are moved by charged particles. Together, they
form what is usually called quantum electrodynamics or QED for short.
As far as is known today, the relativistic description of the motion of charged mat-
ter and electromagnetic fields given by equation (535) is perfect: no differences between
theory and experiment have ever been found, despite intensive searches and despite a
high reward for anybody who would find one. All known predictions completely corres-
pond with the measurement results. In the most spectacular cases, the correspondence
between theory and measurement is more than fourteen digits. But the precision of QED
is less interesting than those of its features that are missing in classical electrodynamics.
Let’s have a quick tour.
Antimatter
Antimatter is now a household term. Interestingly, the concept was formed before any ex-
perimental evidence for it was known. Indeed, the antimatter companion of the electron
Copyright © Christoph Schiller November 1997–January 2007
was predicted in 1926 by Paul Dirac from his equation. Without knowing this prediction,
Carl Anderson discovered it in 1932 and called it positron, even though ‘positon’, without
the ‘r’, would have been the correct name. (Indeed, the correct name is used in the French
language.) Anderson was studying cosmic rays and noticed that some ‘electrons’ were
turning the wrong way in the magnetic field he had applied to his apparatus. He checked
everything in his machine and finally deduced that he found a particle with the same
mass as the electron, but with positive electric charge.
The existence of positrons has many strange implications. Already in 1928, before their
discovery, the swedish theorist Oskar Klein had pointed out that Dirac’s equation for elec-
trons makes a strange prediction: when an electron hits a sufficiently steep potential wall,
the reflection coefficient is larger than unity. Such a wall will reflect more than what is
colours and other interactions between light and matter 779
thrown at it. In 1935, after the discovery of the positron, Werner Heisenberg and Hans
Ref. 758 Euler explained the paradox. They found that the Dirac equation predicts that if an elec-
tric field exceeds the critical value of
m e c 2 m e2 c 3
Ec = = = 1.3 EV~m , (536)
eλ e eħ
the vacuum will spontaneously generate electron–positron pairs, which then are separ-
ated by the field. As a result, the original field is reduced. This so-called vacuum polariza-
tion is also the reason for the reflection coefficient greater than unity found by Klein, as
steep potentials correspond to high electric fields.
Truly gigantic examples of vacuum polarization, namely around charged black holes,
Page 897 will be described later on.
We note that such effects show that the number of particles is not a constant in the
microscopic domain, in contrast to everyday life. Only the difference between particle
Virtual particles
Contrary to what was said so far, there is a case where actions smaller than the minimal
one do play a role. We already encountered an example: in the collision between two
Page 750 electrons, there is an exchange of virtual photons. We know that the exchanged virtual
photon cannot be observed. Indeed, the action value S for this exchange obeys
SD
ħ
. (537)
2
In short, virtual particles are those particles that appear only as mediators in interactions;
they cannot be observed. Virtual particles are intrinsically short-lived; they are the oppos-
ite of free or real particles. In a certain sense, virtual particles are particles bound both in Copyright © Christoph Schiller November 1997–January 2007
– CS – more to come – CS –
In summary, all virtual matter and radiation particle-antiparticles pairs together form
what we call the vacuum; in addition, virtual radiation particles form static fields. Virtual
particles are needed for a full description of all interactions. In particular, virtual particles
are responsible for every decay process.
Page 836, page 876 We will describe a few more successes of quantum theory shortly. Before we do that,
we settle an important question.
780 v quanta of light and matter • 21. colours and other interactions
Compositeness
When is an object composite, and thus not elementary? Quantum theory gives several
pragmatic answers. The first one is somewhat strange: an object is composite when its
gyromagnetic ratio is different than the one predicted by QED. The gyromagnetic ratio γ
is defined as the ratio between the magnetic moment M and the angular momentum L.
In other terms,
M = γL . (538)
The gyromagnetic ratio γ is measured in s−1 T−1 =C~kg and determines the energy levels
of magnetic spinning particles in magnetic fields; it will reappear later in the context of
Page 919 magnetic resonance imaging. All candidates for elementary particles have spin 1~2. The
gyromagnetic ratio for spin 1~2 particles of mass m can be written as
γ= =д
M e
The criterion of elementarity thus can be reduced to a criterion on the value of the di-
mensionless number д, the so-called д-factor. (The expression eħ~2m is often called the
magneton of the particle. The dimensionless factor g/2 is often called the gyromagnetic
ratio as well; unfortunately, this often leads to confusion.) If the д-factor differs from the
value predicted by QED for point particles, about 2.0, the object is composite. For example,
Challenge 1274 ny a 4 He+ helium ion has a spin 1~2 and a д value of 14.7 ë 103 . Indeed, the radius of the he-
lium ion is 3 ë 10−11 m, obviously a finite value, and the ion is a composite entity. For the
proton, one measures a д-factor of about 5.6. Indeed, experiments yield a finite proton
radius of about 0.9 fm and show that it contains several constituents.
Also the neutron, which has a magnetic moment despite being neutral, must therefore
be composite. Indeed, its radius is approximately that of the proton. Similarly, molecules,
mountains, stars and people must be composite. Following this first criterion, the only
elementary particles are leptons – i.e. electrons, muons, tauons and neutrinos –, quarks,
and intermediate bosons – i.e. photons, W-bosons, Z-bosons and gluons. More details on
Page 919 these particles will be uncovered in the chapter on the nucleus.
Another simple criterion for compositeness has just been mentioned: any object with a
measurable size is composite. This criterion produces the same list of elementary particles
as the first. Indeed, this criterion is related to the previous one. The simplest models for
Copyright © Christoph Schiller November 1997–January 2007
д−2=
R
(540)
λC
where R is the radius and λ C = h~mc the Compton wavelength of the system. The expres-
sion is surprisingly precise for helium 4 ions, helium 3, tritium ions and protons, as you
Challenge 1275 e might want to check. The tables in Appendix C make the same point.
A third criterion for compositeness is more general: any object larger than its Compton
length is composite. The argument is simple. An object is composite if one can detect
internal motion, i.e. motion of some components. Now the action of any part with mass
colours and other interactions between light and matter 781
where m is the mass of the composite object. On the other hand, following the principle
of quantum theory, this action, to be observable, must be larger than ħ~2. Inserting this
condition, we find that for any composite object*
rA
ħ
. (542)
2π m c
The right hand side differs only by a factor 4π 2 from the so-called Compton (wave)length
λ=
h
. (543)
**
Light is diffracted by material gratings. Can matter be diffracted by light gratings? Surpris-
ingly, it actually can, as predicted by Dirac and Kapitza in 1937. This was accomplished for
Ref. 761 teh first time in 1986, using atoms. For free electrons the feat is more difficult; the clearest
confirmation came in 2001, when new laser technology was used to perform a beautiful
Copyright © Christoph Schiller November 1997–January 2007
measurement of the typical diffraction maxima for electrons diffracted by a light grating.
**
Light is totally reflected when it is directed to a dense material under an angle so large that
it cannot enter it any more. Interestingly, in the case that the dense material is excited, the
Ref. 752 totally reflected beam can be amplified. This has been shown by several Russian physicists.
**
Where is the sea bluest? Sea water is blue because it absorbs red and green light. Sea water
Challenge 1276 ny * Can you find the missing factor of 2? And is the assumption valid that the components must always be
lighter than the composite?
782 v quanta of light and matter • 21. colours and other interactions
can also be of bright colour if the sea floor reflects light. Often, sea water is often also green,
because it can contains small particles that scatter or absorb blue light. Most frequently,
this is soil or plankton. (Satellites determine plankton content from the ‘greenness’ of the
sea.) The sea is thus especially blue if it is deep, quiet and cold; in that case, the ground is
distant, soil is not mixed into the water, and the plankton content is low. There is a place
where the sea is deep, quiet and cold for most parts of the year: the Sargasso sea. It is often
called the bluest spot of the oceans.
This number first appeared in explanations for the fine structure of certain atomic colour
spectra, hence its strange name. Sommerfeld was the first to understand its general im-
portance. The number is central to quantum electrodynamics for several reasons. First
of all, it describes the strength of electromagnetism. Since all charges are multiples of the
electron charge, a higher value would mean a stronger attraction or repulsion between
charged bodies. The value of α thus determines the size of atoms, the size of all things, as
well as all colours in nature.
Secondly, only because this number is quite a bit smaller than unity are we able to
talk about particles at all. The argument is somewhat involved; it will be detailed later
on. In any case, only the small value of the fine structure constant makes it possible
to distinguish particles from each other. If the number were near or larger than one,
particles would interact so strongly that it would not be possible to observe or to talk
about particles at all.
This leads to the third reason for the importance of the fine structure constant. Since it
is a dimensionless number, it implies some yet unknown mechanism that fixes its value.
Uncovering this mechanism is one of the challenges remaining in our adventure. As long
Copyright © Christoph Schiller November 1997–January 2007
as the mechanism remains unknown – as is the case in 2007– we do not understand the
colour and size of a single thing around us.
Small changes in the strength of electromagnetic attraction between electrons and pro-
tons would have numerous important consequences. Can you describe what would hap-
pen to the size of people, to the colour of objects, to the colour of the Sun or to the work-
ings of computers if the strength would double? And if it would drop to half the usual
Challenge 1278 ny value over time?
Explaining the value of α is the most famous and the toughest challenge of modern
physics since the issue appeared in the 1920s. That is the reason for Pauli’s request to the
devil. In 1946, during his Nobel Prize lecture, he repeated the statement that a theory that
Ref. 767 does not determine this number cannot be complete. The challenge is so tough that since
colours and other interactions between light and matter 783
that time there seem to be only two classes of physicists: those who did not even dare to
take on the challenge, and those who had no clue. This fascinating story still awaits us.
The topic of the fine structure constant is so deep that it leads many astray. For example,
it is sometimes heard that in physics it is impossible to change physical units in such a
way that ħ, c and e are equal to 1 at the same time; these voices suggest that doing so
would not allow that the number 1~137.036... would keep its value. Can you show that
Challenge 1279 n the argument is wrong, and that doing so does not affect the fine structure constant?
To continue with the highest efficiency on our path across quantum theory, we first
look at two important topics: the issue of indistinguishability and the issue of interpreta-
tion of its probabilities.
Biblio graphy
702 Giuseppe Fumagalli, Chi l’ha detto?, Hoepli, Milano, 1983. Cited on page 720.
703 The quantum of action is introduced in Max Pl anck, Über irreversible Strahlungsvor-
gänge, Sitzungsberichte der Preußischen Akademie der Wissenschaften, Berlin pp. 440–480,
1899. In the paper, Planck used the letter b for what nowadays is called h. Cited on page 721.
704 Bohr explained the indivisibilty of the quantum of action in his famous Como lecture. N.
B ohr, Atomtheorie und Naturbeschreibung, Springer, Berlin, 1931. On page 16 he writes:
“No more is it likely that the fundamental concepts of the classical theories will ever become
superfluous for the description of physical experience. The recognition of the indivisibility
of the quantum of action, and the determination of its magnitude, not only depend on an
analysis of measurements based on classical concepts, but it continues to be the application
of these concepts alone that makes it possible to relate the symbolism of the quantum theory
to the data of experience.” He also writes: “...the fundamental postulate of the indivisibility
Other textbooks regularly appear around the world. Cited on page 721.
705 Max B orn, Zur Quantenmechanik der Stoßvorgänge (vorläufige Mitteilung), Zeitschrift
für Physik 37, pp. 863–867, 1926, Max B orn, Quantenmechanik der Stoßvorgänge, Zeits-
chrift für Physik 38, pp. 803–827, 1926. Cited on page 725.
706 See for example the papers by Jan Hilgevoord, The uncertainty principle for energy
and time, American Journal of Physics 64, pp. 1451–1456, 1996, and by Paul Busch, On the
time–energy uncertainty reaction, part 1 & 2, Foundations of Physics 20, pp. 1–43, 1990. A
classic is the paper by Eugene P. Wigner, On the time–energy uncertainty reaction, in
Abdus Sal am & Eugene P. Wigner, editors, Aspects of Quantum Theory, Cambridge
University Press, 1972. Cited on page 726.
707 See also the booklet by Cl aus Mattheck, Warum alles kaputt geht - Form und Versagen
in Natur und Technik, Forschungszentrum Karlsruhe, 2003. Cited on page 729.
708 R. Clifton, J. Bub & H. Halvorson, Characterizing quantum theory in terms of
information-theoretic constraints, http://arxiv.org/abs/quant-ph/0211089. Cited on page
method with radiowaves: R. Hanbury Brown & R.Q. Twiss, Nature 177, p. 27, 1956,
Cited on page 742.
716 Science 8 September 1995 Cited on page 743.
717 L. Mandel, Configuration-space photon number operators in quantum optics, Physical
Review 144, pp. 1071–1077, 1966. No citations.
718 A. Einstein, Über einen die Erzeugung und Umwandlung des Lichtes betreffenden heur-
istischen Standpunkt, Annalen der Physik 17, pp. 132–184, 1905. Cited on page 743.
719 See the summary by P.W. Milonni, Answer to question 45: What (if anything) does the
photoelectric effect teach us?, American Journal of Physics 65, pp. 11–12, 1997. Cited on page
744.
720 For a detailed account, See Jeffrey J. Prentis, Poincaré’s proof of the quantum discon-
tinuity of nature, American Journal of Physics 63, pp. 339–350, 1995. The original papers are
786 v quanta of light and matter
Henri Poincaré, Sur la théorie des quanta, Comptes Rendus de l’Académie des Sciences
(Paris) 153, pp. 1103–1108, 1911, as well as Henri Poincaré, Sur la théorie des quanta,
Journal de Physique (Paris) 2, pp. 5–34, 1912. Cited on page 744.
721 J. Jacobson, G. Björk, I. Chang & Y. Yamamoto, Photonic de Broglie waves, Phys-
ical Review Letters 74, pp. 4835–4838, 1995. The first measurement was published by E.J.S.
Fonseca, C.H. Monken & S. de Pádua, Measurement of the de Broglie wavelength of
a multiphoton wave packet, Physical Review Letters 82, pp. 2868–2671, 1995. Cited on page
744.
722 For the three photon state, see M.W. Mitchell, J.S. Lundeen & A.M. Steinberg,
Super-resolving phase measurements with a multiphoton entangled state, Nature 429,
pp. 161–164, 2004, and for the four-photon state see, in the same edition, P. Walther,
J.-W. Pan, M. Aspelmeyer, R. Ursin, S. Gasparoni & A. Zeilinger, De Broglie
wavelength of a non-local four-photon state, Nature 429, p. 158-161, 2004. Cited on page
745.
723 For an introduction to spueezed light, see L. Mandel, Non-classical states of the electro-
502, On the constitution of atoms and molecules: Part III, ibid., pp. 857–875. Cited on page
758.
733 L. de Broglie, Comptes rendus de l’Académie des Sciences 177, pp. 507–510, 1923. Cited
on page 756.
734 C. Jönsson, Interferenz von Elektronen am Doppelspalt, Zeitschrift für Physik 161,
pp. 454–474, 1961, C. Jönsson, Electron diffraction at multiple slits, American Journal of
Physics 42, pp. 4–11, 1974. Due to the charge of electons, this experiment is not easy to per-
form; any parts of the set-up that are insulating get charged and distort the picture. That
is the reason that the experient was performed much later with electrons than with atoms,
neutrons and molecules. Cited on page 757.
735 M. Arndt, O. Nairz, J. Vos–Andreae, C. Keller, G. van der Z ouw & A.
Zeilinger, Wave–particle duality of C60 molecules, Nature 401, pp. 680–682, 14 Octo-
ber 1999. See also the observation for tetraphenyleprophyrin and C 60 F48 by the same team,
published as L. Hackermüller & al., Wave nature of biomolecules and fluorofullerenes,
Physical Review Letters 91, p. 090408, 2003.
201, 1995, C.C. Bradley, C.A. Sackett, J.J. Tollett & R.G. Hulet, Evidence of
bose-einstein condensation in an atomic gas with attractive interactions, Physical Review
Letters 75, pp. 1687–1690, 1995, K.B. Davis, M.-O. Mewes, M.R. Andrews, N.J. van
Dru ten, D.S. Durfee, D.M. Kurn & W. Ketterle, Bose-Einstein Condensation in
a Gas of Sodium Atoms, Physical Review Letters 75, pp. 3969–3973, 1995. For a simple intro-
duction, see W. Ketterle, Experimental studies of Bose–Einstein condensation, Physics
Today pp. 30–35, December 1999. Cited on page 754.
742 W.M. Itano, D.J. Heinzen, J.J. B ollinger & D.J. Winel and, Quantum Zeno effect,
Physical Review A 41, pp. 2295–2300, 1990. M.C. Fischer, B. Gu tiérrez-Medina &
M.G. R aizen, Observation of the Quantum Zeno and Anti-Zeno effects in an unstable sys-
tem, Physical Review Letters 87, p. 040402, 2001, also http://www-arxiv.org/abs/quant-ph/
0104035. Cited on page 854.
788 v quanta of light and matter
743 See P. Facchi, Z. Hradil, G. Krenn, S. Pascazio & J. Řeháček Quantum Zeno
tomography, Physical Review A, 66, p. 012110, 2002. Cited on page 854.
744 Robert H. Dicke & James P. Wittke, Introduction to quantum theory, Addison–
Wesley, Reading, Massachusetts, 1960. See also Stephen Gasiorowicz, Quantum Phys-
ics, Jojn Wiley & Sons, 1974. Cited on page 759.
745 G. Papini, Shadows of a maximal acceleration, http://arxiv.org/abs/gr-qc/0211011. Cited on
page 758.
746 Y. Aharonov & D. B ohm, Significance of electromagnetic potentials in the quantum
theory, Physical Review 115, pp. 485–491, 1959. No citations.
747 G.E. Uhlenbeck & S. Gousmit, Ersetzung der Hypothese vom unmechanischen
Zwang durch eine Forderung bezüglich des inneren Verhaltens jedes einzelnen Elektrons,
Naturwissenschaften 13, pp. 953–954, 1925. Cited on page 768.
748 L. Thomas, The motion of the spinning electron, Nature 117, p. 514, 1926. Cited on page
768.
1951. L.L. Foldy, Electron–neutron interaction, Review of Modern Physics 30, pp. 471–481,
1952. Cited on page 777.
757 John P. Costell a & Bruce H.J. Mc Kell ar, The Foldy–Wouthuysen transforma-
tion, American Journal of Physics 63, pp. 1119–1121, 1995. Cited on page 770.
758 H. Euler & B. Kockel, Naturwissenschaften 23, p. 246, 1935, H. Euler, Annalen der
Physik 26, p. 398, 1936, W. Heisenberg & H. Euler, Folgerung aus der Diracschen The-
orie des Electrons, Zeitschrift für Physik 98, pp. 714–722, 1936. Cited on page 779.
759 The g-factors for composite nuclei are explained by ... See also Hans Dehmelt, Is the
electron a composite particle?, Hyperfine Interactions 81, pp. 1–3, 1993. Cited on page 780.
760 J.P. Woerdman, G. Nienhuis & I. Kuščer, Is it possible to rotate an atom?, Optics
Communications 93, pp. 135–144, 1992. We are talking about atoms rotating around their
centre of mass; atoms can of course rotate around other bodies, as discussed by M.P. Sil-
bibliography 789
W
hy are we able to distinguish twins from each other? Why can we distinguish
hat looks alike, such as a copy from an original? Most of us are convinced that
henever we compare an original with a copy, we can find a difference. This con-
viction turns out to be correct, even though it is a quantum effect that is in contrast with
How many surgical gloves (for the right hand) are necessary if m doctors
need to operate w patients in a hygienical way, so that nobody gets in contact
with the body fluids of anybody else?
The same problem also appears in other settings. For example, it also applies to condoms,
men and women – and is then (officially) called the condom problem – or to computers,
Copyright © Christoph Schiller November 1997–January 2007
Ref. 768 interfaces and computer viruses. In fact, ‘condom problem’ is the term used in most books
that discuss it. Obviously, the optimal number of gloves is not the product wm. In fact,
the problem has three subcases.
Challenge 1281 e — The simple case m = w = 2 already provides the most important ideas needed. Are
you able to find the optimal solution and procedure?
— In the case w = 1 and m odd or the case m = 1 and w odd, the solution is (m + 1)~2
Challenge 1282 e gloves. This is the optimal solution, as you can easily check yourself.
Ref. 769 — A solution with a simple procedure for all other cases is given by 2w~3 + m~2 gloves,
where x means the smallest integer greater than or equal to x. For example, for two
doctor and three patients this gives only three gloves. (However, this formula does not
Challenge 1283 e always give the optimal solution; better values exist in certain subcases.)
792 vi permu tation of particles • 22. are particles like gloves?
Two basic properties of gloves determine the solution to the puzzle. First, gloves have
two sides, an interior and an exterior one. Secondly, gloves can be distinguished from
each other. Do these two properties also apply to particles? We will discuss the issue of
Page 1073 double-sidedness in the third part of the mountain ascent. In fact, the question whether
particles can be turned inside out will be of great importance for their description and
their motion. In the present chapter we concentrate on the second issue, namely whether
objects and particles can always be distinguished. We will find that elementary particles
do not behave like gloves in these but in an even more surprising manner. (In fact, they
do behave like gloves in the sense that one can distinguish right-handed from left-handed
ones.)
In everyday life, distinction of objects can be achieved in two ways. We are able to
distinguish objects – or people – from each other because they differ in their intrinsic
properties, such as their mass, colour, size or shape. In addition, we are also able to distin-
guish objects if they have the same intrinsic properties. Any game of billiard suggests that
= ln 3 + +
S V 3 ln α
(545)
kN Λ 2 N
where»k is the Boltzmann constant, T the temperature, ln the natural logarithm, and
Λ = 2πħ 2 ~mkT is the thermal wavelength (approximately the de Broglie wavelength
of the particles making up the gas). In this formula, the pure number α is equal to 1 if the Copyright © Christoph Schiller November 1997–January 2007
particles are distinguishable, and equal to 1~N! if they are not. Measuring the entropy thus
allows us to determine α and therefore whether particles are distinguishable. It turns out
Challenge 1284 e that only the second case describes nature. This can be checked with a simple test: only in
the second case does the entropy of two volumes of identical gas add up.* The result, often
Challenge 1285 d * Indeed, the entropy values observed by experiment, for a monoatomic gas, are given by the so-called
Sackur–Tetrode formula
= ln +
S V 5
(546)
kN NΛ 3 2
which follows when α = 1~N! is inserted above. It was deduced independently by the German physicist Otto
Sackur (1880–1914) and the Dutch physicist Hugo Tetrode (1895–1931). Note that the essential parameter
is the ratio between V ~N, the volume per particle, and Λ 3 , the de Broglie volume of a particle.
are particles like gloves? 793
called Gibbs’ paradox,* thus proves that the microscopic components of matter are indis-
Ref. 770 tinguishable: in a system of microscopic particles, there is no way to say which particle is
which. Indistinguishability is an experimental property of nature.**
The properties of matter would be completely different without indistinguishability.
* Josiah Willard Gibbs (1839–1903), US-American physicist who was, with Maxwell and Planck, one of the
three founders of statistical mechanics and thermodynamics; he introduced the concepts of ensemble and
of phase.
** When radioactivity was discovered, people thought that it contradicted the indistinguishability of atoms,
as decay seems to single out certain atoms compared to others. But quantum theory then showed that this
is not the case and that atoms do remain indistinguishable.
794 vi permu tation of particles • 22. are particles like gloves?
change symmetry, is the property of nature that observations are unchanged under ex-
change of identical particles. Together with space-time symmetry, gauge symmetry and
the not yet encountered renormalization symmetry, permutation symmetry forms one
of the four pillars of quantum theory. Permutation symmetry is a property of composed
systems, i.e. of systems made of many (identical) subsystems. Only for such systems does
indistinguishability play a role.
In other words, ‘indistinguishable’ is not the same as ‘identical’. Two particles are not
the same; they are more like copies of each other. On the other hand, everyday life ex-
* In everyday life, the weight or mass is commonly used as observable. However, it cannot be used in the
quantum domain, except for simple cases. Can you give at least two reasons, one from special relativity and
Challenge 1286 n one from general relativity?
are particles like gloves? 795
perience shows us that two copies can always be distinguished under close inspection,
so that the term is not fully appropriate either. In the microscopic domain, particles are
countable and completely indistinguishable.* Particles are perfect copies of each other.
We will discover shortly that permutation is partial rotation. Permutation symmetry
Challenge 1287 e thus is a symmetry under partial rotations. Can you find out why?
for some unknown angle α. Applying this expression twice, by exchanging the same
couple of indices again, allows us to conclude that e2i α = 1. This implies that
* The word ‘indistinguishable’ is so long that many physicists sloppily speak of ‘identical’ particles neverthe-
Copyright © Christoph Schiller November 1997–January 2007
detectors
mirrors
beam
source splitter
possible
two identical light
photons paths
bosons (at any energy) – turns out to be a boson; a composite of an odd number of fer-
mions is (always) a fermion. For example, almost all of the known molecules are bosons
(electronically speaking). Fermionic molecules are rather special and even have a special
If experiments force us to conclude that nobody, not even nature, can distinguish any
two particles of the same type, we deduce that they do not form two separate entities, but
some sort of unity. Our naive, classical sense of particle as a separate entity from the rest
of the world is thus an incorrect description of the phenomenon of ‘particle’. Indeed, no
experiment can track particles with identical intrinsic properties in such a way that they
can be distinguished with certainty. This impossibility has been checked experimentally
with all elementary particles, with nuclei, with atoms and with numerous molecules.
How does this fit with everyday life, i.e. with classical physics? Photons do not worry
us much here. Let us focus the discussion on matter particles. We know to be able to
distinguish electrons by pointing to the wire in which they flow, and we can distinguish
our fridge from that of our neighbour. While the quantum of action makes distinction
are particles like gloves? 797
impossible, everyday life allows it. The simplest explanation is to imagine a microscopic
particle, especially an elementary one, as a bulge, i.e. as a localized excitation of the va-
cuum. Figure 335 shows two such bulges representing two particles. It is evident that if
particles are too near to each other, it makes no sense to distinguish them; we cannot say
any more which is which.
The bulge image shows that
either for large distances or for high
potential walls separating them, dis-
tinction of identical particles does
become possible. In such situations,
measurements allowing to track F I G U R E 335 Particles as localized excitations
them independently do exist. In
other words, we can specify a limit energy at which permutation symmetry of objects
or particles separated by a distance d becomes important. It is given by
theory describes all particles of a given type as excitations of a single fundamental field.
Particles are indistinguishable because each particle is an excitation of the same basic
substrate and each excitation has the same properties. A situation with one particle is
then described by a vacuum state acted upon by a creation operator. Adding a second
particle is described by adding a second creation operator, and subtracting a particle by
adding a annihilation operator; the latter turns out to be the adjunct of the former.
Quantum field theory then studies how these operators must behave to describe ob-
servations.* It arrives at the following conclusions:
— Fields with half-integer spin are fermions and imply (local) anticommutation.
— Fields with integer spin are bosons and imply (local) commutation.
— For all fields at spacelike separations, the commutator – respectively anticommutator
– vanishes.
— Antiparticles of fermions are fermions, and antiparticles of bosons are bosons.
— Virtual particles behave like their real counterparts.
These connections are at the basis of quantum field theory. They describe how particles
are identical. But why are they? Why are all electrons identical? Quantum field theory
describes electrons as identical excitations of the vacuum, and as such as identical by
construction. Of course, this answer is only partially satisfying. We will find a better one
only in the third part of our mountain ascent.
2 ë 10−26 (552)
of the total state. Electrons are always in an antisymmetric state: thus they are fermions.
The reasoning behind this elegant experiment is the following. If electrons would not
always be fermions, every now and then an electron could fall into the lowest energy
level of a copper atom, leading to X-ray emission. The lack of such X-rays implies that
electrons are fermions to a very high accuracy. X-rays could be emitted only if they were
bosons, at least part of the time. Indeed, two electrons, being fermions, cannot be in the
same quantum state: this restriction is called the Pauli exclusion principle. It applies to
all fermions and is our next topic.
d, d † = dd † + d † d = 1 (551)
they describe a fermion. The so defined bracket is called the anticommutator bracket.
are particles like gloves? 799
A copying machine is a machine that takes an original, reads out its properties and
produces a copy, leaving the original unchanged. This seems definition seems straight-
forward. However, we know that if we extract information from an original, we have to
interact with it. As a result, the system will change at least by the quantum of action. We
thus expect that due to quantum theory, copies and originals can never be identical.
Quantum theory proves this in detail. A copying machine is described by an operator
that maps the state of an original system to the state of the copy. In other words, a copying
machine is linear. This linearity leads to a problem. Simply stated, if a copying machine
were able to copy originals either in state SAe or in state SBe, it could not decide what to do
if the state of the original were SAe+SBe. On one hand, the copy should be SAe+SBe; on the
other hand, the linearity of the copier forbids this. Indeed, a copier is a device described
by an operator U that changes the starting state Ssec of the copy in the following way:
— If the original is in state SAe, a copier acts as
In other words, a copy machine cannot copy a state completely.* This is the no-cloning
theorem.
The impossibility of copying is implicit in quantum theory. If we were able to clone
systems, we could to measure a variable of a system and a second variable on its copy. We
would be thus able to beat the indeterminacy relation. This is impossible. Copies are and
always must be imperfect.
Copyright © Christoph Schiller November 1997–January 2007
Other researchers then explored how near to perfection a copy can be, especially in the
Ref. 776 case of classical systems. To make a long story short, these investigations show that also
the copying or cloning of macroscopic systems is impossible. In simple words, copying
machines do not exist. Copies can always be distinguished from originals if observations
are made with sufficient care. In particular, this is the case for biological clones; biological
clones are identical twins born following separate pregnancies. They differ in their finger
* The no-cloning theorem puts severe limitations on quantum computers, as computations often need cop-
ies of intermediate results. It also shows that faster-than-light communication is impossible in EPR experi-
ments. In compensation, quantum cryptography becomes possible – at least in the laboratory. Indeed, the
no-cloning theorem shows that nobody can copy a quantum message without being noticed. The specific
ways to use this result in cryptography are the 1984 Bennett–Brassard protocol and the 1991 Ekert protocol.
800 vi permu tation of particles • rotations and statistics
prints, iris scans, physical and emotional memories, brain structures, and in many other
Challenge 1292 n aspects. (Can you specify a few more?) In short, biological clones, like identical twins, are
not copies of each other.
The lack of quantum mechanical copying machines is disappointing. Such machines,
or teleportation devices, could be fed with two different inputs, such as a lion and a goat,
and produce a superposition: a chimaera. Quantum theory shows that all these imaginary
beings cannot be realized.
In summary, everyday life objects such as photocopies, billiard balls or twins are always
distinguishable. There are two reasons: first, quantum effects play no role in everyday life,
so that there is no danger of unobservable exchange; secondly, perfect clones of classical
systems do not exist anyway, so that there always are tiny differences between any two
objects, even if they look identical at first sight. Gloves can always be distinguished.
lation, i.e. under position and time shift of viewpoints. The spin value describes how an
object behaves under rotations in three dimensions, i.e. under orientation change of view-
points.** As is well known, the magnitude of four-momentum is an invariant property,
given by the mass, whereas its orientation in space-time is free. Similarly, the magnitude
of spin is an invariant property, and its orientation has various possibilities with respect
to the direction of motion. In particular, the spin of massive particles behaves differently
from that of massless particles.
* Eugene Wigner (b. 1902 Budapest, d. 1995 Princeton), Hungarian–US-American theoretical physicist, re-
ceived the Nobel Prize for physics in 1993. He wrote over 500 papers, many about symmetry in physics. He
was also famous for being the most polite physicist in the world.
** The group of physical rotations is also called SO(3), since mathematically it is described by the group of
Special Orthogonal 3 by 3 matrices.
rotations and statistics – visualizing spin 801
The symmetry investigations lead to the classification of particles by their mass, their
momentum and their spin. To complete the list, the remaining symmetries must be in-
cluded. These are motion inversion parity, spatial parity and charge inversion parity. Since
these symmetries are parities, each elementary particle has to be described by three addi-
tional numbers, called T, P and C, each of which can take values of either +1 or −1. Being
parities, they must be multiplied to yield the value for a composed system.
Page 1190 A list of the values observed for all elementary particles in nature is given in
Appendix C. Spin and parities together are called quantum numbers. As we will discover
later on, additional interaction symmetries will lead to additional quantum numbers. But
let us return to spin.
The main result is that spin 1/2 is a possibility in nature, even though it does not appear
in everyday life. Spin 1/2 means that only a rotation of 720 degrees is equivalent to one of
802 vi permu tation of particles • rotations and statistics
F I G U R E 338 A belt visualizing two spin F I G U R E 339 Another model for two spin
1/2 particles 1/2 particles
into the belt. Again, a second swap will undo the twist.
In other words, if we take each end to represent a particle and a twist to mean a factor
−1, the belt exactly describes the phase behaviour of spin 1~2 wave functions under ex-
change and under rotations. In particular, we see that spin and exchange behaviour are
related.
The human body has such a belt built in: the arm. Just take your hand, put an object
on it for clarity such as a cup, and turn the hand and object by 2π by twisting the arm.
After a second rotation the whole system will be untangled again.
The trick is even more impressive when many arms are used. You can put your two
Challenge 1294 e hands (if you chose the correct starting position) under the cup or you can take a friend
or two who each keep a hand attached to the cup. The feat can still be performed: the
Challenge 1295 e whole system untangles after two full turns.
804 vi permu tation of particles • rotations and statistics
α=0 α = 2π α = 4π
Another demonstration is to connect two buckles with many bands or threads, like in
Figure 339. Both a rotation by 2π of one end or an exchange of the two ends produces
quite a tangle, even if one takes paths that ‘in between’ the bands; nevertheless, in both
cases a second rotation leads back to the original situation.
There is still another way to show the connection between rotation and exchange. Just
glue any number of threads or bands, say half a metre long, to an asymmetric object.
Like the arm of a human being, the bands are supposed to go to infinity and be attached
there. If any of the objects, which represent the particles, is rotated by 2π, twists appear Copyright © Christoph Schiller November 1997–January 2007
in its strings. If the object is rotated by an additional turn, to a total of 4π, as shown in
Figure 341, all twists and tangles can be made to disappear, without moving or turning
the object. You really have to experience this in order to believe it. And the trick really
works with any number of bands glued to the object.
Even more astonishing is the exchange part of the experiment. Take two particles of
the type shown on the left side of Figure 341. If you exchange the positions of two such
spin 1/2 particles, always keeping the ends at infinity fixed, a tangled mess is created. But
incredibly, if you exchange the objects a second time, everything untangles neatly, inde-
Challenge 1296 e pendently of the number of attached strings. You might want to test yourself that the
behaviour is also valid with sets of three or more particles.
All these observations together form the spin statistics theorem for spin 1/2 particles:
spin and exchange behaviour are related. Indeed, these almost ‘experimental’ arguments
rotations and statistics – visualizing spin 805
Ref. 781 can be put into exact mathematical language by studying the behaviour of the configura-
tion space of particles. These investigations result in the following statements:
Note that all these arguments require three dimensions, because there are no tangles (or
knots) in fewer dimensions.** And indeed, spin exists only in three or more spatial di-
mensions.
Here is a challenge. A spin 1/2 object can be modelled with one belt attached to it. If
you want to model the spin behaviour with attached one-dimensional strings instead of
Challenge 1298 n attached bands, what is the minimum number required?
* A mathematical observable behaving like a spin 1/2 particle is neither a vector nor a tensor, as you may
Challenge 1297 e want to check. An additional concept is necessary; such an observable is called a spinor. We will introduce
Page 776 it later on.
** Of course, knots and tangles do exist in higher dimensions. Instead of considering knotted one-
dimensional lines, one can consider knotted planes or knotted higher-dimensional hyperplanes. For ex-
ample, deformable planes can be knotted in four dimensions and deformable 3-spaces in five dimensions.
806 vi permu tation of particles • rotations and statistics
Integer spin
Under rotations, integer spin particles behave differently from half-integer particles. In-
teger spin particles do not show the strange sign changes under rotations by 2π. In the
belt imagery, integer spin particles need no attached strings. The spin 0 particle obviously
corresponds to a sphere. Models for other spin values are shown in Figure 342. Exploring
their properties in the same way as above, we arrive at the so-called spin-statistics theorem:
Objects of half-integer spin are fermions. They obey the Pauli exclusion prin-
ciple.
Objects of integer spin are bosons.
Electrons do have spin 1~2 and are charged. Electrons and all other charged particles
with spin 1~2 do have a magnetic moment.** A magnetic moment is expected for any
rotating charge. In other words, spin 1~2 does behave like rotation. However, assuming
that a particle consists of a continuous charge distribution in rotational motion gives the
wrong value for the magnetic moment. In the early days of the twentieth century, when
physicists were still thinking in classical terms, they concluded that spin 1~2 particles thus
* This sentence implies that spin 1 and higher can also be achieved with tails; can you find such a represent-
Challenge 1302 ny ation?
Note that composite fermions can be bosons only up to that energy at which the composition breaks
down. Otherwise, by packing fermions into bosons, we could have fermions in the same state.
Challenge 1303 ny ** This can easily be measured in a an experiment; however, not one of the Stern–Gerlach type. Why?
808 vi permu tation of particles • rotations and statistics
cannot be rotating. This myth has survived through many textbooks. The correct deduc-
tion is that the assumption of continuous charge distribution is wrong. Indeed, charge
is quantized; nobody today expects that elementary charge is continuously spread over
space, as that would contradict its quantization.
Let us remember what rotation is. Both the belt trick for spin 1/2 as well as the integer
spin case remind us: a rotation of one body around another is a fraction or a multiple of an
exchange. What we call a rotating body in everyday life is a body continuously exchanging
the positions of its parts. Rotation and exchange are the same.
Above we found that spin is exchange behaviour. Since rotation is exchange and spin
is exchange, it follows that spin is rotation. Since we deduced, like Wigner, spin from
rotation invariance, this consequence is not a surprise.
The belt model of a spin 1/2 particle tells us that such a particle can rotate continuously
Page 802 without any hindrance. In short, we are allowed to maintain that spin is rotation about an
axis, without any contradiction to observations, even for spin 1/2. The belt model helps
sion from the experiments showing the existence of spin. So far, we have seen that rota-
tion requires the existence of spin, that spin appears when relativity is introduced into
Page 323 quantum theory, and that relativity requires antimatter. Taking these three statements
together, the conclusion of the title is not surprising any more. Interestingly, there is a
simple argument making the same point directly, without any help of quantum theory,
when the belt model is extended from space alone to full space-time.
To learn how to think in space-time, let us take a particle spin 1, i.e. a particle looking
like a detached belt buckle in three dimensions. When moving in a 2+1 dimensional space-
* Obviously, the detailed structure of the electron still remains unclear at this point. Any angular momentum
S is given classically by S = Θω; however, neither the moment of inertia Θ, connected to the rotation radius
and electron mass, nor the angular velocity ω are known at this point. We have to wait quite a while, until
the third part of our adventure, to find out more.
rotations and statistics – visualizing spin 809
t t t t t
Challenge 1304 ny time, it is described by a ribbon. Playing around with ribbons in space-time, instead of
belts in space, provides many interesting conclusions. For example, Figure 343 shows that
wrapping a rubber ribbon around the fingers can show that a rotation of a body by 2π
in presence of a second one is the same as exchanging the positions of the two bodies.*
Both sides of the hand transform the same initial condition, at one border of the hand, to
the same final condition at the other border. We have thus successfully extended a known
result from space to space-time. Interestingly, we can also find a smooth sequence of steps
realizing this equivalence.
If you think that Figure 343 is not a satisfying explanation, you are right. A more com-
plete (yet equivalent) explanation is given by Figure 344. We assume that each particle is
Copyright © Christoph Schiller November 1997–January 2007
described by a segment; in the figure, they lie horizontally. The leftmost diagram shows
two particles: one at rest and one being rotated by 2π. The deformation of the ribbons
shows that this process is equivalent to the exchange in position of two particles, which is
shown in the rightmost diagram. Again, one notes that the sequence that shows the equi-
valence between rotation and exchange requires the use of a particle–antiparticle pair.
Without antiparticles, the equivalence of rotation and exchange would not hold. Rota-
tion in space-time indeed requires antiparticles.
* Obviously, the next step would be to check the full spin 1/2 model of Figure 341 in four-dimensional
Challenge 1305 ny space-time. But this is not an easy task; there is no generally accepted solution yet.
810 vi permu tation of particles • rotations and statistics
Biblio graphy
768 See the paper by Martin Gardner, Science fiction puzzle tales, Clarkson Potter, 67,
pp. 104–105, 1981, or his book Aha! Insight, Scientific American & W.H. Freeman, 1978. The
rabbit story is from A. Hajnal & P. Lovász, An algorithm to prevent the propagation
of certain diseases at minimum cost, in Interfaces Between Computer Science and Opera-
tions Research, edited by J.K. Lenstra, A.H.G. R innooy Kan & P. Van Emde B oas,
Mathematisch Centrum, Amsterdam 1978, whereas the computer euphemism is used by A.
Orlitzky & L. Shepp, On curbing virus propagation, Technical memorandum, Bell Labs
1989. Cited on page 791.
769 A complete discussion of the problem can be found in chapter 10 of Il an Vardi, Computa-
tional Recreations in Mathematica, Addison Wesley, Redwood City, California, 1991. Cited
on page 791.
770 On Gibbs’ paradox, see your favourite text on thermodynamics or statistical mechanics. See
ics Letters A 92, pp. 271–272, 1982, and by W.K. Wootters & W.H. Zurek, A single
quantum cannot be cloned, Nature 299, pp. 802–803, 1982. For a discussion of photon and
multiparticle cloning, see N. Gisin & S. Massar, Optimal quantum cloning machines,
Physics Review Letters 79, pp. 2153–2156, 1997. The whole topic has been presented in detail
by V. Buzek & M. Hillery, Quantum cloning, Physics World 14, pp. 25–29, November
2001. Cited on page 798.
776 The most recent experimental and theoretical results on physical cloning are described in
A. L amas-Linares, C. Simon, J.C. Howell & D. B ouwmeester, Experimental
quantum cloning of single photons, Science 296, pp. 712 – 714, 2002, D. Collins & S.
Popescu, A classical analogue of entanglement, preprint http://arxiv.org/abs/quant-ph/
0107082, 2001, and A. Daffertshofer, A.R. Pl astino & A. Pl astino, Classical
no-cloning theorem, Physical Review Letters 88, p. 210601, 2002. Cited on page 799.
812 vi permu tation of particles
Ref. 785
“ presentation has been so long delayed is no
doubt caused by the fact that Niels Bohr
brainwashed a whole generation of theorists
into thinking that the job was done fifty years
”
W
hy is this famous physical issue arousing such strong emotions? In particular,
ho is brainwashed, Gell-Mann, the discoverer of the quarks, or most of the
orld’s physicists working on quantum theory who follow Niels Bohr’sopinion?
In the twentieth century, quantum mechanics has thrown many in disarray. Indeed, it
radically changed the two most basic concepts of classical physics: state and system. The
state is not described any more by the specific values taken by position and momentum,
but by the specific wave function ‘taken’ by the position and momentum operators.* In ad-
dition, in classical physics a system was described as a set of permanent aspects of nature;
permanence was defined as negligible interaction with the environment. Quantum mech-
anics shows that this definition has to be modified as well.
* It is equivalent, but maybe conceptually clearer, to say that the state is described by a complete set of
commuting operators. In fact, the discussion is somewhat simplified in the Heisenberg picture. However,
here we study the issue in the Schrödinger picture, using wave functions.
814 vii details of quantum theory • superpositions and probabilities
(Why?)
How would it evolve in time? We can ask the same questions about a superposition of a
state where a car is inside a closed garage with a state where the car is outside.
Such strange situations are not usually observed in everyday life. The reason for this
We can then say that whenever the system is in a pure state, its density matrix, or density
functional, contains off-diagonal terms of the same order of magnitude as the diagonal
* Most what can be said about this topic has been said by two people: John von Neumann, who in the
Ref. 786 nineteen-thirties stressed the differences between evolution and decoherence, and by Hans Dieter Zeh, who
Ref. 787 in the nineteen seventies stressed the importance of baths and the environment in the decoherence process.
why are people either dead or alive? 815
ones.* Such a density matrix corresponds to the above-mentioned strange situations that
we do not observe in daily life.
We now have a look at the opposite situation. In contrast to the case just mentioned, a
density matrix for macroscopic distinct states with vanishing off-diagonal elements, such
as the two state example
ρ = SaS2 ψ a a ψ †a + SbS2 ψ b a ψ †b
SaS2 0
= (ψ a , ψ b ) 2
ψ †a
0 SbS
(559)
ψ †b
describes a system which possesses no phase coherence at all. (Here, a denotes the non-
commutative dyadic product or tensor product which produces a tensor or matrix start-
ing from two vectors.) Such a diagonal density matrix cannot be that of a pure state; it
describes a system which is in the state ψ a with probability SaS2 and which is in the state
where tr denotes the trace, i.e. the sum of all diagonal elements. We also remind ourselves
that a system with a large and constant entropy is called a bath. In simple physical terms,
a bath is a system to which we can ascribe a temperature. More precisely, a (physical) bath,
or (thermodynamic) reservoir, is any large system for which the concept of equilibrium can
Copyright © Christoph Schiller November 1997–January 2007
be defined. Experiments show that in practice, this is equivalent to the condition that a
bath consists of many interacting subsystems. For this reason, all macroscopic quantities
describing the state of a bath show small, irregular fluctuations, a fact that will be of central
importance shortly.
* Using the density matrix, we can rewrite the evolution equation of a quantum system:
dρ
= − [H, ρ] .
i
ψ̇ = −iHψ becomes (558)
dt ħ
Both are completely equivalent. (The new expression is sometimes also called the von Neumann equation.)
We won’t actually do any calculations here. The expressions are given so that you recognize them when you
encounter them elsewhere.
** In many settings, decoherence is called disentanglement, as we will see below.
816 vii details of quantum theory • superpositions and probabilities
Obviously, an everyday bath is also a bath in the physical sense: a thermodynamic bath
is similar to an extremely large warm water bath, one for which the temperature does not
change even if one adds some cold or warm water to it. Examples of physical baths are an
intense magnetic field, a large amount of gas, or a large solid. (The meanings of ‘intense’
and ‘large’ of course depend on the system under study.) The physical concept of bath is
thus an abstraction and a generalization of the everyday concept of bath.
It is easy to see from the definition (560) of entropy that the loss of off-diagonal ele-
Challenge 1308 n ments corresponds to an increase in entropy. And it is known that any increase in entropy
of a reversible system, such as the quantum mechanical system in question, is due to an
interaction with a bath.
Where is the bath interacting with the system? It obviously must be outside the system
one is talking about, i.e. in its environment. Indeed, we know experimentally that any
environment is large and characterized by a temperature. Some examples are listed in
Table 58. Any environment therefore contains a bath. We can even go further: for every
If the bath is described as a set of particles randomly hitting the microscopic system,
it is best characterized by the effective wavelength λ eff of the particles and by the average
Challenge 1309 ny interval t hit between two hits. A straightforward calculation shows that the decoherence
time t d is in any case smaller than this time interval, so that Copyright © Christoph Schiller November 1997–January 2007
t d D t hit =
1
, (561)
φσ
where φ is the flux of particles and σ the cross-section for the hit.* Typical values are given
* The decoherence time is derived by studying the evolution of the density matrix ρ(x, x ′ ) of ob-
jects localized at two points x and x ′ . One finds that the off-diagonal elements follow ρ(x, x ′ , t) =
′ 2
ρ(x, x ′ , 0)e−Λt(x−x ) , where the localization rate Λ is given by
2
Λ = k φσeff (562)
where k is the wave number, φ the flux and σeff the cross-section of the collisions, i.e. usually the size of the
Ref. 789 macroscopic object.
why are people either dead or alive? 817
matter baths
solid, liquid 300 K 10 pm 1031 ~m2 s 10−12 s 10−25 s
air 300 K 10 pm 1028 ~m2 s 10−9 s 10−22 s
laboratory vacuum 50 mK 10 µm 1018 ~m2 s 10 s 10−12 s
photon baths
sunlight 5800 K 900 nm 1023 ~m2 s 10−4 s 10−17 s
‘darkness’ 300 K 20 µm 1021 ~m2 s 10−2 s 10−15 s
1017 ~m2 s
a. The cross-section σ in the case of matter and photon baths was assumed to be 10−19 m2 for
atoms; for the macroscopic object a size of 1 mm was used as example. For neutrino baths, ...
in Table 58. We easily note that for macroscopic objects, decoherence times are extremely
short. Scattering leads to fast decoherence. However, for atoms or smaller systems, the
situation is different, as expected.
Copyright © Christoph Schiller November 1997–January 2007
A second method to estimate the decoherence time is also common. Any interaction
of a system with a bath is described by a relaxation time t r . The term relaxation designates
any process which leads to the return to the equilibrium state. The terms damping and
friction are also used. In the present case, the relaxation time describes the return to equi-
librium of the combination bath and system. Relaxation is an example of an irreversible
evolution. A process is called irreversible if the reversed process, in which every compon-
ent moves in opposite direction, is of very low probability.* For example, it is usual that
One also finds the surprising result that a system hit by a particle of energy E hit collapses the density
Ref. 790 matrix roughly down to the de Broglie (or thermal de Broglie) wavelength of the hitting particle. Both results
together give the formula above.
* Beware of other definitions which try to make something deeper out of the concept of irreversibility, such as
818 vii details of quantum theory • superpositions and probabilities
a glass of wine poured into a bowl of water colours the whole water; it is very rarely ob-
served that the wine and the water separate again, since the probability of all water and
wine molecules to change directions together at the same time is rather low, a state of
affairs making the happiness of wine producers and the despair of wine consumers.
Now let us simplify the description of the bath. We approximate it by a single, un-
specified, scalar field which interacts with the quantum system. Due to the continuity
of space, such a field has an infinity of degrees of freedom. They are taken to model the
many degrees of freedom of the bath. The field is assumed to be in an initial state where
its degrees of freedom are excited in a way described by a temperature T. The interac-
tion of the system with the bath, which is at the origin of the relaxation process, can be
described by the repeated transfer of small amounts of energy E hit until the relaxation
process is completed.
The objects of interest in this discussion, like the mentioned cat, person or car, are
described by a mass m. Their main characteristic is the maximum energy E r which can
E hit eE hit ~k T − 1
td = tr (565)
E r eE hit ~k T + 1
where k is the Boltzmann constant and like above, E r is the maximum energy which can
be transferred from the system to the environment. Note that one always has t d D t r .
After the decoherence time t d is elapsed, the system has evolved from the coherent to
the incoherent superposition of states, or, in other words, the density matrix has lost its
off-diagonal terms. One also says that the phase coherence of this system has been des-
troyed. Thus, after a time t d , the system is found either in the state ψ a or in the state ψ b ,
respectively with the probability SaS2 or SbS2 , and not any more in a coherent superpos-
ition which is so much in contradiction with our daily experience. Which final state is
selected depends on the precise state of the bath, whose details were eliminated from the
calculation by taking an average over the states of its microscopic constituents.
claims that ‘irreversible’ means that the reversed process is not at all possible. Many so-called ‘contradictions’
Copyright © Christoph Schiller November 1997–January 2007
between the irreversibility of processes and the reversibility of evolution equations are due to this mistaken
interpretation of the term ‘irreversible’.
* This result is derived as in the above case. A system interacting with a bath always has an evolution given
Ref. 792 by the general form
dρ
= − [H, ρ] − Q[Vj ρ, Vj ] + [Vj , ρVj ] ,
i 1 † †
(563)
dt ħ 2t o j
where ρ is the density matrix, H the Hamiltonian, V the interaction, and t o the characteristic time of the
Challenge 1310 ny interaction. Are you able to see why? Solving this equation, one finds for the elements far from the diagonal
ρ(t) = ρ 0 e−t~t 0 . In other words, they disappear with a characteristic time t o . In most situations one has a
relation of the form
E hit
t0 = tr = t hit (564)
Er
or some variations of it, as in the example above.
why are people either dead or alive? 819
The important result is that for all macroscopic objects, the decoherence time t d is
extremely small. In order to see this more clearly, we can study a special simplified case.
A macroscopic object of mass m, like the mentioned cat or car, is assumed to be at the
same time in two locations separated by a distance l, i.e. in a superposition of the two
corresponding states. We further assume that the superposition is due to the object mov-
ing as a quantum mechanical oscillator with frequency ω between the two locations; this
is the simplest possible system that shows superpositions of an object located in two dif-
ferent positions. The energy of the object is then given by E r = mω 2 l 2 , and the smallest
transfer energy E hit = ħω is the difference between the oscillator levels. In a macroscopic
situation, this last energy is much smaller than kT, so that from the preceding expression
Ref. 793 we get
2
ħ2 λ 2T
td = tr = tr =
E hit
t r (566)
2E r kT 2mkT l 2 l2
º
in which the frequency ω has disappeared. The quantity λ T = ħ~ 2mkT is called the
system is subject to an extremely low friction, leading to a very long relaxation time, its
decoherence time is still vanishingly short.
Our everyday environment is full of baths. Therefore, coherent superpositions of mac-
roscopically distinct states never appear in everyday life. In short, we cannot be dead and
alive at the same time.
We also arrive at a second conclusion: decoherence results from coupling to a bath in
the environment. Decoherence is a statistical or thermodynamic effect. We will return to
this issue below.
described by a wave function. For example, it can happen that a particle forms neither a
macroscopic nor a microscopic system! In these situations, when the interaction is coher-
ent, one speaks of entanglement; such a particle or set of particles is said to be entangled
with its environment.
Entangled, coherently interacting systems are separable, but not divisible. In quantum
theory, nature is not found to be made of isolated entities, but is still made of separable
entities. The criterion of separability is the incoherence of interaction. Coherent super-
positions imply the surprising consequence that there are systems which, even though
they look divisible, are not. Entanglement poses a limit to divisibility. All surprising prop-
erties of quantum mechanics, such as Schrödinger’s cat, are consequences of the classical
prejudice that a system made of two or more parts must necessarily be divisible into two
what is a system? what is an object? 821
space
collapse
t1 t2 t3 t4
Page 502
“ body ...
James Joyce, A Painful Case
passing a slit. Following the description just deduced, the process proceeds schematically
as depicted in Figure 346. A film of the same process can be seen in the lower left corners
on the pages following page 719. The situation is surprising: due to the short decoher-
ence time, in a wave function collapse the maximum of the function changes position at
extremely high speed. In fact, the maximum moves faster than light. But is it a problem?
A situation is called acausal or non-local if energy is transported faster than light. Using
Challenge 1311 n Figure 346 you can determine the energy velocity involved, using the results on signal
Page 591 propagation. The result is a value smaller than c. A wave function maximum moving
faster than light does not imply energy moving faster than light.
In other words, quantum theory has speeds greater than light, but no energy speeds
Ref. 799 greater than light. In classical electrodynamics, the same happens with the scalar and the
vector potentials if the Coulomb gauge is used. We have also encountered speeds faster
822 vii details of quantum theory • superpositions and probabilities
space
detector 2
detector 1
collapse
Page 314 than that of light in the motion of shadows and in many other observations. Any physicist
now has two choices: he can be straight, and say that there is no non-locality in nature;
or he can be less straight, and claim there is. In the latter case, he has to claim that even
classical physics is non-local. However, this never happens. On the other hand, there is a
danger in this more provoking usage: a small percentage of those who say that the world
is non-local after a while start to believe that there really are faster-than-light effects in
nature. These people become prisoners of their muddled thinking; on the other hands,
muddled thinking helps to get more easily into newspapers. In short, even though the
definition of non-locality is not unanimous, here we stick to the stricter one, and define
non-locality as energy transport faster than light.
An often cited Gedanken experiment that shows the pitfalls of non-locality was
proposed by Bohm* in the discussion around the so-called Einstein–Podolsky–Rosen
Ref. 800, Ref. 801 paradox. In the famous EPR paper the three authors try to find a contradiction between
quantum mechanics and common sense. Bohm translated their rather confused paper
into a clear Gedanken experiment. When two particles in a spin 0 state move apart,
Copyright © Christoph Schiller November 1997–January 2007
measuring one particle’s spin orientation implies an immediate collapse also of the other
particle’s spin, namely in the exactly opposite direction. This happens instantaneously
over the whole separation distance; no speed limit is obeyed. In other words, entangle-
ment seems to lead to faster-than-light communication.
Again, in Bohm’s experiment, no energy is transported faster than light. No non-
locality is present, despite numerous claims of the contrary by certain authors. The two
entangled electrons belong to one system: assuming that they are separate only because
the wave function has two distant maxima is a conceptual mistake. In fact, no signal
* David Joseph Bohm (1917–1992) American–British physicist. He codiscovered the Aharonov–Bohm ef-
fect; he spent a large part of his later life investigating the connections between quantum physics and philo-
sophy.
what is a system? what is an object? 823
can be transmitted with this method; the decoherence is a case of prediction which
looks like a signal without being one. We already discussed such cases in the section
Page 594 on electrodynamics.
Bohm’s experiment has actually been performed. The first and most famous realization
Ref. 802 was realized in 1982 by Alain Aspect; he used photons instead of electrons. Like all latter
tests, it has fully confirmed quantum mechanics.
In fact, experiments such as the one by Aspect confirm that it is impossible to treat
either of the two particles as a system by itself; it is impossible to ascribe any physical
property, such as a spin orientation, to either of them alone. (The Heisenberg picture
would express this restriction even more clearly.)
The mentioned two examples of apparent non-locality can be dismissed with the re-
mark that since obviously no energy flux faster than light is involved, no problems with
causality appear. Therefore the following example is more interesting. Take two identical
atoms, one in an excited state, one in the ground state, and call l the distance that separ-
current of the order of 1 pA flows in clockwise direction with one where it flows in counter-
Ref. 811 clockwise direction have been produced.
**
Ref. 805 Superpositions of magnetization in up and down direction at the same time have also be
observed for several materials.
**
Some people wrongly state that an atom that is in a superposition of states centred at
different positions has been photographed. (This lie is even used by some sects to attract
Challenge 1312 n believers.) Why is this not true?
824 vii details of quantum theory • superpositions and probabilities
**
Since the 1990s, the sport of finding and playing with new systems in coherent macro-
Ref. 807 scopic superpositions has taken off across the world. The challenges lie in the clean exper-
iments necessary. Experiments with single atoms in superpositions of states are among
Ref. 808 the most popular ones.
**
Ref. 809 In 1997, coherent atom waves were extracted from a cloud of sodium atoms.
**
Macroscopic objects usually are in incoherent states. This is the same situation as for
light. The world is full of ‘macroscopic’, i.e. incoherent light: daylight, and all light from
lamps, from fire and from glow-worms is incoherent. Only very special and carefully
constructed sources, such as lasers or small point sources, emit coherent light. Only these
Challenge 1313 ny Can you find a method to measure the degree of entanglement? Can you do so for a system
made of many particles?
**
The study of entanglement leads to a simple conclusion: teleportation contradicts correla-
Challenge 1314 ny tion. Can you confirm the statement?
**
Challenge 1315 n Are ghost images in TV sets, often due to spurious reflections, examples of interference?
**
Challenge 1316 r What happens when two monochromatic electrons overlap?
what is all the fuss abou t measurements in quantum theory? 825
**
Some people say that quantum theory could be used for quantum computing, by using
Ref. 813 coherent superpositions of wave functions. Can you give a general reason that makes this
Challenge 1317 n aim very difficult, even without knowing how such a quantum computer might work?
for a system in a state ψ gives as result one of the eigenvalues a n , and the probability Pn
to get the result a n is given by
Pn = Sφ†n ψS2 , (567)
Page 1224 where φ n is the eigenfunction of the operator A corresponding to the eigenvalue a n .
Experiments also show a second property of quantum measurements: after the meas-
urement, the observed quantum system is in the state φ n corresponding to the measured
eigenvalue a n . One also says that during the measurement, the wave function has col-
Ref. 810 lapsed from ψ to φ n . By the way, both properties can also be generalized to the more
general cases with degenerate and continuous eigenvalues.
At first sight, the sort of probabilities encountered in quantum theory are different
826 vii details of quantum theory • superpositions and probabilities
from the probabilities we encounter in everyday life. Roulette, dice, pachinko machines,
the direction in which a pencil on its tip falls, have been measured experimentally to be
random (assuming no cheating by the designer or operators) to a high degree of accuracy.
These systems do not puzzle us. We unconsciously assume that the random outcome is
due to the small, but uncontrollable variations of the starting conditions every time the
experiment is repeated.*
But microscopic systems seem to be different. The two
measurement properties just mentioned express what phys-
icists observe in every experiment, even if the initial con- gravity
ditions are taken to be exactly the same every time. But
why then is the position for a single electron, or most other
observables of quantum systems, not predictable? In other
words, what happens during the collapse of the wave func-
tion? How long does it take? In the beginning of quantum
small, it is difficult to read if the paper gets brittle; if the magnetic tracks on tapes are
too small, they demagnetize and loose the stored information. In other words, a record is
rendered stable against internal fluctuations by making it of sufficient size. Every record
thus consists of many components and shows small fluctuations.
Therefore, every system with memory, i.e. every system capable of producing a record,
contains a bath. In summary, the statement that any observation is the production of a
record can be expressed more precisely as: Any observation of a system is the result of an
interaction between that system and a bath in the recording apparatus.**
* To get a feeling for the limitations of these unconscious assumptions, you may want to read the already
mentioned story of those physicists who built a machine that could predict the outcome of a roulette ball
Ref. 85 from the initial velocity imparted by the croupier.
** Since baths imply friction, we can also say: memory needs friction.
what is all the fuss abou t measurements in quantum theory? 827
H H int tr
the particle, i.e. those not described – spanned, in mathematical language – by the oper-
ator A corresponding to the observable we want to measure. The numbers c n = Sφ†n ψ p S
give the expansion of the state ψ p , which is taken to be normalized, in terms of the basis
φ n . For example, in a typical position measurement, the functions φ n would be the pos-
ition eigenfunctions and ψ other would contain the information about the momentum, the
spin and all other properties of the particle.
How does the system–detector interaction look like? Let us call the state of the appar-
atus before the measurement χ start ; the measurement apparatus itself, by definition, is
a device which, when it is hit by a particle in the state φ n ψ other , changes from the state
χ start to the state χ n . One then says that the apparatus has measured the eigenvalue a n
corresponding to the eigenfunction φ n of the operator A. The index n is thus the record
of the measurement; it is called the pointer index or variable. This index tells us in which
828 vii details of quantum theory • superpositions and probabilities
state the microscopic system was before the interaction. The important point, taken from
our previous discussion, is that the states χ n , being records, are macroscopically distinct,
precisely in the sense of the previous section. Otherwise they would not be records, and
the interaction with the detector would not be a measurement.
Of course, during measurement, the apparatus sensitive to φ n changes the part ψ other
of the particle state to some other situation ψ other,n , which depends on the measurement
and on the apparatus; we do not need to specify it in the following discussion.* Let us have
an intermediate check of our reasoning. Do apparatuses as described here exist? Yes, they
do. For example, any photographic plate is a detector for the position of ionizing particles.
A plate, and in general any apparatus measuring position, does this by changing its mo-
mentum in a way depending on the measured position: the electron on a photographic
plate is stopped. In this case, χ start is a white plate, φ n would be a particle localized at
spot n, χ n is the function describing a plate blackened at spot n and ψ other,n describes the
momentum and spin of the particle after it has hit the photographic plate at the spot n.
After the interaction, using the just mentioned characteristics of the apparatus, the com-
bined state ψ a is
ψ a = Q c n φ n ψ other,n χ n . (571)
n
This evolution from ψ i to ψ a follows from the evolution equation applied to the particle
detector combination. Now the state ψ a is a superposition of macroscopically distinct
states, as it is a superposition of distinct macroscopic states of the detector. In our example
ψ a could correspond to a superposition of a state where a spot on the left upper corner
is blackened on an otherwise white plate with one where a spot on the right lower corner
of the otherwise white plate is blackened. Such a situation is never observed. Let us see
why. The density matrix ρ a of this situation, given by
n,m
* How does the interaction look like mathematically? From the description we just gave, we specified the
final state for every initial state. Since the two density matrices are related by
ρf = T ρi T † (568)
Challenge 1319 ny we can deduce the Hamiltonian from the matrix T. Are you able to see how?
By the way, one can say in general that an apparatus measuring an observable A has a system interaction
Hamiltonian depending on the pointer variable A, and for which one has
[H + H int , A] = 0 . (569)
what is all the fuss abou t measurements in quantum theory? 829
contains non-diagonal terms, i.e. terms for n x m, whose numerical coefficients are dif-
ferent from zero. Now let’s take the bath back in.
From the previous section we know the effect of a bath on such a macroscopic super-
position. We found that a density matrix such as ρ a decoheres extremely rapidly. We as-
sume here that the decoherence time is negligibly small, in practice thus instantaneous,*
so that the off-diagonal terms vanish, and only the final, diagonal density matrix ρ f , given
by
ρ f = Q Sc n S2 (φ n ψ other,n χ n ) a (φ n ψ other,n χ n )† (573)
n
has experimental relevance. As explained above, such a density matrix describes a mixed
state and the numbers Pn = Sc n S2 = Sφ†n ψ p S2 give the probability of measuring the value
a n and of finding the particle in the state φ n ψ other,n as well as the detector in the state χ n .
But this is precisely what the two properties of quantum measurements state.
We therefore find that describing a measurement as an evolution of a quantum system
We thus have a formula for the time the wave function takes to collapse. The first experi-
Ref. 812 mental measurements of the time of collapse are appearing and confirm these results.
Hidden variables
Obviously a large number of people are not satisfied with the arguments just presented.
They long for more mystery in quantum theory. The most famous approach is the idea
that the probabilities are due to some hidden aspect of nature which is still unknown to
humans. But the beautiful thing about quantum mechanics is that it allows both concep-
tual and experimental tests on whether such hidden variables exist without the need of
knowing them.
Clearly, hidden variables controlling the evolution of microscopic system would con-
tradict the result that action values below ħ~2 cannot be detected. This minimum observ-
Copyright © Christoph Schiller November 1997–January 2007
able action is the reason for the random behaviour of microscopic systems.
Historically, the first argument against hidden variables was given by John von Neu-
mann.**
* Note however, that an exactly vanishing decoherence time, which would mean a strictly infinite number
of degrees of freedom of the environment, is in contradiction with the evolution equation, and in particular
with unitarity, locality and causality. It is essential in the whole argument not to confuse the logical con-
sequences of a extremely small decoherence time with those of an exactly vanishing decoherence time.
** János von Neumann (b. 1903 Budapest, d. 1957 Washington DC) Hungarian mathematician. One of the
greatest and clearest minds of the twentieth century, he settled already many questions, especially in applied
mathematics and quantum theory, that others still struggle with today. He worked on the atomic and the
hydrogen bomb, on ballistic missiles, and on general defence problems. In another famous project, he build
the first US-American computer, building on his extension of the ideas of Konrad Zuse.
830 vii details of quantum theory • superpositions and probabilities
– CS – to be written – CS –
An additional no-go theorem for hidden variables was published by Kochen and
Ref. 814 Specker in 1967, (and independently by Bell in 1969). It states that non-contextual hid-
den variables are impossible, if the Hilbert space has a dimension equal or larger than
three. The theorem is about non-contextual variables, i.e. about hidden variables inside
the quantum mechanical system. The Kochen–Specker theorem thus states that there is
no non-contextual hidden variables model, because mathematics forbids it. This result
essentially eliminates all possibilities, because usual quantum mechanical systems have
dimensions much larger than three.
But also common sense eliminates hidden variables, without any recourse to math-
ematics, with an argument often overlooked. If a quantum mechanical system had in-
ternal hidden variables, the measurement apparatus would have zillions of them.* And
(575)
where the expressions in brackets are the averages of the measurement products over a
large number of samples. This result holds independently of the directions of the involved
polarizers.
On the other hand, if the polarizers 1 and 2 at position A and the corresponding ones
at position B are chosen with angles of π~4, quantum theory predicts that the result is
º
S(a 1 b 1 ) + (a 2 b 1 ) + (a 2 b 2 ) − (a 1 b 2 )S = 2 2 A 2 (576)
From the arguments presented here we draw a number of conclusions which we need
Giambattista Vico*
”
for the rest of our mountain ascent. Note that these conclusions are not yet shared by all
physicists! The whole topic is still touchy.
— Probabilities do not appear in measurements because the state of the quantum system
is unknown or fuzzy, but because the detailed state of the bath in the environment
is unknown. Quantum mechanical probabilities are of statistical origin and are due to
baths in the environment (or the measurement apparatus). The probabilities are due to
the large number of degrees of freedom contained in any bath. These large numbers
make the outcome of experiments unpredictable. If the state of the bath were known,
the outcome of an experiment could be predicted. The probabilities of quantum theory
are ‘thermodynamic’ in origin.
In other words, there are no fundamental probabilities in nature. All probabilities
in nature are due to decoherence; in particular, all probabilities are due to the statistics
of the many particles – some of which may be virtual – that are part of the baths in the
environment. Modifying well-known words by Albert Einstein, ‘nature really does not
Copyright © Christoph Schiller November 1997–January 2007
play dice.’ We therefore called ψ the wave function instead of ‘probability amplitude’, as
is often done. State function would be an even better name.
— Any observation in everyday life is a special case of decoherence. What is usually called
the ‘collapse of the wave function’ is a decoherence process due to the interaction with
the baths present in the environment or in the measuring apparatus. Because humans
are warm-blooded and have memory, humans themselves are thus measurement ap-
paratuses. The fact that our body temperature is 37°C is thus the reason that we see
* ‘We are able to demonstrate geometrical matters because we make them; if we could prove physical mat-
ters we would be able to make them.’ Giovanni Battista Vico (b. 1668 Napoli, d. 1744 Napoli) important
Italian philosopher and thinker. In this famous statement he points out a fundamental distinction between
mathematics and physics.
832 vii details of quantum theory • superpositions and probabilities
Challenge 1321 n only a single world, and no superpositions. (Actually, there are more reasons; can you
name a few?)
— A measurement is complete when the microscopic system has interacted with the bath
in the measuring apparatus. Quantum theory as a description of nature does not re-
quire detectors; the evolution equation describes all examples of motion. However,
measurements do require the existence of detectors. Detectors, being machines that re-
cord observations, have to include a bath, i.e. have to be classical, macroscopic objects.
In this context one speaks also of a classical apparatus. This necessity of the measure-
ment apparatus to be classical had been already stressed in the very early stages of
quantum theory.
— All measurements, being decoherence processes that involve interactions with baths,
are irreversible processes and increase entropy.
— A measurement is a special case of quantum mechanical evolution, namely the evolu-
tion for the combination of a quantum system, a macroscopic detector and the environ-
and remains the basic property of evolution. There are no non-unitary processes in
quantum mechanics.
— The description of the collapse of the wave function as a decoherence process is an
explanation exactly in the sense in which the term ‘explanation’ was defined in the
Page 703 intermezzo; it describes the relation between an observation and all the other aspects
of reality, in this case the bath in the detector or the environment. The collapse of the
wave function has been both calculated and explained. The collapse is not a question
of ‘interpretation’, i.e. of opinion, as unfortunately often is suggested.**
* The opposite view is sometimes falsely attributed to Niels Bohr. The Moon is obviously in contact with
Challenge 1322 n many radiation baths. Can you list a few?
** This implies that the so-called ‘many worlds’ interpretation is wishful thinking. The conclusion is con-
conclusions on probabilities and determinism 833
— It is not useful to speculate whether the evolution for a single quantum measurement
could be determined if the state of the environment around the system were known.
Measurements need baths. But a bath, being irreversible, cannot be described by a
wave function, which behaves reversibly.* Quantum mechanics is deterministic. Baths
are probabilistic.
— In summary, there is no irrationality in quantum theory. Whoever uses quantum the-
ory as argument for superstitions, irrational behaviour, new age beliefs or ideologies is
guilty of disinformation. The statement by Gell-Mann at the beginning of this chapter
is thus such an example. Another is the following well-known but incorrect statement
by Richard Feynman:
Ref. 817 ...nobody understands quantum mechanics.
Nobel Prizes in physics obviously do not prevent infection by ideology. On the other
hand, the process of decoherence allows a clear look on various interesting issues.
Are you able to find an interaction which is momentum-dependent? What is the con-
Challenge 1323 n sequence for macroscopic systems?
In other words, in contrast to general relativity, quantum theory produces a distinction
between space and time. In fact, we can define position as the observable that commutes
with interaction Hamiltonians. This distinction between space and time is due to the
properties of matter and its interactions. We could not have deduced this distinction in
general relativity.
Ref. 816 firmed when studying the details of this religious approach. It is a belief system, not based on facts.
* This very strong type of determinism will be very much challenged in the last part of this text, in which
it will be shown that time is not a fundamental concept, and therefore that the debate around determinism
looses most of its interest.
834 vii details of quantum theory • superpositions and probabilities
communication is based on a secret key that is used to decrypt the message. Even if the
communication channel is of the highest security – like entangled photons – one still has
to find a way to send the communication partner the secret key necessary for the decryp-
tion of the messages. Finding such methods is the main aspect of quantum cryptology, a
large research field. However, close investigation shows that all key exchange methods are
limited in their security. In short, due to the quantum of action, nature provides limits on
* Cryptology consists of the field of cryptography, the art of coding messages, and the field of cryptoana-
lysis, the art of deciphering encrypted messages. For a good introduction to cryptology, see the text by
Albrecht Beu telspacher, Jörg S chwenk & Klaus-Dieter Wolfenstätter, Moderne Ver-
fahren der Kryptographie, Vieweg 1995.
conclusions on probabilities and determinism 835
the possibility of sending encrypted messages. The statement of these limits is (almost)
equivalent to the statement that change in nature is limited by the quantum of action.
The quantum of action provides a limit to secure information exchange. This connec-
tion also allows to brush aside several incorrect statements often found in the media.
Page 1068 Stating that ‘the universe is information’ or that ‘the universe is a computer’ is as devoid
of reason as saying that the universe is an observation, a measurement apparatus, a clock-
work or a chewing-gum dispenser. Any expert of motion should beware of these and sim-
ilarly fishy statements; people who use them either deceive themselves or try to deceive
others.
“
Now that we can look at quantum effects without ideological baggage, let us have some
serious fun in the world of quantum theory. The quantum of action has important con-
Terence
”
sequences for biology, chemistry, material science, technology, and science fiction. We
will only explore a cross-section of these topics, but it will be worth it.
Biolo gy
A special form of electromagnetic motion is of importance to humans: life. We mentioned
at the start of quantum theory that life cannot be described by classical physics. Life is a
quantum effect. Let us see why.
Living beings can be described as objects showing metabolism, information pro-
cessing, information exchange, reproduction and motion. Obviously, all these properties
follow from a single one, to which the others are enabling means:
Living beings are objects able to reproduce.**
From your biology lessons in secondary school you might remember the main proper-
ties of heredity. Reproduction is characterized by random changes from one generation
to the next. The statistics of mutations, for example Mendel’s ‘laws’ of heredity, and the
Copyright © Christoph Schiller November 1997–January 2007
lack of intermediate states, are direct consequences of quantum theory. In other words,
reproduction and growth are quantum effects.
In order to reproduce, living beings must be able to move in self-directed ways. An ob-
ject able to perform self-directed motion is called a machine. All self-reproducing beings
are machines.
Since reproduction and growth is simpler the smaller the system is, most living be-
ings are extremely small machines for the tasks they perform, especially when compared
* ‘I am a man and nothing human is alien to me.’ Terence is Publius Terentius Afer (c. 190–159 bce ), the
important roman poet. He writes this in his play Heauton Timorumenos, verse 77.
** However, there are examples of objects which reproduce and which nobody would call living. Can you
Challenge 1324 n find some examples, together with a sharper definition?
biology 837
to human made machines. This is the case even though the design of human machines
has considerably fewer requirements: human-built machines do not need to be able to
reproduce; as a result, they do not need to be made of a single piece of matter, as all living
beings have to. Indeed, despite all the restrictions nature has to live with, living beings
hold many miniaturization world records:
— The brain has the highest processing power per volume of any calculating device so
far. Just look at the size of chess champion Gary Kasparov and the size of the computer
against which he played. Or look at the size of any computer that attempts to speak.
— The brain has the densest and fastest memory of any device so far. The set of com-
pact discs (CDs) or digital versatile discs (DVDs) that compare with the brain is many
Page 655 thousand times larger.
— Motors in living beings are many orders of magnitude smaller than human-built ones.
Just think about the muscles in the legs of an ant.
— The motion of living beings beats the acceleration of any human-built machine by
Reproduction
Life is a sexually transmitted disease.
“ Anonymous
All the astonishing complexity of life is geared towards reproduction. Reproduction is the ”
ability of an object to build other objects similar to itself. Quantum theory told us that
Page 798 only a similar object is possible, as an exact copy would contradict the quantum of action,
as we found out above.
Since reproduction requires mass increase, reproducing objects show both metabol-
ism and growth. In order that growth leads to an object similar to the original, a con-
838 vii details of quantum theory • applied quantum mechanics
Living beings are machines. How do these machines work? From a physical point of view,
we need only a few sections of our walk so far to describe them: universal gravity and QED.
Simply stated, life is an electromagnetic process taking place in weak gravity.* But the
* In fact, also the nuclear interactions play some role for life: cosmic radiation is one source for random
mutations, which are so important in evolution. Plant growers often use radioactive sources to increase
Ref. 824 mutation rates. But obviously, radioactivity can also terminate life.
The nuclear interactions are also implicitly involved in several other ways. They were necessary to form
the materials – carbon, oxygen, etc. – required for life. Nuclear interactions are behind the main mechanism
for the burning of the Sun, which provides the energy for plants, for humans and for all other living beings
(except a few bacteria in inaccessible places).
Summing up, the nuclear interactions play a role in the appearance and in the in destruction of life; but
they play no (known) role for the actions of particular living beings.
biology 839
details of this statement are tricky and interesting. Table 59 gives an overview of motion
processes in living beings. Interestingly, all motion in living beings can be summarized
in a few classes by asking for the motor driving it.
Nature only needs few small but powerful devices to realize all motion types used by
living beings. Given the long time that living systems have been around, these devices are
extremely efficient. In fact, ion pumps, chemical pumps, rotational and linear molecular
motors are all specialized molecular machines. Ion and chemical pumps are found in
membranes and transport matter. Rotational and linear motor move structures against
membranes. In short, all motion in living beings is due to molecular motors. Even though
840 vii details of quantum theory • applied quantum mechanics
there is still a lot to be learned about them, what is known already is spectacular enough.
molecular motor, the power of the environmental noise is eight to nine orders of mag-
nitude higher than the power consumed by the motor. The ratio shows what a fantastic
piece of machinery such a motor is.
Page 75 We encountered rotational motors already above; nature uses them to rotate the cilia
Ref. 827 of many bacteria as well as sperm tails. Researchers have also discovered that evolution
produced molecular motors which turn around DNA helices like a motorized bolt would
turn around a screw. Such motors are attached at the end of some viruses and insert the
DNA into virus bodies when they are being built by infected cells, or extract the DNA from
Ref. 825 the virus after it has infected a cell. Another rotational motor, the smallest known so far
– 10 nm across and 8 nm high – is ATP synthase, a protein that synthesizes most ATP in
cells.
Molecular pumps are essential to life. They pump chemicals, such as ..., into every cell
biology 841
or out of it, using energy, even if the concentration gradient tries to do the opposite. Mo-
lecular pumps are thus essential in ensuring that life is a process far from equilibrium.
In addition, disturbed molecular pumps are responsible for the water loss in cholera, for
example.
In the following, we concentrate on linear motors. The ways molecules produce move-
Ref. 826 ment in linear motors was uncovered during the 1990s. The results then started a wave
of research on all other molecular motors found in nature. All molecular motors share a
number of characteristic properties. There are no temperature gradients involved, as in
car engines, no electrical currents, as in electrical motors, and no concentration gradients,
as found in chemically induced motion, such as the rising of a cake.
The central part of the most important linear molecular motor is a combination of two
protein molecules, namely myosin and actin. Myosin changes between two shapes and
literally walks along actin. It moves in regular small steps, as shown in Figure 351. The
motion step size has been measured with beautiful experiments to always be an integer
On average, the myosin molecule will thus move in one direction. Nowadays the motion
of single molecules can be followed in special experimental set-ups. These experiments
confirm that muscles use such a ratchet mechanism. The ATP molecule adds energy to
the system and triggers the potential variation through the shape change it induces in the
myosin molecule. That is how our muscles work.
Another well-studied linear molecular motor is the kinesin–microtubule system that
carries organelles from one place to the other within a cell. As in the previous example,
also in this case chemical energy is converted into unidirectional motion. Researchers
were able to attach small silica beads to single molecules and to follow their motion. Using
laser beams, they could even apply forces to these single molecules. Kinesin was found
* It was named by Walt Disney after by Ratchet Gearloose, the famous inventor from Duckburg.
842 vii details of quantum theory • applied quantum mechanics
Fixed position
U(t 1 )
U(t 2 )
Brownian motion
can take place
U(t 3 )
to move with around 800 nm~s, in steps lengths which are multiples of 8 nm, using one
ATP molecule at a time, and exerting a force of about 6 pN.
Quantum ratchets also exist as human-built systems, such as electrical ratchets for elec-
tron motion or optical ratchets that drive small particles. Extensive experimental research
is going on in these field.
human twins have different finger prints, iris scans, blood vessel networks, intrauterine
experiences, among others.
**
Many molecules found in living beings, such as sugar, have mirror molecules. However,
in all living beings only one of the two sorts is found. Life is intrinsically asymmetric.
Challenge 1329 n How can this be?
**
How is it possible that the genetic difference between man and chimpanzee is regularly
given as about 1 %, whereas the difference between man and woman is one chromosome
Challenge 1330 n in 46, in other words, about 2.2 %?
biology 843
**
What is the longest time a single bacterium has survived? Not 5000 years as the bacteria
found in Egyptian mummies, not even 25 million years as the bacteria resurrected from
the intestines in insects enclosed in amber. It is much longer, namely an astonishing 250
million years. This is the time that bacteria discovered in the 1960s by Hans Pflug in
(low-radioactivity) salt deposits in Fulda (Germany) have hibernated there before being
brought back to life in the laboratory. The result has been recently confirmed by the dis-
covery of another bacterium in a North-American salt deposit in the Salado formation.
**
Molecular motors are quite performant. Those in a bird, the sooty shearwater (Paffinus
griseus) allow it fly 74 000 km in a year, with a measured record of 1094 km a day. Copyright © Christoph Schiller November 1997–January 2007
**
In 1967, a TV camera was deposited on the Moon. Unknown to everybody, it contained
a small patch of Streptococcus mitis. Three years later, the camera was brought back to
Ref. 428 Earth. The bacteria were still alive. They had survived for three years without food, water
or air. Life can be resilient indeed.
**
In biology, classifications are extremely useful. (This is similar to the situation in astro-
physics, but in full contrast to the situation in physics.) Table 60 gives an overview of the
Ref. 829 magnitude of the task. This wealth of material can be summarized in one graph, shown
in Figure 353. Newer research seems to suggest some slight changes to the picture. So far
844 vii details of quantum theory • applied quantum mechanics
Thermotogales
Humans have many living beings on board. For example, humans need bacteria to live.
It is estimated that 90% of the bacteria in the human mouth alone are not known yet; only
500 species have been isolated so far. These useful bacteria help us as a defence against
the more dangerous species.
**
Mammals have a narrow operating temperature. In contrast to machines, humans func-
Challenge 1332 d tion only if the internal temperature is within a narrow range. Why? Does this require-
ment also apply to extraterrestrials – provided they exist?
**
How did the first cell arise? This question is still open. However, researchers have found
the physics of pleasure 845
several substances that spontaneously form closed membranes in water. Such substances
also form foams. It might well be that life formed in foam. Other options discussed are
that life formed underwater, at the places where magma rises into the ocean. Elucidating
Challenge 1333 r the question is one of the great riddles of biology.
**
Challenge 1334 n Could life have arrived to Earth from outer space?
“ sphere?
Oscar Wilde (1854–1900) The Critic as Artist.
Pleasure is a quantum effect. The reason is simple. Pleasure comes from the senses. All ”
senses measure. And all measurements rely on quantum theory. The human body, like an
Page 600
distribution we have seen earlier on. The different chemicals in the three cone types (red,
green, blue) lead to different sensor speeds; this can be checked with the simple test shown
Ref. 830 in Figure 354. The images of the eye are only sharp if the eye constantly moves in small
random motions. If this motion is stopped, for example with chemicals, the images pro-
duced by the eye become unsharp.
The eye also contains 120 million highly sensitive general light intensity detectors, the
rods. This sensitivity difference is the reason that at night all cats are grey. Until recently,
human built light sensors with the same sensitivity as rods had to be helium cooled, be-
cause technology was not able to build sensors at room temperature as sensitive as the
human eye.
The touch sensors are distributed over the skin, with a surface density which varies
846 vii details of quantum theory • applied quantum mechanics
F I G U R E 354 The different speed of the eye’s colour sensors, the cones, lead
to a strange effect when this picture (in colour version) is shaken right to left
in weak light
* Taste sensitivity is not separated on the tongue into distinct regions; this is an incorrect idea that has been
copied from book to book for over a hundred years. You can perform a falsification by yourself, using sugar
Challenge 1335 n or salt grains.
the physics of pleasure 847
In summary, the sensors with which nature provides us are state of the art; their sens-
itivity and ease of use is the highest possible. Since all sensors trigger pleasure or help to
Ref. 833 avoid pain, nature obviously wants us to enjoy life with the most intense pleasure possible.
Studying physics is one way to do this.
There are two things that make life worth living:
”
The nerves and the brain
There is no such thing as perpetual tranquillity
* Victor Friedrich Weisskopf (b. 1908 Vienna, d. 2002 Cambridge), acclaimed theoretical physicist who
worked with Einstein, Born, Bohr, Schrödinger and Pauli. He catalysed the development of quantum elec-
trodynamics and nuclear physics. He worked on the Manhattan project but later in life intensely campaigned
against the use of nuclear weapons. During the cold war he accepted the membership in the Soviet Academy
of Sciences. He was professor at MIT and for many years director of CERN, in Geneva. He wrote several suc-
cessful physics textbooks. The author heard him making the above statement in Geneva, in 1981, during
one of his lectures.
** This is not in contrast with the fact that one or two whale species have brains with a slightly larger mass.
The larger mass is due to the protection these brains require against the high pressures which appear when
whales dive (some dive to depths of 1 km). The number of neurons in whale brains is considerably smaller
than in human brains.
848 vii details of quantum theory • applied quantum mechanics
Living clocks
L’horologe fait de la réclame pour le temps.*
“ Georges Perros
When we sing a musical note we are able to reproduce the original frequency with high ”
accuracy. We also know from everyday experience that humans are able to keep the beat
Ref. 834 to within a few per cent for a long time. When doing sport or when dancing, we are able
to keep the timing to high accuracy. (For shorter or longer times, the internal clocks are
not so precise.) All these clocks are located in the brain.
Brains process information. Also computers do this, and like computers, all brains
need a clock to work well. Every clock is made up of the same components. It needs an
oscillator determining the rhythm and a mechanism to feed the oscillator with energy.
In addition, every clock needs an oscillation counter, i.e. a mechanism that reads out
the clock signal, and a means of signal distribution throughout the system is required,
the human chromosomes. At every division, one molecule is lost. When the molecules
are all lost, the cell dies. Research into the mechanisms and the exceptions to this process,
such as cancer and sexual cells, is ongoing.
The basis of the monthly period in women is equally interesting and complex.
The most fascinating clocks are those at the basis of conscious time. Of these, the
brain’s stopwatch or interval timer, has been most intensely studied. Only recently was
its mechanism uncovered by combining data on human illnesses, human lesions, mag-
Ref. 840 netic resonance studies, and effects of specific drugs. The basic mechanism takes place in
the striatum in the basal ganglia of the brain. The striatum contains thousands of timer
cells with different periods. They can be triggered by a ‘start’ signal. Due to their large
* Clocks are ads for time.
the physics of pleasure 849
number, for small times of the order of one second, every time interval has a different
pattern across these cells. The brain can read these patterns and learn them. In this way
we can time music or specific tasks to be performed, for example, one second after a
signal.
Even though not all the clock mechanisms in humans are known, natural clocks share
a property with human-built clocks: they are limited by quantum mechanics. Even the
simple pendulum is limited by quantum theory. Let us explore the topic.
“ Karl Valentin.
In general relativity, we found that clocks do not exist, because there is no unit of time
that can be formed using the constants c and G. Clocks, like any measurement standard,
”
much as possible?
High precision implies high sensitivity to fluctuations. Now, all clocks have a motor
inside that makes them work. A high precision clock thus needs a high precision motor.
In all clocks, the position of the motor is read out and shown on the dial. The quantum
of action implies that a precise clock motor has a position indeterminacy. The clock pre-
cision is thus limited. Worse, like any quantum system, the motor has a small, but finite
probability to stop or to run backwards for a while.
You can check this prediction yourself. Just have a look at a clock when its battery is
almost empty, or when the weight driving the pendulum has almost reached the bottom
* ‘Also the future used to be better in the past.’ Karl Valentin (b. 1882 Munich, d. 1948 Planegg), German
author and comedian.
850 vii details of quantum theory • applied quantum mechanics
position. It will start doing funny things, like going backwards a bit or jumping back and
forward. When the clock works normally, this behaviour is strongly suppressed; however,
Challenge 1338 ny it is still possible, though with low probability. This is true even for a sundial.
In other words, clocks necessarily have to be macroscopic in order to work properly. A
clock must be as large as possible, in order to average out its fluctuations. Astronomical
systems are good examples. A good clock must also be well-isolated from the environ-
ment, such as a freely flying object whose coordinate is used as time variable, as is done
in certain optical clocks.
MA
ħ T
(578)
c2 τ τ
where T is the time to be measured. You might check that this condition directly requires
Challenge 1341 e that any clock must be macroscopic.
Let us play with the formula by Salecker and Wigner. It can be rephrased in the fol-
lowing way. For a clock that can measure a time t, the size l is connected to the mass m
by ¾
lA
ħt
. (579)
m
Ref. 837 How close can this limit be achieved? It turns out that the smallest clocks known, as well
as the clocks with most closely approach this limit, are bacteria. The smallest bacteria,
the mycoplasmas, have a mass of about 8 ë 10−17 kg, and reproduce every 100 min, with a
Copyright © Christoph Schiller November 1997–January 2007
precision of about 1 min. The size predicted from expression (579) is between 0.09 µm and
0.009 µm. The observed size of the smallest mycoplasmas is 0.3 µm. The fact that bacteria
can come so close to the clock limit shows us again what a good engineer evolution has
been.
Note that the requirement by Salecker and Wigner is not in contrast with the possibil-
ity to make the oscillator of the clock very small; researchers have built oscillators made
of a single atom. In fact, such oscillations promise to be the most precise human built
Ref. 836 clocks. But the oscillator is only one part of any clock, as explained above.
Page 848
In the real world, the clock limit can be tightened even more. The whole mass M can-
not be used in the above limit. For clocks made of atoms, only the binding energy between
atoms can be used. This leads to the so-called standard quantum limit for clocks; it limits
the physics of pleasure 851
their frequency ν by ¾
=
δν ∆E
(580)
ν E tot
where ∆E = ħ~T is the energy indeterminacy stemming from the finite measuring time
T and E tot = NE bind is the total binding energy of the atoms in the metre bar. So far,
the quantum limit has not yet been achieved for any clock, even though experiments are
getting close to it.
In summary, clocks exist only in the limit of ħ being negligible. In practice, the errors
made by using clocks and metre bars can be made as small as required; it suffices to make
the clocks large enough. Clock built into human brains comply with this requirement.
We can thus continue our investigation into the details of matter without much worry, at
least for a while. Only in the third part of our mountain ascent, where the requirements
for precision will be higher and general relativity will limit the size of physical systems,
We have seen that predictions of the future are made difficult by nonlinearities and the
divergence of from similar conditions; we have seen that many particles make it difficult
to predict the future due to the statistical nature of their initial conditions; we have seen
”
that quantum theory makes it often hard to fully determine initial states; we have seen
that not-trivial space-time topology can limit predictability; finally, we will discover that
black hole and similar horizons can limit predictability due to their one-way transmission
of energy, mass and signals.
If due to the quantum of action perfect clocks do not exist, is determinism still the
correct description of nature? Yes and no. We learned that all the mentioned limitations
of clocks can be overcome for limited time intervals; in practice, these time intervals can
be made so large that the limitations do not play a role in everyday life. As a result, in
quantum systems both determinism and time remain applicable, as long as we do not
Copyright © Christoph Schiller November 1997–January 2007
extend it to infinite space and time. However, when extremely large dimensions and in-
tervals need to be taken into account, quantum theory cannot be applied alone; in those
Page 1018 cases, general relativity needs to be taken into account.
“ Salvador Dalì
counterpart. It is true that decay, including the ageing of humans, can be observed clas-
sically; however, its origin is a pure quantum effect.
Experiments show that the prediction of decay, like that of scattering of particles, is
only possible on average, for a large number of particles or systems, never for a single
one. These results confirm the quantum origin of the process. In every decay process, the
superposition of macroscopically distinct states – in this case are those of a decayed and
an undecayed particle – is made to decohere rapidly by the interaction with the envir-
onment. Usually the ‘environment’ vacuum is sufficient to induce decoherence. As usual,
the details of the involved environment states are unknown for a single system and make
any prediction for a specific system impossible.
Decay, including that of radioactive nuclei, is influenced by the environment, even
in the case that it is ‘only’ the vacuum. The statement can be confirmed by experiment.
By enclosing a part of space between two conducting plates, one can change the degrees
of freedom of the vacuum contained between them. Putting an electromagnetically un-
Ṅ = −
N
where
τ
= S`ψ final SH int Sψ final eS .
1 2π 2
(581)
τ ħ
The decay is thus essentially an exponential one, independently of the details of the
physical process. In addition, the decay time τ depends on the interaction and on the
square modulus of the transition matrix element. This result was named the golden rule
by Fermi,* because it works so well despite being an approximation whose domain of
applicability is not easy to specify.
The golden rule leads to
Copyright © Christoph Schiller November 1997–January 2007
N(t) = N 0 e− τ .
t
(582)
Decay is thus predicted to follow an exponential law. For over half a century, all experi-
ments confirmed this result. On the other hand, when quantum theory is used to derive
Ref. 842 the golden rule, it is found that decay is exponential only in certain special systems. A
calculation that takes into account higher order terms predicts deviations from exponen-
tial decay for completely isolated systems: for short times, the decay rate should vanish;
* Originally, the golden rule is a statement from the christian bible, namely the sentence ‘Do to others what
you want them to do to you’.
the physics of pleasure 853
for long times, the decay rate should follow an algebraic – not an exponential – depend-
Ref. 843 ence on time, in some cases even with superimposed oscillations. After an intense exper-
imental search have deviations for short times been observed. The observation of devi-
ations at long times are rendered impossible by the ubiquity of thermal noise. It turns
out that decay is exponential only when the environment is noisy, the system made of
many weakly interacting particles, or both. Since this is usually the case, the exceptional
exponential decay becomes the (golden) rule in experiments.
“
Many thinkers advise to enjoy the present. As shown by perception research, what hu-
Ovidius
”
mans call ‘present’ has a duration of a few tenths of a second. This leads us to ask whether
given time. Zeno’s discussion was based on an extrapolation of classical physics into do-
mains where it is not valid any more. Every observation, like every photograph, implies
a time average: observations average interactions over a given time.** For a photograph,
the duration is given by the shutter time; for a measurement, the average is defined by
the details of the set-up. Whatever this set-up might be, the averaging time is never zero.
There is no ‘point-like’ instant of time that describes the present. The observed present is
always an average over a non-vanishing interval of time. The present has a duration in
nature.
– CS – more to be told – CS –
Copyright © Christoph Schiller November 1997–January 2007
The quantum Zeno effect was partially observed by Wayno Itano and his group in 1990,
Ref. 742 and definitively observed by Mark Raizen and his group, in 2001.
Ref. 743 In a fascinating prediction, Saverio Pascazio and his team have predicted that the
quantum Zeno effect can be used to realize X-ray tomography of objects with the low-
est radiation levels imaginable.
brain moves with over one million kilometres per hour through the cosmic background
radiation; we also observe that consciousness moves along with it.
The brain is a quantum system; it is based on molecules and electrical currents. The
changes in consciousness that appear when matter is taken away from the brain – in
operations or accidents – or when currents are injected into the brain – in accidents,
experiments or misguided treatments – have been described in great detail by the medical
profession. Also the observed influence of chemicals on the brain – from alcohol to hard
drugs – makes the same point. The brain is a quantum system.
Magnetic resonance imaging can detect which parts of the brain work when sensing,
remembering or thinking. Not only is sight, noise and thought processed in the brain;
we can follow the processing on computer screens. The other, more questionable experi-
mental method, positron tomography, works by letting people swallow radioactive sugar.
It confirms the findings on the location of thought and on its dependence on chemical
fuel. In addition, we already know that memory depends on the particle nature of matter.
a finite but moderate temperature, only because we are a mixture of liquids and solids,
only because we are electrically neutral, only because we are large compared to a black
hole of our same mass, only because we are large compared to our quantum mechanical
wavelength, only because we have a limited memory, only because our brain forces us to
approximate space and time by continuous entities, and only because our brain cannot
avoid describing nature as made of different parts. If any of these conditions were not
fulfilled we would not observe motion; we would have no fun studying physics.
In addition, we saw that we have these abilities only because our forefathers lived on
Earth, only because life evolved here, only because we live in a relatively quiet region of
our galaxy, and only because the human species evolved long after than the big bang.
If any of these conditions were not fulfilled, or if we were not animals, motion would
856 vii details of quantum theory • applied quantum mechanics
not exist. In many ways motion is thus an illusion, as Zeno of Elea had claimed. To say
the least, the observation of motion is due to the limitations of the human condition.
A complete description of motion and nature must take this connection into account.
Before we do that, we explore a few details of this connection.
**
Even at perfect darkness, the eye does not yield a black impression, but a slightly brighter
one, called eigengrau. This is a result of noise created inside the eye, probably triggered by
spontaneous decay of rhodopsin, or alternatively, by spontaneous release of neurotrans-
mitters.
Copyright © Christoph Schiller November 1997–January 2007
**
The high sensitivity of the ear can be used to hear light. To do this, take an empty 750 ml
jam glass. Keeping its axis horizontal, blacken the upper half of the inside with a candle.
The lower half should remain transparent. After doing this, close the jam glass with its
lid, and drill a 2 to 3 mm hole into it. If you now hold the closed jam glass with the hole
to your ear, keeping the black side up, and shining into it from below with a 50 W light
Challenge 1347 n bulb, something strange happens: you hear a 100 Hz sound. Why?
**
Most senses work already before birth. It is well-known that playing the violin to a preg-
chemistry – from atoms to dna 857
Figure to be included
nant mother every day during the pregnancy has an interesting effect. Even if nothing is
told about it to the child, it will become a violin player later on.
**
**
We saw that nature has indeed invented pleasure as a guide for a human behaviour. All of
biology builds on chemistry and on materials science. We have a short overview of both
fields.
main result is that in atoms with more than one electron, the various electron clouds form
spherical layers around the nucleus. The layers can be grouped into groups of related
clouds, called shells. For electrons outside a given shell, the nucleus and the inner shells,
the atomic core, can often be approximated as a single charged entity.
Shells are numbered from the inside out. This principal quantum number, usually writ-
ten n, is deduced and related to the quantum number that numbers the states in the
hydrogen atom.
Quantum theory shows that the first shell has room for two electrons, the second for
Ref. 847 * The correct statement is: the Dirac equation contains all of chemistry. The relativistic effects that distinguish
the two equations are necessary, for example, to understand why gold is yellow and does not like to react or
why mercury is liquid.
858 vii details of quantum theory • applied quantum mechanics
Figure to be included
8, the third for 18, and the general n-th shell for 2n 2 electrons. A way to picture this con-
Ref. 857 nection is shown in Figure 357. It is called the periodic table of the elements. The standard
Page 1197 way to show the table is shown in Appendix C.
B onds
When atoms approach each other, their electron clouds are deformed and mixed. The
reason for these changes is the combined influence of the two nuclei. The effect for these
changes is highest for the outermost electrons: they form chemical bonds.
The cloud overlap lead to a gain in energy. That is the reason that fire is hot. After
the energy has been released, a certain distance between the atoms appears, as shown in
Figure 358. A lower distance would lead to electrostatic repulsion between the atomic
cores.
Many atoms can bind to several neighbours. In this case, energy minimization leads
to specific bond angles, as shown in Figure 359. Do you remember those funny pictures
Ref. 858 of school chemistry about orbitals and dangling bonds? Well, dangling bonds can now be
seen. Several groups were able to image them using scanning force or scanning tunnelling
Ref. 859 microscopes.
The repulsion between the ºclouds of each bond explains why angle values near that of
Challenge 1348 e tetrahedral skeletons (2arctan 2 = 109.47°) are so common in molecules. For example,
the H-O-H angle in water molecules is 107°.
Copyright © Christoph Schiller November 1997–January 2007
n=1 1H 2 He
7N 8O
n=2 6C 3 Li 4 Be 9F
5B 10 Ne
25 Mn 26 Fe
24 Cr 15 P 16 S 27 Co
n=3 23 V 14 Si 11 Na 12 Mg 17 Cl 28 Ni
22 Ti 13 Al 18 Ar 29 Cu
21 Sc 30 Zn
64 Gd 65 Tb
n=4 63 Eu 43 Tc 44 Ru 66 Dy
62 Sm 42 Mo 33 As 34 Se 45 Rh 67 Ho
61 Pm 41 Nb 32 Ge 19 K 20 Ca 35 Br 46 Pd 68 Er
60 Nd 40 Zr 31 Ga 36 Kr 47 Ag 69 Tm
59 Pr 39 Y 48 Cd 70 Yb
58 Ce 71 Lu
107 Bh 108 Hs
106 Sg 83 Bi 84 Po 109 Mt
n=6 105 Db 82 Pb 55 Cs 56 Ba 85 At 110 Ds
104 Rf 81 Tl 86 Rn 111 Rg
89 Ac 112 Uub
s p d f
shell electron number
shell of last electron increases clockwise
Copyright © Christoph Schiller November 1997–January 2007
Figure to be included
human machine we are as adults. For example, DNA determines the hair colour, predis-
poses for certain illnesses, determines the maximum size one can grow to, and much
more. Of all known molecules, human DNA is thus most intimately related to human ex-
istence. The large size of the molecules is the reason that understanding its full structure
and its full contents is a task that will occupy scientists for several generations to come.
activated with electrical current or with chemicals. They are lightweight, quiet and simple
to manufacture. However, the first arm wrestling contest between human and artificial
muscles held in 2005 was won by a teenage girl. The race to do better is ongoing.
**
A cube of sugar does not burn. However, if you put some cigarette ash on top of it, it
burns. Why?
**
Challenge 1349 ny Why do organic materials burn at much lower temperature than inorganic materials?
**
materials science 861
An important aspect of life is death. When we die, conserved quantities like our energy,
momentum, angular momentum and several other quantum numbers are redistributed.
They are redistributed because conservation means that nothing is lost. What does all
Challenge 1350 ny this imply for what happens after death?
**
Chemical reactions can be slow but still dangerous. Spilling mercury on aluminium will
Copyright © Christoph Schiller November 1997–January 2007
lead to an amalgam that reduces the strength of the aluminium part after some time. That
is the reason that bringing mercury thermometers on aeroplanes is strictly forbidden.
Materials science
Did you know that one cannot use a boiled egg
“ as a toothpick?
Karl Valentin
It was mentioned several times that the quantum of action explains all properties of mat-
”
ter. Many researchers in physics, chemistry, metallurgy, engineering, mathematics and
biology have cooperated in the proof of this statement. In our mountain ascent we have
only a little time to explore this vast topic. Let us walk through a selection.
862 vii details of quantum theory • applied quantum mechanics
If a geologist takes a stone in his hands, he is usually able to give, within an error of a few
percent, the age of the stone. The full story forms a large part of geology, but the general
lines should be known to every physicist.
Every stone arrives in your hand through the rock cycle. The rock cycle is a process that
transforms magma from the interior of the Earth into igneous rocks, through cooling and
crystallization. Igneous rocks, such as basalt, can transform through erosion, transport
and deposition into sedimentary rocks. Either of these two rock types can be transformed
through high pressures or temperatures into metamorphic rocks, such as marble. Finally,
most rocks are generally – but not always – transformed back into magma.
The full rock cycle takes around 110 to 170 million years. For this reason, rocks that are
older than this age are much less common on Earth. Any stone is the product of erosion
materials science 863
of one of the rock types. A geologist can usually tell, simply by looking at it, the type of
rock it belongs to; if he sees the original environment, he can also give the age, without
any laboratory.
We see that many signals are able to penetrate a mountain. However, only sound or
radio waves provide the possibility to distinguish different materials, or to distinguish
solids from liquids and from air. In addition, any useful method requires a large number
of signal sources and of signal receptors, and thus a large amount of cash. Will there ever
be a simple method allowing to look into mountains as precisely as X-rays allow to study
human bodies? For example, will it ever be possible to map the interior of the pyramids?
Challenge 1351 n A motion expert like the reader should be able to give a definite answer.
One of the high points of twentieth century physics was the development of the best
method so far to look into matter with dimensions of about a metre or less: magnetic
Page 919 resonance imaging. We will discuss it later on.
The other modern imaging technique, ultrasound imaging, is getting more and more
criticized. It is much used for prenatal diagnostics of embryos. However, studies have
864 vii details of quantum theory • applied quantum mechanics
found that ultrasound produces extremely high levels of audible sound to the baby, espe-
cially when the ultrasound is switched on or off, and that babies react negatively to this
loud noise.
netic properties of matter at low temperatures. We know already that matter can not be
paramagnetic at lowest temperatures. It seems that matter is either ferromagnetic, dia-
magnetic or antiferromagnetic at lowest temperatures.
Page 868 More about superfluidity and superconductivity will be told below.
Phase Ty pe L ow t e m p e r at u r e b e - Example
h av i o u r
Solid conductor superconductivity lead
antiferromagnet chromium, MnO
ferromagnet iron
insulator diamagnet
Liquid bosonic Bose–Einstein condensation, i.e. su- 4 He
perfluidity
3
fermionic pairing, then BEC, i.e. superfluidity He
87
Gas bosonic Bose–Einstein condensation Rb, 7 Li, 23 Na, H,
4
He, 41 K
fermionic pairing, then Bose–Einstein con- 40 K, 6 Li
mountains can be higher. Indeed the highest mountain on Mars, Olympus mons, is 80 km
Challenge 1353 n high. Can you find a few other effects limiting mountain height?
**
Do you want to become rich? Just invent something that can be produced in the factory,
is cheap and can substitute duck feathers in bed covers, sleeping bags or in badminton
Challenge 1354 r shuttlecocks. Another industrial challenge is to find an artificial substitute for latex, and a
third one is to find a substitute for a material that is rapidly disappearing due to pollution:
cork.
**
Challenge 1355 ny What is the difference between solids, liquids and gases?
**
What is the difference between the makers of bronze age knifes and the builders of the Copyright © Christoph Schiller November 1997–January 2007
**
Robert Full has shown that van der Waals forces are responsible for the way that geckos
Ref. 851 walk on walls and ceilings. The gecko, a small reptile with a mass of about 100 g, uses an
elaborate structure on its feet to perform the trick. Each foot has 500 000 hairs each split in
up to 1000 small spatulae, and each spatula uses the van der Waals force (or alternatively,
capillary forces) to stick to the surface. As a result, the gecko can walk on vertical glass
walls or even on glass ceilings; the sticking force can be as high as 100 N per foot.
The same mechanism is used by jumping spiders (Salticidae). For example, Evarcha
arcuata have hairs at their feet which are covered by hundred of thousands of setules.
Ref. 852 Again, the van der Waals force in each setule helps the spider to stick on surfaces.
Researchers have copied these mechanisms for the first time in 2003, using microlitho-
graphy on polyimide, and hope to make durable sticky materials in the future.
**
**
Does the melting point of water depend on the magnetic field? This surprising claim was
Ref. 853 made in 2004 by Inaba Hideaki and colleagues. They found a change of 0.9 mK~T. It is
known that the refractive index and the near infrared spectrum of water is affected by
magnetic fields. Indeed, not everything about water might be known yet.
**
Plasmas, or ionized gases, are useful for many applications. Not only can they be used
for heating or cooking and generated by chemical means (such plasmas are variously
called fire or flames) but they can also be generated electrically and used for lighting Copyright © Christoph Schiller November 1997–January 2007
or deposition of materials. Electrically generated plasmas are even being studied for the
disinfection of dental cavities.
**
It is known that the concentration of CO 2 in the atmosphere between 1800 and 2005
Challenge 1359 ny has increased from 280 to 380 parts per million. (How would you measure this?) It is
known without doubt that this increase is due to human burning of fossil fuels, and not
to natural sources such as the oceans or volcanoes. There are three arguments. First of
all, there was a parallel decline of the 14 C~12 C ratio. Second, there was a parallel decline
of the 13 C~12 C ratio. Finally, there was a parallel decline of the oxygen concentration. All
three measurements independently imply that the CO 2 increase is due to the burning of
fuels, which are low in 14 C and in 13 C, and at the same time decrease the oxygen ratio.
868 vii details of quantum theory • applied quantum mechanics
Natural sources do not have these three effects. Since CO 2 is a major greenhouse gas, the
data implies that humans are also responsible for a large part of the temperature increase
during the same period.
**
The technologies to produce perfect crystals, without grain boundaries or dislocations,
are an important part of modern industry. Perfectly regular crystals are at the basis of the
integrated circuits used in electronic appliances, are central to many laser and telecom-
munication systems and are used to produce synthetic jewels.
Synthetic diamonds have now displaced natural diamonds in almost all applications.
In the last years, methods to produce large, white, jewel-quality diamonds of ten carats
Ref. 854 and more are being developed. These advances will lead to a big change in all the domains
that depend on these stones, such as the production of the special surgical knives used in
eye lens operation.
Quantum technolo gy
I were better to be eaten to death with a rust
Quantum effects do not appear only in microscopic systems. Several quantum effects are
important in modern life; transistors, lasers, superconductivity and a few other effects
and systems have shaped modern life in many ways.
” Copyright © Christoph Schiller November 1997–January 2007
play an essential role. That is the reason for the requirement of low temperature in such
experiments. Nevertheless, it took over 50 years to reach a full understanding of the ef-
Ref. 866 fect. This happened in 1957, when Bardeen, Cooper and Schrieffer published their results.
At low temperatures, electron behaviour is dominated by an attractive interaction that
makes them form pairs – today called Cooper pairs – that are effective bosons. And bo-
sons can all be in the same state, thus effectively moving without friction.
For superconductivity, the attractive interaction between electrons is due to the de-
formation of the lattice. Two electrons attract each other in the same way as two masses
attract each other due to deformation of the space-time mattress. However, in the case of
solids, the deformations are quantized. With this approach, Bardeen, Cooper and Schrief-
fer explained the lack of electric resistance of superconducting materials, their complete
diamagnetism (µ r = 0), the existence of an energy gap, the second-order transition to
normal conductivity at a specific temperature, and the dependence of this temperature
on the mass of the isotopes. Last but not least, they received the Nobel Prize in 1972.*
perfluid 3 He-B phase, vortices are quantized: vortices only exist in integer multiples of the
Ref. 860 elementary circulation h~2m3 He . Present research is studying how these vortices behave
and how they induce the transition form laminar to turbulent flows.
* For John Bardeen (1908–1991), this was his second, after he had got the first Nobel Prize in 1956, shared
with William Shockley and Walter Brattain, for the discovery of the transistor. The first Nobel Prize was a
problem for Bardeen, as he needed time to work on superconductivity. In an example to many, he reduced
the tam-tam around himself to a minimum, so that he could work as much as possible on the problem of
superconductivity. By the way, Bardeen is topped by Frederick Sanger and by Marie Curie. Sanger first won a
Nobel Prize in chemistry in 1958 by himself and then won a second one shared with Walter Gilbert in 1980;
Marie Curie first won one with her husband and a second one by herself, though in two different fields.
** They received the Nobel Prize in 1996 for this discovery.
*** Aage Bohr and Ben Mottelson received the Nobel Prize in 1975, Anthony Leggett in 2003.
870 vii details of quantum theory • applied quantum mechanics
Figure missing
In recent years, studying the behaviour of gases at lowest temperatures has become
very popular. When the temperature is so low that the de Broglie wavelength is compar-
Quantized conductivity
Ref. 861 In 1996, the Spanish physicist J.L. Costa–Krämer and his colleagues performed a simple
experiment. They put two metal wires on top of each other on a kitchen table and attached
a battery, a 10 kΩ resistor and a storage oscilloscope to them. Then they measured the elec-
trical current while knocking on the table. In the last millisecond before the wires detach,
the conductivity and thus the electrical current diminished in regular steps of a 7 µA, as
can easily be seen on the oscilloscope. This simple experiment could have beaten, if it
had been performed a few years earlier, a number of enormously expensive experiments
which discovered this quantization at costs of several million euro each, using complex
set-ups and extremely low temperatures.
In fact, quantization of conductivity appears in any electrical contact with a small cross-
section. In such situations the quantum of action implies that the conductivity can only
Challenge 1361 ny be a multiple of 2e 2 ~ħ 1~12 906 1~Ω. Can you confirm this result?
Copyright © Christoph Schiller November 1997–January 2007
Note that electrical conductivity can be as small as required; only quantized electrical
conductivity has the minimum value of 2e 2 ~ħ.
quantum of conductance
e2
S=n . (583)
ħ
The explanation is straightforward: it is the quantum analogue of the classical Hall ef-
fect, which describes how conductance varies with applied magnetic field. Von Klitzing
received the Nobel Prize for physics for the discovery, since the effect was completely
unexpected, allows a highly precise measurement of the fine structure constant and also
allows one to build detectors for the smallest voltage variations measurable so far.
Two years later, it was found that in extremely strong magnetic fields the conductance
Ref. 864 could vary in steps one third that size. Shortly afterwards, even stranger numerical frac-
Ref. 863 tions were also found. In a landmark paper, Robert Laughlin explained all these results by
assuming that the electron gas could form collective states showing quasiparticle excita-
tions with a charge e~3. This was confirmed 15 years later and earned him a Nobel price as
– CS – explanation to be inserted – CS –
What do we learn from this result? Systems in two dimensions have states which follow
different rules than systems in three dimensions. Can we infer something about quarks
from this result? Quarks are the constituents of protons and neutrons, and have charges
e~3 and 2e~3. At this point we need to stand the suspense, as no answer is possible; we
come back to this issue later on.
is a good estimate for the energy needed for production, can you estimate which lamp is
Challenge 1362 n the most friendly to the environment?
Nobody thought much about lamps, until Albert Einstein and a few other great phys-
icists came along, such as Theodore Maiman, Hermann Haken and several others that
got the Nobel Prize with their help. In 1916, Einstein showed that there are two types of
sources of light – or of electromagnetic radiation in general – both of which actually
‘create’ light. He showed that every lamp whose brightness is turned up high enough
will change behaviour when a certain intensity threshold is passed. The main mechan-
ism of light emission then changes from spontaneous emission to stimulated emission.
Nowadays such a special lamp is called a laser. (The letters ‘se’ in laser are an abbreviation
of ‘stimulated emission’.) After a passionate worldwide research race, in 1960 Maiman
872 vii details of quantum theory • applied quantum mechanics
TA B L E 64 A selection of lamps
Incandescent lamps
tungsten wire light bulbs, halogen lamps 25 lm~W 0.1 cent~lm 700 h
Gas discharge lamps
oil lamps, candle
neon lamps
mercury lamps 100 lm~W ... cent~lm ... h
metal halogenide lamps (ScI3 or ‘xenon’, NaI, 110 lm~W ... cent~lm ... h
DyI3 , HoI3 , TmI5 )
was the first to build a laser emitting visible light. (So-called masers emitting microwaves
were already known for several decades.) In summary, Einstein and the other physicists
showed that lasers are lamps which are sufficiently turned up. Lasers consist of some light
producing and amplifying material together with a mechanism to pump energy into it.
The material can be a gas, a liquid or a solid; the pumping process can use electrical
current or light. Usually, the material is put between two mirrors, in order to improve
the efficiency of the light production. Common lasers are semiconductor lasers (essen- Copyright © Christoph Schiller November 1997–January 2007
tially highly pumped LEDs or light emitting diodes), He–Ne lasers (highly pumped neon
lamps), liquid lasers (essentially highly pumped fire flies) and ruby lasers (highly pumped
luminescent crystals).
Lasers produce radiation in the range from microwaves and extreme ultraviolet. They
have the special property of emitting coherent light, usually in a collimated beam. There-
fore lasers achieve much higher light intensities than lamps, allowing their use as tools.
In modern lasers, the coherence length, i.e. the length over which interference can be ob-
served, can be thousands of kilometres. Such high quality light is used e.g. in gravitational
wave detectors.
People have become pretty good at building lasers. Lasers are used to cut metal sheets
up to 10 cm thickness, others are used instead of knives in surgery, others increase surface
hardness of metals or clean stones from car exhaust pollution. Other lasers drill holes in
quantum technology 873
teeth, measure distances, image biological tissue or grab living cells. Most materials can
be used to make lasers, including water, beer and whiskey.
Some materials amplify light so much that end mirrors are not necessary. This is the
case for nitrogen lasers, in which nitrogen, or simply air, is used to produce a UV beam.
Even a laser made of a single atom (and two mirrors) has been built; in this example,
Ref. 889 only eleven photons on average were moving between the two mirrors. Quite a small
lamp. Also lasers emitting light in two dimensions have been built. They produce a light
plane instead of a light beam.
Lasers have endless applications. Lasers read out data from compact discs (CDs), are
used in the production of silicon integrated circuits, and transport telephone signals; we
Page 259 already encountered lasers that work as loudspeakers. The biggest advances in recent
years came from the applications of femtosecond laser pulses. Femtosecond pulses gen-
erate high-temperature plasmas in the materials they propagate, even in air. Such short
pulses can be used to cut material without heating it, for example to cut bones in heart
Each photon interferes only with itself. Interference between two different photons
never occurs.
Often this statement is misinterpreted as implying that two separate photon sources can-
not interfere. It is almost unbelievable how this false interpretation has spread through
Ref. 884 the literature. Everybody can check that this statement is incorrect with a radio: two dis-
tant radio stations transmitting on the same frequency lead to beats in amplitude, i.e. to
wave interference. (This should not to be confused with the more common radio interfer-
ence, with usually is simply a superposition of intensities.) Radio transmitters are coher-
ent sources of photons, and any radio receiver shows that two such sources can indeed
interfere.
Copyright © Christoph Schiller November 1997–January 2007
In 1949, interference of two different sources has been demonstrated with microwave
beams. Numerous experiments with two lasers and even with two thermal light sources
Ref. 884 have shown light interference from the fifties onwards. Most cited is the 1963 experiment
by Magyar and Mandel; they used two ruby lasers emitting light pulses and a rapid shutter
camera to produce spatial interference fringes.
However, all these experimental results do not contradict the statement by Dirac. In-
deed, two photons cannot interfere for several reasons.
— Interference is a result of space-time propagation of waves; photons appear only when
* See the famous, beautiful but difficult textbook P.A.M. Dirac, The Principles of Quantum Mechanics,
Clarendon Press, Oxford, 1930, page 9.
874 vii details of quantum theory • applied quantum mechanics
the energy–momentum picture is used, mainly when interaction with matter takes
place. The description of space-time propagation and the particle picture are mutually
exclusive – this is one aspect of the complementary principle. Why does Dirac seem to
Ref. 885 * In 2002, the first holograms have been produced that made use of neutron beams.
quantum technology 875
**
Challenge 1364 r Will there ever be desktop laser engravers for 2000 euro?
**
Challenge 1365 r Will there ever be room-temperature superconductivity?
**
Challenge 1366 n Will there ever be teleportation of everyday objects?
**
One process that quantum physics does not allow is telephathy. An unnamed space
Copyright © Christoph Schiller November 1997–January 2007
agency found out in the Apollo 14 mission, when, during the flight to the moon, cosmo-
naut Edgar Mitchell tested telepathy as communication means. Unsurprisingly, he found
Ref. 867 that it was useless. It is not clear why NASA spent so much money for an experiment that
could have been performed for 1000 times less cost also in other ways.
**
Challenge 1367 d Will there ever be applied quantum cryptology?
**
Will there ever be printable polymer electronic circuits, instead of lithographically pat-
Challenge 1368 d terned silicon electronics as is common now?
876 vii details of quantum theory • quantum electrodynamics
T
he central concept that quantum theory introduces in the description of nature is
he idea of virtual particles. Virtual particles are short-lived particles; they owe
heir existence exclusively to the quantum of action. Because of the quantum of ac-
tion, they do not need to follow the energy-mass relation that special relativity requires of
normal, real particles. Virtual particles can move faster than light and can move backward
in time. Despite these strange properties, they have many observable effects.
Waves induce oscillations of ships because a ship absorbs energy from the waves.
When oscillating, the ship also emits waves. This happens mainly towards the two sides
of the ship. As a result, for a single ship, the wave emission has no net effect on its position.
Now imagine that two parallel ships oscillate in a long swell, with a wavelength much lar-
ger than the distance between the ships. Due to the long wavelength, the two ships will
oscillate in phase. The ships will thus not be able to absorb energy from each other. As a
result, the energy they radiate towards the outside will push them towards each other.
The effect is not difficult to calculate. The energy of a rolling ship is
E = mдh α 2 ~2 (584)
where α is the roll angle amplitude, m the mass of the ship and д = 9, 8 m~s2 the accelera-
quantum electrodynamics – the origin of virtual reality 877
tion due to gravity. The metacentric height h is the main parameter characterizing a ship,
especially a sailing ship; it tells with what torque the ship returns to the vertical when
inclined by an angle α. Typically, one has h =1.5 m.
When a ship is inclined, it will return to the vertical by a damped oscillation. A damped
oscillation is characterized by a period T and a quality factor Q. The quality factor is the
number of oscillations the system takes to reduce its amplitude by a factor e = 2.718. If the
quality factor Q of an oscillating ship and its oscillation period T are given, the radiated
power W is
W = 2π
E
. (585)
QT
We saw above that radiation pressure is W~c, where c is the wave propagation velocity.
For water waves, we have the famous relation
c=
дT
hα 2
ma = m2π 2 . (587)
QT 2
Inserting typical values such as Q = 2.5, T =10 s, α =0.14 rad and a ship mass of 700 tons,
we get about 1.9 kN. Long swells thus make ships attract each other. The intensity of the
attraction is comparatively small and can indeed be overcome with a rowing boat. On the
other hand, even the slightest wind will damp the oscillation amplitude and have other
effects that will avoid the observation of the attraction.
Sound waves or noise in air can have the same effect. It is sufficient to suspend two
Ref. 871 metal plates in air and surround them by loudspeakers. The sound will induce attraction
(or repulsion) of the plates, depending on whether the sound wavelength cannot (or can)
be taken up by the other plate.
In 1948, the Dutch physicist Hendrik Casimir made one of the most spectacular predic-
tions of quantum theory: he predicted a similar effect for metal plates in vacuum. Casimir,
who worked at the Dutch Electronics company Philips, wanted to understand why it was
Copyright © Christoph Schiller November 1997–January 2007
so difficult to build television tubes. Television screens are made by deposing small neut-
ral particles on glass, but Casimir observed that the particles somehow attracted each
other. Casimir got interested in understanding how neutral particles interact. During
these theoretical studies he discovered that two neutral mirrors (or metal plates) would
attract each other even in complete vacuum. This is the famous Casimir effect. Casimir
also determined the attraction strength between a sphere and a plate, and between two
spheres. In fact, all conducting bodies attract each other in vacuum, with a force depend-
ing on their geometry.
In all these situations, the role of the sea is taken by the zero-point fluctuations of the
electromagnetic field, the role of the ships by the mirrors. Casimir understood that the
space between two parallel mirrors, due to the geometrical constraints, had different zero-
878 vii details of quantum theory • quantum electrodynamics
point fluctuations than the free vacuum. Like two ships, the result would be the attraction
of the mirrors.
Casimir predicted that the attraction for two mirrors of mass m and surface A is given
by
ma π 3 ħc
= . (588)
A 120 d 4
The effect is a pure quantum effect; in classical electrodynamics, two neutral bodies do
not attract. The effect is small; it takes some dexterity to detect it. The first experimental
Ref. 872 confirmation was by Derjaguin, Abrikosova and Lifshitz in 1956; the second experimental
Ref. 873 confirmation was by Marcus Sparnaay, Casimir’s colleague at Philips, in 1958. Two beauti-
ful high-precision measurements of the Casimir effect were performed in 1997 by Lamor-
Ref. 874 eaux and in 1998 by Mohideen and Roy; they confirmed Casimir’s prediction with a pre-
cision of 5 % and 1 % respectively. (Note that at very small distances, the dependence is
The Banach–Tarski paradox for vacuum Copyright © Christoph Schiller November 1997–January 2007
The Casimir effect, as well as other experiments, imply that there is a specific and finite
energy density that can be ascribed to the vacuum. The statement has a dramatic con-
sequence: space-time cannot be continuous.
The reasoning is simple. If the vacuum were continuous, we could make use of the
Page 55 Banach–Tarski paradox and split, without any problem, a ball of vacuum into two balls
of vacuum, each with the same volume. In other words, one ball with energy E could
not be distinguished from two balls of energy 2E. However, this mathematical trick is
impossible in nature.
Indeed, a finite vacuum energy density implies the existence of an energy cut-off, and
thus a length cut-off. In short, in the same way that we used the Banach–Tarski paradox
Page 224 to show that chocolate (or any other matter) possesses an intrinsic length, we can now
quantum electrodynamics – the origin of virtual reality 879
deduce that the vacuum does so as well. And if vacuum has an intrinsic scale, it is not
continuous in some sense. Therefore, in the same way that matter turned out not to be
continuous, vacuum is not either. A continuous vacuum would have no length scale and
no finite energy density. We will have to wait for the third part of the text to find out more
Page 1020 details. There, the structure of the vacuum will turn out to be even more interesting than
that of matter.
trons from each other. However, this is not correct. The momentum of virtual photons
does not have to be in the direction of its energy flow; it can also be in opposite direc-
tion.* Obviously, this is only possible within the limits provided by the indeterminacy
principle.
Vacuum energy
The strangest result of quantum field theory is the energy density of the vacuum.
* One of the most beautiful booklets on quantum electrodynamics which makes this point remains the text
by Richard Feynman, QED: the Strange Theory of Light and Matter, Penguin Books, 1990.
880 vii details of quantum theory • quantum electrodynamics
– CS – More to be written – CS –
Moving mirrors
Mirrors also work when in motion; in contrast, walls that produce echoes do not work at
all speeds. Walls do not produce echoes if one moves faster than sound. However, mirrors
always produce an image. This observation shows that the speed of light is the same for
Challenge 1372 n any observer. Can you detail the argument?
Mirrors also differ from tennis rackets. We saw that mirrors cannot be used to change
the speed of the light they hit, in contrast to what tennis rackets can do with balls. This
observation shows that the speed of light is also a limit velocity. In short, the simple ex-
istence of mirrors is sufficient to derive special relativity.
But there are more interesting things to be learned from mirrors. We only have to
ask whether mirrors work when they undergo accelerated motion. This issue yields a
surprising result.
A α ( ) ( )
973 4 ħ 2 ħω 6
(589)
10 125π me c me c 2
Copyright © Christoph Schiller November 1997–January 2007
for the case that the energy ħω of the photon is much smaller than the rest energy m e c 2
of the electron. This value is about 18 orders of magnitude smaller than what was meas-
urable in 1999; the future will show whether the effect can be observed for visible light.
However, for high energy photons these effects are observed daily in particle accelerators.
In these cases one observes not only interaction through virtual electron–antielectron
pairs, but also through virtual muon–antimuon pairs, virtual quark–antiquark pairs, and
much more.
Everybody who consumes science fiction knows that matter and antimatter annihilate
and transform into pure light. In more detail, a matter particle and an antimatter particle
annihilate into two or more photons. More interestingly, quantum theory predicts that
the opposite process is also possible: photons hitting photons can produce matter!
Ref. 878 In 1997, this was also confirmed experimentally. At the Stanford particle accelerator,
quantum electrodynamics – the origin of virtual reality 881
photons from a high energy laser pulse were bounced off very fast electrons. In this way,
the reflected photons acquired a large energy, when seen in the inertial frame of the ex-
perimenter. The original pulse, of 527 nm or 2.4 eV green light, had a peak power density
of 1022 W~m2 , about the highest achievable so far. That is a photon density of 1034 ~m3
and an electric field of 1012 V~m, both of which were record values at the time. When
this laser pulse was reflected off a 46.6 GeV electron beam, the returning photons had an
Challenge 1374 ny energy of 29.2 GeV and thus had become intense gamma rays. These gamma rays then
collided with still incoming green photons and produced electron–positron pairs by the
reaction
γ 29.2 + n γ green e+ + e− (590)
for which both final particles were detected by special apparatuses. The experiment thus
showed that light can hit light in nature, and above all, that doing so can produce matter.
This is the nearest one can get to the science fiction idea of light swords or of laser swords
`d 2 e = ħ~mt (591)
increasing linearly with time; however, the extremely small displacements produced this
way seem out of experimental reach so far. But the result is not a surprise. Are you able
Copyright © Christoph Schiller November 1997–January 2007
namely
v= = lim = lim ∆x
dx ∆x 1
, (592)
dt ∆t 0 ∆t ∆t 0 ∆t
one gets an infinity as intermediate step. Indeed, dt being vanishingly small, one could
argue that one is dividing by zero. Both arguments show the difficulty to accept that the
result of a limit process can be a finite quantity even if infinite quantities appear in it. The
parallel with the definition of the velocity is closer than it seems; both ‘infinities’ stem
from the assumption that space-time is continuous, i.e. infinitely divisible. The infinities
necessary in limit processes for the definition of differentiation, of integration or for the
renormalization scheme appear only when space-time is approximated as a complete set,
or as physicists say, as a ‘continuous’ set.
On the other hand, the conviction that the appearance of an infinity might be a sign
of incompleteness of a theory was an interesting development in physics. It shows how
uncomfortable many physicists had become with the use of infinity in our description of
**
Quantum electrodynamics explains why there are only a finite number of different atom
Ref. 886 types. In fact, it takes only two lines to prove that pair production of electron–antielectron
pairs make it impossible that a nucleus has more than about 137 protons. Can you show
Challenge 1377 n this? The effect at the basis of this limit, the polarization of the vacuum, also plays a role
Page 897 in much larger systems, such as charged black holes, as we will see shortly.
* Not long after his death, his wish has been fulfilled, although in a different manner that he envisaged. The
third part of this mountain ascent will show the way out of the issue.
curiosities and fun challenges of quantum electrodynamics 883
cards
or
bricks l
table
**
Is there a critical magnetic field in nature, like there is a critical electric field, limited by
Challenge 1378 ny spontaneous pair production?
**
In classical physics, the field energy of a point-like charged particle, and hence its mass,
Page 614 was predicted to be infinite. QED effectively smears out the charge of the electron over its
Compton wavelength, so that in the end the field energy contributes only a small correc-
Challenge 1379 ny tion to its total mass. Can you confirm this?
** Copyright © Christoph Schiller November 1997–January 2007
Microscopic evolution can be pretty slow. Light, especially when emitted by single atoms,
is always emitted by some metastable state. Usually, the decay times, being induced by
the vacuum fluctuations, are much shorter than a microsecond. However, there are meta-
stable atomic states with a lifetime of ten years: for example, an ytterbium ion in the 2 F7~2
state achieves this value, because the emission of light requires an octupole transition, in
Ref. 888 which the angular momentum changes by 3ħ; this is an extremely unlikely process.
**
Microscopic evolution can be pretty fast. Can you imagine how to deduce or to measure
Challenge 1380 n the speed of electrons inside atoms? And inside metals?
**
884 vii details of quantum theory • quantum electrodynamics
Take a horseshoe. The distance between the two ends is not fixed, since otherwise their
position and velocity would be known at the same time, contradicting the indeterminacy
relation. Of course, this reasoning is also valid for any other solid object. In short, both
quantum mechanics and special relativity show that rigid bodies do not exist, albeit for
different reasons.
**
Have you ever admired a quartz crystal or some other crystalline material? The beautiful
shape and atomic arrangement has formed spontaneously, as a result of the motion of
atoms under high temperature and pressure, during the time that the material was deep
under the Earth’s surface. The details of crystal formation are complex and interesting.
For example, are regular crystal lattices energetically optimal? This simple question
leads to a wealth of problems. We might start with the much simpler question º whether a
Challenge 1381 n regular dense packing of spheres is the most dense possible. Its density is π~ 18 , i.e. a bit
Ref. 893 Fink in 1984. He studied rhenium atoms on tungsten surfaces and showed, as observed,
that they cannot form dimers – two atoms moving closeby – but readily form trimers.
This is an example contradicting classical physics; the effect is impossible if one pictures
atoms as immutable spheres, but becomes possible when one remembers that the electron
clouds around the atoms rearrange depending on their environment.
For an exact study of crystal energy, the interactions between all atoms have to be
included. The simplest question is to determine whether a regular array of alternatively
charged spheres has lower energy than some irregular collection. Already such simple
questions are still topic of research; the answer is still open.
Another question is the mechanism of face formation in crystals. Can you confirm
Challenge 1382 n that crystal faces are those planes with the slowest growth speed, because all fast growing
planes are eliminated? The finer details of the process form a complete research field in
curiosities and fun challenges of quantum electrodynamics 885
**
In 1997, a Czech group built a quantum version of the Foucault pendulum, using the
Ref. 897 superfluidity of helium. In this beautiful piece of research, they cooled a small ring of fluid
helium below the temperature of 0.28 K, below which the helium moves without friction.
In such situations it thus can behave like a Foucault pendulum. With a clever arrangement,
it was possible to measure the rotation of the helium in the ring using phonon signals, and
to show the rotation of the Earth.
**
If an electrical wire is sufficiently narrow, its electrical conductance is quantized in steps
of 2e 2 ~ħ. The wider the wire, the more such steps are added to its conductance. Can you
Challenge 1383 n explain the effect? By the way, quantized conductance has also been observed for light
886 vii details of quantum theory • quantum electrodynamics
**
An example of modern research is the study of hollow atoms, i.e. atoms missing a number
of inner electrons. They have been discovered in 1990 by J.P. Briand and his group. They
appear when a completely ionized atom, i.e. one without any electrons, is brought in
contact with a metal. The acquired electrons then orbit on the outside, leaving the inner
shells empty, in stark contrast with usual atoms. Such hollow atoms can also be formed
Ref. 899 by intense laser irradiation.
**
In the past, the description of motion with formulae was taken rather seriously. Before
computers appeared, only those examples of motion were studied which could be de-
scribed with simple formulae. It turned out that Galilean mechanics cannot solve the
**
Photons not travelling parallel to each other attract each other through gravitation and
thus deflect each other. Could two such photons form a bound state, a sort of atom of
light, in which they would circle each other, provided there were enough empty space for
Challenge 1385 n this to happen?
**
Challenge 1386 n Can the universe ever have been smaller than its own Compton wavelength?
In fact, quantum electrodynamics, or QED, provides a vast number of curiosities and
every year there is at least one interesting new discovery. We now conclude the theme
with a more general approach. Copyright © Christoph Schiller November 1997–January 2007
How can one move on perfect ice? – The ultimate physics test
In our quest, we have encountered motion of many sorts. Therefore, the following test –
not to be taken too seriously – is the ultimate physics test, allowing to check your under-
standing and to compare it with that of others.
Imagine that you are on a perfectly frictionless surface and that you want to move to
its border. How many methods can you find to achieve this? Any method, so tiny its effect
may be, is allowed.
Classical physics provided quite a number of methods. We saw that for rotating
ourselves, we just need to turn our arm above the head. For translation motion, throwing
a shoe or inhaling vertically and exhaling horizontally are the simplest possibilities. Can
summary of quantum electrodynamics 887
Challenge 1387 n you list at least six additional methods, maybe some making use of the location of the
surface on Earth? What would you do in space?
Electrodynamics and thermodynamics taught us that in vacuum, heating one side of
the body more than the other will work as motor; the imbalance of heat radiation will
push you, albeit rather slowly. Are you able to find at least four other methods from these
Challenge 1388 n two domains?
General relativity showed that turning one arm will emit gravitational radiation un-
Challenge 1389 n symmetrically, leading to motion as well. Can you find at least two better methods?
Quantum theory offers a wealth of methods. Of course, quantum mechanics shows
that we actually are always moving, since the indeterminacy relation makes rest an im-
possibility. However, the average motion can be zero even if the spread increases with
time. Are you able to find at least four methods of moving on perfect ice due to quantum
Challenge 1390 n effects?
Materials science, geophysics, atmospheric physics and astrophysics also provide ways to
starts with elementary particles, characterized by their mass, their spin and their charge,
and with the vacuum, essentially a sea of virtual particle–antiparticle pairs. Interactions
between charged particles are described as the exchange of virtual photons, and decay is
described as the interaction with the virtual photons of the vacuum.
All physical results of QED can be calculated by using the single diagram of Figure 366.
It contains the Lagrangian and the Hamiltonian of QED, as well as a method to calculate
results for actual systems. As QED is a perturbative theory, the diagram directly describes
the first order effects and its composites describe effects of higher order. QED is a perturb-
ative theory.
QED describes all everyday properties of matter and radiation. It describes the divis-
ibility down to the smallest constituents, the isolability from the environment and the
888 vii details of quantum theory • quantum electrodynamics
radiation:
q=0
matter: s=1
q=e
s = 1/2
interaction:
α = 1/137.0359...
Σ E=0
Σ p=0
Σ s=0
Σ q=0
F I G U R E 366 QED as perturbation theory
impenetrability of matter. It also describes the penetrability of radiation. All these proper-
ties are due to electromagnetic interactions of constituents and follow from Figure 366.
Matter is divisible because the interactions are of finite strength, matter is separable be-
cause the interactions are of finite range, and matter is impenetrable because interactions
among the constituents increase in intensity when they approach each other, in particu-
lar because matter constituents are fermions. Radiation is divisible into photons, and is
penetrable because photons are bosons and first order photon-photon interactions do
not exist.
Both matter and radiation are made of elementary constituents. These elementary
constituents, whether bosons or fermions, are indivisible, isolable, indistinguishable, and
point-like.
To describe observations, it is necessary to use quantum electrodynamics in all those
situations for which the characteristic dimensions d are of the order of the Compton
wavelength
d λC =
h
. (593)
mc
In situations where the dimensions are of the order of the de Broglie wavelength, or equi-
Copyright © Christoph Schiller November 1997–January 2007
valently, where the action is of the order of the Planck value, simple quantum mechanics
is sufficient:
d λ dB =
h
. (594)
mv
For larger dimensions, classical physics will do.
Together with gravity, quantum electrodynamics explains almost all observations of
motion on Earth; QED unifies the description of matter and radiation in daily life. All
objects and all images are described by it, including their properties, their shape, their
transformations and their other changes. This includes self-organization and chemical
or biological. In other words, QED gives us full grasp of the effects and the variety of
motion due to electromagnetism.
summary of quantum electrodynamics 889
based on the idea that measurable quantities can be multiplied and added. This always
happens at one space-time point. In mathematical jargon, observables form a local al-
gebra. Thus the structure of an algebra contains, implies and follows from the idea that
local properties lead to local properties. We will discover later on that this basic assump-
tion is wrong at high energies.
We defined special relativity using v D c, general relativity using L~M E 4G~c 2 and
Challenge 1394 r quantum theory using S E ħ~2. How can we define electromagnetism in one statement?
This is not known yet.
* On the other hand, there is beautiful work going on how humans move their limbs; it seems that humans
Ref. 900 move by combining a small set of fundamental motions.
890 vii details of quantum theory • 27. including gravitation
length l
height h
single crystal
neutron beam silicon mirrors
Many other open issues of more practical nature have not been mentioned. Indeed,
by far the largest numbers of physicists get paid for some form of applied QED. However,
our quest is the description of the fundamentals of motion. So far, we have not achieved
δφ =
mдhl
(595)
ħv
where l is the distance of the two climbs and v and m are the speed and mass of the
Ref. 907 neutrons. These beautifully simple experiments have confirmed the formula within ex-
perimental errors.*
Ref. 909 In the 1990s, similar experiments have even been performed with complete atoms.
* Due to the influence of gravity on phases of wave functions, some people who do not believe in bath in-
duced decoherence have even studied the influence of gravity on the decoherence process of usual quantum
Ref. 908 systems in flat space-time. Predictably, the calculated results do not reproduce experiments.
quantum mechanics with gravitation – the first approach 891
These set-ups allow to build interferometers so sensitive that local gravity д can be meas-
ured with a precision of more than eight significant digits.
R0 4 t0 4
(N b ) = 10244
3
(596)
l Pl t Pl
in which N b = 1081 and t 0 = 1.2 ë 1010 a were used. There is no known reason why the
number of baryons and the horizon size R 0 should be related in this way. This coincidence
is equivalent to the one originally stated by Dirac,* namely
ħ2
m p3 . (598)
Gct 0
where m p is the proton mass. This approximate equality seems to suggest that certain
Copyright © Christoph Schiller November 1997–January 2007
microscopic properties, namely the mass of the proton, is connected to some general
properties of the universe as a whole. This has lead to numerous speculations, especially
since the time dependence of the two sides differs. Some people even speculate whether
Ref. 933 * The equivalence can be deduced using Gn b m p = 1~t 02 , which, as Weinberg explains, is required by several
Page 1065 cosmological models. Indeed, this can be rewritten simply as
m 0 ~R 0 m Pl ~R Pl = c ~G .
2 2 2 2 4 2
(597)
Together with the definition of the baryon density n b = N b ~R 03 one gets Dirac’s large number hypothesis,
¼
»
substituting protons for pions. Note that the Planck time and length are defined as ħG~c 5 and ħG~c 3
and are the natural units of length and time. We will study them in detail in the third part of the mountain
ascent.
892 vii details of quantum theory • 27. including gravitation
relations (596) or (598) express some long-sought relation between local and global to-
Ref. 934 pological properties of nature. Up to this day, the only correct statement seems to be that
they are coincidences connected to the time at which we happen to live, and that they
should not be taken too seriously.
Limits to disorder
Die Energie der Welt ist constant.
Page 479
We have already encountered the famous statement by Clausius, the father of the term
‘entropy’. Strangely, for over hundred years nobody asked whether there actually exists a
theoretical maximum for entropy. This changed in 1973, when Jakob Bekenstein found
”
Ref. 911 the answer while investigating the consequences gravity has for quantum physics. He
found that the entropy of an object of energy E and size L is bound by
S D EL
kπ
(599)
ħc
Copyright © Christoph Schiller November 1997–January 2007
for all physical systems. In particular, he deduced that (nonrotating) black holes saturate
Challenge 1397 n the bound, with an entropy given by
kc 3 A kG
S= = 4πM 2 (600)
Għ 4 ħc
where A is now the area of the horizon of the black hole. It is given by A = 4πR 2 =
4π(2GM~c 2 )2 . In particular, the result implies that every black hole has an entropy. Black
holes are thus disordered systems described by thermostatics. Black holes are the most
* The energy of the universe is constant. Its entropy tends towards a maximum.
quantum mechanics with gravitation – the first approach 893
T=a
ħ
. (601)
2πkc
The result means that there is no vacuum on Earth, because any observer on its surface
can maintain that he is accelerated with 9.8 m~s2 , thus leading to T = 40 zK! We can thus
measure gravity, at least in principle, using a thermometer. However, even for the largest
practical accelerations the temperature values are so small that it is questionable whether
Ref. 915 the effect will ever be confirmed experimentally. But if it will, it will be a great experiment.
When this effect was predicted, people studied the argument from all sides. For ex-
ample, it was then found that the acceleration of a mirror leads to radiation emission!
Mirrors are thus harder to accelerate than other bodies of the same mass.
When the acceleration is high enough, also matter particles can be detected. If a Copyright © Christoph Schiller November 1997–January 2007
particle counter is accelerated sufficiently strongly across the vacuum, it will start count-
ing particles! We see that the difference between vacuum and matter becomes fuzzy at
large energies.
* The precise discussion that black holes are the most disordered systems in nature is quite subtle. It is
Ref. 912 summarized by Bousso. Bousso claims that the area appearing in the maximum entropy formula cannot be
taken naively as the area at a given time, and gives four arguments why this should be not allowed. However,
all four arguments are wrong in some way, in particular because they assume that lengths smaller than the
Planck length or larger than the universe’s size can be measured. Ironically, he brushes aside some of the
arguments himself later in the paper, and then deduces an improved formula, which is exactly the same as
the one he criticizes first, just with a different interpretation of the area A. In short, the expression of black
hole entropy is the maximum entropy for a physical system with surface A.
894 vii details of quantum theory • 27. including gravitation
For completeness, we mention that also an observer in rotational motion detects radi-
ation following expression (601).
In the 1980s, Unruh and Wald showed that black hole radiation is precisely the for-
gotten effect that puts everything right. Because of black hole radiation, the box feels
buoyancy, so that it cannot be lowered down to the horizon. It floats somewhat above
it, so that the heat radiation inside the box has not yet zero energy when it falls out of
the opened box. As a result, the black hole does increase in mass and thus in entropy. In
summary, when the empty box is pulled up again, the final situation is thus the following:
only part of the energy of the heat radiation has been converted into mechanical energy,
part of the energy went into the increase of mass and thus of entropy of the black hole.
The second principle of thermodynamics is saved.
Well, it is only saved if the heat radiation has precisely the right energy density at
the horizon and above. Let us have a look. The centre of the box can only be lowered
up to a hovering distance d above the horizon, where the acceleration due to gravity is
д = c 2 ~4GM. The energy E gained by lowering the box is
The efficiency of the process is η = E~mc 2 . To be consistent with the second law of ther-
modynamics, this efficiency must obey
η= =1−
E TBH
2
e (603)
mc T
We thus find a black hole temperature TBH given by the hovering distance d. That hov-
ering distance d is roughly given by the size of the box. The box size in turn must be at
least the wavelength of the thermal radiation; in first approximation, Wien’s relation gives
d ħc~kT. A precise calculation, first performed by Hawking, gives the result
ħc 3 c4
TBH = = = дsurf =
ħc 1 ħ
дsurf with (604)
8πkGM 4πk R 2πkc 4GM
where R and M are the radius and the mass of the black hole. It is either called the black-
hole temperature or Bekenstein-Hawking temperature. As an example, a black hole with
the mass of the Sun would have the rather small temperature of 62 nK, whereas a smaller
black hole with the mass of a mountain, say 1012 kg, would have a temperature of 123 GK.
Copyright © Christoph Schiller November 1997–January 2007
That would make quite a good oven. All known black hole candidates have masses in the
range from a few to a few million solar masses. The radiation is thus extremely weak, the
reason being that the emitted wavelength is of the order of the black hole radius, as you
Challenge 1400 ny might want to check. The radiation emitted by black holes is often also called Bekenstein-
Ref. 917 Hawking radiation.
Black hole radiation is thus so weak that we can speak of an academic effect. It leads
Challenge 1401 ny to a luminosity that increases with decreasing mass or size as
c 6 ħ π2
L 2 L = nAσ T 4 = n
1 1
or (605)
M 2 R G 2 M 2 15 ë 27
896 vii details of quantum theory • 27. including gravitation
Φdq
Second principle entropy never decreases: surface area never de-
dS E 0 creases: dA E 0
Third principle T = 0 cannot be achieved a = 0 cannot be achieved
20 480 π G 2
t = M3 M 3 3.4 ë 10−16 s~kg3 (606)
ħc 4
as function of their initial mass M. For example, a black hole with mass of 1 g would have
a lifetime of 3.4 ë 10−25 s, whereas a black hole of the mass of the Sun, 2.0 ë 1030 kg, would Copyright © Christoph Schiller November 1997–January 2007
have a lifetime of about 1068 years. Obviously, these numbers are purely academic. In any
case, black holes evaporate. However, this extremely slow process for usual black holes
determines their lifetime only if no other, faster process comes into play. We will present
a few such processes shortly. Hawking radiation is the weakest of all known effects. It is
not masked by stronger effects only if the black hole is non-rotating, electrically neutral
and with no matter falling into it from the surroundings.
So far, none of these quantum gravity effects has been confirmed experimentally, as the
values are much too small to be detected. However, the deduction of a Hawking temper-
Ref. 919 ature has been beautifully confirmed by a theoretical discovery of Unruh, who found that
there are configurations of fluids in which sound waves cannot escape, so-called ‘silent
holes’. Consequently, these silent holes radiate sound waves with a temperature satisfying
the same formula as real black holes. A second type of analogue system, namely optical
bl ack holes aren’t bl ack 897
c5
L < L Pl = = 0.9 ë 1052 W ,
Copyright © Christoph Schiller November 1997–January 2007
(607)
4G
Challenge 1404 e as you might want to check yourself. In short, the sources of gamma ray bursts are the
Ref. 924 biggest bombs found in the universe. In fact, more detailed investigations of experimental
data confirm that gamma ray bursts are ‘primal screams’ of black holes in formation.
With all this new data, Ruffini took up his 1975 model again in 1997 and with his col-
laborators showed that the gamma ray bursts generated by the annihilation of electron-
positrons pairs created by vacuum polarization, in the region they called the dyadosphere,
have a luminosity and a duration exactly as measured, if a black hole of about a few up to
30 solar masses is assumed. Charged black holes therefore reduce their charge and mass
* For more about this fascinating topic, see the http://www.aip.de/~jcg/grb.html website by Jochen Greiner.
898 vii details of quantum theory • 27. including gravitation
+90
+180 -180
-90
F I G U R E 369 The location of the 2704 gamma ray bursts observed in the
sky between 1991 and 2000 by the BATSE satellite; the galactic plane is
horizontal (NASA)
3c 6
ρ=
1
(608)
M 2 32πG 3
and can be quite low for large black holes. For the highest black holes known, with 1000
Challenge 1405 e million solar masses or more, the density is of the order of the density of air. Nevertheless,
even in this case, the density is the highest possible in nature for that mass.
By the way, the gravitational acceleration at the horizon is still appreciable, as it is given
by
Challenge 1406 ny which is still 15 km~s2 for an air density black hole.
Obviously, the black hole temperature is related to the entropy S by its usual definition
= V = W
1 ∂S ∂S
T ∂E ρ ∂(Mc 2 ) ρ
(610)
All other thermal properties can be deduced by the standard relations from thermostatics.
In particular, it looks as if black holes are the matter states with the largest possible
Challenge 1407 ny entropy. Can you confirm this statement?
It also turns out that black holes have a negative heat capacity: when heat is added, they
cool down. In other words, black holes cannot achieve equilibrium with a bath. This is
not a real surprise, since any gravitationally bound material system has negative specific
heat. Indeed, it takes only a bit of thinking to see that any gas or matter system collapsing
Challenge 1408 ny under gravity follows dE~dR A 0 and dS~dR A 0. That means that while collapsing, the
Challenge 1409 n energy and the entropy of the system shrink. (Can you find out where they go?) Since
temperature is defined as 1~T = dS~dE, temperature is always positive; from the temper-
ature increase dT~dR < 0 during collapse one deduces that the specific heat dE~dT is
Copyright © Christoph Schiller November 1997–January 2007
low entropy – object is thrown into the Sun, the information contained is radiated away.
The information is contained in some slight deviations from black hole radiation, namely
in slight correlations between the emitted radiation emitted over the burning time of the
Sun. A short calculation, comparing the entropy of a room temperature book and the
information contained in it, shows that these effects are extremely small and difficult to
measure.
Ref. 930 A clear exposition of the topic was given by Don Page. He calculated what information
would be measured in the radiation if the system of black hole and radiation together
would be in a pure state, i.e. a state containing specific information. The result is simple.
Even if a system is large – consisting of many degrees of freedom – and in pure state, any
smaller subsystem nevertheless looks almost perfectly thermal. More specifically, if a total
system has a Hilbert space dimension N = nm, where n and m D n are the dimensions
bl ack holes aren’t bl ack 901
of two subsystems, and if the total system is in a pure state, the subsystem m would have
Challenge 1411 ny an entropy S m given by
mn
Sm =
1−m 1
+ Q (611)
2n k=n+1 k
S m = ln m − mQ1.
m
for (612)
2n
To discuss the result, let us think of n and m as counting degrees of freedom, instead of
Hilbert space dimensions. The first term in equation (612) is the usual entropy of a mixed
state. The second term is a small deviation and describes the amount of specific informa-
tion contained in the original pure state; inserting numbers, one finds that it is extremely
small compared to the first. In other words, the subsystem m is almost indistinguishable
More paradoxes
A black hole is a macroscopic object, similar to a star. Like all objects, it can interact with
its environment. It has the special property to swallow everything that falls into them.
This immediately leads us to ask if we can use this property to cheat around the usual
everyday ‘laws’ of nature. Some attempts have been studied in the section on general
Page 493 relativity and above; here we explore a few additional ones.
Copyright © Christoph Schiller November 1997–January 2007
**
Apart from the questions of entropy, we can look for methods to cheat around conserva-
Challenge 1412 ny tion of energy, angular momentum, or charge. Every Gedanken experiment comes to the
same conclusions. No cheats are possible; in addition, the maximum number of degrees
of freedom in a region is proportional to the surface area of the region, and not to its
volume. This intriguing result will keep us busy for quite some time.
**
A black hole transforms matter into antimatter with a certain efficiency. Thus one might
Challenge 1413 ny look for departures from particle number conservation. Are you able to find an example?
902 vii details of quantum theory • 27. including gravitation
**
Black holes deflect light. Is the effect polarization dependent? Gravity itself makes no
difference of polarization; however, if virtual particle effects of QED are included, the story
Ref. 935 might change. First calculations seem to show that such an effect exists, so that gravitation
might produce rainbows. Stay tuned.
**
If lightweight boxes made of mirrors can float in radiation, one gets a strange con-
sequence: such a box might self-accelerate in free space. In a sense, an accelerated box
could float on the Fulling–Davies–Unruh radiation it creates by its own acceleration.
Are you able to show the following: one reason why this is impossible is a small but
Challenge 1414 ny difference between gravity and acceleration, namely the absence of tidal effects. (Other
reasons, such as the lack of perfect mirrors, also make the effect impossible.)
ħ2
r gr.B. = = 1.1 ë 1029 m (613)
G m e2 m p
which is about a thousand times the distance to the cosmic horizon. In fact, even in the
normal hydrogen atom there is not a single way to measure gravitational effects. (Are you
Challenge 1416 ny able to confirm this?) But why is gravity so weak? Or equivalently, why are the universe
and normal atoms so much smaller than a gravitational Bohr atom? At the present point
of our quest these questions cannot be answered. Worse, the weakness of gravity even
* Modern approaches take another direction, as explained in the third part of the mountain ascent.
quantum mechanics of gravitation 903
means that with high probability, future experiments will provide little additional data
helping to decide among competing answers. The only help is careful thought.
Decoherence of space-time
If the gravitational field evolves like a quantum system, we encounter all issues found in
other quantum systems. General relativity taught us that the gravitational field and space-
time are the same. As a result, we may ask why no superpositions of different macroscopic
space-times are observed.
Ref. 937 The discussion is simplified for the simplest case of all, namely the superposition, in a
vacuum region of size l, of a homogeneous gravitational field with value д and one with
Page 816 value д′ . As in the case of a superposition of macroscopic distinct wave functions, such
a superposition decays. In particular, it decays when particles cross the volume. A short
Challenge 1417 ny calculation yields a decay time given by
where n is the particle number density, kT their kinetic energy and m their mass. In-
serting typical numbers, we find that the variations in gravitational field strength are ex-
Challenge 1418 ny tremely small. In fact, the numbers are so small that we can deduce that the gravitational
field is the first variable which behaves classically in the history of the universe. Quantum
gravity effects for space-time will thus be extremely hard to detect.
In short, matter not only tells space-time how to curve, it also tells it to behave with
class. This result calls for the following question.
Do gravitons exist?
Quantum theory says that everything that moves is made of particles. What kind of
particles are gravitational waves made of? If the gravitational field is to be treated
quantum mechanically like the electromagnetic field, its waves should be quantized. Most
properties of these quanta can be derived in a straightforward way.
The 1~r 2 dependence of universal gravity, like that of electricity, implies that the
particles have vanishing mass and move at light speed. The independence of gravity from
electromagnetic effects implies a vanishing electric charge.
Copyright © Christoph Schiller November 1997–January 2007
The observation that gravity is always attractive, never repulsive, means that the field
quanta have integer and even spin. Vanishing spin is ruled out, since it implies no coupling
Ref. 938 to energy. To comply with the property that ‘all energy has gravity’, S = 2 is needed. In fact,
it can be shown that only the exchange of a massless spin 2 particle leads, in the classical
limit, to general relativity.
The coupling strength of gravity, corresponding to the fine structure constant of elec-
tromagnetism, is given either by
m 2 E 2
α G1 = = 2.2 ë 10−15 kg−2 α G2 = = =
G Gmm
or by (615)
ħc ħc m Pl E Pl
904 vii details of quantum theory • 27. including gravitation
However, the first expression is not a pure number; the second expression is, but depends
on the mass one inserts. These difficulties reflect the fact that gravity is not properly speak-
ing an interaction, as became clear in the section on general relativity. It is often argued
that m should be taken as the value corresponding to the energy of the system in ques-
tion. For everyday life, typical energies are 1 eV, leading to a value α G2 1~1056 . Gravity
is indeed weak compared to electromagnetism, for which α em = 1~137.04.
If all this is correct, virtual field quanta would also have to exist, to explain static grav-
itational fields.
However, up to this day, the so-called graviton has not yet been detected, and there is
in fact little hope that it ever will. On the experimental side, nobody knows yet how to
Challenge 1419 n build a graviton detector. Just try! On the theoretical side, the problems with the coupling
constant probably make it impossible to construct a renormalizable theory of gravity; the
lack of renormalization means the impossibility to define a perturbation expansion, and
thus to define particles, including the graviton. It might thus be that relations such as
E = ħω or p = ħ~2πλ are not applicable to gravitational waves. In short, it may be that
Space-time foam
The indeterminacy relation for momentum and position also applies to the gravitational
field. As a result, it leads to an expression for the indeterminacy of the metric tensor д in
a region of size L, which is given by
l Pl 2
∆д 2 , (616)
L2
¼
Challenge 1420 ny where l Pl = ħG~c 3 is the Planck length. Can you deduce the result? Quantum theory
thus shows that like the momentum or the position of a particle, also the metric tensor д
is a fuzzy observable.
But that is not all. Quantum theory is based on the principle that actions below ħ~2
cannot be observed. This implies that the observable values for the metric д in a region
of size L are bound by
дE 3 2 .
2ħG 1
(617)
Copyright © Christoph Schiller November 1997–January 2007
c L
Can you confirm this? The result has far-reaching consequences. A minimum value for
the metric depending inversely on the region size implies that it is impossible to say what
happens to the shape of space-time at extremely small dimensions. In other words, at
extremely high energies, the concept of space-time itself becomes fuzzy. John Wheeler
introduced the term space-time foam to describe this situation. The term makes clear
that space-time is not continuous nor a manifold in those domains. But this was the
basis on which we built our description of nature so far! We are forced to deduce that
our description of nature is built on sand. This issue will form the start of the third part
Page 1020 of our mountain ascent.
quantum mechanics of gravitation 905
No particles
Gravity has another important consequence for quantum theory. To count and define
Page 794 particles, quantum theory needs a defined vacuum state. However, the vacuum state can-
not be defined when the curvature radius of space-time, instead of being larger than
the Compton wavelength, becomes comparable to it. In such highly curved space-times,
particles cannot be defined. The reason is the impossibility to distinguish the environ-
ment from the particle in these situations: in the presence of strong curvatures, the va-
cuum is full of spontaneously generated matter, as black holes show. Now we just saw that
at small dimensions, space-time fluctuates wildly; in other words, space-time is highly
curved at small dimensions or high energies. In other words, strictly speaking particles
cannot be defined; the particle concept is only a low energy approximation! We will ex-
plore this strange conclusion in more detail in the third part of our mountain ascent.
No science fiction
ity. The issues are so dramatic, changing everything from the basis of classical physics to
the results of quantum theory, that we devote the beginning of the third part only to the
exploration of the contradictions. There will be surprising consequences on the nature
of space-time, particles and motion. But before we study these issues, we complete the
theme of the present, second part of the mountain ascent, namely the essence of matter
and interactions.
906 vii details of quantum theory
Biblio graphy
Gedanken sind nicht stets parat. Man schreibt
”
785 Gell-Mann wrote this for the 1976 Nobel Conference. M. Gell-Mann, What are the Build-
ing Blocks of Matter?, in D. Huff & O. Prewitt, editors, The Nature of the Physical Uni-
verse, New York, Wiley, 1979, p. 29. Cited on page 813.
786 See e.g. the reprints of his papers in the standard collection by John A. Wheeler &
Wojciech H. Zurek, Quantum Theory and Measurement, Princeton University Press,
1983. Cited on page 814.
787 H.D. Zeh, On the interpretation of measurement in quantum theory, Foundations of Phys-
ics 1, pp. 69–76, 1970. Cited on page 814.
reservoirs, Nature 403, pp. 269–273, 2000. See also the summary by W.T. Strunz, G. Al-
ber & F. Haake, Dekohärenz in offenen Quantensystemen, Physik Journal 1, pp. 47–52,
November 2002. Cited on page 819.
797 L. Hackermüller, K. Hornberger, B. Brezger, A. Zeilinger & M. Arndt,
Decoherence of matter waves by thermal emission of radiation, Nature 427, pp. 711–714,
2004. Cited on page 819.
798 K. Baumann, Quantenmechanik und Objektivierbarkeit, Zeitschrift für Naturforschung
25a, pp. 1954–1956, 1970. Cited on page 820.
799 See for example D. Styer, Physics Today p. 11, September 2000. Cited on page 821.
800 David B ohm, Quantum Theory, Prentice-Hall, 1951, pp. 614–622. Cited on page 822.
801 A. Einstein, B. Pod olsky & N. Rosen, Can quantum-mechanical description of real-
ity be considered complete?, Physical Review 48, pp. 696–702, 1935. Cited on page 822.
802 A. Aspect, J. Dalibard & G. Roger, Experimental tests of Bell’s inequalities using
time-varying analyzers, Physical Review Letters 49, pp. 1804–1807, 1982, Cited on page 823.
Today 51, pp. 36–42, July 1998. An experiment putting atom at two places at once, distant
about 80 nm, was published by C. Monroe & al., Science 272, p. 1131, 1996. Cited on page
824.
809 M.R. Andres & al., Observations of interference between two Bose condensates, Science
275, pp. 637–641, 31 January 1997. See also the www.aip.org/physnews/special.htm website.
Cited on page 824.
810 The most famous reference on the wave function collapse is chapter IV of the book by Kurt
Gottfried, Quantum Mechanics, Benjamin, New York, 1966, It is the favorite reference
by Victor Weisskopf, and cited by him on every occasion he talks about the topic. Cited on
page 825.
811 An experimental measurement of superpositions of left and right flowing currents with 1010
electrons was J.E. Mooij, T.P. Orl and o, L. Levitov, L. Tian, C.H. van der Wal
908 vii details of quantum theory
& S. Lloyd, Josephson persistent-current qubit, Science 285, pp. 1036–1039, 1999. In the
year 2000, superpositions of 1 µA clockwise and anticlockwise have been detected; for more
details, see J.R. Friedman & al., Quantum superposition of distinct macroscopic states,
Nature 406, p. 43, 2000. Cited on page 823.
812 Collapse times have been measured ... Cited on page 829.
813 A clear discussion can be found in S. Haroche & J.-M. R aimond, Quantum computing:
dream or nightmare?, Physics Today 49, pp. 51–52, 1996, as well as the comments in Physics
Today 49, pp. 107–108, 1996. Cited on page 825.
814 S. Kochen & E.P. Specker, The problem of hidden variables in quantum mechanics, 17,
pp. 59–87, 1967. Cited on page 830.
815 J.F. Cl auser, M.A. Horne, A. Shimony & R.A. Holt, Physical Review Letters 23,
p. 880, 1969. The more general and original result is found in J.S. Bell, Physics 1, p. 195,
1964, Cited on page 830.
816 Bryce de Witt & Neill Graham, eds., The Many–Worlds Interpretation of Quantum
Mechanics, Princeton University Press, 1973. This interpretation talks about entities which
823 The beautiful analogy between optics and two-dimensional electron systems, in GaAs at he-
lium temperature, has been studied in detail in C.W.J. Beenakker & H. van Hou ten,
Solid State Physics, 44, pp. 1–228, 1991, No citations.
824 The use of radioactivity for breeding is described by ... Cited on page 838.
825 S.M. Block, Real engines of creation, Nature 386, pp. 217–219, 1997. Cited on page 840.
826 Early results and ideas on molecular motors are summarised by Barbara Goss Levi,
Measured steps advance the understanding of molecular motors, Physics Today pp. 17–19,
April 1995. Newer results are described in R.D. Astumian, Making molecules into motors,
Scientific American pp. 57–64, July 2001. Cited on page 841.
827 The motorized screw used by viruses was described by A.A. Simpson & al., Structure of
the bacteriophage phi29 DNA packaging motor, Nature 408, pp. 745–750, 2000. Cited on
page 840.
bibliography 909
828 R. Bartussek & P. Hänggi, Brownsche Motoren, Physikalische Blätter 51, pp. 506–507,
1995. Cited on page 841.
829 The table and the evolutionary tree are taken from J.O. McInerney, M. Mull arkey,
M.E. Wernecke & R. Powell, Bacteria and Archaea: Molecular techniques reveal as-
tonishing diversity, Biodiversity 3, pp. 3–10, 2002. The tree might still change a little in the
coming years. Cited on page 843.
830 This is taken from the delightful children text Hans J. Press, Spiel das Wissen schafft,
Ravensburger Buchverlag 1964, 2004. Cited on page 845.
831 The discovery of a specific taste for fat was published by F. L augerette, P. Passilly-
Degrace, B. Patris, I. Niot, M. Febbraio, J.P. Montmayeur & P. Besnard,
CD36 involvement in orosensory detection of dietary lipids, spontaneous fat preference,
and digestive secretions, Journal of Clinical Investigation 115, pp. 3177–3184, 2005. Cited on
page 846.
832 Linda Buck and Richard Axel received the 2004 Nobel Prize for medicine and physiology
beautiful textbook on the topic is Hans Peter Willberg & Friedrich Forssman,
Lesetypographie, Verlag Hermann Schmidt, Mainz, 1997. See also Robert Bringhurst,
The Elements of Typographic Style, Hartley & Marks, 2004.
Many scientists passionately enjoyed their occupation. Any biography of Charles Darwin
will purvey his fascination for biology, of Friedrich Bessel for astronomy, of Albert Einstein
for physics and of Linus Pauling for chemistry.
834 The group of John Wearden in Manchester has shown by experiments with persons that the
accuracy of a few per cent is possible for any action with a duration between a tenth of a
second and a minute. See J. McCrone, When a second lasts forever, New Scientist pp. 53–
56, 1 November 1997. Cited on page 848.
835 E.J. Zimmerman, The macroscopic nature of space-time, American Journal of Physics 30,
pp. 97–105, 1962. Cited on page 850.
836 The potential of single-ion clocks is explained in ... Cited on page 850.
837 P.D. Peş ić, The smallest clock, European Journal of Physics 14, pp. 90–92, 1993. Cited on
S.R. Wilkinson, C.F. Bharucha, M.C. Fischer, K.W. Madison, P.R. Morrow,
Q. Niu, B. Sundaram & M.G. R aizen, Nature 387, p. 575, 1997. Cited on page 853.
844 See ... Cited on page 853.
845 H. Kobayashi & S. Kohshima, Unique morphology of the human eye, Nature 387,
pp. 767–768, 1997. Cited on page 856.
846 James W. Prescott, Body pleasure and the origins of violence, The Futurist Bethseda,
1975, also available at http://www.violence.de/prescott/bullettin/article.html. Cited on page
857.
847 See for example P. Py ykkö, Relativity, gold, closed-shell interactions, and CsAu.NH3, An-
gewandte Chemie, International Edition 41, pp. 3573–3578, 2002, or L.J. Norrby, Why is
mercury liquid? Or, why do relativistic effects not get into chemistry textbooks?, Journal of
Chemical Education 68, pp. 110–113, 1991. Cited on page 857.
bibliography 911
848 The switchable mirror effect was discovered by J.N. Huiberts, R. Griessen, J.H.
R ector, R.J. Wijngarden, J.P. Dekker, D.G. de Groot & N.J. Koeman, Yt-
trium and lanthanum hydride films with switchable optical properties, Nature 380, pp. 231–
234, 1996. A good introduction is R. Griessen, Schaltbare Spiegel aus Metallhydriden,
Physikalische Blätter 53, pp. 1207–1209, 1997. Cited on page 865.
849 The present attempts to build cheap THz wave sensors – which allow to see through clothes
– is described by D. Clery, Brainstorming their way to an imaging revolution, Science 297,
pp. 761–763, 2002. Cited on page 864.
850 V.F. Weisskopf, Of atoms, mountains and stars: a study in qualitative physics, Science
187, p. 605, 1975. Cited on page 865.
851 K. Au tumn, Y. Liang, T. Hsich, W. Zwaxh, W.P. Chan, T. Kenny, R. Fearing &
R.J. Full, Adhesive force of a single gecko foot-hair, Nature 405, pp. 681–685, 2000. Cited
on page 867.
852 A.B. Kesel, A. Martin & T. Seidl, Getting a grip on spider attachment: an AFM
859 For experimentally determined pictures of the orbitals of dangling bonds, see for example
F. Giessibl & al., Subatomic features on the silicon (111)-(7x7) surface observed by atomic
force microscopy, Science 289, pp. 422–425, 2000. Cited on page 858.
860 A.P. Finne, T. Araki, R. Bl aauwgeers, V.B. Eltsov, N.B. Kopnin, M. Krusius,
L. Skrbek, M. Tsubota & G.E. Volovik, An intrinsic velocity-independent criterion
for superfluid turbulence, Nature 424, pp. 1022–1025, 2003. Cited on page 869.
861 J.L. Costa–Krämer, N. Garcia, P. Garcia–Molchales & P.A. Serena, Con-
ductance in disordered nanowires: foward and backscattering, Physical Review B 53,
p. 102658, 1996. Cited on page 870.
862 The first experimental evidence is by V.J. Goldman & B. Su, Resonance tunnelling in the
fractional quantum Hall regime: measurement of fractional charge, Science 267, pp. 1010–
1012, 1995. The first unambiguous measurements are by R. de Picciotto & al., Direct
912 vii details of quantum theory
observation of a fractional charge, Nature 389, pp. 162–164, 1997, and L. Saminadayar,
D.C. Gl attli, Y. Jin & B. Etienne, Observation of the e,3 fractionally charged Laugh-
lin quasiparticle/ Physical Review Letters 79, pp. 2526–2529, 1997. or http://arxiv.org/abs/
cond-mat/9706307. Cited on page 870.
863 The original prediction, which earned him a Nobel Prize for physics, is in R.B. L aughlin,
Anomalous quantum Hall effect: an incompressible quantum fluid with fractionally charged
excitations, Physical Review Letters 50, pp. 1395–1398, 1983. Cited on pages 870 and 871.
864 The fractional quantum Hall effect was discovered by D.C. Tsui, H.L. Stormer & A.C.
Gossard, Two-dimensional magnetotransport in the extreme quantum limit, Physical Re-
view Letters 48, pp. 1559–1562, 1982. Cited on page 871.
865 The discovery of the original quantum Hall effect, which earned von Klitzing a Nobel Prize,
was published as K. von Klitzing, G. Dorda & M. Pepper, New method for high-
accuracy determination of the fine-structure constant based on quantised Hall resistance,
Physical Review Letters 45, pp. 494–497, 1980. Cited on page 870.
873 M.J. Sparnaay, Nature 180, p. 334, 1957. M.J. Sparnaay, Measurements of attractive
forces between flat plates, Physica (Utrecht) 24, pp. 751–764, 1958. Cited on page 878.
874 S.K. L amoreaux, Demonstration of the Casimir force in the 0.6 to 6 µm range, Physical
Review Letters 78, pp. 5–8, 1997. U. Mohideen & A. Roy, Precision measurement of the
Casimir force from 0.1 to 0.9 µm, Physical Review Letters 81, pp. 4549–4552, 1998. Cited on
page 878.
875 A. L ambrecht, Das Vakuum kommt zu Kräften, Physik in unserer Zeit 36, pp. 85–91, 2005.
Cited on page 878.
876 T.H. B oyer, Van der Waals forces and zero-point energy for dielectric and permeable ma-
terials, Physical Review A 9, pp. 2078–2084, 1974. This was taken up again in O. Kenneth,
I. Klich, A. Mann & M. R evzen, Repulsive Casimir forces, Physical Review Letters 89,
p. 033001, 2002. However, none of these effects has been verified yet. Cited on page 878.
bibliography 913
877 The effect was discovered by K. Scharnhorst, On propagation of light in the vacuum
between plates, Physics Letters B 236, pp. 354–359, 1990. See also G. Barton, Faster-than-
c light between parallel mirrors: the Scharnhorst effect rederived, Physics Letters B 237,
pp. 559–562, 1990, and P.W. Milonni & K. Svozil, Impossibility of measuring faster-
than-c signalling by the Scharnhorst effect, Physics Letters B 248, pp. 437–438, 1990. The
latter corrects an error in the first paper. Cited on page 878.
878 D.L. Burke & al., Positron production in multiphoton light-by-light scattering, Phys-
ical Review Letters 79, pp. 1626–1629, 1997. A simple summary is given in Bertram
Schwarzschild, Gamma rays create matter just by plowing into laser light, Physics
Today 51, pp. 17–18, February 1998. Cited on page 880.
879 M. Kardar & R. Golestanian, The ‘friction’ of the vacuum, and other fluctuation-
induced forces, Reviews of Modern Physics 71, pp. 1233–1245, 1999. Cited on page 881.
880 G. Gour & L. Sriramkumar, Will small particles exhibit Brownian motion in the
quantum vacuum?, http://arxiv.org/abs/quant/phys/9808032 Cited on page 881.
P.A.M. Dirac, ..., 1985. This article transcribes a speech by Paul Dirac just before his death.
888 M. Roberts & al., Observation of an electric octupole transition in a single ion, Physical
Review Letters 78, pp. 1876–1879, 1997. A lifetime of 3700 days was determined. Cited on
page 883.
889 K. An, J.J. Childs, R.R. Desari & M.S. Field, Microlaser: a laser with one atom in
an optical resonator, Physical Review Letters 73, pp. 3375–3378, 19 December 1994. Cited on
page 873.
890 A. Takita, H. Yamamoto, Y. Hayasaki, N. Nishida & H. Misawa, Three-
dimensional optical memory using a human fingernail, Optics Express 13, pp. 4560–4567,
2005. They used a Ti:sapphire oscillator and a Ti:sapphire amplifier. Cited on page 873.
891 See the website by Tom Hales on http://www.math.lsa.umich.edu/~hales/countdown. An
earlier book on the issue is the delightful text by Max Leppmaier, Kugelpackungen von
Kepler bis heute, Vieweg Verlag, 1997. Cited on page 884.
914 vii details of quantum theory
892 For a short overview, also covering binary systems, see C. Bechinger, H.-H. von
Grünberg & P. Leiderer, Entropische Kräfte - warum sich repulsiv wechselwirkende
Teilchen anziehen können, Physikalische Blätter 55, pp. 53–56, 1999. Cited on page 884.
893 Hans–Werner Fink, Direct observation of three-body interactions in adsorbed layers:
Re on W(111), Physical Review Letters 52, pp. 1532–1534, 1984. Cited on page 884.
894 See for example the textbook by A. Pimpinelli & J. Vill ain, Physics of Crystal Growth,
Cambridge University Press, 1998. Cited on page 885.
895 Y. Furukawa, Faszination der Schneekristalle - wie ihre bezaubernden Formen entstehen,
Chemie in unserer Zeit 31, pp. 58–65, 1997. His http://www.lowtem.hokudai.ac.jp/~frkw/
index_e.html website gives more information. Cited on page 885.
896 P.-G. de Gennes, Soft matter, Reviews of Modern Physics 63, p. 645, 1992. Cited on page
885.
897 The fluid helium based quantum gyroscope was first built by the Czech group ... and the
Berkeley group, Nature 10. April 1997. Cited on page 885.
duced quantum mechanics, Physical Review Letters 34, pp. 1472–1475, 1975. This experiment
is usually called the COW experiment. Cited on page 890.
908 G.C. Ghirardi, A. R imini & T. Weber, Unified dynamics for microscopic and macro-
scopic systems, Physical Review D 34, pp. 470–491, 1986. Cited on page 890.
909 Ch.J. B ordé & C. L ämmerzahl, Atominterferometrie und Gravitation, Physikalische
Blätter 52, pp. 238–240, 1996. See also A. Peters, K.Y. Chung & S. Chu, Observation
of gravitational acceleration by dropping atoms, Nature 400, pp. 849–852, 26 August 1999.
Cited on page 890.
910 V.V. Nesvizhevsky & al., Quantum states of neutrons in the Earth’s gravitational field,
Nature 415, p. 297, 17 January 2002. Cited on page 891.
911 J.D. Bekenstein, Black holes and entropy, Physical Review D 7, pp. 2333–2346, 1973.
Cited on page 892.
bibliography 915
912 R. B ousso, The holographic principle, Review of Modern Physics 74, pp. 825–874, 2002,
also available as http://arxiv.org/abs/hep-th/0203101. The paper is an excellent review; how-
ever, it has some argumentation errors, as explained on page 893. Cited on page 893.
913 This is clearly argued by S. Carlip, Black hole entropy from horizon conformal field the-
ory, Nuclear Physics B Proceedings Supplement 88, pp. 10–16, 2000. Cited on page 893.
914 S.A. Fulling, Nonuniqueness of canonical field quantization in Riemannian space-time,
Physical Review D 7, pp. 2850–2862, 1973, P. Davies/ Scalar particle production in Schwar-
zchild and Rindler metrics, Journal of Physics A: General Physics 8, pp. 609–616, 1975, W.G.
Unruh, Notes on black hole evaporation, Physical Review D 14, pp. 870–892, 1976. Cited
on page 893.
915 About the possibility to measure Fulling–Davies–Unruh radiation, see for example the pa-
per by H. Rosu, On the estimates to measure Hawking effect und Unruh effect in the labor-
atory, International Journal of Modern Physics D3 p. 545, 1994. http://arxiv.org/abs/gr-qc/
9605032 or the paper P. Chen & T. Tajima, Testing Unruh radiation with ultra-intense
lasers, Physical Review Letters 83, pp. 256–259, 1999. Cited on page 893.
Teil I und II, Sterne und Weltraum March and April 2001. Cited on page 897.
925 When the gamma ray burst encounters the matter around the black hole, it is broadened.
The larger the amount of matter, the broader the pulse is. See G. Preparata, R. Ruffini
& S.-S. Xue, The dyadosphere of black holes and gamma-ray bursts, Astronomy and As-
trophysics 338, pp. L87–L90, 1998, R. Ruffini, J.D. Salmonson, J.R. Wilson & S.-S.
Xue, On the pair electromagnetic pulse of a black hole with electromagnetic structure, As-
tronomy and Astrophysics 350, pp. 334–343, 1999, R. Ruffini, J.D. Salmonson, J.R.
Wilson & S.-S. Xue, On the pair electromagnetic pulse from an electromagnetic black
hole surrounded by a baryonic remnant, Astronomy and Astrophysics 359, pp. 855–864,
2000, and C.L. Bianco, R. Ruffini & S.-S. Xue, The elementary spike produced by
a pure e+ e− pair-electromagnetic pulse from a black hole: the PEM pulse, Astronomy and
Astrophysics 368, pp. 377–390, 2001. For a very personal account by Ruffini on his involve-
916 vii details of quantum theory
ment in gamma ray bursts, see his paper Black hole formation and gamma ray bursts, http://
arxiv.org/abs/astro-ph/0001425. Cited on page 898.
926 About the membrane despription of black holes, see ... Cited on page 899.
927 http://arxiv.org/abs/astro-ph/0301262, http://arxiv.org/abs/astro-ph/0303539, D.W. Fox
& al., Early optical emission from the γ-ray burst of 4 October 2002, Nature 422, pp. 284–
286, 2003. Cited on page 898.
928 Negative heat capacity has also been found in atom clusters and in nuclei. See e.g. M.
Schmidt & al., Negative heat capacity for a cluster of 147 sodium atoms, Physical Review
Letters 86, pp. 1191–1194, 2001. Cited on page 899.
929 H.-P. Nollert, Quasinormal modes: the characteristic ‘sound’ of black holes and neutron
stars, Classical and Quantum Gravity 16, pp. R159–R216, 1999. Cited on page 899.
930 He wrote a series of papers on the topic; a beautiful summary is Don N. Page, How fast
does a black hole radiate information?, International Journal of Modern Physics 3, pp. 93–
106, 1994, based on his earlier papers, such as Information in black hole radiation, Physical
938 This point was made repeatedly by Steven Weinberg, namely in Derivation of gauge
invariance and the equivalence principle from Lorentz invariance of the S-matrix , Phys-
ics Letters 9, pp. 357–359, 1964, in Photons and gravitons in S-matrix theory: derivation of
charge conservation and equality of gravitational and inertial mass, Physical Review B 135,
pp. 1049–1056, 1964, and in Photons and gravitons in perturbation theory: derivation of
Maxwell’s and Einstein’s equations, Physical Review B 138, pp. 988–1002, 1965. Cited on
page 903.
939 L.H. Ford & T.A. Roman, Quantum field theory constrains traversable wormhole geo-
metries, http://arxiv.org/abs/gr-qc/9510071 or Physical Review D 53, pp. 5496–5507, 1996.
Cited on page 905.
940 M.J. Pfenning & L.H. Ford, The unphysical nature of ‘warp drive’, Classical and
Quantum Gravity 14, pp. 1743–1751, 1997. Cited on page 905.
bibliography 917
941 B.S. Kay, M. R adzikowski & R.M. Wald, Quantum field theory on spacetimes with
a compactly generated Cauchy horizon, http://arxiv.org/abs/gr-qc/9603012 or Communica-
tions in Mathematical Physics 183 pp. 533–556, 1997. Cited on page 905.
942 An overview on squeezed light is given in ... No citations.
943 An introduction into the history of the concept of precision is given by N.W. Wise, The
values of precision, Princeton University Press, 1994. No citations.
944 E.H. Kennard, Zur Quantenmechanik einfacher Bewegungstypen, Zeitschrift für Physik
44, pp. 326–352, 1927. The priority of the result is often wrongly attributed to the much
later paper by H.P. Robertson, The uncertainty principle, Physical Review 34, pp. 163–
164, 1929. The complete result is by Erwin Schrödinger, Zum Heisenbergschem Un-
schärfeprinzip, Sitzungsberichte der Preussischen Akademie der Wissenschaften, Physikalisch-
Mathematische Klasse Berlin, pp. 296–303, 1930. No citations.
945 W. Heisenberg, Über den anschaulichen Inhalt der quantentheoretischen Kinematik
und Mechanik, Zeitschrift für Physik 43, pp. 172–198, 1927. No citations.
N
uclear physics was born in 1896 in France, but is now a small activity.
“
A physical wonder – magnetic resonance imaging
”
Arguably, the most spectacular tool that physical research has produced in the twentieth
century was magnetic resonance imaging, or MRI for short. This technique allows to im-
age human bodies with a high resolution and with (almost) no damage, in strong contrast
to X-ray imaging. Though the machines are still expensive – costing up to several million
euro – there is hope that they will become cheaper in the future. Such a machine consists
essentially of a large magnetic coil, a radio transmitter and a computer. Some results of
putting part of a person into the coil are shown in Figure 370.
In these machines, a radio transmitter emits radio waves that are absorbed because
hydrogen nuclei are small spinning magnets. The magnets can be parallel or antiparallel Copyright © Christoph Schiller November 1997–January 2007
to the magnetic field produced by the coil. The transition energy E can be absorbed from
a radio wave whose frequency ω is tuned to the magnetic field B. The energy absorbed
by a single hydrogen nucleus is given by
E = ħω = ħγB (618)
The material constant γ~2π has a value of 42.6 MHz~T for hydrogen nuclei; it results
from the non-vanishing spin of the proton. This is a quantum effect, as stressed by the
appearance of the quantum of action ħ. Using some cleverly applied magnetic fields, typ-
ically with a strength between 0.3 and 7 T for commercial and up to 20 T for experimental
machines, the machines are able to measure the absorption for each volume element sep-
arately. Interestingly, the precise absorption level depends on the chemical compound
920 viii inside the nucleus • the structure of the nucleus
the nucleus is built into. Thus the absorption value will depend on the chemical environ-
ment. When the intensity of the absorption is plotted as grey scale, an image is formed
that retraces the different chemical composition. Two examples are shown in Figure 370.
Using additional tricks, modern machines can picture blood flow in the heart or air flow
in lungs; they can even make films of the heart beat. Other techniques show how the loc-
ation of sugar metabolism in the brain depends on what you are thinking about.* In fact,
also what you are thinking about all the time has been imaged: the first image of people
Ref. 953 making love has been taken by Willibrord Weijmar Schultz and his group in 1999. It is
shown in Figure 371.
Each magnetic resonance image thus proves that atoms have spinning nuclei. Like for
any other object, nuclei have size, colour, composition and interactions that ask to be
explored.
The magnetic resonance signal shows that hydrogen nuclei are quite sensitive to magnetic
fields. The д-factor of protons, defined using the magnetic moment µ, their mass and
Page 780 charge as д = µ4m~eħ, is about 5.6. Using expression (540) that relates the д-factor and
the radius of a composite object, we deduce that the radius of the proton is about 0.9 fm;
this value is confirmed by experiment. Protons are thus much smaller than hydrogen
atoms, the smallest of atoms, whose radius is about 30 pm. In turn, the proton is the
smallest of all nuclei; the largest nuclei have radii 7 times the proton value.
The small size of nuclei is no news. It is known since the beginning of the twenti-
eth century. The story starts on the first of March in 1896, when Henri Becquerel* dis-
covered a puzzling phenomenon: minerals of uranium potassium sulphate blacken pho-
tographic plates. Becquerel had heard that the material is strongly fluorescent; he conjec-
tured that fluorescence might have some connection to the X-rays discovered by Conrad
Röngten the year before. His conjecture was wrong; nevertheless it led him to an import-
ant new discovery. Investigating the reason for the effect of uranium on photographic
plates, Becquerel found that these minerals emit an undiscovered type of radiation, dif-
ferent from anything known at that time; in addition, the radiation is emitted by any
substance containing uranium. In 1898, Bémont named the property of these minerals
radioactivity.
Radioactive rays are also emitted from many elements other than
uranium. The radiation can be ‘seen’: it can be detected by the tiny
flashes of light that are emitted when the rays hit a scintillation
* Henri Becquerel (b. 1852 Paris, d. 1908 Le Croisic), important French physicist; his primary topic was the
study of radioactivity. He was the thesis adviser of Marie Curie, the wife of Pierre Curie, and was central to
bringing her to fame. The SI unit for radioactivity is named after him. For his discovery of radioactivity he
received the 1903 Nobel Prize for physics; he shared it with the Curies.
** Ernest Rutherford (1871–1937), important New Zealand physicist. He emigrated to Britain and became
professor at the University of Manchester. He coined the terms alpha particle, beta particle, proton and
neutron. A gifted experimentalist, he discovered that radioactivity transmutes the elements, explained the
nature of alpha rays, discovered the nucleus, measured its size and performed the first nuclear reactions.
Ironically, in 1908 he received the Nobel price for chemistry, much to the amusement of himself and of
the world-wide physics community; this was necessary as it was impossible to give enough physics prizes
to the numerous discoverers of the time. He founded a successful research school of nuclear physics and
many famous physicists spent some time at his institute. Ever an experimentalist, Rutherford deeply disliked
quantum theory, even though it was and is the only possible explanation for his discoveries.
922 viii inside the nucleus • the structure of the nucleus
research group followed the path of the particles by using scintillation screens; later on
they used an invention by Charles Wilson: the cloud chamber. A cloud chamber, like its
successor, the bubble chamber, produces white traces along the path of charged particles;
the mechanism is the same as the one than leads to the white lines in the sky when an
aeroplane flies by.
The radiation detectors gave a strange result: most alpha
particles pass through the metal foil undisturbed, whereas
a few are reflected. In addition, those few which are reflec-
ted are not reflected by the surface, but in the inside of
Challenge 1421 n the foil. (Can you imagine how they showed this?) Ruther-
ford deduced from this scattering experiment that first of all,
atoms are mainly transparent. Only transparency explains
why most alpha particles pass the foil without disturbance, Copyright © Christoph Schiller November 1997–January 2007
even though it was over 2000 atoms thick. But some particles
were scattered by large angles or even reflected. Rutherford
showed that the reflections must be due to a single scattering
point. By counting the particles that were reflected (about 1
in 20000 for his 0.4 µm gold foil), Rutherford was also able Marie Curie
to deduce the size of the reflecting entity and to estimate its
mass. He found that it contains almost all of the mass of the atom in a diameter of around
1 fm. He thus named it the nucleus. Using the knowledge that atoms contain electrons,
Rutherford then deduced from this experiment that atoms consist of an electron cloud
that determines the size of atoms – of the order of 0.1 nm – and of a tiny but heavy nucleus
at the centre. If an atom had the size of a basketball, its nucleus would have the size of a
dust particle, yet contain 99.9 % of the basketball’s mass. Atoms resemble thus candy floss
the structure of the nucleus – the densest clouds 923
around a heavy dust particle. Even though the candy floss – the electron cloud – around
the nucleus is extremely thin and light, it is strong enough to avoid that two atoms in-
terpenetrate; thus it keeps the neighbouring nuclei at constant distance. For the tiny and
massive alpha particles however, the candy floss is essentially empty space, so that they
simply fly through the electron clouds until they exit on the other side or hit a nucleus.
The density of the nucleus is impressive: about 5.5 ë 1017 kg~m3 . At that density, the
mass of the Earth would fit in a sphere of 137 m radius and a grain of sand would have a
Challenge 1422 e mass larger than the largest existing oil tanker. (True?) Now we know that oil tankers are
complex structures. What then is the structure of a nucleus?
“ Ernest Rutherford
”
924 viii inside the nucleus • the structure of the nucleus
be levitated. Obviously, a trap only makes sense if the trapped particle can be observed.
In case of neutrons, this is achieved by the radio waves absorbed when the magnetic
moment switches direction with respect to an applied magnetic field. The result of these
experiments is simple: the lifetime of free neutrons is around 888(1) s. Nevertheless, in-
side most nuclei we are made of, neutrons do not decay, as the result does not lead to a
Challenge 1423 n state of lower energy. (Why not?)
Magnetic resonance images also show that some elements have different types of
atoms. These elements have atoms that with the same number of protons, but with dif-
ferent numbers of neutrons. One says that these elements have several isotopes.* This
* The name is derived from the Greek words for ‘same’ and ‘spot’, as the atoms are on the same spot in the
periodic table of the elements.
the structure of the nucleus – the densest clouds 925
Half-life
stable
el. capture (beta+)
beta - emission
alpha emission
proton emission
neutron emission
spontaneous fission
unknown
F I G U R E 372 All known nuclides with their lifetimes (above) and main decay modes (BELOW, data from
http://www.nndc.bnl.gov/nudat2)
also explains why some elements radiate with a mixture of different decay times. Though
chemically they are (almost) indistinguishable, isotopes can differ strongly in their nuc-
Copyright © Christoph Schiller November 1997–January 2007
lear properties. Some elements, such as tin, caesium, or polonium, have over thirty iso-
topes each. Together, the about 100 known elements have over 2000 nuclides.*
The motion of protons and neutrons inside nuclei allows to understand the spin and
the magnetic moment of nuclei. Since nuclei are so extremely dense despite containing nu-
merous positively charged protons, there must be a force that keeps everything together
against the electrostatic repulsion. We saw that the force is not influenced by electromag-
netic or gravitational fields; it must be something different. The force must be short range;
otherwise nuclei would not decay by emitting high energy alpha rays. The new force is
called the strong nuclear interaction. We shall study it in detail shortly.
* Nuclides is the standard expression for a nucleus with a given number of neutrons and protons.
926 viii inside the nucleus • the structure of the nucleus
metal wire
(e.g. paper clip)
thin
aluminium
foils
* Viktor Franz Heß, (1883–1964), Austrian nuclear physicist, received the Nobel Prize for physics in 1936 for
his discovery of cosmic radiation. Heß was one of the pioneers of research into radioactivity. Heß’ discovery
also explained why the atmosphere is always somewhat charged, a result important for the formation and
behaviour of clouds. Twenty years after the discovery of cosmic radiation, in 1932 Carl Anderson discovered
the first antiparticle, the positron, in cosmic radiation; in 1937 Seth Neddermeyer and Carl Anderson dis-
covered the muon; in 1947 a team led by Cecil Powell discovered the pion; in 1951, the Λ 0 and the kaon K 0
are discovered. All discoveries used cosmic rays and most of these discoveries led to Nobel Prizes.
the structure of the nucleus – the densest clouds 927
flew, the stronger it became. Risking his health and life, he continued upwards to more
than 5000 m; there the discharge effect was several times stronger than on the surface
of the Earth. This result is exactly what is expected from a radiation coming from outer
space and absorbed by the atmosphere. In one of his most important flights, performed
Copyright © Christoph Schiller November 1997–January 2007
during an (almost total) solar eclipse, Heß showed that most of the ‘height radiation’ did
not come from the Sun, but from further away. He – and Millikan – thus called the ra-
diation cosmic rays. During the last few centuries, many people have drunk from a glass
and eaten chocolate; but only Heß combined these activities with such careful observa-
tion and deduction that he earned a Nobel Prize.*
Today, the most impressive detectors for cosmic rays are Geiger–Müller counters and
spark chambers. Both share the same idea; a high voltage is applied between two metal
parts kept in a thin and suitably chosen gas (a wire and a cylindrical mesh for the Geiger-
Müller counter, two plates or wire meshes in the spark chambers). When a high energy
ionizing particle crosses the counter, a spark is generated, which can either be observed
* In fact, Hess gold foils in his electrometer.
928 viii inside the nucleus • the structure of the nucleus
through the generated spark (as you can do yourself in the entrance hall of the CERN
main building), or detected by the sudden current flow. Historically, the current was first
amplified and sent to a loudspeaker, so that the particles can be heard by a ‘click’ noise.
With a Geiger counter, one cannot see atoms or particles, but one can hear them. Finally,
ionized atoms could be counted. Finding the right gas mixture is tricky; it is the reason
that the counter has a double name. One needs a gas that extinguishes the spark after a
while, to make the detector ready for the next particle. Müller was Geiger’s assistant; he
made the best counters by adding the right mixture of alcohol to the gas in the chamber.
Nasty rumours maintained that this was discovered when another assistant tried, without
success, to build counters while Müller was absent. When Müller, supposedly a heavy
drinker, came back, everything worked again. However, the story is apocryphal. Today,
Geiger–Müller counters are used around the world to detect radioactivity; the smallest
fit in mobile phones and inside wrist watches.
The particle energy in cosmic rays spans a large range between 103 eV and at least
20
10 eV; the latter is the same energy as a tennis ball after serve. Understanding the origin Copyright © Christoph Schiller November 1997–January 2007
of cosmic rays is a science by its own. Some are galactic in origin, some are extragalactic.
For most energies, supernova remnants – pulsars and the like – seem the best candidates.
However, the source of the highest energy particles is still unknown.
In other words, cosmic rays are probably the only type of radiation discovered without
the help of shadows. But shadows have been found later on. In a beautiful experiment
performed in 1994, the shadow thrown by the Moon on high energy cosmic rays (about
10 TeV) was studied. When the position of the shadow is compared with the actual pos-
ition of the Moon, a shift is found. Indeed, due to the magnetic field of the Earth, the
cosmic ray Moon shadow is expected to be shifted westwards for protons and eastwards
Ref. 974 for antiprotons. The data are consistent with a ratio of antiprotons between 0 % and 30 %.
By studying the shadow’s position, the experiment thus showed that high energy cosmic
rays are mainly positively charged and thus consist mainly of matter, and only in small
the structure of the nucleus – the densest clouds 929
Cosmic rays have several effects on everyday life. Through the charges they produce in
the atmosphere, they are probably responsible for the jagged, non-straight propagation
of lightning. (Lightning advances in pulses, alternating fast propagation for about 30 m
with slow propagation, until they hit connect. The direction they take at the slow spots
depends on the wind and the charge distribution in the atmosphere.) Cosmic rays are also
important in the creation of rain drops and ice particles inside clouds, and thus indirectly
in the charging of the clouds. Cosmic rays, together with ambient radioactivity, also start
Page 528 the Kelvin generator.
If the Moon would not exist, we would die from cosmic rays. The Moon helps to give
the Earth a high magnetic field via a dynamo effect, which then diverts most rays towards
the magnetic poles. Also the upper atmosphere helps animal life to survive, by shielding
life from the harmful effects of cosmic rays. Indeed, aeroplane pilots and airline employ-
930 viii inside the nucleus • the structure of the nucleus
F I G U R E 376 Two aurorae australes on Earth, seen from space (a composed image with superimposed
UV intensity, and a view in the X-ray domain) and a double aurora on Saturn (all NASA)
ees have a strong radiation exposure that is not favourable to their health. Cosmic rays
are one of several reasons that long space travel, such as a trip to mars, is not an option
without electron clouds also in other situations. In fact, the vast majority of nuclei in the
universe have no electron clouds at all: in the inside of stars no nucleus is surrounded
by bound electrons; similarly, a large part of intergalactic matter is made of protons. It is
known today that most of the matter in the universe is found as protons or alpha particles
inside stars and as thin gas between the galaxies. In other words, in contrast to what the
Greeks said, matter is not usually made of atoms; it is mostly made of nuclei. Our everyday
environment is an exception when seen on cosmic scales. In nature, atoms are rare.
By the way, nuclei are in no way forced to move; nuclei can also be stored with almost
no motion. There are methods – now commonly used in research groups – to superpose
* In the solar system, aurorae due to core magnetic fields have been observed on Jupiter, Saturn, Uranus,
Neptune, Earth, Io and Ganymede. Aurorae due to other mechanisms have been seen on Venus and Mars.
the structure of the nucleus – the densest clouds 931
electric and magnetic fields in such a way that a single nucleus can be kept floating in
Page 615 mid-air; we discussed this possibility in the section on levitation earlier on.
Nuclei decay
Not all nuclei are stable over time. The first measurement that provided a hint was the
way radioactivity changes with time. The number N of atoms decreases with time. More
precisely, radioactivity follows an exponential decay:
The parameter τ, the so-called life time, depends on the type of nucleus emitting the rays.
It can vary from much less than a microsecond to millions of millions of years. The ex-
pression has been checked for as long as 34 multiples of the duration τ; its validity and
precision is well-established by experiments. Radioactivity is the decay of unstable nuclei.
radioactivity.
By the way, in every human body about nine thousand radioactive decays take place
every second, mainly 4.5 kBq (0.2 mSv~a) from 40 K and 4 kBq from 14 C (0.01 mSv~a).
Challenge 1425 n Why is this not dangerous?
All radioactivity is accompanied by emission of energy. The energy emitted by an atom
trough radioactive decay or reactions is regularly a million time large than that emitted
by a chemical process. That is the reason for the danger of nuclear weapons. More than a
decay, a radioactive process is thus an explosion.
What distinguishes those atoms that decay from those which do not? An exponential
decay law implies that the probability of decay is independent of the age of the atom. Age
Page 792 or time plays no role. We also know from thermodynamics, that all atoms are exactly
identical. So how is the decaying atom singled out? It took around 40 years to discover
that decays are triggered by the statistical fluctuations of the vacuum, as described by
quantum theory. Indeed, radioactivity is one of the clearest observations that classical
Ref. 955 over hundred years have passed since these results became known.
With the advent of radiometric dating, for the first time it became possible to reliably
date the age of rocks, to compare it with the age of meteorites and, when space travel
became fashionable, with the age of the Moon. The result of the field of radiometric dating
was beyond all estimates and expectations: the oldest rocks and the oldest meteorites
studied independently using different dating methods, are 4570(10) million years old. But
if the Earth is that old, why did the Earth not cool down in its core in the meantime?
the structure of the nucleus – the densest clouds 933
* In 1960, the developer of the radiocarbon dating technique, Willard Libby, received the Nobel Prize for
chemistry.
934 viii inside the nucleus • the structure of the nucleus
F I G U R E 378 Various nuclear shapes – fixed (left) and oscillating (right), shown
realistically as clouds (above) and simplified as geometric shapes (below)
about 2.4 fm, where their density decreases. Nuclei are thus small clouds, as shown in
Figure 378.
We know that molecules can be of extremely involved shape. In contrast, nuclei are
mostly spheres, ellipsoids or small variations of these. The reason is the short range, or
better, the fast spatial decay of nuclear interactions. To get interesting shapes like in mo-
lecules, one needs, apart from nearest neighbour interactions, also next neighbour inter-
actions and next next neighbour interactions. The strong nuclear interaction is too short
ranged to make this possible. Or does it? It might be that future studies will discover that
some nuclei are of more unusual shape, such as smoothed pyramids. Some predictions
Ref. 969 have been made in this direction; however, the experiments have not been performed yet.
The shape of nuclei does not have to be fixed; nuclei can also oscillate in shape. Such
oscillations have been studied in great detail. The two simplest cases, the quadrupole and
octupole oscillations, are shown in Figure 378. Obviously, nuclei can also rotate. Rapidly
spinning nuclei, with a spin of up to 60ħ and more, exist. They usually slow down step by
of quarks inside hadrons. Bound motion determines shape and shape changes of com-
pounds.
Decay is obviously the basis of radioactivity. Decay can be due to the electromagnetic,
the strong or the weak nuclear interaction. Decay allows to study the conserved quantities
of nuclear interactions.
Nuclear reactions are combinations of scattering, decay and possibly bound motion.
Nuclear reactions are for nuclei what the touching of objects is in everyday life. Touching
an object we can take it apart, break it, solder two objects together, throw it away, and
much more. The same can be done with nuclei. In particular, nuclear reactions are re-
sponsible for the burning of the Sun and the other stars; they also tell the history of the
nuclei inside our bodies.
936 viii inside the nucleus • the structure of the nucleus
Quantum theory showed that all four types of motion can be described in the same
way. Each type of motion is due to the exchange of virtual particles. For example, scat-
tering due to charge repulsion is due to exchange of virtual photons, the bound motion
inside nuclei due to the strong nuclear interaction is due to exchange of virtual gluons,
beta decay is due to the exchange of virtual W bosons, and neutrino reactions are due to
the exchange of virtual Z bosons. The rest of this chapter explains these mechanisms in
more details.
Nuclei react
The first man who thought to have made transuranic elements, the Italian genius Enrico
Page 795 Fermi, received the Nobel Prize for the discovery. Shortly afterwards, Otto Hahn and his
collaborators Lise Meitner and Fritz Strassman showed that Fermi was wrong, and that
his prize was based on a mistake. Fermi was allowed to keep his prize, the Nobel com-
mittee gave Hahn the Nobel Prize as well, and to make the matter unclear to everybody
Meitner called the splitting process nuclear fission. A large amount of energy is liberated
in fission. In addition, several neutrons are emitted; they can thus start a chain reaction.
Later, and (of course) against the will of the team, the discovery would be used to make
nuclear bombs.
Reactions and decays are transformations. In each transformation, already the Greek
taught us to search, first of all, for conserved quantities. Besides the well-known cases
of energy, momentum, electric charge and angular momentum conservation, the results
Copyright © Christoph Schiller November 1997–January 2007
of nuclear physics lead to several new conserved quantities. The behaviour is quite con-
strained. Quantum field theory implies that particles and antiparticles (commonly de-
noted by a bar) must behave in compatible ways. Both experiment and quantum field
theory show for example that every reaction of the type A + B C + D implies that the
reactions A + C B + D or C + D A + B or, if energy is sufficient, A C + D + B, are
also possible. Particles thus behave like conserved mathematical entities.
Experiments show that antineutrinos differ from neutrinos. In fact, all reactions con-
firm that the so-called lepton number is conserved in nature. The lepton number L is zero
for nucleons or quarks, is 1 for the electron and the neutrino, and is −1 for the positron
and the antineutrino.
In addition, all reactions conserve the so-called baryon number. The baryon number B
the structure of the nucleus – the densest clouds 937
is 1 for protons and neutrons (and 1/3 for quarks), and −1 for antiprotons and antineutrons
(and thus −1~3 for antiquarks). So far, no process with baryon number violation has ever
been observed. Baryon conservation is one reason for the danger of radioactivity, fission
and fusion.
water, 14 C and 90 Sr through food, and 137 Cs in both ways. In the meantime, all countries
have agreed to perform their nuclear tests underground.
But even peaceful nuclear reactors are dangerous. The reason was discovered in 1934
by Frédéric Joliot and his wife Irène, the daughter of Pierre and Marie Curie: artificial
radioactivity. The Joliot–Curies discovered that materials irradiated by alpha rays become
radioactive in turn. They found that alpha rays transformed aluminium into radioactive
phosphorous:
27 4 4
13 Al + 2 α 15 P . (621)
In fact, almost all materials become radioactive when irradiated with alpha particles, neut-
rons or gamma rays. As a result, radioactivity itself can only be contained with difficulty.
938 viii inside the nucleus • the structure of the nucleus
M at e r i a l Activity in
B q/kg
air c. 10−2
sea water 101
human body c. 102
cow milk max. 103
pure 238 U metal c. 107
highly radioactive α emitters A 107
radiocarbon: 14 C (β emitter) 108
highly radioactive β and γ emitters A9
main nuclear fallout: 137 Cs, 90 Sr (α emitter) 2 ë 109
polonium, one of the most radioactive materials (α) 1024
Nuclear physics is the most violent part of physics. But despite this bad image, nuclear
physics has something to offer which is deeply fascinating: the understanding of the Sun,
the stars and the early universe.
The Sun emits 385 YW of light. Where does this energy come from? If it came by burn-
ing coal, the Sun would stop burning after a few thousands of years. When radioactivity
was discovered, researchers tested the possibility that this process was at the heart of the
Sun’s shining. However, even though radioactivity can produce more energy than chem-
ical burning, the composition of the Sun – mostly hydrogen and helium – makes this
impossible. In fact, the study of nuclei showed that the Sun burns by hydrogen fusion.
Fusion is the composition of a large nucleus from smaller ones. In the Sun, the fusion
reaction
4 1 H 4 He + 2 e + + 2 ν + 4.4 pJ (622)
the structure of the nucleus – the densest clouds 939
In total, four protons are thus fused to one helium nucleus; if we include the electrons,
four hydrogen atoms are fused to one helium atom with the emission of neutrinos and
light with a total energy of 4.4 pJ (26.7 MeV). Most of the energy is emitted as light;
around 10 % is carried away by neutrinos. The first of the three reaction of equation 623
is due to the weak nuclear interaction; this avoids that it happens too rapidly and ensures
that the Sun will shine still for some time. Indeed, in the Sun, with a luminosity of 385 YW,
Ref. 971 there are thus about 1038 fusions per second. This allows to deduce that the Sun will last
another handful of Ga (Gigayears) before it runs out of fuel.
Chain reactions are quite common in nature. Fire is a chemical chain reaction, as are
exploding fireworks. In both cases, material needs heat to burn; this heat is supplied by a
neighbouring region that is already burning.
**
Radioactivity can be extremely dangerous to humans. The best example is plutonium.
Only 1 µg of this alpha emitter inside the human body are sufficient to cause lung cancer.
**
Lead is slightly radioactive, because it contains the 210 Pb isotope, a beta emitter. This
lead isotope is produced by the uranium and thorium contained in the rock from where
940 viii inside the nucleus • the structure of the nucleus
Exposure Dose
the lead is extracted. For sensitive experiments, such as for neutrino experiments, one
needs radioactivity shields. The best shield material is lead, but obviously it has to be low
radioactivity lead. Since the isotope 210 Pb has a half-life of 22 years, one way to do it is to
use old lead. In a precision neutrino experiment in the Gran Sasso in Italy, the research Copyright © Christoph Schiller November 1997–January 2007
team uses lead mined during Roman times in order to reduce spurious signals.
**
Not all reactors are human made. Natural reactors have been predicted in 1956 by Paul
Kuroda. In 1972 the first example was found. In Oklo, in the African country of Gabon,
there is a now famous geological formation where uranium is so common that two thou-
sand million years ago a natural nuclear reactor has formed spontaneously – albeit a small
one, with an estimated power generation of 100 kW. It has been burning for over 150 000
years, during the time when the uranium 235 percentage was 3% or more, as required for
chain reaction. (Nowadays, the uranium 235 content on Earth is 0.7%.) The water of a
nearby river was periodically heated to steam during an estimated 30 minutes; then the
reactor cooled down again for an estimated 2.5 hours, since water is necessary to mod-
the structure of the nucleus – the densest clouds 941
erate the neutrons and sustain the chain reaction. The system has been studied in great
detail, from its geological history up to the statements it makes about the constancy of
the ‘laws’ of nature. The studies showed that 2000 million years ago the mechanisms were
the same as those used today.
**
High energy radiation is dangerous to humans. In the 1950s, when nuclear tests were still
made above ground by the large armies in the world, the generals overruled the orders
of the medical doctors. They positioned many soldiers nearby to watch the explosion,
and worse, even ordered them to walk to the explosion site as soon as possible after the
explosion. One does not need to comment on the orders of these generals. Several of these
unlucky soldiers made a strange observation: during the flash of the explosion, they were
Challenge 1429 ny able to see the bones in their own hand and arms. How can this be?
time scales. As a result, radiometric dating methods can be be surprisingly precise. But
Challenge 1430 n how does one measure half-lives of thousands of millions of years to high precision?
**
The beta decay of the radioactive carbon isotope 14 C has a decay time of 5568 a. This
isotope is continually created in the atmosphere through the influence of cosmic rays.
This happens through the reaction 14 N + n p + 14 C. As a result, the concentration of
radiocarbon in air is relatively constant over time. Inside living plants, the metabolism
thus (unknowingly) maintains the same concentration. In dead plants, the decay sets in.
The decay time of a few thousand years is particularly useful to date historic material. The
method, called radiocarbon dating, has been used to determine the age of mummies, the
age of prehistoric tools and the age of religious relics. The original version of the technique
942 viii inside the nucleus • the structure of the nucleus
210
Pb 22 a gamma-ray counting dating wine
3
H 12.3 a gamma-ray counting dating wine
measured the radiocarbon content through its radioactive decay and the scintillations
it produced. A quality jump was achieved when accelerator mass spectroscopy became
commonplace. It was not necessary any more to wait for decays: it is now possible to
determine the 14 C content directly. As a result, only a tiny amount of carbon, as low as
0.2 mg, is necessary for a precise dating. This technique showed that numerous religious
relics are forgeries, such as a cloth in Turin, and several of their wardens turned out to be
crooks.
the strong nuclear interaction and the birth of matter 943
**
Researchers have developed an additional method to date stones using radioactivity.
Whenever an alpha ray is emitted, the emitting atom gets a recoil. If the atom is part
of a crystal, the crystal is damaged by the recoil. The damage can be seen under the mi-
croscope. By counting the damaged regions it is possible to date the time at which rocks
have been crystallized. In this way it has been possible to determine when material from
volcanic eruptions has become rock.
**
Several methods to date wine are used, and more are in development. A few are given in
Table 70.
**
In the 1960s, a bunch of criminals once exploded a nuclear bomb in the stratosphere. As
**
Selected radioactive decay times can be changed by external influence. Electron capture,
as observed in beryllium-7, is one of the rare examples were the decay time can change, by
up to 1.5%, depending on the chemical environment. The decay time for the same isotope
has also been found to change by a fraction of a percent under pressures of 27 GPa. On
the other hand, these effects are predicted (and measured) to be negligible for nuclei of
larger mass.
“ Peter Sloterdijk
Since protons are positively charged, inside nuclei they must be bound by a force strong ”
enough to keep them together against their electromagnetic repulsion. This is the strong Copyright © Christoph Schiller November 1997–January 2007
nuclear interaction. Most of all, the strong interaction tells a good story about the stuff
we are made of.
All stars shine because of fusion. When two light nuclei are fused to a heavier one, some
”
energy is set free, as the average nucleon is bound more strongly. This energy gain is
possible until the nuclei of iron 56 Fe are made. For nuclei beyond this nucleus, the binding
energies per nucleon then decrease again; thus fusion is not energetically possible.*
The different stars observed in the sky** can be distinguished by the type of fusion nuc-
lear reaction that dominates. Most stars, in particular young or light stars run hydrogen
fusion. In fact, there are at least two main types of hydrogen fusion: the direct hydrogen-
hydrogen (p-p) cycle and the CNO cycle(s).
Page 938 The hydrogen cycle described above is the main energy source of the Sun. The simple
description does not fully purvey the fascination of the process. On average, protons in
the Sun’s centre move with 600 km~s. Only if they hit each other precisely head-on can
a nuclear reaction occur; in all other cases, the electrostatic repulsion between the pro-
tons keeps them apart. For an average proton, a head-on collision happens once every 7
thousand million years. Nevertheless, there are so many proton collisions in the Sun that
every second four million tons of hydrogen are burned to helium.
Fortunately for us, the photons generated in the Sun’s centre are ‘slowed’ down by the
The end result of the cycle is the same as that of the hydrogen cycle, both in nuclei and in
energy. The CNO cycle is faster than hydrogen fusion, but requires higher temperatures,
as the protons must overcome a higher energy barrier before reacting with carbon or
Challenge 1431 n nitrogen than when they react with another proton. (Why?) Due to the comparatively low
temperature of a few tens of million kelvin inside the Sun, the CNO cycle is less important
Copyright © Christoph Schiller November 1997–January 2007
than the hydrogen cycle. (This is also the case for the other CNO cycles that exist.) These
studies also explain why the Sun does not collapse. The Sun is a ball of hot gas, and the
high temperature of its constituents prevents their concentration into a small volume. For
some stars, the radiation pressure of the emitted photons prevents collapse; for others it is
the Pauli pressure; for the Sun, like for the majority of stars, it is the usual thermal motion
of the gas.
The nuclear reaction rates at the interior of a star are extremely sensitive to temperat-
ure. The carbon cycle reaction rate is proportional to between T 13 for hot massive O stars
and T 20 for stars like the Sun. In red giants and supergiants, the triple alpha reaction rate
is proportional to T 40 ; these strong dependencies imply that stars shine with constancy
over medium times, since any change in temperature would be damped by a very effi-
cient feedback mechanism. (Of course, there are exceptions: variable stars get brighter
and darker with periods of a few days; and the Sun shows small oscillations in the minute
range.)
How can the Sun’s surface have a temperature of 6000 K, whereas the corona around
it, the thin gas emanating from the Sun, reaches two million Kelvin? In the latter part of
the twentieth century it was shown, using satellites, that the magnetic field of the Sun is
the cause; through the violent flows in the Sun’s matter, magnetic energy is transferred to
the corona in those places were flux tubes form knots, above the bright spots in the left
of Figure 379 or above the dark spots in the right photograph. As a result, the particles of
the corona are accelerated and heat the whole corona.
When the Sun erupts, as shown in the lower left corner in Figure 379, matter is ejected
far into space. When this matter reaches the Earth,* after being diluted by the journey,
it affects the environment. Solar storms can deplete the higher atmosphere and can thus
Copyright © Christoph Schiller November 1997–January 2007
possibly trigger usual Earth storms. Other effects of the Sun are the formation of auroras
and the loss of orientation of birds during their migration; this happens during exception-
ally strong solar storms, as the magnetic field of the Earth is disturbed in these situations.
The most famous effect of a solar storm was the loss of electricity in large parts of Canada
in March of 1989. The flow of charged solar particles triggered large induced currents in
the power lines, blew fuses and destroyed parts of the network, shutting down the power
system. Millions of Canadians had no electricity, and in the most remote places it took
two weeks to restore the electricity supply. Due to the coldness of the winter and a train
accident resulting from the power loss, over 80 people died. In the meantime the network
has been redesigned to withstand such events.
* It might even be that the planets affect the solar wind; the issue is not settled and is still under study.
946 viii inside the nucleus • the strong nuclear interaction
The proton cycle and the CNO cycles are not the only options. Heavier and older stars
than the Sun can also shine through other fusion reactions. In particular, when hydrogen
is consumed, such stars run helium burning:
3 4 He 12
C. (625)
This fusion reaction is of low probability, since it depends on three particles being at the
same point in space at the same time. In addition, small amounts of carbon disappear
rapidly via the reaction α + 12 C 16 O. Nevertheless, since 8 Be is unstable, the reaction
with 3 alpha particles is the only way for the universe to produce carbon. All these neg-
ative odds are countered only by one feature: carbon has an excited state at 7.65 MeV,
which is 0.3 MeV above the sum of the alpha particle masses; the excited state reson-
antly enhances the low probability of the three particle reaction. Only in this way the
universe is able to produce the atoms necessary for pigs, apes and people. The prediction
Because tritium is radioactive, most research experiments are performed with the much
less efficient deuterium–deuterium reactions, which have a lower cross section and a
lower energy gain:
Copyright © Christoph Schiller November 1997–January 2007
D+D T + H + 4 MeV
3
D+D He + n + 3.3 MeV . (627)
Fusion takes place when deuterium and tritium (or deuterium) collide at high energy.
The high energy is necessary to overcome the electrostatic repulsion of the nuclei. In
other words, the material has to be hot. To release energy from deuterium and tritium,
one therefore first needs energy to heat it up. This is akin to the ignition of wood: in order
to use wood as a fuel, one first has to heat it with a match.
* See www.jet.edfa.org.
the strong nuclear interaction and the birth of matter 947
Following the so-called Lawson criterium, published in 1957 by the English engineer
Ref. 958 John Lawson, (but already known to Russian researchers) a fusion reaction releases en-
ergy only if the triple product of density n, reaction (or containment) time τ and temper-
ature T exceeds a certain value. Nowadays this criterium is written as
In order to realize the Lawson criterium, JET uses temperatures of 100 to 200 MK, particle
densities of 2 to 3 ë 1020 m−3 , and confinement times of 1 s. The temperature is much
higher than the 20 MK at the centre of the Sun, because the densities and the confine- Copyright © Christoph Schiller November 1997–January 2007
manages to keep the plasma from touching the walls for about a second; then the situ-
ation becomes unstable: the plasma touches the wall and is absorbed there. After such
a disruption, the cycle consisting of gas injection, plasma heating and fusion has to be
restarted. As mentioned, JET has already achieved ignition, that is the state were more
energy is released than is added for plasma heating. However, so far, no sustained com-
mercial energy production is planned or possible, because JET has no attached electrical
power generator.
The successor project, ITER, an international tokamak built with European, Japanese,
US-American and Russian funding, aims to pave the way for commercial energy gen-
eration. Its linear reactor size will be twice that of JET; more importantly, ITER plans
to achieve 30 s containment time. ITER will use superconducting magnets, so that it will
have extremely cold matter at 4 K only a few metres from extremely hot matter at 100 MK.
In other words, ITER will be a high point of engineering. The facility will be located in
Cadarache in France and is planned to start operation in the year 2016.
Where do our atoms come from? Copyright © Christoph Schiller November 1997–January 2007
People consist of electrons and various nuclei. Electrons, hydrogen and helium nuclei
Page 470 are formed during the big bang. All other nuclei are formed in stars. Young stars run
hydrogen burning or helium burning; heavier and older stars run neon-burning or even
silicon-burning. These latter processes require high temperatures and pressures, which
are found only in stars with a mass at least eight times that of the Sun. However, all fusion
processes are limited by photodissociation and will not lead to nuclei heavier than 56 Fe.
Heavier nuclei can only be made by neutron capture. There are two main processes;
the s-process (for ‘slow’) runs inside stars, and gradually builds up heavy elements until
the most heavy, lead, from neutron flying around. The rapid r-process occurs in stellar
explosions. Many stars die this violent death. Such an explosion has two main effects: on
one hand it distributes most of the matter of the star, such carbon, nitrogen or oxygen,
the strong nuclear interaction and the birth of matter 949
into space in the form of neutral atoms. On the other hand, new elements are synthesized
during the explosion. The abundances of the elements in the solar system can be precisely
measured. These several hundred data points correspond exactly with what is expected
from the material ejected by a (type II) supernova explosion. In other words, the solar
system formed from the remnants of a supernova, as did, somewhat later, life on Earth.*
We all are recycled stardust.
figure to be inserted
F I G U R E 381 A selection of mesons and baryons and their classification as bound states of
quarks
* By chance, the composition ratios between carbon, nitrogen and oxygen inside the Sun are the same as
Copyright © Christoph Schiller November 1997–January 2007
world.
The proton and the neutron are seen as combinations of two quarks, called up (u) and
down (d). Later, other particles lead to the addition of four additional types of quarks.
Their names are somewhat confusing: they are called strange (s), charm (c), bottom (b) –
also called ‘beauty’ in the old days – and top (t) – called ‘truth’ in the past.
All quarks have spin one half; their electric charges are multiples of 1~3 of the electron
charge. In addition, quarks carry a strong charge, which in modern terminology is called
colour. In contrast to electromagnetism, which has only positive, negative, and neutral
charges, the strong interaction has red, blue, green charges on one side, and anti-red,
anti-blue and anti-green on the other. The neutral state is called ‘white’. All baryons and
mesons are white, in the same way that all atoms are neutral.
– CS – details to be added – CS –
Here, the value of the coupling constant α unif is taken at the unifying energy, a factor of
Page 964 1000 below the Planck energy. (See the section of unification below.) In other words, a
general understanding of masses of bound states of the strong interaction, such as the
proton, requires almost purely a knowledge of the unification energy and the coupling
constant at that energy. The approximate value α unif = 1~25 is an extrapolation from the
low energy value, using experimental data.
The proportionality factor in expression (629) is still missing. Indeed, it is not easy to
calculate. Many calculations are now done on computers. The most promising calcula-
tion simplify space-time to a lattice and then reduce QCD to lattice QCD. Using the most
powerful computers available, these calculations have given predictions of the mass of
Ref. 963 the proton and other baryons within a few per cent.
But the mass is not the only property of the proton. Being a cloud of quarks and gluons,
it also has a shape. Surprisingly, it took a long time before people started to become inter-
Copyright © Christoph Schiller November 1997–January 2007
ested in this aspect. The proton is made of two u quarks and one d quark. It thus resembles
a ionized H 2+ molecule, where one electron forms a cloud around two protons. Obviously,
the H 2+ molecule is elongated. Is that also the case for the proton? First results from 2003
seem to point into this direction.
The shape of a molecule will depend on whether other molecules surround it. Recent
research showed that both the size and the shape of the proton in nuclei is slightly variable;
Ref. 964 both seem to depend on the nucleus in which the proton is built-in.
Apart from shapes, molecules also have a colour. The colour of a molecule, like that
of any object, is due to the energy absorbed when it is irradiated. For example, the H 2+
molecule can absorb certain light frequencies by changing to an excited state. Protons
Ref. 965 and neutrons can also be excited; in fact, their excited states have been studied in detail; a
952 viii inside the nucleus • the strong nuclear interaction
figure to be inserted
This means that mu = m d = 330 MeV, a bit more than a third of the nucleon, whose Copyright © Christoph Schiller November 1997–January 2007
mass is c. 940 MeV. If we assume that the quark magnetic moment is proportional to
their charge, we predict a ratio of the magnetic moments of the proton and the neut-
ron of µ p ~µ n = −1.5; this prediction differs from measurements only by 3 %. Using the
values for the magnetic moment of the quarks, magnetic moment values of over half
a dozen of other baryons can be predicted. The results typically deviate from measure-
ments by around 10 %; the sign is always correctly calculated.
— The quark model describes the quantum numbers of mesons and baryons. All texts
on the quark model are full of diagrams such as those shown in Figure 381. These
generalizations of the periodic table of the elements were filled in during the twentieth
century; they allow a complete classification of all mesons and baryons as bound states
of quarks.
— The quark model also explains the mass spectrum of baryons and mesons. The best
the strong nuclear interaction and the birth of matter 953
q
g g g
g g
g g
g
predictions are made by lattice calculations. After one year of computer time, research-
ers were able to reproduce the masses of proton and neutron to within a few per cent.
Even if one sets the u and d quark masses to zero, the resulting proton and neutron
LQ C D = − 41 F µν − c 2 Q m q ψ q ψ qk + i ħc c Q ψ q γ µ (D µ ) k l ψ ql
(a) (a)µν k k
F (631)
q q
where
(a)
F µν = ∂ µ Aaν − ∂ ν Aaµ + дs f abc Abµ Acν
(D µ ) k l = δ k l ∂ µ − i Q λ ak , l Aaµ .
дs
2 a Copyright © Christoph Schiller November 1997–January 2007
We remember from the section on the principle of least action that Lagrangians are always
Page 199 sums of scalar products; this is clearly seen in the formula. Furthermore, the index a =
1 . . . 8 numbers the eight types of gluons, k = 1, 2, 3 numbers the three colours and q =
1, . . . 6 numbers the six quark flavours. The fields Aaµ (x) are the eight gluon fields: they
are the coiled lines in Figure 383. The field ψ qk (x) is the field of the quark of flavour q
and colour k: it is the straight line in the figure. The quark fields are 4-component Dirac
spinors.
The first term of the Lagrangian (631) represents the kinetic energy of the radiation
(gluons), the second term the kinetic energy of the matter particles (the quarks) and the
third term the interaction between the two. The third term in the Lagrangian, the inter-
action term, thus corresponds to the third diagram in Figure 383.
954 viii inside the nucleus • the strong nuclear interaction
Gluons are massless; therefore no gluon mass term appears in the Lagrangian. In op-
position to electromagnetism, where the gauge group U(1) is abelian, the gauge group
SU(3) of the strong interactions is non-abelian. As a consequence, the colour field itself
is charged, i.e. carries colour: the index a appears on A and F. As a result, gluons can
interact with each other, in contrast to photons, which pass each other undisturbed. The
first two diagrams of Figure 383 are thus reflected in the somewhat complicated defini-
(a)
tion of the function F µν . In contrast to electrodynamics, the definition has an extra term
that is quadratic in the fields A; it is described by the interaction strength дs and by the
so-called structure constants f abc . These are the structure constants of the SU(3) algebra.
The interaction between the quarks and the gluons, the third term of the Lagrangian,
is described by the matrices λ ak , l ; they are a fundamental, 3-dimensional representation
σi
γ0 = γn =
I 0 0
[λ a , λ b ] = 2i f abc λ c
λ a , λ b = 4~3δ ab I + 2d abc λ c (633)
where I is the unit matrix. The structure constants f abc , which are odd under permutation of any pair of
indices, and d abc , which are even, are
abc f abc abc d abc abc d abc
º
123 1 118 1~ 3 355 1~2
147 1~2 146 1~2 366 −1~2
156 −1~2 157 1~2º 377 −1~2 º
246 1~2 228 1~ 3 448 −1~(2º3 ) (634)
257 1~2 247 −1~2 558 −1~(2º3 )
345 1~2 256 1~2º 668 −1~(2º3 )
367 º−1~2 338 1~ 3 778 º 3
−1~(2 )
458 º3 ~2 344 1~2 888 −1~ 3
678 3 ~2
All other elements vanish. A fundamental 3-dimensional representation of the generators λ a is given for
Copyright © Christoph Schiller November 1997–January 2007
0 1 0 0 −i 0 1 0 0
λ1 = 1 0 0 λ2 = i 0 0 λ3 = 0 −1 0
0 0 0 0 0 0 0 0 0
0 0 1 0 0 −i 0 0 0
λ4 = 0 0 0 λ5 = 0 0 0 λ6 = 0 0 1
1 0 0 i 0 0 0 1 0
0 0 0
1
1 0 0
λ7 = 0 0 −i λ 8 = º 0 1 0 . (635)
0 i 0 3 0 0 −2
There are eight matrices, one for each gluon type, with 3 3 elements, corresponding to the three colours of
the strong interactions.
956 viii inside the nucleus • the strong nuclear interaction
and gauge invariance (SU(3) in this case) and by specifying the different particles types
(6 quarks in this case) with their masses and the coupling constant.
The Lagrangian is thus almost fixed by construction. We say ‘almost’, because it con-
tains a few parameters that remain unexplained:
— The number and the masses of the quarks are not explained by QCD.
— The coupling constant дs of the strong interaction is unexplained. Often also the equi-
valent quantity α s = дs2 ~4π is used to describe the coupling. Like for the case of the
electroweak interactions, α s and thus дs depend on the energy Q of the experiment.
This energy dependence is indeed observed in experiments; it is described by the renor-
malization procedure:
α s (Q 2 ) =
12π 1
+ ... (636)
33 − 2n f ln Q 22
Λ
where n f is the number of quarks with mass less than the energy scale Q and lies
But when the mass values for quarks are looked up in most tables, the masses of u and d
quarks are only of the order of a few MeV~c 2 , whereas the proton’s mass is 938 MeV~c2 .
What is the story here?
The definition of the masses for quarks is more involved than for other particles.
Quarks are never found as free particles, but only in bound states. Quarks behave almost
like free particles at high energies; this property is called asymptotic freedom. The mass of
such a free quark is called current quark mass; for the light quarks it is only a few MeV~c2 .
At low energy, for example inside a proton, quarks are not free, but must carry along a
* Asymptotic freedom was first discovered by Gerard ‘t Hooft; since he had the Nobel Prize already, the 2004
Prize was then given to the next people who found it: David Gross, David Politzer and Frank Wilczek.
** Only at energies much larger than Λ can a perturbation expansion be applied.
the strong nuclear interaction and the birth of matter 957
large amount of energy due to the confinement process. As a result, bound quarks have a
much larger constituent quark mass, which takes into account this confinement energy. To
give an idea of the values, take a proton; the indeterminacy relation for a particle inside a
º
sphere of radius 0.9 fm gives a momentum indeterminacy of around 190 MeV~c. In three
dimensions this gives an energy of 3 times that value, or a mass of about 330 MeV~c2 .
Three confined quarks are thus heavier than a proton, whose mass is 938 MeV~c2 ; we can
thus still say that a compound proton is less massive than its constituents. In short, the
mass of the proton and the neutron is (almost exclusively) the kinetic energy of the quarks
Ref. 977 inside them, as their own rest mass is negligible. As Frank Wilczek says, some people put
on weight even though they never eat anything heavy.
To complicate the picture, the distinction of the two mass types makes no sense for
the top quark; this quark decays so rapidly that the confinement process has no time to
set in. As a result, the top mass is again a mass of the type we are used to.
Nobody knows whether this is necessary: the race for a simple approximation method
for finding solutions is still open.
**
Even though gluons are massless, like photons and gravitons, there is no colour radiation.
Gluons carry colour and couple to themselves; as a result, free gluons were predicted
to directly decay into quark–antiquark pairs. This decay has indeed been observed in
experiments at particle accelerators.
Something similar to colour radiation, but still stranger might have been found
in 1997. First results seem to confirm the prediction of glueballs from numerical
Ref. 978 calculations.
958 viii inside the nucleus • the strong nuclear interaction
**
The latest fashion in high energy physics is the search for hybrid mesons, particles made
Ref. 978 of gluons and quarks. This fashion is not over yet; the coming years should settle whether
the candidates known so far really are hybrids.
**
Do particles made of five quarks, so-called pentaquarks, exist? So far, they seem to exist
only in a few laboratories in Japan, whereas other laboratories across the world fail to see
them. The issue is still open.
**
Whenever we look at a periodic table of the elements, we look at a manifestation of the
strong interaction. The Lagrangian of the strong interaction describes the origin and prop-
erties of the presently known 115 elements.
**
In the 1960s and 70s, it was discovered that the Sun pulsates with a frequency of 5 minutes.
The effect is small, only 3 kilometres out of 1.4 million; still it is measurable. In the mean-
time, helioseismologists have discovered numerous additional oscillations of the Sun, and Copyright © Christoph Schiller November 1997–January 2007
in 1993, even on other stars. Such oscillations allow to study what is happening inside
stars, even separately in each of the layers they consist of.
**
Historically, nuclear reactions also provided the first test of the relation E = γmc 2 . This
was achieved in 1932 by Cockcroft and Walton. They showed that by shooting protons
into lithium one gets the reaction
7 1 8 4 4
3 Li + 1 H 4 Be 2 He + 2 He + 17 MeV . (637)
The measured energy on the right is exactly the value that is derived from the differences
in total mass of the nuclei on both sides.
the weak nuclear interaction and the handedness of nature 959
**
Some stars shine like a police siren: their luminosity increases and decreases regularly.
Such stars, called Cepheids, are important because their period depends on their average
(absolute) brightness. Measuring their period and their brightness on Earth thus allows
astronomers to determine their distance.
**
Nucleosynthesis is mainly regulated by the strong interaction. However, if the electromag-
netic interaction would be much stronger or much weaker, stars would either produce too
little oxygen or too little carbon. This argument is due to Fred Hoyle. Can you fill in the
Challenge 1434 d details?
The next weird characteristic of the weak interaction is the nonconservation of parity
under spatial inversion. The weak interaction distinguishes between mirror systems, in
contrast to everyday life, gravitation, electromagnetism, and the strong interactions. Par-
ity non-conservation had been predicted by 1956 by Lee Tsung-Dao and Yang Chen Ning,
and was confirmed a few months later, earning them a Nobel Prize. (They had predicted
the effect in order to explain the ability of K mesons to decay either into 2 or into 3 pions.)
The most beautiful consequence of parity non-conservation property is its influence on
Ref. 982 the colour of certain atoms. This prediction was made in 1974 by Bouchiat and Bouchiat.
The weak interaction is triggered by the weak charge of electrons and nuclei. Therefore,
electrons in atoms do not exchange only virtual photons with the nucleus, but also virtual
Z particles. The chance for this latter process is extremely small, around 10−11 times smal-
960 viii inside the nucleus • the weak nuclear interaction
ler than virtual photon exchange. But since the weak interaction is not parity conserving,
this process allows electron transitions which are impossible by purely electromagnetic ef-
fects. In 1984, measurements confirmed that certain optical transitions of caesium atoms
that are impossible via the electromagnetic interaction, are allowed when the weak inter-
Ref. 983 action is taken into account. Several groups have improved these results and have been
able to confirm the prediction of the weak interaction, including the charge of the nucleus,
Ref. 988 to within a few per cent.
**
The weak interaction is required to have an excess of matter over antimatter. Without the
parity breaking of the weak interactions, there would be no matter at all in the universe.
**
Through the emitted neutrinos, the weak interaction helps to get the energy out of a
supernova. If that were not the case, black holes would form, heavier elements – of which
we are made – would not have been spread out into space, and we would not exist.
**
Ref. 981 The paper by Peter Higgs on the boson named after him is only 79 lines long, and has
only five equations.
**
The weak interaction is not parity invariant. In other words, when two electrons collide,
the fraction of the collisions that happens through the weak interaction should behave
differently than a mirror experiment. In 2004, polarized beams of electrons – either left-
handed or right-handed – were shot at a matter target and the reflected electrons were
Copyright © Christoph Schiller November 1997–January 2007
counted. The difference was 175 parts per billion – small, but measurable. The experiment
also confirmed the predicted weak charge of -0.046 of the electron.
**
The weak interaction is also responsible for the heat produced inside the Earth. This heat
keeps the magma liquid. As a result, the weak interaction, despite its weakness, is respons-
ible for all earthquakes, tsunamis and volcanic eruptions.
**
Beta decay, due to the weak interaction, separates electrons and protons. Only in 2005
people have managed to propose practical ways to use this effect to build long-life batter-
the weak nuclear interaction and the handedness of nature 961
ies that could be used in satellites. Future will tell whether the method will be successful.
Mass, the Higgs boson and a ten thousand million dollar lie
Difficile est satiram non scribere.
“
In the years 1993 and 1994 an intense marketing campaign was carried out across the
Juvenal*
”
United States of America by numerous particle physicists. They sought funding for the
‘superconducting supercollider’, a particle accelerator with a circumference of 80 km. This
should have been the largest human machine ever built, with a planned cost of more than
ten thousand million dollars, aiming at finding the Higgs boson before the Europeans
would do so, at a fraction of the cost. The central argument brought forward was the
following: since the Higgs boson was the basis of mass, it was central to science to know
about it. Apart from the discussion on the relevance of the argument, the worst is that it
— Solar neutrinos arrive on Earth at 6 ë 1014 ~m2 s, with an energy from 0 to 0.42 MeV;
they are due to the p-p reaction in the sun; a tiny is due to the 8 B reaction and has
energies up to 15 MeV.
— Atmospheric neutrinos are products of cosmic rays hitting the atmosphere, consist of
2/3 of muon neutrinos and one third of electron neutrinos, and have energies mainly
between 100 MeV and 5 GeV.
Page 933 — Earth neutrinos from the radioactivity that keeps the Earth warm form a flux of
6 ë 1010 ~m2 s.
— Fossil neutrinos from the big bang, with a temperature of 1.95 K are found in the uni-
verse with a density of 300 cm−3 , corresponding to a flux of 1015 ~m2 s.
* ‘It is hard not to be satirical.’ 1, 30
962 viii inside the nucleus • the weak nuclear interaction
— Man-made neutrinos are produced in nuclear reactors (at 4 MeV) and as neutrino
beams in accelerators, using pion and kaon decay. A standard nuclear plant produces
5ë1020 neutrinos per second. Neutrino beams are produced, for example, at the CERN
in Geneva. They are routinely sent 700 km across the Earth to central Italy, where they
are detected.
They are mainly created in the atmosphere by cosmic radiation, but also coming
directly from the background radiation and from the centre of the Sun. Nevertheless,
during our whole life – around 3 thousand million seconds – we have only a 10 %
chance that one of them interacts with one of the 3ë1027 atoms of our body. The reason
is that the weak interaction is felt only over distances less than 10−17 m, about 1/100th
of the diameter of a proton. The weak interaction is indeed weak.
**
The weak interaction is so weak that a neutrino–antineutrino annihilation – which is only
**
Only one type of particles interacts (almost) only weakly: neutrinos. Neutrinos carry no
electric charge, no colour charge and almost no gravitational charge (mass). To get an
impression of the weakness of the weak interaction, it is usually said that the probability
of a neutrino to be absorbed by a lead screen of the thickness of one light-year is less than
50 %. The universe is thus essentially empty for neutrinos. Is there room for bound states
of neutrinos circling masses? How large would such a bound state be? Can we imagine
bound states, which would be called neutrinium, of neutrinos and antineutrinos circling
each other? The answer depends on the mass of the neutrino. Bound states of massless
particles do not exist. They could and would decay into two free massless particles.*
Since neutrinos are massive, a neutrino–antineutrino bound state is possible in prin-
Challenge 1435 ny ciple. How large would it be? Does it have excited states? Can they ever be detected? These
issues are still open.
**
Do ruminating cows move their jaws equally often in clockwise and anticlockwise dir-
ection? In 1927, the theoretical physicists Pascual Jordan and Ralph de Laer Kronig pub- Copyright © Christoph Schiller November 1997–January 2007
Ref. 984 lished a study showing that in Denmark the two directions are almost equally distributed.
The rumination direction of cows is thus not related to the weak interaction.
**
The weak interaction plays an important part in daily life. First of all, the Sun is shining.
The fusion of two protons to deuterium, the first reaction of the hydrogen cycle, implies
Page 939 that one proton changes into a neutron. This transmutation and the normal beta decay
Challenge 1436 e have the same first-order Feynman diagram. The weak interaction is thus essential for
the burning of the Sun. The weakness of the process is one of the guarantees that the Sun
will continue burning for quite some time.
* In particular, this is valid for photons bound by gravitation; this state is not possible.
the standard model – as seen on television 963
**
Of course, the weak interaction is responsible for radioactive beta decay, and thus for part
of the radiation background that leads to mutations and thus to biological evolution.
**
What would happen if the Sun suddenly stopped shining? Obviously, temperatures would
fall by several tens of degrees within a few hours. It would rain, and then all water would
freeze. After four or five days, all animal life would stop. After a few weeks, the oceans
would freeze; after a few months, air would liquefy.
**
Not everything about the Sun is known. For example, the neutrino flux from the Sun
oscillates with a period of 28.4 days. That is the same period with which the magnetic
field of the Sun oscillates. The connections are still being studied.
British team led by Ken Ledingham observed laser induced uranium fission in 238 U nuc-
lei. In the meantime, this has even be achieved with table-top lasers. The latest feat, in
2003, was the transmutation of 129 I to 128 I with a laser. This was achieved by focussing
Ref. 985 a 360 J laser pulse onto a gold foil; the ensuing plasma accelerates electrons to relativ-
istic speed, which hit the gold and produce high energy γ rays that can be used for the
transmutation.
964 viii inside the nucleus • 32. grand unification – a simple dream
“ Albertus Magnus
Is there a common origin of the three particle interactions? We have seen in the preced- ” Copyright © Christoph Schiller November 1997–January 2007
ing sections that the Lagrangians of the electromagnetic, the weak and the strong nuclear
interactions are determined almost uniquely by two types of requirements: to possess a
certain symmetry and to possess mathematical consistency. The search for unification of
the interactions thus requires the identification of the unified symmetry of nature. In re-
cent decades, several candidate symmetries have fuelled the hope to achieve this program:
grand unification, supersymmetry, conformal invariance and coupling constant duality.
The first of them is conceptually the simplest.
At energies below 1000 GeV there are no contradictions between the Lagrangian of
the standard model and observation. The Lagrangian looks like a low energy approxima-
* ‘Matter is coagulated light.’ Albertus Magnus (b. c. 1192 Lauingen, d. 1280 Cologne), the most important
thinker of his time.
grand unification – a simple dream 965
tion. It should thus be possible (attention, this a belief) to find a unifying symmetry that
contains the symmetries of the electroweak and strong interactions as subgroups and
thus as different aspects of a single, unified interaction; we can then examine the physical
properties that follow and compare them with observation. This approach, called grand
unification, attempts the unified description of all types of matter. All known elementary
particles are seen as fields which appear in a Lagrangian determined by a single symmetry
group.
Like for each gauge theory described so far, also the grand unified Lagrangian is mainly
determined by the symmetry group, the representation assignments for each particle, and
the corresponding coupling constant. A general search for the symmetry group starts
Ref. 986 with all those (semisimple) Lie groups which contain U(1)SU(2)SU(3). The smallest
groups with these properties are SU(5), SO(10) and E(8); they are defined in Appendix D.
For each of these candidate groups, the experimental consequences of the model must be
studied and compared with experiment.
All grand unified models predict the existence of magnetic monopoles, as was shown
Ref. 989 by Gerard ’t Hooft. However, despite extensive searches, no such particles have been
found yet. Monopoles are important even if there is only one of them in the whole uni-
verse: the existence of a single monopole implies that electric charge is quantized. Grand
unification thus explains why electric charge appears in multiples of a smallest unit.
Copyright © Christoph Schiller November 1997–January 2007
Grand unification predicts the existence of heavy intermediate vector bosons, called
X bosons. Interactions involving these bosons do not conserve baryon or lepton number,
but only the difference B − L between baryon and lepton number. To be consistent with
experiment, the X bosons must have a mass of the order of 1016 GeV.
Most spectacularly, the X bosons grand unification implies that the proton decays.
This prediction was first made by Pati and Salam in 1974. If protons decay, means that
neither coal nor diamond* – nor any other material – is for ever. Depending on the pre-
cise symmetry group, grand unification predicts that protons decay into pions, electrons,
kaons or other particles. Obviously, we know ‘in our bones’ that the proton lifetime is
* As is well known, diamond is not stable, but metastable; thus diamonds are not for ever, but coal might be,
if protons do not decay.
966 viii inside the nucleus • 32. grand unification – a simple dream
rather high, otherwise we would die of leukaemia; in other words, the low level of cancer
already implies that the lifetime of the proton is larger than 1016 a.
Detailed calculations for the proton lifetime τ p using SU(5) yield the expression
M X4
τp 10311 a
1
αG2 (M X ) M p5
(640)
where the uncertainty is due to the uncertainty of the mass M X of the gauge bosons in-
volved and to the exact decay mechanism. Several large experiments aim to measure this
lifetime. So far, the result is simple but clear. Not a single proton decay has ever been
Ref. 990 observed. The experiments can be summed up by
τ(p e + π 0 ) A 5 ë 1033 a
τ(p K + ν̄) A 1.6 ë 1033 a
These values are higher than the prediction by SU(5). To settle the issue definitively, one
last prediction of grand unification remains to be checked: the unification of the coupling
constants.
particles that change the curves at intermediate energies, so that they all meet at a grand
unification energy of about 1016 GeV. (The line thicknesses in Figure 384 represent the
experimental errors.) The inclusion of supersymmetry also puts the proton lifetime pre-
diction back to a value higher (but not by much) than the present experimental bound
and predicts the correct value of the mixing angle. With supersymmetry, we can thus re-
tain all advantages of grand unification (charge quantization, fewer parameters) without
being in contradiction with experiments. The predicted particles, not yet found, are in
a region accessible to the LHC collider presently being built at CERN in Geneva. We will
explore supersymmetry later on.
* The graph shows the constants α 1 = 35 α em ~ cos2 θ W for the electromagnetic interaction, α 2 = α em ~ sin2 θ W
for the weak interaction and the strong coupling constant α 3 = α s .
grand unification – a simple dream 967
1/α i 1/α i
60 60
1/α1 SM 1/α1 MSSM
50 50
40 40
1/α2
1/α2
30 30
20 20
1/α3
1/α3
10 10
0 0
0 5 10 15 0 5 10 15
10 10 10 10 Q/GeV 10 10 10 10 Q/GeV
Eventually, some decay and particle data will become available. Even though these
experimental results will require time and effort, a little bit of thinking shows that they
probably will be only partially useful. Grand unification started out with the idea to unify
the description of matter. But this ambitious goal cannot been achieved in this way. Grand
unification does eliminate a certain number of parameters from the Lagrangians of QCD
and QFD; on the other hand, some parameters remain, even if supersymmetry is added.
Most of all, the symmetry group must be put in from the beginning, as grand unification
cannot deduce it from a general principle.
If we look at the open points of the standard model, grand unification reduces their
number. However, grand unification only shifts the open questions of high energy physics
to the next level, while keeping them unanswered. Grand unification remains a low en-
ergy effective theory. Grand unification does not tell us what elementary particles are; the
name ‘grand unification’ is ridiculous. In fact, the story of grand unification is a first hint
that looking at higher energies using only low-energy concepts is not the way to solve the
mystery of motion. We definitively need to continue our adventure. Copyright © Christoph Schiller November 1997–January 2007
968 viii inside the nucleus
Biblio graphy
952 A excellent technical introduction to nuclear physics is B ogdan Povh, Kl aus R ith,
Christoph Scholz & Frank Zetsche, Teilchen und Kerne, Springer, 5th edition,
1999. It is also available in English translation.
One of the best introductions into particle physics is Kurt Gottfried & Victor F.
Weisskopf, Concepts of Particle Physics, Clarendon Press, Oxford, 1984. Victor Weisskopf
was one of the heroes of the field, both in theoretical research and in the management of
CERN, the particle research institution. Cited on page 919.
953 W.C.M. Weijmar Schultz & al., Magnetic resonance imaging of male and female
genitals during coitus and female sexual arousal, British Medical Journal 319, pp. 1596–1600,
December 18, 1999, available online as http://www.bmj.com/cgi/content/full/319/7225/1596.
Cited on page 920.
954 A good overview is given by A.N. Halliday, Radioactivity, the discovery of time and the
proton and the observation of scaling, Review of Modern Physics 63, pp. 597–614, 1991, and
J.I. Friedman, Deep inelastic scattering: Comparisons with the quark model, Review of
Modern Physics 63, pp. 615–620, 1991. Cited on page 949.
960 About the strange genius of Gell-Mann, see the beautiful book by George Johnson, Mur-
ray Gell-Mann and the Revolution in Twentieth-Century Physics, Knopf, 1999. Cited on page
950.
961 G. Zweig, An SU3 model for strong interaction symmetry and its breaking II, CERN Re-
port No. 8419TH. 412, February 21, 1964. Cited on page 949.
962 F. Wilczek, Getting its from bits, Nature 397, pp. 303–306, 1999. Cited on page 951.
963 For an overview of lattice QCD calculations, see ... Cited on page 951.
964 S. Strauch & al., Polarization transfer in the 4 He (e,e’p) 3 H reaction up to Q 2 = 2.6(GeV/
c)2 , Physical Review Letters 91, p. 052301, 2003. Cited on page 951.
bibliography 969
965 The excited states of the proton and the neutron can be found in on the particle data group
website at http://pdg.web.cern.ch. Cited on page 951.
966 An older but fascinating summary of solar physics is R. Kippenhahn, Hundert Milliarden
Sonnen, Piper, 1980. which is available also in English translation. No citations.
967 M. Brunetti, S. Cecchini, M. Galli, G. Giovannini & A. Pagliarin, Gamma-
ray bursts of atmospheric origin in the MeV energy range, Geophysical Research Letters 27,
p. 1599, 1 June 2000. Cited on page 939.
968 A book with nuclear explosion photographs is Michael Light, 100 Suns, Jonathan Cape,
2003. Cited on page 941.
969 J. Dudek, A. God, N. Schunck & M. Mikiewicz, Nuclear tetrahedral symmetry:
possibly present throughout the periodic table, Physical Review Letters 88, p. 252502, 24
June 2002. Cited on page 935.
970 A good introduction is R. Cl ark & B. Wodsworth, A new spin on nuclei, Physics World
pp. 25–28, July 1998. Cited on page 935.
trophysik, Teubner Verlag, Stuttgart, 1989. Also available in English and in several other
languages. Cited on page 961.
980 D. Treille, Particle physics from the Earth and from teh sky: Part II, Europhysics News
35, no. 4, 2004. Cited on page 961.
981 P.W. Higgs, Broken symmetries, massless particles and gauge fields, Physics Letters 12,
pp. 132–133, 1964. He then expanded the story in P.W. Higgs, Spontaneous symmetry
breakdown without massless bosons, Physical Review 145, p. 1156-1163, 1966. Higgs gives
most credit to Anderson, instead of to himself; he also mentions Brout and Englert, Gural-
nik, Hagen, Kibble and ‘t Hooft. Cited on page 960.
982 M.A. B ouchiat & C.C. B ouchiat, Weak neutral currents in atomic physics, Physics
Letters B 48, pp. 111–114, 1974. U. Amaldi, A. B öhm, L.S. Durkin, P. L angacker,
A.K. Mann, W.J. Marciano, A. Sirlin & H.H. Williams, Comprehensive analysis
970 viii inside the nucleus
of data pertaining to the weak neutral current and the intermediate-vector-boson masses,
Physical Review D 36, pp. 1385–1407, 1987. Cited on page 959.
983 M.C. Noecker, B.P. Masterson & C.E. Wiemann, Precision measurement of parity
nonconservation in atomic cesium: a low-energy test of electroweak theory, Physical Review
Letters 61, pp. 310–313, 1988. See also D.M. Meekhof & al., High-precision measurement
of parity nonconserving optical rotation in atomic lead, Physical Review Letters 71, pp. 3442–
3445, 1993. Cited on page 960.
984 Rumination is studied in P. Jordan & R. de L aer Kronig, in Nature 120, p. 807, 1927.
Cited on page 962.
985 K.W.D. Ledingham & al., Photonuclear physics when a multiterawatt laser pulse inter-
acts with solid targets, Physical Review Letters 84, pp. 899–902, 2000. K.W.D. Ledingham
& al., Laser-driven photo-transmutation of Iodine-129 – a long lived nuclear waste product,
Journal of Physics D: Applied Physics 36, pp. L79–L82, 2003. R.P. Singhal, K.W.D. Led-
ingham & P. McKenna, Nuclear physics with ultra-intense lasers – present status and fu-
ture prospects, Recent Research Developments in Nuclear Physics 1, pp. 147–169, 2004. Cited
QUANTUM PHYSIC S
IN A NUTSHELL
Quantum theory’s essence – the l ack of the infinitely
small
C
ompared to classical physics, quantum theory is remarkably more
omplex. The basic idea however, is simple: in nature there is a minimum
Achievements in precision
Quantum theory improved the accuracy of predictions from the few – if any – digits
common in classical mechanics to the full number of digits – sometimes fourteen – that
can be measured today. The limited precision is usually not given by the inaccuracy of
theory, it is given by the measurement accuracy. In other words, the agreement is only
limited by the amount of money the experimenter is willing to spend. Table 71 shows this
in more detail.
TA B L E 71 Some comparisons between classical physics, quantum theory and experiment
si c a l q ua n t um ment e st i -
pre- theorya m at e b
dic-
tion
Simple motion of bodies
Indeterminacy 0 ∆x∆p E ħ~2 (1 10−2 ) ħ~2 10 k€
Wavelength of matter none λp = 2πħ (1 10−2 ) ħ 10 k€
beams
Tunnelling rate in alpha 0 1~τ is finite (1 10−2 ) τ 5 k€
decay
Compton wavelength none λ c = h~m e c (1 10−3 ) λ 20 k€
974 x quantum physics in a nu tshell
O b se r va b l e C l as- Prediction of M e a su r e - C o st b
si c a l q ua n t um ment e st i -
pre- theorya m at e
dic-
tion
Pair creation rate 0 ... agrees 20 M€
Radiative decay time in none τ 1~n 3 (1 10−2 ) 5 k€
hydrogen
Smallest action and 0 ħ~2 (1 10−6 ) ħ~2 10 k€
angular momentum
Casimir effect 0 ma~A = (π 2 ħc)~(240r 4 ) (1 10−3 ) ma 30 k€
Colours of objects
Lamb shift none ∆λ = 1057.86(1) MHz (1 10−6 ) ∆λ 50 k€
Rª = m e cα 2 ~2h (1 10−9 ) Rª
Page 981 a. All these predictions are calculated from the quantities of Table 72, and no other input. Their
Page 1170 most precise experimental values are given in Appendix B.
b. Sometimes the cost for the calculation of the prediction is higher than that of the experimental
Challenge 1437 n observation. (Can you spot the examples?) The sum of the two is given.
quantum theory’s essence – the l ack of the infinitely small 975
We notice that the predicted values are not noticeably different from the measured ones.
If we remember that classical physics does not allow to calculate any of the predicted
values we get an idea of the progress quantum physics has allowed. But despite this im-
pressive agreement, there still are unexplained observations. In fact, these unexplained
Page 981 observations provide the input for the calculations just cited; we list them in detail below,
in Table 72.
In summary, in the microscopic domain we are left with the impression that quantum
theory is in perfect correspondence with nature; despite prospects of fame and riches,
despite the largest number of researchers ever, no contradiction with observation has
been found yet.
In nature, actions smaller than ħ~2 = 0.53 ë 10−34 Js are not observed.
All intrinsic properties in nature – with the exception of mass – such as elec-
tric charge, spin, parities, etc., appear as integer numbers; in composed systems
they either add or multiply.
In fact, the second statement results from the first. The existence of a smallest action in
nature directly leads to the main lesson we learned about motion in the second part of
our adventure:
This statement applies to everything, thus to all objects and to all images, i.e. to matter
and to radiation. Moving stuff is made of quanta. Stones, water waves, light, sound waves,
earthquakes, gelatine and everything else we can interact with is made of particles. We
started the second part of our mountain ascent with the title question: what is matter and Copyright © Christoph Schiller November 1997–January 2007
what are interactions? Now we know: they are composites of elementary particles.
To be clear, an elementary particle is a countable entity, smaller than its own Compton
Page 1190 wavelength, described by energy, momentum, and the following complete list of intrinsic
properties: mass, spin, electric charge, parity, charge parity, colour, isospin, strangeness,
charm, topness, beauty, lepton number, baryon number and R-parity. Experiments so far
failed to detect a non-vanishing size for any elementary particle.
Moving entities are made of particles. To see how deep this result is, you can apply
it to all those moving entities for which it is usually forgotten, such as ghosts, spirits,
angels, nymphs, daemons, devils, gods, goddesses and souls. You can check yourself what
Challenge 1438 e happens when their particle nature is taken into account.
From the existence of a minimum action, quantum theory deduces all its statements
about particle motion. We go through the main ones.
976 x quantum physics in a nu tshell
There is no rest for microscopic particles. All objects obey the indeterminacy principle,
which states that the indeterminacies in position x and momentum p follow
and making rest an impossibility. The state of particles is defined by the same observables
as in classical physics, with the difference that observables do not commute. Classical
physics appears in the limit that the Planck constant ħ can effectively be set to zero.
Quantum theory introduces a probabilistic element into motion. It results from the
minimum action value through the interactions with the baths in the environment of
any system.
Large number of identical particles with the same momentum behave like waves. The
so-called de Broglie wavelength λ is given by the momentum p of a single particle through
both in the case of matter and of radiation. This relation is the origin of the wave be-
haviour of light. The light particles are called photons; their observation is now standard
practice. All waves interfere, refract and diffract. This applies to electrons, atoms, photons
and molecules. All waves being made of particles, all waves can be seen, touched and
moved. Light for example, can be ‘seen’ in photon-photon scattering, can be ‘touched’ us-
ing the Compton effect and it can be ‘moved’ by gravitational bending. Matter particles,
such as molecules or atoms, can be seen, e.g. in electron microscopes, as well as touched
and moved, e.g. with atomic force microscopes. The interference and diffraction of wave
particles is observed daily in the electron microscope.
Particles cannot be enclosed. Even though matter is impenetrable, quantum theory
shows that tight boxes or insurmountable obstacles do not exist. Waiting long enough
always allows to overcome boundaries, since there is a finite probability to overcome any
obstacle. This process is called tunnelling when seen from the spatial point of view and is
called decay when seen from the temporal point of view. Tunnelling explains the working
of television tubes as well as radioactive decay.
Particles are described by an angular momentum called spin, specifying their beha-
viour under rotations. Bosons have integer spin, fermions have half integer spin. An even Copyright © Christoph Schiller November 1997–January 2007
number of bound fermions or any number of bound bosons yield a composite boson; an
odd number of bound fermions or an infinite number of interacting bosons yield a low-
energy fermion. Solids are impenetrable because of the fermion character of its electrons
in the atoms.
Identical particles are indistinguishable. Radiation is made of bosons, matter of fermi-
ons. Under exchange, fermions commute at space-like separations, whereas bosons anti-
commute. All other properties of quantum particles are the same as for classical particles,
namely countability, interaction, mass, charge, angular momentum, energy, momentum,
position, as well as impenetrability for matter and penetrability for radiation.
In collisions, particles interact locally, through the exchange of other particles. When
matter particles collide, they interact through the exchange of virtual bosons, i.e. off-shell
bosons. Motion change is thus due to particle exchange. Exchange bosons of even spin
quantum theory’s essence – the l ack of the infinitely small 977
mediate only attractive interactions. Exchange bosons of odd spin mediate repulsive in-
teractions as well.
Quantum theory defines elementary particles as particles smaller than their own
Compton wavelength. Experiments so far failed to detect a non-vanishing size for any
elementary particle.
The properties of collisions imply the existence of antiparticles, as regularly observed
in experiments. Elementary fermions, in contrast to many elementary bosons, differ from
their antiparticles; they can be created and annihilated only in pairs. Apart from neutri-
nos, elementary fermions have non-vanishing mass and move slower than light.
Images, made of radiation, are described by the same properties as matter. Images can
only be localized with a precision of the wavelength λ of the radiation producing it.
The appearance of Planck’s constant ħ implies that length scales exist in nature.
Quantum theory introduces a fundamental jitter in every example of motion. Thus the
infinitely small is eliminated. In this way, lower limits to structural dimensions and to
accelerating observers, the Casimir effect, i.e. the attraction of neutral conducting bodies,
and the limit for the localization of particles. In fact, an object of mass m can be localized
only within intervals of the Compton wavelength
λC = =
h 2πħ
, (644)
mc mc
where c is the speed of light. At the latest at these distances we must abandon the classical
description and use quantum field theory. Quantum field theory introduces corrections
to classical electrodynamics; among others, the nonlinearities thus appearing produce
small departures from the superposition principle for electromagnetic fields, resulting in
978 x quantum physics in a nu tshell
photon-photon scattering.
Composite matter is separable because of the finite interaction energies of the con-
stituents. Atoms are made of a nucleus made of quarks, and of electrons. They provide an
effective minimal length scale to all everyday matter.
Elementary particles have the same properties as either objects or images, except di-
visibility. The elementary fermions (objects) are: the six leptons electron, muon, tau, each
with its corresponding neutrino, and the six quarks. The elementary bosons (images) are
Page 1190 the photon, the eight gluons and the two weak interaction bosons.
Quantum chromodynamics, the field theory of the strong interactions, explains the
masses of mesons and baryons through its descriptions as bound quark states. At funda-
mental scales, the strong interaction is mediated by the elementary gluons. At femtometer
scales, the strong interaction effectively acts through the exchange of spin 0 pions, and is
thus strongly attractive.
The theory of electroweak interactions describes the unification of electromagnetism
**
The world looks irreversible, even though it isn’t. We never remember the future. We are
fooled because we are macroscopic.
**
The world looks decoherent, even though it isn’t. We are fooled again because we are
macroscopic.
**
There are no clocks possible in nature. We are fooled because we are surrounded by a
huge number of particles.
**
Motion seems to disappear, even though it is eternal. We are fooled again, because our
**
The world seems dependent on the choice of the frame of reference, even though it is not.
We are fooled because we are used to live on the surface of the Earth.
**
Objects seem distinguishable, even though the statistical properties of their components
show that they are not. We are fooled because we live at low energies.
**
Matter looks continuous, even though it isn’t. We are fooled because of the limitations of
our senses.
In short, our human condition permanently fools us. The answer to the title question is
affirmative: quantum theory is magic. That is its main attraction.
Ref. 993 on every product package. It shows in detail how deeply our human condition fools us.
This product contains particles moving with speeds higher than one million kilometres per
hour.
Every kilogram of this product contains the same amount of energy as liberated by about
one hundred nuclear bombs.*
In case this product is brought in contact with antimatter, a catastrophic explosion will
occur.
In case this product is rotated, it will emit gravitational radiation.
Warning: care should be taken when transporting this product:
The force needed depends on its velocity, as does its weight.
This product will emit additional radiation when accelerated.
This product attracts, with a force that increases with decreasing distance, every other object
around, including its purchaser’s kids.
The constituents of this product are exactly the same as those of any other object in the
universe, including those of rotten fish.
The use could be disturbed by the (possibly) forthcoming collapse of the universe.
* A standard nuclear warhead has an explosive yield of about 0.2 megatons (implied is the standard explosive
Ref. 994 trinitrotoluene or TNT), about thirteen times the yield of the Hiroshima bomb, which was 15 kilotonne. A
megatonne is defined as 1 Pcal=4.2 PJ, even though TNT delivers about 5 % slightly less energy than this
value. In other words, a megaton is the energy content of about 47 g of matter. That is less than a handful for
most solids or liquids.
the essence and the limits of quantum theory 981
TA B L E 72 Everything quantum field theory and general relativity do not explain; in other words, a list of
the only experimental data and criteria available for tests of the unified description of motion
O b se r va b l e P r o p e r t y u n e x p l a i n e d so f a r
O b se r va b l e P r o p e r t y u n e x p l a i n e d so f a r
The table has several notable aspects.* First of all, neither quantum mechanics nor general
relativity explain any property unexplained in the other field. The two theories do not
help each other; the unexplained parts of both fields simply add up. Secondly, both in
quantum theory and in general relativity, motion still remains the change of position
with time. In short, in the first two parts of this walk we did not achieve our goal: we still
do not understand motion. Our basic questions remain: What is time and space? What is
mass? What is charge and what are the other properties of objects? What are fields? Why
are all the electrons the same?
We also note that Table 72 lists extremely different concepts. That means that at this
point of our walk there is a lot we do not understand. Finding the answers will not be
easy, but will require effort.
On the other hand, the list of unexplained properties of nature is also short. The de-
scription of nature our adventure has produced so far is concise and precise. No discrep-
ancies from experiments are known. In other words, we have a good description of mo-
Copyright © Christoph Schiller November 1997–January 2007
* Every now and then, researchers provide other lists of open questions. However, they all fall into the list
above. The elucidation of dark matter and of dark energy, the details of the big bang, the modifications of
general relativity by quantum theory, the mass of neutrinos, the quest for unknown elementary particles
such as the inflaton field, magnetic monopoles or others, the functioning of cosmic high-energy particle
accelerators, the stability or decay of protons, the origins of the heavy chemical elements, other interactions
between matter and radiation or the possibility of higher spatial dimensions are questions that all fall into
the table above.
the essence and the limits of quantum theory 983
Classical Quantum
gravitation gravity
1685 ca 1990
Classical ħ, e, k Theory
mechanics of motion
1650 Quantum General 2020?
theory relativity
1925 1915
F I G U R E 385 A simplified history of the description of motion in physics, by giving the limits
to motion included in each description
An even more suggestive summary of the progress and open issues of physics is shown
in Figure 385. From one corner of a cube, representing Galilean physics, three edges – la-
belled G, c and ħ, e, k – lead to classical gravity, special relativity and quantum theory.
Each constant implies a limit to motion; in the corresponding theory, this limit is taken
into account. From these first level theories, corresponding parallel edges lead to general
relativity, quantum field theory and quantum gravity, which take into account two of the
limits.* From the second level theories, all edges lead to the last missing corner; that is the
theory of motion. It takes onto account all limits found so far. Only this theory is a full
or unified description of motion. The important point is that we already know all limits
to motion. To arrive at the last point, no new experiments are necessary. No new know-
ledge is required. We only have to advance in the right direction, with careful thinking. Copyright © Christoph Schiller November 1997–January 2007
Reaching the final theory of motion is the topic of the third part of our adventure.
Finally, we note from Table 72 that all progress we can expect about the foundations
of motion will take place in two specific fields: cosmology and high energy physics.
* Of course, Figure 385 gives a simplified view of the history of physics. A more precise diagram would
use three different arrows for ħ, e and k, making the figure a five-dimensional cube. However, not all of its
Challenge 1439 ny corners would have dedicated theories (can you confirm this?), and moreover, the diagram would be much
less appealing.
984 x quantum physics in a nu tshell
How to delude oneself that one has reached the top of Motion
Mountain
Nowadays it is deemed chic to pretend that the adventure is over at the stage we have just
reached.* The reasoning is as follows. If we change the values of the unexplained constants
from Table 72 only ever so slightly, nature would look completely different from what it
Ref. 996 does. The consequences have been studied in great detail; Table 73 gives an overview of
the results.
TA B L E 73 A selection of the consequences of changing the properties of nature
O b se r va b l e C h a n g e R e su lt
O b se r va b l e C h a n g e R e su lt
symmetry
Lorentz symmetry none: no communication possible
U(1) different: no Huygens principle, no way to see anything
SU(2) different: no radioactivity, no Sun, no life
SU(3) different: no stable quarks and nuclei
Global structures, from general relativity
topology other: unknown; possibly correlated gamma ray bursts or star
images at the antipodes
Some even speculate that the table can be condensed into a single sentence: if any para-
Ref. 997 meter in nature is changed, the universe would either have too many or too few black
986 x quantum physics in a nu tshell
Challenge 1440 r holes. However, the proof of this condensed summary is not complete yet.
In fact, Table 73, on the effects of changing nature, is overwhelming. Obviously, even
the tiniest changes in the properties of nature are incompatible with our existence. What
does this mean? Answering this question too rapidly is dangerous. Many fall into a com-
mon trap, namely to refuse admitting that the unexplained numbers and other properties
need to be explained, i.e. deduced from more general principles. It is easier to throw in
some irrational belief. The three most fashionable beliefs are that the universe is created
or designed, that the universe is designed for people, or that the values are random, as our
universe happens to be one of many others.
All these beliefs have in common that they have no factual basis, that they discourage
further search and that they sell many books. Physicists call the issue of the first belief fine
tuning, and usually, but not always, steer clear from the logical errors contained in the so
Page 699 common belief in ‘creation’ discussed earlier on. However, many physicists subscribe to
the second belief, namely that the universe is designed for people, calling it the anthropic
soon find out. To provide a feeling of what awaits us, we rephrase the remaining issues in
six simple challenges.
What determines colours? In other words, what relations of nature fix the famous fine
structure constant? Like the hero of Douglas Adams’ books, physicists know the answer
to the greatest of questions: it is 137.036. But they do not know the question.
What fixes the contents of a teapot? It is given by its size to the third power. But why
are there only three dimensions? Why is the tea content limited in this way?
Was Democritus right? Our adventure has confirmed his statement up to this point;
nature is indeed well described by the concepts of particles and of vacuum. At large scales,
relativity has added a horizon, and at small scales, quantum theory added vacuum energy
and pair creation. Nevertheless, both theories assume the existence of particles and the
existence of space-time, and neither predicts them. Even worse, both theories completely
fail to predict the existence of any of the properties either of space-time – such as its
dimensionality – or of particles – such as their masses and other quantum numbers. A
from which the 1093 actual initial conditions have to be explained. There is a small prob-
lem that we know nothing whatsoever about f . Its value could be 0, if all data were in-
Page 835 terdependent, or 1, if none were. Worse, above we noted that initial conditions cannot
be defined for the universe at all; thus f should be undefined and not be a number at all!
Whatever the case, we need to understand how all the visible particles get their 1093 states Copyright © Christoph Schiller November 1997–January 2007
“
that the atoms move always in the void and the
unlimited, must say what movement is, and in
what their natural motion consists.
Aristotle, Treaty of the Heaven
”
Ref. 998
Biblio graphy
992 An informative account of the world of psychokinesis and the paranormal is given by the
famous professional magician James R andi, Flim-flam!, Prometheus Books, Buffalo 1987,
as well as in several of his other books. See also the http://www.randi.org website. Cited on
page 977.
993 This way to look at things goes back to the text by Susan Hewitt & Edward Subitzky,
A call for more scientific truth in product warning labels, Journal of Irreproducible Results
36, nr. 1, 1991. Cited on page 979.
994 J. Malik, The yields of the Hiroshima and Nagasaki nuclear explosions, Technical Report
LA-8819, Los Alamos National Laboratory, September 1985. Cited on page 980.
995 James Clerk Max well, Scientific Papers, 2, p. 244, October 1871. Cited on page 984.
996 A good introduction is C.J. Hogan, Why the universe is just so, Reviews of Modern Physics
72, pp. 1149–1161, 2000. Most of the material of Table 73 is from the mighty book by John D.
W ”
obbly entities, in particular jellyfish or amoebas, open up a fresh vision of the
orld of motion, if we allow to be led by the curiosity to study them in detail.
e have missed many delightful insights by leaving them aside. In particular,
wobbly entities yield surprising connections between shape change and motion which
will be of great use in the last part of our mountain ascent. Instead of continuing to look
at the smaller and smaller, we now take a look back, towards everyday motion and its
mathematical description.
To enjoy this intermezzo, we change a dear habit. So far, we always described any gen-
eral example of motion as composed of the motion of point particles. This worked well
in classical physics, in general relativity and in quantum theory; we based the approach
on the silent assumption that during motion, each point of a complex system can be fol-
lowed separately. We will soon discover that this assumption is not realized at smallest
scales. Therefore the most useful description of motion of extended bodies uses methods
that do not require that body parts be followed one by one. We explore this issue in this
intermezzo; doing so is a lot of fun in its own right.
If we imagine particles as extended entities – as we soon will have to – a particle mov-
ing through space is similar to a dolphin swimming through water or to a bee flying
through air. Let us explore how these animals do this.
Copyright © Christoph Schiller November 1997–January 2007
* ‘The primary and most beautiful of nature’s qualities is motion, which agitates her at all times; but this
motion is simply a perpetual consequence of crimes; she conserves it by means of crimes only.’ Donatien Al-
phonse François de Sade (1740–1814) is the intense French writer from whom the term ‘sadism’ was de-
duced.
992 intermezzo: bacteria, flies and knots
mд = f A v 2 ρ (647)
where A is the surface of the wing, v is the speed of the wing in the fluid of density ρ. The
factor f is a pure number, usually with a value between 0.2 and 0.4, that depends on the Copyright © Christoph Schiller November 1997–January 2007
Ref. 999 angle of the wing and its shape; here we use the average value 0.3. For a Boeing 747, the
surface is 511 m2 , the top speed is 250 m~s; at an altitude of 12 km the density of air is only
a quarter of that on the ground, thus only 0.31 kg~m3 . We deduce (correctly) that a Boeing
Challenge 1443 e 747 has a mass of about 300 ton. For bumblebees with a speed of 3 m~s and a wing surface
of 1 cm2 , we get a lifted mass of about 35 mg, much less than the weight of the bee, namely
about 1 g. The mismatch is even larger for fruit flies. In other words, an insect cannot fly
if it keeps its wings fixed. Therefore, insects and small birds must move their wings, in
contrast to aeroplanes, not only to take off or to gain height, but also to simply remain
airborne in horizontal flight. In contrast, aeroplanes generate enough lift with fixed wings.
Indeed, if you look at flying animals, such as the ones shown in Figure 388, you note that
the larger they are, the less they need to move their wings (at cruising speed).
Challenge 1444 n Can you deduce from equation (647) that birds or insects can fly but people cannot?
intermezzo: bacteria, flies and knots 993
F I G U R E 388 Examples of the three larger wing types in nature, all optimized for rapid flows: turkey
vulture (Cathartes aura), ruby-throated hummingbird (Archilochus colubris) and a dragonfly (© S.L.
Brown, Pennsylvania Game Comission/Joe Kosack and nobodythere)
The formula also (partly) explains why human powered aeroplanes must be so large.*
But how do insects, small birds, flying fish or bats have to move their wings? This is a
tricky question. In fact, the answer is just being uncovered by modern research. The main
wings, except during take-off and landing. Second, common size birds use flapping wings.
(Hummingbirds can have over 50 wing beats per second.) These first two types of wings
have a thickness of about 10 to 15 % of the wing depth. At smaller dimensions, a third
wing type appears, as seen in dragonflies and other insects. At these scales, at Reynolds
numbers of around 1000 and below, thin membrane wings are the most efficient. The
* The rest of the explanation requires some aerodynamics, which we will not study here. Aerodynamics
shows that the power consumption, and thus the resistance of a wing with given mass and given cruise
speed, is inversely proportional to the square of the wingspan. Large wingspans with long slender wings are
thus of advantage in (subsonic) flying, especially when energy is scarce.
** The website http://www.aniprop.de presents a typical research approach and the sites http://ovirc.free.fr
and http://www.ornithopter.org give introductions into the way to build such systems for hobbyists.
994 intermezzo: bacteria, flies and knots
Reynolds number measures the ratio between inertial and viscous effects in a fluid. It is
defined as
R=
lvρ
(648)
η
where l is a typical length of the system, v the speed, ρ the density and η the dynamic
viscosity of the fluid.* A Reynolds number much larger than one is typical for rapid air
flow and fast moving water. In fact, the Reynolds numbers specifies what is meant by a
‘rapid’ or ‘fluid’ flow on one hand, and a ‘slow’ or ‘viscous’ flow on the other. The first three
wing types, shown in Figure 388, are all for rapid flows.
The fourth type of wings is found at the smallest possible dimensions, for insects smal-
ler than one millimetre; their wings are not membranes at all. Typical are the cases of
thrips and of parasitic wasps, which can be as small as 0.3 mm. All these small insects
have wings which consist of a central stalk surrounded by hair. In fact, Figure 389 shows
* The viscosity is the resistance to flow a fluid poses. It is defined by the force F necessary to move a layer of
surface A with respect to a second, parallel one at distance d; in short, the (coefficient of) dynamic viscosity
is defined as η = d F~A v. The unit is 1 kg~s m or 1 Pa s or 1 N s~m2 , once also called 10 P or 10 poise. In other
words, given a horizontal tube, the viscosity determines how strong the pump needs to be to pump the fluid
through the tube at a given speed. The viscosity of air 20°C is 1.8 10−5 kg~s m or 18 µPa s and increases
Challenge 1445 ny with temperature. In contrast, the viscosity of liquids decreases with temperature. (Why?) The viscosity of
water at 0°C is 1.8 mPa s, at 20°C it is 1.0 mPa s (or 1 cP), and at 40°C is 0.66 mPa s. Hydrogen has a viscosity
smaller than 10 µPa s, whereas honey has 25 Pa s and pitch 30 MPa s.
Physicists also use a quantity ν called the kinematic viscosity. It is defined with the help of the mass density
of the fluid as ν = η~ρ and is measured in m2 ~s, once called 104 stokes. The kinematic viscosity of water at
20°C is 1 mm2 ~s (or 1 cSt). One of the smallest values is that of acetone, with 0.3 mm2 ~s; a larger one is
glycerine, with 2000 mm2 ~s. Gases range between 3 mm2 ~s and 100 mm2 ~s.
intermezzo: bacteria, flies and knots 995
Swimming
Swimming is a fascinating phenomenon. The Greeks argued that the ability of fish to
swim is a proof that water is made of atoms. If atoms would not exist, a fish could not
advance through it. Indeed, swimming is an activity that shows that matter cannot be
continuous. Studying swimming can thus be quite enlightening. But how exactly do fish
swim?
Whenever dolphins, jellyfish, submarines or humans swim, they take water with their
fins, body, propellers, hands or feet and push it backwards. Due to momentum conserva-
tion they then move forward.* In short, people swim in the same way that fireworks or
rockets fly: by throwing matter behind them. Does all swimming work in this way? In
particular, do small organisms advancing through the molecules of a liquid use the same
method? No.
It turns out that small organisms such as bacteria do not have the capacity to propel or
accelerate water against their surroundings. From far away, the swimming of microorgan-
between inertial and dissipative effects at different scales. This ratio is measured by the
Reynolds number. For the scallop the Reynolds number is about 100, which shows that
when it swims, inertial effects are much more important than dissipative, viscous effects.
For a bacterium the Reynolds number is much smaller than 1, so that inertial effects effect-
ively play no role. There is no way to accelerate water away from a bacterial-sized scallop,
and thus no way to glide. In fact one can even show the stronger result that no cell-sized
being can move if the shape change is the same in the two halves of the motion (opening
Ref. 1007 and closing). Such a shape change would simply make it move back and forward. Thus
Page 74 * Fish could use propellers, as the arguments against wheels we collected at the beginning of our walk do
not apply for swimming. But propellers with blood supply would be a weak point in the construction, and
thus in the defence of a fish.
996 intermezzo: bacteria, flies and knots
figure to be added
there is no way to move at cell dimensions with a method the scallop uses on centimetre
scale; in fact the so-called scallop theorem states that no microscopic system can swim if
it uses movable parts with only one degree of freedom.
Microorganisms thus need to use a more evolved, two-dimensional motion of their
shape to be able to swim. Indeed, biologists found that all microorganisms use one of the
following three swimming styles:
Ref. 1011 move their legs when they perform the breast stroke. Most cells can also change the
sense in which the flagellum is kicked, thus allowing them to move either forward
Ref. 1015 or backward. Through their twisted oar motion, bacterial flagella avoid retracing the
same path when going back and forward. As a result, the bacteria avoid the scallop
theorem and manage to swim despite their small dimensions. The flexible oar motion
they use is an example of a non-adiabatic mechanism; an important fraction of the
energy is dissipated.
— The smallest swimming organisms, bacteria with sizes between 0.2 µm and 5 µm, swim
Ref. 1012 using bacterial flagella. These flagella, also called prokaryote flagella, are different from
age or do not try to swim at all. Some microorganisms are specialized to move along
liquid–air interfaces. Others attach themselves to solid bodies they find in the liquid.
Some of them are able to move along these solids. The amoeba is an example for a mi-
croorganism moving in this way. Also the smallest active motion mechanisms known,
Page 840 namely the motion of molecules in muscles and in cell membranes, work this way.
Let us summarize these observations in a different way. All known active motion, or
self-propulsion, takes place in fluids – be it air or liquids. All active motion requires shape
change. In order that shape change leads to motion, the environment, e.g. the water, must
itself consist of moving components always pushing onto the swimming entity. The mo-
tion of the swimming entity can then be deduced from the particular shape change it
performs. To test your intuition, you may try the following puzzle: is microscopic swim-
Challenge 1446 ny ming possible in two spatial dimensions? In four?
998 intermezzo: bacteria, flies and knots
– CS – to be inserted – CS –
Page 92 In fact, cats confirm in three dimensions what we already knew for two dimensions: a
deformable body can change its own orientation in space without outside help.
Continuing the theme of motion of wobbly entities, a famous example cannot be avoided.
In 1957, the mathematician Stephen Smale proved that a sphere can be turned inside out.
The discovery brought him the Fields medal in 1966, the highest prize for discoveries in
mathematics. Mathematicians call his discovery the eversion of the sphere.
To understand the result, we need to describe more clearly the rules of mathematical
eversion. First of all, it is assumed that the sphere is made of a thin membrane which
has the ability to stretch and bend without limits. Secondly, the membrane is assumed
to be able to intersect itself. Of course, such a ghostly material does not exist in everyday
life; but in mathematics, it can be imagined. A third rule requires that the moves must
be performed in such a way that the membrane is not punctured, ripped nor creased; in
short, everything must happen smoothly (or differentiably, as mathematicians like to say).
Even though Smale proved that eversion is possible, the first way to actually perform it
was discovered by the blind topologist Bernard Morin in 1961, based on ideas of Arnold
Copyright © Christoph Schiller November 1997–January 2007
and at the same time were of spherical shape, eversion might be a symmetry of particles.
To make you think, we mention the effects of eversion on the whole surrounding space,
not only on the sphere itself. The final effect of eversion is the transformation
(x, y, −z) R 2
(x, y, z) (649)
r2
where R » is the radius of the sphere and r is the length of the coordinate vector (x, y, z),
thus r = x 2 + y 2 + z 2 . Due to the minus sign in the z-coordinate, eversion is thus dif-
ferent from inversion, but not by too much. As we will find out shortly, a transformation
Page 1073 similar to eversion, space-time duality, is a fundamental symmetry of nature.
1000 intermezzo: bacteria, flies and knots
“ knots!
(Surprisingly efficient child education
technique.)
Knots and their generalization are central to the study of wobbly entity motion. A (math-
ematical) knot is a closed piece of rubber string, i.e. a string whose ends have been glued
together, which cannot be deformed into a circle or a simple loop. The simple loop is
” Copyright © Christoph Schiller November 1997–January 2007
also called the trivial knot. If knots are ordered by their crossing numbers, as shown in
Figure 394, the trivial knot (01 ) is followed by the trefoil knot (31 ) and by the figure-eight
knot (41 ). The figure only shows prime knots, i.e., knots that cannot be decomposed into
two knots that are connected by two parallel strands. In addition, the figure only shows
one of two possible mirror images.
Knots are of importance in the context of this intermezzo as they visualize the limit-
ations of the motion of wobbly entities. In addition, we will find other reasons to study
knots later on. In this section, we just have a bit of fun.*
* Pretty pictures and other information about knots can be found on the KnotPlot site, i.e. at the http://www.
intermezzo: bacteria, flies and knots 1001
right-hand left-hand
crossing +1 crossing -1
Redemeister Redemeister Redemeister
move I move II move III
the flype
a nugatory crossing
How do we describe such a knot through the telephone? Mathematicians have spent
a lot of time to figure out smart ways to achieve it. The simplest way is to flatten the knot
onto a plane and to list the position and the type (below or above) of the crossings. Copyright © Christoph Schiller November 1997–January 2007
Mathematicians are studying the simplest way to describe knots by the telephone. The
task is not completely finished, but the end is in sight. Of course, the flat diagrams can be
characterized by the minimal number of crossings. The knots in Figure 394 are ordered
in this way. There is 1 knot with zero, 1 with three and 1 with four crossings (not counting
Ref. 1021 mirror knots); there are 2 knots with five and 3 with six crossings, 7 knots with seven,
21 knots with eight, 41 with nine, 165 with ten, 552 with eleven, 2176 with twelve, 9988
with thirteen, 46 972 with fourteen, 253 293 with fifteen and 1 388 705 knots with sixteen
crossings.
Mathematicians do not talk about ‘telephone messages’, they talk about invariants,
i.e. about quantities that do not depend on the precise shape of the knot. At present,
cs.ubc.ca/nest/imager/contributions/scharein/KnotPlot.html site.
1002 intermezzo: bacteria, flies and knots
Since knots are stable in time, a knotted line in three dimensions is equivalent to a
knotted surface in space-time. When thinking in higher dimensions, we need to be care-
ful. Every knot (or knotted line) can be untied in four or more dimensions; however, there
is no surface embedded in four dimensions which has as t = 0 slice a knot, and as t = 1
slice the circle. Such a surface embedding needs at least five dimensions.
In higher dimensions, knots are possible only n-spheres are tied instead of circles; for
example, as just said, 2-spheres can be tied into knots in 4 dimensions, 3-spheres in 5
dimensions and so forth.
Mathematicians also study more elaborate structures. Links are the generalization of
knots to several closed strands. Braids are the generalization of links to open strands.
Braids are especially interesting, as they form a group; can you state what the group op- Copyright © Christoph Schiller November 1997–January 2007
Figure to be included
**
This is the simplest unsolved knot problem: Imagine an ideally wobbly rope, that is, a
rope that has the same radius everywhere, but whose curvature can be changed as one
Challenge 1449 r prefers. Tie a trefoil knot into the rope. By how much do the ends of the rope get nearer?
In 2006, there are only numerical estimates for the answer: about 10.1 radiuses. There is
no formula yielding the number 10.1. Alternatively, solve the following problem: what
is the rope length of a closed trefoil knot? Also in this case, only numerical values are
known – about 16.33 radiuses – but no exact formula. The same is valid for any other
knot, of course.
**
A famous type of eel, the knot fish Myxine glutinosa, also called hagfish or slime eel, is
Ref. 1022 able to make a knot in his body and move this knot from head to tail. It uses this motion
to cover its body with a slime that prevents predators from grabbing it; it also uses this
motion to escape the grip of predators, to get rid of the slime after the danger is over, and
to push against a prey it is biting in order to extract a piece of meat. All studied knot fish
form only left handed trefoil knots, by the way; this is another example of chirality in
nature. Copyright © Christoph Schiller November 1997–January 2007
**
One of the most incredible discoveries of recent years is related to knots in DNA molecules.
The DNA molecules inside cell nuclei can be hundreds of millions of base pairs long; they
regularly need to be packed and unpacked. When this is done, often the same happens
as when a long piece of rope or a long cable is taken out of a closet.
It is well known that you can roll up a rope and put it into a closet in such a way that
it looks orderly stored, but when it is pulled out at one end, a large number of knots is
suddenly found. Figure 399 shows how to achieve this.
To make a long story short, this also happens to nature when it unpacks DNA in cell
nuclei. Life requires that DNA molecules move inside the cell nucleus without hindrance.
So what does nature do? Nature takes a simpler approach: when there are unwanted cross-
1004 intermezzo: bacteria, flies and knots
ings, it cuts the DNA, moves it over and puts the ends together again. In cell nuclei, there
are special enzymes, the so-called topoisomerases, which perform this process. The de-
tails of this fascinating process are still object of modern research.
**
The great mathematician Carl-Friedrich Gauß was the first person to ask what would hap-
pen when an electrical current I flows along a wire A linked with a wire B. He discovered
Ref. 1026 a beautiful result by calculating the effect of the magnetic field of one wire onto the other.
Gauss found the expression
(xA − xB )
ë BB = =n,
1 1
4πI ∫ dx
A
A
4π ∫ dx ë ∫ dx
A
A
B
B
SxA − xB S3
(650)
where the integrals are performed along the wires. Gauss found that the number n does
**
Knots appear rarely in nature. For example, tree roots do not seem to grow many knots
Challenge 1450 r during the lifetime of a plant. How do plants avoid this? In other words, why are there no
knotted bananas in nature?
**
If we move along the knot and count the crossings where we stay above and subtract the Copyright © Christoph Schiller November 1997–January 2007
number of crossings where we pass below, we get a number called the writhe of the knot.
It is not an invariant, but usually a tool in building them. The writhe is not necessarily
Challenge 1451 ny invariant under one of the three Reidemeister moves. Can you see which one? However,
the writhe is invariant under flypes.
Clouds
Clouds are another important class of extended entities. The lack of a definite boundary
makes them even more fascinating than amoebas, bacteria or falling cats. We can observe
the varieties of clouds from an aeroplane. We also have encountered clouds as the basic
structure determining the size of atoms. Comparing these two and other types of clouds
teaches us several interesting things about nature.
intermezzo: bacteria, flies and knots 1005
Galaxies are clouds of stars; stars are clouds of plasma; the atmosphere is a gas cloud.
Page 610 Obviously, the common cumulus or cumulonimbus in the sky are vapour and water
droplet clouds. Clouds of all types can be described by a shape and a size, even though in
theory they have no bound. An effective shape can be defined by that region in which the
cloud density is only, say, 1 % of the maximum density; slightly different procedures can
also be used. All clouds are described by probability densities of the components making
up the cloud. All clouds show conservation of their number of constituents.
Whenever we see a cloud, we can ask why it does not collapse. Every cloud is an aggreg-
Page 185 ate. All aggregates are kept from collapse in only three ways: through rotation, through
pressure or through the Pauli principle, i.e. the quantum of action. Galaxies are kept from
collapsing by rotation. Most stars and the atmosphere are kept from collapsing by gas pres-
sure. Neutron stars, the Earth, atomic nuclei, protons or the electron clouds of atoms are
kept apart by the quantum of action.
A rain cloud can contain several thousand tons of water; can you explain what keeps
Fluid dynamics is a topic with many interesting aspects. A beautiful result from the 1960s
Ref. 1005 is that a linear, deformable vortex in a rotating liquid is (almost) described by the one-
dimensional Schrödinger equation. A simple physical system of this type is the vortex that
can be observed in any emptying bath tub: it is extended in one dimension, and it wriggles
around. As we will see shortly, this wriggling motion is described by a Schrödinger-like
equation.
Any deformable linear vortex, as illustrated in Figure 400, is described by a continuous
set of position vectors r(t, s) that depend on time t and on a single parameter s. The
parameter s specifies the relative position along the vortex. At each point on the vortex,
there is a unit tangent vector e(t, s), a unit normal curvature vector n(t, s) and a unit
1006 intermezzo: bacteria, flies and knots
w v
e=
∂r
,
∂s
κn =
∂e
,
∂s
τw = −
∂(e n)
, (651)
∂s
where κ specifies the value of the curvature and τ specifies the value of the torsion. In
general, both numbers depend on time and on the position along the line.
We assume that the rotating environment induces a local velocity v for the vortex that
is proportional to the curvature κ, perpendicular to the tangent vector e and perpendic- Copyright © Christoph Schiller November 1997–January 2007
v = ηκ(e n) , (652)
Ref. 1005 where η is the so-called coefficient of local self-induction that describes the coupling
between the liquid and the vortex motion.
Ref. 1006 Any vortex described by the evolution equation (652) obeys the one-dimensional
Schrödinger equation. To repeat his argument, we assume that the filament is deformed
only slightly from the straight configuration. (Technically, we are thus in the linear re-
gime.) For such a filament, directed along the x-axis, we can write
e = (1, , ) (1, 0, 0) ,
∂y ∂z
∂x ∂x
∂2 y ∂2 z
κn (0, 2 , 2 ) , and
∂x ∂x
v = (0, , ) .
∂y ∂z
(654)
∂t ∂t
We can thus rewrite equation (652) as
∂2 z ∂2 y
(0, , ) = η (0, − 2 , 2 ) .
∂y ∂z
(655)
∂t ∂t ∂x ∂x
Copyright © Christoph Schiller November 1997–January 2007
This equation is well known; if we drop the first coordinate and introduce complex num-
bers by setting Φ = y + iz, we can rewrite it as
∂2 Φ
= iη 2 .
∂Φ
(656)
∂t ∂x
This is the one-dimensional Schrödinger equation for the evolution of a free wave func-
tion! The complex function Φ specifies the transverse deformation of the vortex. In other
words, we can say that the Schrödinger equation in one dimension describes the evolu-
tion of the deformation for an almost linear vortex surrounded by a rotating liquid. We
note that there is no constant ħ in the equation, as we are exploring a classical system.
1008 intermezzo: bacteria, flies and knots
The amplitude a and the wavelength or pitch b = 1~τ can be freely chosen, as long as
the approximation of small deviation is fulfilled; this condition translates as a b.* In
the present interpretation, the fundamental solution corresponds to a vortex line that is
deformed into a helix, as shown in Figure 401. The angular speed ω is the rotation speed
around the axis of the helix.
Challenge 1453 ny A helix moves along the axis with a speed given by
In other words, for extended entities following evolution equation (652), rotation and
p=τ=
1
. (659)
b
Momentum is thus inversely proportional to the helix wavelength or pitch, as expected.
The energy E is defined using ∂Φ~∂t, leading to
E = ητ 2 =
η
. (660)
b2
Energy and momentum are connected by
p2
E= µ=
1
where . (661)
2µ 2η
In other words, a vortex with a coefficient η – describing the coupling between environ-
Page 1006 ment and vortex – is thus described by a number µ that behaves like an effective mass. We
can also define the (real) quantity SΦS = a; it describes the amplitude of the deformation.
In the Schrödinger equation (656), the second derivative implies that the deformation
‘wave packet’ has tendency to spread out over space. Can you confirm that the wavelength–
Copyright © Christoph Schiller November 1997–January 2007
frequency relation for a vortex wave group leads to something like the indeterminacy
Challenge 1455 ny relation (however, without a ħ appearing explicitly)?
In summary, the complex amplitude Φ for a linear vortex in a rotating liquid behaves
like the one-dimensional wave function of a non-relativistic free particle. In addition,
we found a suggestion for the reason why complex numbers appear in the Schrödinger
equation of quantum theory : they could be due to the intrinsic rotation of an underlying
substrate. We will see later on whether this is correct.
* The curvature is given by κ = a~b 2 , the torsion by τ = 1~b. Instead of a b one can thus also write κ τ.
Challenge 1454 ny ** A wave packet moves along the axis with a speed given by v packet = 2ητ 0 , where τ 0 is the torsion of the
helix of central wavelength.
intermezzo: bacteria, flies and knots 1009
effective size
Burgers vector
Fluid space-time
So far, we have looked at the motion of wobbly entities in continuous space-time. But
that is an unnecessary restriction. Looking at space-time itself in this way is also interest-
Ref. 1027 ing. The most intriguing approach was published in 1995 by Ted Jacobson. He explored
what happens if space-time, instead of assumed to be continuous, is assumed to be the
statistical average of numerous components moving in a disordered fashion.
The standard description of general relativity describes space-time as an entity similar
Page 391 to a flexible mattress. Jacobson studied what happens if the mattress is assumed to be
made of a liquid. A liquid is a collection of (undefined) components moving randomly
and described by a temperature varying from place to place. He thus explored what hap-
pens if space-time is made of fluctuating entities.
Page 880 Jacobson started from the Fulling–Davies–Unruh effect and assumed that the local
temperature is given by the same multiple of the local gravitational acceleration. He also
used the proportionality – correct on horizons – between area and entropy. Since the
energy flowing through a horizon can be called heat, one can thus translate the expression
δQ = TδS into the expression δE = aδA(c 2 ~4G), which describes the behaviour of space-
Copyright © Christoph Schiller November 1997–January 2007
Page 1092 time at horizons. As we have seen, this expression is fully equivalent to general relativity.
In other words, imagining space-time as a liquid is a powerful analogy that allows to
deduce general relativity. Does this mean that space-time actually is similar to a liquid?
So far, the analogy is not sufficient to answer the question. In fact, just to confuse the
reader a bit more, there is an old argument for the opposite statement.
Solid space-time
The main reason to try to model empty space as a solid is a famous property of the mo-
tion of dislocations. To understand it, a few concepts need to be introduced. Dislocations
are one-dimensional construction faults in crystals, as shown in Figure 402. A general
dislocation is a mixture of the two pure dislocation types: edge dislocations and screw
1010 intermezzo: bacteria, flies and knots
dislocations. Both are shown in Figure 402. If one studies how the involved atoms can
rearrange themselves, one finds that edge dislocations can only move perpendicularly to
the added plane. In contrast, screw dislocations can move in all directions.* An import-
ant case of general, mixed dislocations, i.e. of mixtures of edge and screw dislocations,
are closed dislocation rings. On such a dislocation ring, the degree of mixture changes
continuously from place to place.
A dislocation is described by its strength and by its effective size; they are shown, re-
spectively, in red and blue in Figure 402. The strength of a dislocation is measured by
the so-called Burgers vector; it measures the misfits of the crystal around the disloca-
tion. More precisely, the Burgers vector specifies by how much a section of perfect crystal
needs to be displaced, after it has been cut open, to produce the dislocation. Obviously,
the strength of a dislocation is quantized in multiples of a minimal Burgers vector. In
fact, dislocations with large Burgers vectors can be seen as composed of dislocations of
minimal Burgers vector.
E=»
E0
. (663)
1 − v 2 ~c 2
A screw dislocation thus cannot move faster than the speed of sound in a crystal and its
width shows a speed-dependent contraction. (Edge dislocations have similar, but more
Copyright © Christoph Schiller November 1997–January 2007
complex behaviour.) The motion of screw dislocations in solids is described by the same
effects and formulae that describe the motion of bodies in special relativity; the speed of
sound is the limit speed for dislocations in the same way that the speed of light is the limit
speed for objects.
Does this mean that elementary particles are dislocations of space or even of space-
time, maybe even dislocation rings? The speculation is appealing, even though it supposes
that space-time is a solid, and thus contradicts the model of space or space-time as a fluid.
Page 1032 Worse, we will soon encounter good reasons to reject modelling space-time as a lattice;
Challenge 1456 n maybe you can find a few ones already by yourself. Still, expressions (662) and (663) for
y
φ
ψ θ
φ
Equator
θ
x
their separation is decreased by −∆φ, and they are rotated back by −∆θ. Due to to the
Challenge 1457 ny conservation of angular momentum, the two-disc system changes its longitude ∆ψ as
∆ψ = γ 2 ∆θ∆φ ,
1
(664)
2
where γ is the angular radius of the discs. This cycle can be repeated over and over. The
cycle it allows a body on the surface of the Earth, to swim along the surface. However, for
a body of metre size, the motion for each swimming cycle is only around 10−27 m.
Wisdom showed that the mechanism also works in curved space-time. The mechan-
ism thus allows a falling body to swim away from the path of free fall. Unfortunately, the
achievable distances for everyday objects are negligible. Nevertheless, the effect exists.
1012 intermezzo: bacteria, flies and knots
At this point, we are thoroughly confused. Space-time seems to be solid and liquid
at the same time. Despite this contrast, the situation gives the impression that extended,
Challenge 1458 n
Ref. 1030
What is the shape of raindrops? Try to picture it. However, use your reason, not your
prejudice! By the way, it turns out that there is a maximum size for raindrops, with a
value of about 4 mm. The shape of such a large raindrop is shown in Figure 404. Can you
”
imagine where the limit comes from?
For comparison, the drops in clouds, fog or mist are in the range of 1 to 100 µm, with
a peak at 10 to 15 µm. In those cases when all droplets are of similar size one and when
light is scattered only once by the droplets, one can observe coronae, glories or fogbows.
**
Challenge 1459 n What is the entity shown in Figure 405 – a knot, a braid or a link?
**
Copyright © Christoph Schiller November 1997–January 2007
**
Challenge 1461 n Are you able to find a way to classify the way shoe laces can be threaded?
**
A striking example of how wobbly entities can behave is given in Figure 406. There
is indeed a family of snakes that like to jump off a tree and sail through the air to a
neighbouring one. Both the jump and the sailing technique have been studied in recent
years; the website http://www.flyingsnake.org by Jake Socha provides more films and his
Ref. 1034 publications give more details about these intriguing reptiles.
intermezzo: bacteria, flies and knots 1013
Outlook
We have studied one example of motion of extended bodies already earlier on: solitons.
We can thus sum up the possible motions of extended entities in four key themes. We first
studied solitons and interpenetration, then knots and their rearrangement, continued
with duality and eversion and finally explored clouds and extension. The sum of it all
seems to be half liquid and half solid.
The motion of wobbly bodies probably is the most neglected topic in all textbooks on
motion. Research is progressing at full speed; it is expected that many beautiful analo-
gies will be discovered in the near future. For example, in this intermezzo we have not
described any good analogy for the motion of light; similarly, including quantum theory
into the description of wobbly bodies’ motion remains a fascinating issue for anybody
Challenge 1462 r aiming to publish in a new field.
The ideas introduced in this intermezzo were sufficient to prepare us for the third part
of our ascent of Motion Mountain. We can now tackle the final part of our adventure.
Copyright © Christoph Schiller November 1997–January 2007
1014 intermezzo: bacteria, flies and knots
Biblio graphy
999 Henk Tennekes, De wetten van de vliegkunst – over stijgen, dalen, vliegen en zweven,
Aramith Uitgevers, 1992. This clear and interesting text is also available in English. Cited
on page 992.
1000 In 1997, the smallest human-made flying object was the helicopter built by a group of the
Institut für Mikrotechnik in Mainz, in Germany. A picture is available at their web page,
to be found at http://www.imm-mainz.de/English/billboard/f_hubi.html. The helicopter
is 24 mm long, weighs 400 mg and flies (though not freely) using two built-in electric mo-
tors driving two rotors, running at between 40 000 and 100 000 revolutions per minute.
See also the helicopter from Stanford University at http://www-rpl.stanford.edu/RPL/
htmls/mesoscopic/mesicopter/mesicopter.html, with an explanation of its battery prob-
lems. Cited on page 992.
1001 The most recent computational models of lift still describe only two-dimensional wing mo-
1007 E. Purcell, Life at low Reynolds number, American Journal of Physics 45, p. 3, 1977.
Cited on page 995.
1008 Most bacteria are flattened, ellipsoidal sacks kept in shape by the membrane enclosing
the cytoplasma. But there are exceptions; in salt water, quadratic and triangular bacteria
have been found. More is told in the corresponding section in the interesting book by
Bernard Dixon, Power Unseen – How Microbes Rule the World, W.H. Freeman, New
York, 1994. Cited on page 996.
1009 S. Pitnick, G. Spicer & T.A. Markow, How long is a giant sperm?, Nature 375, p. 109,
1995. Cited on page 996.
1010 For an overview of the construction and the motion of coli bacteria, see H.C. Berg,
Motile behavior of bacteria, Physics Today 53, pp. 24–29, January 2000. Cited on page
997.
bibliography 1015
1022 Alexei Sossinsky, Nœuds – histoire d’une théorie mathématique, Editions du Seuil,
1999. D. Jensen, Le poisson noué, Pour la science, dossier hors série, pp. 14–15, April
1997. Cited on page 1003.
1023 I. Stewart, Game, Set and Math, Penguin Books, 1989, pp. 58–67. Cited on page 1002.
1024 For some modern knot research, see P. Holdin, R.B. Kusner & A. Stasiak, Quantiz-
ation of energy and writhe in self-repelling knots, New Journal of Physics 4, pp. 20.1–20.11,
2002. No citations.
1025 W.R. Taylor, A deeply knotted protein structure and how it might fold, Nature 406,
pp. 916–919, 2000. Cited on page 1002.
1026 A.C. Hirshfeld, Knots and physics: Old wine in new bottles, American Journal of Phys-
ics 66, pp. 1060–1066, 1998. Cited on page 1004.
1016 x intermezzo: bacteria, flies and knots
1027 T. Jacobson, Thermodynamics of spacetime: the Einstein equation of state, Physical Re-
view Letters 75, pp. 1260–1263, 1995, or http://arxiv.org/abs/gr-qc/9504004. Cited on page
1009.
1028 J. Frenkel & T. Kontorowa, Über die Theorie der plastischen Verformung, Physikalis-
che Zeitschrift der Sowietunion 13, pp. 1–10, 1938. F.C. Frank, On the equations of mo-
tion of crystal dislocations, Proceedings of the Physical Society A 62, pp. 131–134, 1949,
J. Eshelby, Uniformly moving dislocations, Proceedings of the Physical Society A 62,
pp. 307–314, 1949. See also G. Leibfried & H. Dietze, Zeitschrift für Physik 126, p. 790,
1949. A general introduction can be found in A. Seeger & P. Schiller, Kinks in dis-
locations lines and their effects in internal friction in crystals, Physical Acoustics 3A, W.P.
Mason, ed., Academic Press, 1966. See also the textbooks by Frank R.N. Nabarro,
Theory of Crystal Dislocations, Oxford University Press, 1967, or J.P. Hirth & J. Lothe,
Theory of Dislocations, McGraw Hills Book Company, 1968. Cited on page 1010.
1029 J. Wisd om, Swimming in spacetime: motion by cyclic changes in body shape, Science 299,
pp. 1865–1869, 21st of March, 2003. Cited on page 1011.
“ Denknotwendigkeiten ersetzen.*
Albert Einstein
”
T
he two stories told in the two parts of the path we have followed up to now, namely
The contradictions
In classical physics and in general relativity, the vacuum, or empty space-time, is a region
with no mass, no energy and no momentum. If matter or gravitational fields are present,
Copyright © Christoph Schiller November 1997–January 2007
space-time is curved. The best way to measure the mass or energy content of space-time
Ref. 1035 is to measure the average curvature of the universe. Cosmology tells us how we can do
Page 468 this. Measurements reveal an average energy density of the ‘vacuum’ of
Ref. 1036 However, quantum field theory tells a different story. A vacuum is a region with zero-
point fluctuations. The energy content of a vacuum is the sum of the zero-point energies
of all the fields it contains. Indeed, the Casimir effect ‘proves’ the reality of these zero-
Page 879 point energies. Their energy density is given, within one order of magnitude, by
* ‘One needs to replace habits of thought by necessities of thought.’
the contradictions 1019
= 3 ν 3 dν =
E 4πh ν max πh 4
V c ∫0
ν
c 3 max
. (666)
The approximation is valid for the case in which the cut-off frequency ν max is much lar-
ger than the rest mass m of the particles corresponding to the field under consideration.
Particle physicists argue that the cut-off energy has to be at least the energy of grand
unification, about 1016 GeV= 1.6 MJ. That would give a vacuum energy density of
1099 J~m3 ,
E
(667)
V
which is about 10108 times higher than the experimental limit deduced from spatial
curvature using general relativity estimates. In other words, something is slightly wrong
here.
General relativity and quantum theory contradict each other in other ways. Gravity
especially if gravitation is taken into account. What does ‘waiting 10 minutes’ mean, if
the clock goes into a quantum-mechanical superposition as a result of its coupling to
space-time geometry?
In addition, quantum theory associates mass with an inverse length via the Compton
wavelength; general relativity associates mass with length via the Schwarzschild radius.
Similarly, general relativity implies that space and time cannot be distinguished,
whereas quantum theory implies that matter does make a distinction between them.
Quantum theory is a theory of – admittedly weird – local observables. In general relativity,
Page 502 there are no local observables, as Einstein’s hole argument shows.
The contradiction between the two theories is shown most dramatically by the failure
of general relativity to describe the pair creation of particles with spin 1/2, a typical and
1020 xi general rel ativity versus quantum theory
Ref. 1038, Ref. 1039 essential quantum process. John Wheeler and others have shown that, in such a case, the
Ref. 1040, Ref. 1041 topology of space necessarily has to change; in general relativity, however, the topology
of space is fixed. In short, quantum theory says that matter is made of fermions, which
cannot be incorporated into general relativity.
No description of nature that contains contradictions can lead to a unified description
or to a completely correct description. In order to proceed, let us take the shortest and
fastest path: let us investigate the contradictions in more detail.
* The main results of this section are standard knowledge among specialists of unification; they are given
here using simple arguments. For another derivation, see the summary section on limit statements in nature,
on page 1088.
d oes matter differ from vacuum? 1021
drawing taken as the act of defining, the picture gives a description of standard twentieth-
century physics. The apparent contradiction is solved by recognizing that the two con-
cepts (the two hands) both originate from a third, hidden concept. In the picture, this
third entity is the hand of the artist.
In the case of space-time and matter, the search for the underlying common concept
Ref. 1044, Ref. 1045 is making renewed progress. The conceptual changes required for this are so striking that
Ref. 1046, Ref. 1047 they should interest anybody who has an interest in physics. The most effective way to
deduce the new concepts is to focus on that domain where the contradiction between
the two standard theories is most dramatic, and where both theories are necessary at the
same time.
Planck scales
Both general relativity and quantum mechanics are successful theories for the description Copyright © Christoph Schiller November 1997–January 2007
of nature. Each provides a criterion for determining when classical Galilean physics is no
longer applicable. (In the following, we use the terms ‘vacuum’ and ‘empty space-time’
interchangeably.)
General relativity shows that it is necessary to take into account the curvature of space-
time whenever we approach an object of mass m to within a distance of the order of the
Schwarzschild radius r S , given by
r S = 2Gm~c 2 . (668)
The gravitational constant G and the speed of light c act as conversion constants. Indeed,
as the Schwarzschild radius of an object is approached, the difference between general re-
lativity and the classical 1~r 2 description of gravity becomes larger and larger. For example,
1022 xi general rel ativity versus quantum theory
TA B L E 74 The size, Schwarzschild radius and Compton wavelength of some objects appearing in
nature. The lengths in quotation marks make no physical sense, as explained in the text.
the barely measurable gravitational deflection of light by the Sun is due to the light ap-
Ref. 1038, Ref. 1048 proaching the Sun to within 2.4ë105 times its Schwarzschild radius. Usually, we are forced
to stay away from objects at a distance that is an even larger multiple of the Schwarzschild
radius, as shown in Table 74. For this reason, general relativity is unnecessary in everyday
life. (An object smaller than its own Schwarzschild radius is called a black hole, because
according to general relativity, no signals from inside the Schwarzschild radius can reach
Ref. 1049 the outside world.)
Similarly, quantum mechanics shows that Galilean physics must be abandoned and
quantum effects must be taken into account whenever an object is approached to within
distances of the order of the (reduced) Compton wavelength λ C , given by
λC =
ħ
.
Copyright © Christoph Schiller November 1997–January 2007
(669)
mc
In this case, Planck’s constant ħ and the speed of light c act as conversion factors to trans-
form the mass m into a length scale. Of course, this length is only relevant if the object
is smaller than its own Compton wavelength. At these scales we get relativistic quantum
effects, such as particle–antiparticle creation or annihilation. Table 74 shows that the ap-
proach distance is near to or smaller than the Compton wavelength only in the micro-
scopic world, so that such effects are not observed in everyday life. We do not therefore
need quantum field theory to describe common observations.
The combined concepts of quantum field theory and general relativity are required in
situations where both conditions are satisfied simultaneously. The necessary approach
distance for such situations is calculated by setting r S = 2λ C (the factor 2 is introduced
d oes matter differ from vacuum? 1023
for simplicity). We find that this is the case when lengths or times are (of the order of)
¼
l Pl = = 1.6 ë 10−35 m, the Planck length,
¼
ħG~c 3
(670)
t Pl = ħG~c 5 = 5.4 ë 10−44 s , the Planck time.
Whenever we approach objects at these scales, both general relativity and quantum mech-
anics play a role, and effects of quantum gravity appear. Because the values of the Planck
dimensions are extremely small, this level of sophistication is unnecessary in everyday
life, in astronomy and even in particle physics.
However, to answer the questions posed at the beginning of the book – why do we
live in three dimensions, and why is the proton 1836.15 times heavier than the electron?
– we require a precise and complete description of nature. The contradictions between
quantum mechanics and general relativity appear to make the search for answers im-
“ are indivisible.
Moses Maimonides, twelfth century
The appearance of the quantum of action in the description of motion leads to quantum ”
limits to all measurements. These limits have important consequences at Planck scales.
Measurement limits appear most clearly when we investigate the properties of clocks
and metre rules. Is it possible to construct a clock that is able to measure time intervals
Ref. 1051, Ref. 1052 shorter than the Planck time? Surprisingly, the answer is no, even though the time–energy
indeterminacy relation ∆E∆t E ħ seems to indicate that by making ∆E large enough, we
Copyright © Christoph Schiller November 1997–January 2007
system. We need the combined knowledge of the non-commuting variables for each mov-
ing component of the clock. Let us focus on the component with the largest time inde-
terminacy ∆t. It is evident that the smallest time interval δt that can be measured by a
clock is always larger than the quantum limit, i.e. larger than the time indeterminacy ∆t
for the most ‘uncertain’ component. Thus we have
δt E ∆t E
ħ
, (671)
∆E
where ∆E is the energy indeterminacy of the moving component, and ∆E must be smaller
than the total energy E = mc 2 of the component itself.* Furthermore, a clock provides
information, so signals have to be able to leave it. Therefore the clock must not be a black
hole: its mass m must be smaller than the Schwarzschild mass for its size, i.e. m D c 2 l~G,
where l is the size of the clock (neglecting factors of order unity). Finally, for a sensible
measurement of the time interval δt, the size l of the clock must be smaller than c δt,
δt E
ħG
(672)
c 5 δt
or ¾
δt E = t Pl .
ħG
(673)
c5
In summary, from three simple properties of any clock – namely, that it is only a single
clock, that we can read its dial, and that it gives sensible read-outs – we conclude that
clocks cannot measure time intervals shorter than the Planck time. Note that this argument
is independent of the nature of the clock mechanism. Whether the clock operates by
gravitational, electrical, mechanical or even nuclear means, the limit still applies.***
Ref. 1059 The same conclusion can be reached in other ways. For example, any clock small
enough to measure small time intervals necessarily has a certain energy indeterminacy
due to the indeterminacy relation. Meanwhile, on the basis of general relativity, any en-
ergy density induces a deformation of space-time, and signals from the deformed region
Page 419 arrive with a certain delay due to that deformation. The energy indeterminacy of the
Copyright © Christoph Schiller November 1997–January 2007
source leads to an indeterminacy in the deformation, and thus in the delay. The expres-
* Physically, this condition means being sure that there is only one clock: if ∆E A E, it would be impossible
to distinguish between a single clock and a clock–anticlock pair created from the vacuum, or a component
together with two such pairs, and so on.
** It is amusing to explore how a clock larger than c δt would stop working, as a result of the loss of rigidity
Challenge 1464 n in its components.
*** Note that gravitation is essential here. The present argument differs from the well-known study on the lim-
Ref. 1056 itations of clocks due to their mass and their measuring time which was published by Salecker and Wigner
Ref. 1057 and summarized in pedagogical form by Zimmerman. Here, both quantum mechanics and gravity are in-
cluded, and therefore a different, lower, and more fundamental limit is found. Note also that the discovery
Ref. 1058 of black hole radiation does not change the argument: black hole radiation notwithstanding, measurement
devices cannot exist inside black holes.
d oes matter differ from vacuum? 1025
Ref. 1048 sion from general relativity for the deformation of the time part of the line element due to
a mass m is δt = mG~l c 3 . From the mass–energy relation, we see that an energy spread
∆E produces an indeterminacy ∆t in the delay:
∆t =
∆E G
. (674)
l c5
This determines the precision of the clock. Furthermore, the energy indeterminacy of the
clock is fixed by the indeterminacy relation for time and energy ∆E E ħ~∆t, which is in
turn fixed by the precision of the clock. Combining all this, we again find the relation
δt E t Pl for the minimum measurable time.
We are forced to conclude that in nature there is a minimum time interval. In other
words, at Planck scales the term ‘instant of time’ has no theoretical or experimental basis.
Another way to deduce this limit reverses the roles of general relativity and quantum the-
ory. To measure the distance between two objects, we have to localize the first object with
respect to the other within a certain interval ∆x. The corresponding energy indetermin-
acy obeys ∆E = c(c 2 m 2 + (∆p)2 )1~2 E cħ~∆x. However, general relativity shows that a
Ref. 1038, Ref. 1048 small volume filled with energy changes the curvature of space-time, and thus changes
Ref. 1060, Ref. 1061 the metric of the surrounding space. For the resulting distance change ∆l, compared with
1026 xi general rel ativity versus quantum theory
d2 =
L
, (677)
dm~dt
one sees directly that a minimum action and a maximum rate of change of mass imply
a minimum distance. In other words, quantum theory and general relativity, when put
together, imply a minimum distance. Copyright © Christoph Schiller November 1997–January 2007
Ref. 1066 These results are often expressed by the so-called generalized indeterminacy principle
where f is a numerical factor of order unity. A similar expression holds for the time–
energy indeterminacy relation. The first term on the right-hand side is the usual quantum-
mechanical indeterminacy. The second term is negligible for everyday energies, and is
d oes matter differ from vacuum? 1027
significant only near Planck energies; it is due to the changes in space-time induced by
Challenge 1465 e gravity at these high energies. You should be able to show that the generalized principle
(678) implies that ∆x can never be smaller than f 1~2 l Pl .
The generalized indeterminacy principle is derived in exactly the same way in which
Heisenberg derived the original indeterminacy principle ∆p∆x E ħ~2, namely by study-
Ref. 1066 ing the deflection of light by an object under a microscope. A careful re-evaluation of the
Ref. 1067 process, this time including gravity, yields equation (678). For this reason, all attempts to
Ref. 1068 unify quantum mechanics and gravity must yield this relation. Indeed, it appears in the
Ref. 1069, Ref. 1070 theory of canonical quantum gravity, in superstring theory, and in the quantum group
Ref. 1071 approach.
Remember that quantum mechanics begins with the realization that the classical
concept of action makes no sense below the value of ħ~2; similarly, unified theories be-
gin with the realization that the classical concepts of time and length make no sense be-
low Planck scales. However, the usual description of space-time does contain such small
which of course has no counterpart in standard quantum mechanics. This shows that
space-time events do not exist. A final way to convince oneself that points have no mean-
ing is to observe that a point is an entity with vanishing volume; however, the minimum
volume possible in nature is the Planck volume VPl = l Pl 3
.
Space-time points are idealizations of events – but this idealization is inadequate. The
use of the concept of ‘point’ is similar to the use of the concept of ‘ether’ a century ago: it
is impossible to detect, and it is only useful as a convenient notion until a way to describe
nature without it has been found. Like the ‘aether’, ‘points’ also lead reason astray.
In other words, the Planck units not only provide natural units; they also provide –
within a factor of order one – the limit values of space and time intervals.
Farewell to the space-time manifold Copyright © Christoph Schiller November 1997–January 2007
The consequences of the Planck limits for measurements of time and space can be taken
much further. It is often said that given any two points in space or any two instants of time,
there is always a third in between. Physicists sloppily call this property continuity, while
mathematicians call it denseness. However, at Planck scales this property cannot hold,
since there are no intervals smaller than the Planck time. Thus points and instants are
not dense, and between two points there is not always a third. This means that space and
time are not continuous. Of course, at large scales they are – approximately – continuous,
in the same way that a piece of rubber or a liquid seems continuous at everyday scales,
even though it is not at a small scale.
All paradoxes resulting from the infinite divisibility of space and time, such as Zeno’s
argument on the impossibility of distinguishing motion from rest, or the Banach–Tarski
1028 xi general rel ativity versus quantum theory
paradox, are now avoided. We can dismiss them straight away because of their incorrect
premises concerning the nature of space and time.
But let us go on. Special relativity, quantum mechanics and general relativity all rely
on the idea that time can be defined for all points of a given reference frame. However,
two clocks a distance l apart cannot be synchronized with arbitrary precision. Since the
distance between two clocks cannot be measured with an error smaller than the Planck
length l Pl , and transmission of signals is necessary for synchronization, it is not possible
to synchronize two clocks with a better precision than l Pl ~c = t Pl , the Planck time. So
use of a single time coordinate for a whole reference frame is only an approximation, and
cannot be maintained in any precise description of nature.
Moreover, since the time difference between events can only be measured within a
Planck time, for two events distant in time by this order of magnitude, it is not possible
Ref. 1073 to say with complete certainty which of the two precedes the other. This is an important
fact. If events cannot be ordered, then the very concept of time, which was introduced
Even the number of spatial dimensions makes no sense at Planck scales. Let us remind
ourselves how to determine this number experimentally. One possible way is to determ-
ine how many points we can choose in space such that all the distances between them are
equal. If we can find at most n such points, the space has n − 1 dimensions. But if reliable
length measurement at Planck scales is not possible, there is no way to determine reliably
the number of dimensions of space with this method.
Another way to check for three spatial dimensions is to make a knot in a shoe string
and glue the ends together: since it stays knotted, we know that space has three dimen-
sions, because there is a mathematical theorem that in spaces with greater or fewer than
three dimensions, knots do not exist. Again, at Planck scales, we cannot say whether a
string is knotted or not, because measurement limits at crossings make it impossible to
say which strand lies above the other.
d oes matter differ from vacuum? 1029
There are many other methods for determining the dimensionality of space.* In all
cases, the definition of dimensionality is based on a precise definition of the concept of
neighbourhood. At Planck scales, however, length measurements do not allow us to say
whether a given point is inside or outside a given region. In short, whatever method we
use, the lack of precise length measurements means that at Planck scales, the dimension-
ality of physical space is not defined. It should not, therefore, come as a surprise that when
we approach these scales, we may get a scale-dependent answer for the number of dimen-
sions, which may be different from three.
The reasons for the problems with space-time become most evident when we remem-
Ref. 1075 ber Euclid’s well-known definition: ‘A point is that which has no part.’ As Euclid clearly
understood, a physical point, as an idealization of position, cannot be defined without
some measurement method. Mathematical points, however, can be defined without ref-
erence to a metric. They are just elements of a set, usually called a ‘space’. (A ‘measurable’
or ‘metric’ space is a set of points equipped with a measure or a metric.)
* For example, we can determine the dimension using only the topological properties of space. If we draw a
Copyright © Christoph Schiller November 1997–January 2007
so-called covering of a topological space with open sets, there are always points that are elements of several
sets of the covering. Let p be the maximal number of sets of which a point can be an element in a given
covering. The minimum value of p over all possible coverings, minus one, gives the dimension of the space.
In fact, if physical space is not a manifold, the various methods for determining the dimensionality may
give different answers. Indeed, for linear spaces without norm, the dimensionality cannot be defined in a
unique way. Different definitions (fractal dimension, Lyapunov dimension, etc.) are possible.
** Where does the incorrect idea of continuous space-time have its roots? In everyday life, as well as in
physics, space-time is a book-keeping device introduced to describe observations. Its properties are extrac-
ted from the properties of observables. Since observables can be added and multiplied, like numbers, we
infer that they can take continuous values, and, in particular, arbitrarily small values. It is then possible to
define points and sets of points. A special field of mathematics, topology, shows how to start from a set of
points and construct, with the help of neighbourhood relations and separation properties, first a topological
space, then, with the help of a metric, a metric space. With the appropriate compactness and connectedness
relations, a manifold, characterized by its dimension, metric and topology, can be constructed.
1030 xi general rel ativity versus quantum theory
To sum up, physical space-time cannot be a set of mathematical points. But there are
more surprises. At Planck scales, since both temporal and spatial order break down, there
is no way to say if the distance between two nearby space-time regions is space-like or
time-like. At Planck scales, time and space cannot be distinguished from each other.
In addition, we cannot state that the topology of space-time is fixed, as general relativ-
ity implies. The topology changes mentioned above that are required for particle reactions
do become possible. In this way another of the contradictions between general relativity
and quantum theory is resolved.
In summary, space-time at Planck scales is not continuous, not ordered, not endowed
with a metric, not four-dimensional, and not made up of points. It satisfies none of the
defining properties of a manifold.* We conclude that the concept of a space-time manifold
has no justification at Planck scales. This is a strong result. Even though both general
relativity and quantum mechanics use continuous space-time, the combined theory does
not.
* A manifold is what looks locally like a Euclidean space. The exact definition can be found in Appendix D.
d oes matter differ from vacuum? 1031
those scales the superposition principle cannot be backed up by experiment, even the
famous Wheeler–DeWitt equation, often assumed to describe quantum gravity, cannot
be valid.
Similarly, permutation symmetry is based on the premise that we can distinguish two
points by their coordinates, and then exchange particles between those locations. As we
have just seen, this is not possible if the distance between the two particles is very small.
We conclude that permutation symmetry has no experimental basis at Planck scales.
Even discrete symmetries, like charge conjugation, space inversion and time reversal,
cannot be correct in this domain, because there is no way to verify them exactly by meas-
urement. CPT symmetry is not valid at Planck scales.
Finally we note that all types of scaling relations break down at small scales. As a result,
renormalization symmetry is also lost at Planck scales.
All these results are consistent: if there are no symmetries at Planck scales, there are
also no observables, since physical observables are representations of symmetry groups.
hints at this stage. The first step necessary to improve our description of motion is the
recognition that abandoning ‘points’ means abandoning the local description of nature.
Both quantum mechanics and general relativity assume that the phrase ‘observable at a
point’ has a precise meaning. Because it is impossible to describe space as a manifold, this
expression is no longer useful. The unification of general relativity and quantum physics
forces the adoption of a non-local description of nature at Planck scales.
The existence of a minimum length implies that there is no way to physically distin-
guish between locations that are even closer together. We are tempted to conclude that
no pair of locations can be distinguished, even if they are one metre apart, since on any
path joining two points, no two locations that are close together can be distinguished. The
problem is similar to the question about the size of a cloud or of an atom. If we measure
1032 xi general rel ativity versus quantum theory
water density or electron density, we find non-vanishing values at any distance from the
centre of the cloud or the atom; however, an effective size can still be defined, because it
is very unlikely that the effects of the presence of a cloud or of an atom can be seen at
distances much larger than this effective size. Similarly, we can guess that two points in
space-time at a macroscopic distance from each other can be distinguished because the
probability that they will be confused drops rapidly with increasing distance. In short,
we are thus led to a probabilistic description of space-time. Space-time becomes a macro-
scopic observable, a statistical or thermodynamic limit of some microscopic entities.
We note that a fluctuating structure for space-time would also avoid the problems of
fixed structures with Lorentz invariance. This property is of course compatible with a
statistical description. In summary, the experimental observations of special relativity –
Lorentz invariance, isotropy and homogeneity – together with the notion of a minimum
distance, point towards a description of space-time as fluctuating. Research efforts in
Ref. 1082 quantum gravity, superstring theory and quantum groups have independently confirmed
Farewell to particles
In every example of motion, some object is involved. One of the important discoveries
of the natural sciences was that all objects are composed of small constituents, called
elementary particles. Quantum theory shows that all composite, non-elementary objects
have a finite, non-vanishing size. This property allows us to determine whether a particle
is elementary or not. If it behaves like a point particle, it is elementary. At present, only the
leptons (electron, muon, tau and the neutrinos), the quarks and the radiation quanta of
the electromagnetic, weak and strong nuclear interactions (the photon, the W and Z bo-
Page 1190 sons, and the gluons) have been found to be elementary. A few more elementary particles
are predicted by various refinements of the standard model. Protons, atoms, molecules,
Page 1022 cheese, people, galaxies and so on are all composite, as shown in Table 74. Elementary
particles have vanishing size and are characterized by their spin and their mass.
Although the definition of ‘elementary particle’ as point particle is all we need in the
following argument, it is not complete. It seems to leave open the possibility that future
experiments could show that electrons or quarks are not elementary. This is not so! In fact,
any particle smaller than its own Compton wavelength is elementary. If it were composite,
Copyright © Christoph Schiller November 1997–January 2007
there would be a lighter particle inside it, which would have a larger Compton wavelength
than the composite particle. This is impossible, since the size of a composite particle must
be larger than the Compton wavelength of its components. (The alternative possibility
that all components are heavier than the composite does not lead to satisfying physical
Ref. 1083 properties: for example, it leads to intrinsically unstable components.)
The size of an object, such as those given in Table 74, is defined as the length at which
differences from point-like behaviour are observed. The size d of an object is determined
by measuring how it scatters a beam of probe particles. For example, the radius of the
atomic nucleus was determined for the first time in Rutherford’s experiment using alpha
particle scattering. In daily life as well, when we look at objects, we make use of scattered
photons. In general, in order for scattering to be useful, the effective wavelength λ = ħ~mv
d oes matter differ from vacuum? 1033
of the probe must be smaller than the object size d to be determined. We thus need d A
λ = ħ~mv E ħ~mc. In addition, in order for a scattering experiment to be possible, the
object must not be a black hole, since, if it were, it would simply swallow the approaching
particle. This means that its mass m must be smaller than that of a black hole of the same
size; in other words, from equation (668) we must have m < dc 2 ~G. Combining this with
the previous condition we get ¾
dA = l Pl .
ħG
(681)
c3
In other words, there is no way to observe that an object is smaller than the Planck length.
Thus, in principle there is thus no way to deduce from observations that a particle is point-
like. In fact, the term ‘point particle’ makes no sense at all! Of course, there is a relation
between the existence of a minimum length for empty space and the existence of a min-
imum length for objects. If the term ‘point of space’ is meaningless, then the term ‘point
* The existence of a minimum size for a particle has nothing to do with the impossibility, in quantum theory,
of localizing a particle to within less than its Compton wavelength.
** Andrei Dmitrievich Sakharov, Soviet nuclear physicist (1921–1989). One of the keenest thinkers in phys-
ics, Sakharov, among others, invented the Tokamak, directed the construction of nuclear bombs, and ex-
plained the matter–antimatter asymmetry of nature. Like many others, he later campaigned against nuclear
weapons, a cause for which he was put into jail and exile, together with his wife, Yelena Bonner. He received
the Nobel Peace Prize in 1975.
1034 xi general rel ativity versus quantum theory
otherwise the probe will be swallowed. Inserting the definitions of the two quantities and
neglecting the factor 2, we again get the limit m < m Pl . (In fact it is a general property
of descriptions of nature that a minimum space-time interval leads to an upper limit for
Ref. 1085 masses of elementary particles.) The importance of the Planck mass will become even
clearer shortly.
Another property connected with the size of a particle is its electric dipole moment.
This describes the deviation of its charge distribution from spherical. Some predictions
from the standard model of elementary particles give as an upper limit for the electron
Ref. 1086 dipole moment d e a value of
Sd e S
< 10−39 m , (683)
e
where e is the charge of the electron. This value is ten thousand times smaller than the
Planck length l Pl . Since the Planck length is the smallest possible length, we seem to have
a contradiction here. However, a more careful, recent prediction from the standard model
change sign under exchange of particles, whereas for two particles with spin 1/2 it does.
We see directly that both transformations are impossible to study at Planck scales.
Given the limit on position measurements, the position of a rotation axis cannot be well
defined, and rotations become impossible to distinguish from translations. Similarly, pos-
itional imprecision makes it impossible to determine of precise separate positions for ex-
change experiments. In short, at Planck scales, spin cannot be defined, and fermions cannot
be distinguished from bosons, or, phrased differently, matter cannot be distinguished from
radiation. We can thus easily see that supersymmetry, a unifying symmetry between bo-
sons and fermions, somehow becomes natural at Planck scales.
But let us now move on to the main property of elementary particles.
d oes matter differ from vacuum? 1035
Farewell to mass
The Planck mass divided by the Planck volume, i.e. the Planck density, is given by
c5
ρ Pl = = 5.2 ë 1096 kg~m3 (685)
G2ħ
and is a useful concept in the following. One way to measure the (gravitational) mass M
enclosed in a sphere of size R, and thus (roughly) of volume R 3 , is to put a test particle in
orbit around it at that same distance R. Universal gravitation then gives for the mass M
Challenge 1466 e the expression M = Rv 2 ~G, where v is the speed of the orbiting test particle. From v < c,
we deduce that M < c 2 R~G; since the minimum value for R is the Planck distance, we get
(again neglecting factors of order unity) a limit for the mass density ρ, namely
ρ < ρ Pl . (686)
∆M E
ħ
. (687)
cR
Note that for everyday situations, this error is extremely small, and other errors, such as
the technical limits of the balance, are much larger.
To check this result, we can explore another situation. We even use relativistic expres-
sions, in order to show that the result does not depend on the details of the situation or
the approximations. Imagine having a mass M in a box of size R, and weighing the box
with a scale. (It is assumed that either the box is massless or that its mass is subtracted by
the scale.) The mass error is given by ∆M = ∆E~c 2 , where ∆E is due to the indeterminacy
in the kinetic energy of the mass inside the box. Using the expression E 2 = m 2 c 4 + p2 c 2 ,
Copyright © Christoph Schiller November 1997–January 2007
we get that ∆M E ∆p~c, which again reduces to equation (687). Now that we are sure of
the result, let us continue.
From equation (687) we deduce that for a box of Planck dimensions, the mass meas-
urement error is given by the Planck mass. But from above we also know that the mass
that can be put inside such a box must not be larger than the Planck mass. Therefore, for a
box of Planck dimensions, the mass measurement error is larger than (or at best equal to)
the mass contained in it: ∆M E M Pl . In other words, if we build a balance with two boxes
of Planck size, one empty and the other full, as shown in Figure 408, nature cannot decide
which way the balance should hang! Note that even a repeated or a continuous measure-
ment will not resolve the situation: the balance will change inclination at random, staying
1036 xi general rel ativity versus quantum theory
R R
find the intermediate result that at Planck scales, inertial and gravitational mass cannot be
distinguished. Even the balance experiment shown in Figure 408 illustrates this: at Planck
scales, the two types of mass are always inextricably linked.) Now, in any scattering exper-
iment, for example in a Compton-type experiment, the mass measurement is performed
by measuring the wavelength change δλ of the probe before and after the scattering. The
mass indeterminacy is given by
=
∆M ∆δλ
. (689)
M δλ
In order to determine the mass in a Planck volume, the probe has to have a wavelength of
the Planck length. But we know from above that there is always a minimum wavelength
indeterminacy, given by the Planck length l Pl . In other words, for a Planck volume the
d oes matter differ from vacuum? 1037
mass error is always as large as the Planck mass itself: ∆M E M Pl . Again, this limit is a
direct consequence of the limit on length and space measurements.
This result has an astonishing consequence. In these examples, the measurement error
is independent of the mass of the scatterer: it is the same whether or not we start with a
situation in which there is a particle in the original volume. We thus find that in a volume
of Planck size, it is impossible to say whether or not there is something there when we
probe it with a beam!
In short, all arguments lead to the same conclusion: vacuum, i.e. empty space-time,
cannot be distinguished from matter at Planck scales. Another common way to express this
state of affairs is to say that when a particle of Planck energy travels through space it will
be scattered by the fluctuations of space-time itself, as well as by matter, and the two cases
are indistinguishable. These surprising results rely on a simple fact: whatever definition of
mass we use, it is always measured via combined length and time measurements. (This is
even the case for normal weighing scales: mass is measured by the displacement of some
Challenge 1467 n that accelerator have to be? But by the standards of everyday life, the Planck energy is
Challenge 1468 n rather small. Measured in litres of gasoline, how much fuel does it correspond to?
**
The usual concepts of matter and of radiation are not applicable at Planck scales. Usually, it
is assumed that matter and radiation are made up of interacting elementary particles. The
concept of an elementary particle implies an entity that is discrete, point-like, real and not
virtual, has a definite mass and a definite spin, is distinct from its antiparticle, and, most of
all, is distinct from vacuum, which is assumed to have zero mass. All these properties are
lost at Planck scales. At Planck scales, the concepts of ‘mass’, ‘vacuum’, ‘elementary particle’,
‘radiation’ and ‘matter’ do not make sense.
**
Do the large errors in mass measurements imply that mass can be negative at Planck
**
We now have a new answer to the old question: why is there something rather than noth-
ing? At Planck scales, there is no difference between something and nothing. We can now
honestly say about ourselves that we are made of nothing.
**
If vacuum cannot be distinguished from matter or radiation, then it is incorrect to claim
that the universe appeared from nothing. Thus, the naive idea of creation is a logical
impossibility, resulting from a lack of imagination.
**
Special relativity implies that no length or energy can be invariant. Since we have come
to the conclusion that the Planck energy and the Planck length are invariant, there must
be deviations from Lorentz invariance at high energy. Can you imagine what the effects
Challenge 1470 r would be? What kind of experiment could measure them? If you find an answer, publish
Ref. 1090 it! First attempts are appearing in the literature. We will return to the issue later on, with
some interesting insights.
Copyright © Christoph Schiller November 1997–January 2007
**
Quantum mechanics alone gives, via the Heisenberg indeterminacy relation, a lower limit
to the spread of measurements, but, strangely enough, not on their precision, i.e. not on the
Ref. 1089 number of significant digits. Jauch gives an example: atomic lattice constants are known
to a much higher precision than the positional indeterminacy of single atoms inside the
Ref. 1070 crystal.
It is sometimes claimed that measurement indeterminacies smaller than the Planck
values are possible for large enough numbers of particles. Can you show why this is in-
Challenge 1471 ny correct, at least for space and time?
**
Of course, the idea that vacuum is not empty is not new. More than two thousand
d oes matter differ from vacuum? 1039
years ago, Aristotle argued for a filled vacuum, although his arguments were incorrect
as seen from today’s perspective. In the fourteenth century there was much discussion
on whether empty space was composed of indivisible entities, but the debate died down
again later.
**
A Planck-energy particle falling in a gravitational field would gain energy. But the Planck
Challenge 1472 n energy is the highest energy in nature. What does this apparent contradiction imply?
**
One way to generalize the results presented here is to assume that, at Planck energy,
Ref. 1059 nature is event-symmetric, i.e., symmetric under exchange of any two events. This idea,
developed by Phil Gibbs, provides an additional formulation of the strange behaviour of
nature at extreme scales.
**
If vacuum and matter cannot be distinguished, we cannot distinguish between objects
Page 37 and their environment. However, this was one the starting points of our journey. Some Copyright © Christoph Schiller November 1997–January 2007
**
1040 xi general rel ativity versus quantum theory
Challenge 1473 d Is there a smallest possible momentum? And a smallest momentum error?
**
There is a maximum acceleration in nature. Can you deduce the value of this so-called
Challenge 1474 n Planck acceleration? Does it require quantum theory?
**
Given that time becomes an approximation at Planck scales, can we still ask whether
nature is deterministic?
Let us go back to the beginning. We can define time, because in nature change is not
random, but gradual. What is the situation now that we know that time is only approxim-
ate? Is non-gradual change possible? Is energy conserved? In other words, are surprises
possible?
It is correct to say that time is not defined at Planck scales, and that therefore that
Vacuum has zero mass density at large scales, but Planck mass density at Planck scales.
Cosmological measurements show that the universe is flat or almost flat at large scales:
in other words, its energy density is low. In contrast, quantum field theory maintains that
vacuum has a high energy density (or mass density) at small scales. Since mass is scale-
dependent, both viewpoints are right. This hints at a solution to what is usually called the
Ref. 1092 cosmological constant problem. The contradiction is only apparent. We will have more
to say about this issue later on.
**
Challenge 1476 n When can matter and vacuum be distinguished? At what energy?
**
d oes matter differ from vacuum? 1041
If matter and vacuum cannot be distinguished, there is a lack of information, which im-
plies an intrinsic basic entropy associated with any part of the universe. We will come
back to this topic shortly, when we discuss the entropy of black holes.
**
Can we distinguish between a liquid and a gas by looking at a single atom? No, only by
looking at many. Similarly, we cannot distinguish between matter and vacuum by look-
ing at one point, but only by looking at many. We must always average. However, even
averaging is not completely successful. Distinguishing matter from vacuum is like distin-
guishing clouds from the clear sky: like clouds, matter has no precise boundary.
**
We have found that space and time are neither continuous nor made up of points. In-
deed, the combination of relativity and quantum theory makes this impossible. In order
c3
K< = 0.39 ë 1070 m−2 (690)
Għ
as a limit for the surface curvature K in nature. In other words, the universe has never
been a point, never had zero age, never had infinite density, and never had infinite
curvature. It is not difficult to get a similar limit for temperature or any other physical
Challenge 1477 n quantity.
In short, since events do not exist, the big bang cannot have been an event. There never
was an initial singularity or a beginning of the universe.
In this rapid journey, we have destroyed all the experimental pillars of quantum the-
ory: the superposition principle, space-time symmetry, gauge symmetry, renormaliza-
tion symmetry and permutation symmetry. We also have destroyed the foundations of
general relativity, namely the concepts of the space-time manifold, fields, particles and
mass. We have even seen that matter and space-time cannot be distinguished. All these
conclusions can be drawn in a simpler manner, by using the minimum action of quantum
Page 1088 theory and the maximum force of general relativity.
It seems that we have lost every concept used for the description of motion, and thus
made its description impossible. We naturally ask whether we can save the situation.
First of all, since matter and radiation are not distinguishable from vacuum, the quest
for unification in the description of elementary particles is correct and necessary.
Moreover, since the concepts of ‘mass’, ‘time’ and ‘space’ cannot be distinguished from
each other, a new, single entity is necessary to define both particles and space-time. At
the beginning of the twenty-first century, three approaches are being pursued to find
out more about this new entity. The first, quantum gravity – especially the use of the loop
Ref. 1044 representation and Ashtekar’s new variables – starts by generalizing space-time symmetry.
Ref. 1045 The second, string theory, starts by generalizing gauge symmetries and interactions. The
Ref. 1046 third, using algebraic quantum groups, looks for generalized permutation symmetries. We
will describe these approaches in more detail later on.
Before we go on, however, we should check what we have deduced so far against ex-
periment.
ħ(ω 1 − ω 2 )d
E char = . (691)
c∆t
Ref. 1094 This energy value is 8 ë 1016 GeV for the best measurement to date. This is not far from
the Planck energy; in fact, it is even closer when the missing factors of order unity are
included. It is expected that the Planck scale will be reached in a few years, so that it
will become possible to test whether the quantum nature of space-time influences the
Ref. 1095, Ref. 1096 dispersion of light signals. Planck-scale effects should produce a dispersion. Detecting it
Copyright © Christoph Schiller November 1997–January 2007
This expression is deduced simply by combining the measurement limit of a ruler, from
quantum theory, with the requirement that the ruler not be a black hole. We will discuss
Page 1057 this result in more detail in the next section. The sensitivity of the detectors to noise might
reach the required level in the early twenty-first century. The noise induced by quantum
Ref. 1099 gravity effects is also predicted to lead to detectable quantum decoherence and vacuum
fluctuations.
A third candidate experiment for measuring quantum gravity is the detection of the
loss of CPT symmetry at high energies. Especially in the case of the decay of certain ele-
Ref. 1096 mentary particles, such as neutral kaons, the precision of experimental measurement is
approaching the detection of Planck-scale effects.
A fourth possibility is that quantum gravity effects may change the threshold energy
Ref. 1100 at which certain particle reactions become possible. It may be that extremely high-energy
photons or cosmic rays will make it possible to prove that Lorentz invariance is indeed
broken near the Planck scale.
A fifth possibility is that the phase of light that travels over long distances might get
Additional predictions
A few candidates for quantum gravity effects have also been predicted by the author. Let
us summarize and clarify the results found so far.* Special relativity starts with the dis-
covery that observable speeds are limited by the speed of light c. Quantum theory starts
with the result that observable actions are limited by ħ~2. The theory of gravitation shows
that for every system with length L and mass M, the observable ratio L~M is limited by
the constant 4G~c 2 . Combining these results, we deduce that all physical observables are
º by what are usually called the Planck values – modified by one or several factors
bound,
of 2 . We need to replace ħ by ħ~2 and G by 4G in all the defining expressions for Planck
º the corresponding measurement limits. In particular, the limit
quantities, in order to find
º
for lengths and times is 2 times the Planck value, and the limit for energy is the Planck
value divided by 8 .** Interestingly, the existence of bounds on all observables makes it
possible to make several testable predictions for the unification of quantum theory and
Copyright © Christoph Schiller November 1997–January 2007
general relativity. These predictions do not depend on the details of the final theory.
However, first we need to refine the argument that we have just presented. The (cor-
rected) Planck values do not seem to be the actual limits to measurements. The actual
measurement limits are stricter.
First of all, for any measurement, we need certain fundamental conditions to be real-
ized. Take the length measurement of an object. We need to be able to distinguish between
matter and radiation, since the object to be measured is made up of matter, and since ra-
* This subsection, in contrast to the ones so far, is speculative; it was added in February 2001.
º half the minimum area. Of
** The entropy of a black hole is thus given by the ratio between its horizon and
course, a detailed investigation also shows that the Planck mass (divided by 8 ) is the limit for elementary
particles from below and for black holes from above. For everyday systems, there is no limit.
1044 xi general rel ativity versus quantum theory
diation is the measurement tool that is used to read distances from the ruler. For a meas-
urement process, we need an interaction, which implies the use of radiation. Note that
this is so even with the use of particle scattering to determine lengths.
In addition, for the measurement of wavelengths we need to distinguish between mat-
ter and radiation, because matter is necessary if we wish to compare two wavelengths.
Page 1123 In fact, all length measurements require the distinction between matter and radiation.*
However, this distinction is impossible at the energy of grand unification, where the
Ref. 1102 electroweak and the strong nuclear interactions are unified. At and above this energy,
particles of matter and particles of radiation transform into each other; in practice they
cannot be distinguished.
If all matter and radiation particles were the same, or mixtures of each other, mass
could not be defined. Nor could spin, charge or any other quantum numbers. To sum up,
no measurement can be performed at energies above the GUT unification energy.
In other words, it is not at the Planck scale that the particle concept (and thus the
Page 1034 This result is in contradiction with one of the predictions of the standard model, but not
Ref. 1104 with others. More interestingly, it is within the reach of future experiments. This improved
limit may be the simplest possible measurement of yet-unpredicted quantum gravity ef-
fects. Measuring the dipole moment could thus be a way to determine the unification
energy (the factor 800) independently of high-energy physics experiments, and possibly
to a higher precision.
* In the modern language of high-energy physics: all measurements require broken supersymmetry.
some experimental predictions 1045
Inverse of
coupling
constant
electromagnetism
Interestingly, the bound on the measurability of observables also puts a bound on the
measurement precision for each observable. This bound is of no importance in everyday
life, but it is important at high energy. What is the precision with which a coupling con-
stant can be measured? Let us take the electromagnetic coupling constant as an example.
This constant α, also called the fine structure constant, is related to the charge q by
¼
q= 4πε 0 ħcα . (695)
Now, any electrical charge is defined and measured by comparing, in an electrical field,
Copyright © Christoph Schiller November 1997–January 2007
the acceleration to which the charged object is subjected with the acceleration of some
unit charge q unit . In other words, we have
=
q ma
. (696)
q unit m unit a unit
Therefore any error in mass and acceleration measurements implies errors in measure-
ments of charge and the coupling constant.
Page 881 We found in the part on quantum theory that the electromagnetic, weak and strong
interactions are characterized by coupling constants, the inverses of which depend lin-
early on the logarithm of the energy. It is usually assumed that these three lines meet at
Ref. 1103 the unification energy. Measurements put the unification coupling value at about 1/26.
1046 xi general rel ativity versus quantum theory
We know from the above discussions that the minimum error for any energy meas-
urement at high energies is given by the ratio between the energy to be measured and
the limit energy. If we plot a graph of the coupling constants against energy, we get the
result shown in Figure 409. The search for consequences of this fan-out effect is delightful.
One way to express the result is to say that coupling constants are by definition subject
to an error. However, all measurement devices, be they clocks, metre rules, scales or any
other devices, use electromagnetic effects at energies of around 1 eV plus the electron rest
energy. This is about 10−16 times the GUT energy. As a consequence, the measurement
precision of any observable should be limited to about 19 digits. The maximum precision
currently achieved is 15 digits, and, for the electromagnetic coupling constant, about 9
digits. It will thus be quite some time before this prediction can be tested.
The challenge is to find a system in which the spreading coupling constant value ap-
pears more clearly in the measurements. For example, it may be that high-precision meas-
urements of the д-factor of elementary particles or of high-energy cosmic ray reactions
Biblio graphy
1035 Steven Weinberg, The cosmological constant problem, Reviews of Modern Physics 61,
pp. 1–23, 1989. Cited on page 1018.
1036 Steven Weinberg, The Quantum Theory of Fields, Cambridge University Press,
volumes I, 1995, and II, 1996. Cited on page 1018.
1037 The difficulties are summarised by B.S. de Witt, Quantum field theory in curved space-
time, Physics Reports 19, pp. 295–357, 1975. Cited on page 1019.
1038 C.W. Misner, K.S. Thorne & J.A. Wheeler, Gravitation, Freeman, 1973. Cited on
pages 1020, 1022, and 1025.
1039 J.A. Wheeler, in Relativity, Groups and Topology, edited by C. DeWitt & B. DeWitt, Gor-
don and Breach, 1994. See also J.A. Wheeler, Physics at the Planck length, International
Journal of Modern Physics A 8, pp. 4013–4018, 1993. Cited on page 1020.
J.L. Friedman & R.D. Sorkin, Spin 1/2 from gravity, Physical Review Letters 44,
ics A 10, pp. 145–165, 1995, preprint at http://arxiv.org/abs/gr-qc/9403008. This paper also
includes an extensive bibliography. Cited on page 1026.
1065 G. Amelino-Camelia, Limits on the measurability of space-time distances in (the
semiclassical approximation of) quantum gravity, Modern Physics Letters A 9, pp. 3415–
3422, 1994, preprint at http://arxiv.org/abs/gr-qc/9603014. See also Y.J. Ng & H.
Van Dam, Limit to space-time measurement, Modern Physics Letters A 9, pp. 335–
340, 1994. Cited on page 1026.
1066 The generalized indeterminacy relation is implicit in Ref. 1060, especially on page 852,
but the issue is explained rather unclearly. Obviously the author considered the result too
simple to be mentioned explicitly. Cited on pages 1026 and 1027.
1067 C. Rovelli & L. Smolin, Discreteness of area and volume in quantum gravity, Nuclear
Physics B 442, pp. 593–619, 1995. R. Loll, The volume operator in discretized quantum
bibliography 1049
of space-time, Physical Review 74, pp. 209–210, 1948. A. Das, Cellular space-time and
quantum field theory, Il Nuovo Cimento 18, pp. 482–504, 1960. Cited on page 1031.
1078 D. Finkelstein, ‘Superconducting’ causal nets, International Journal of Theoretical Phys-
ics 27, pp. 473–519, 1985. Cited on page 1031.
1079 N.H. Christ, R. Friedberg & T.D. Lee, Random lattice field theory: general formu-
lation, Nuclear Physics B 202, pp. 89–125, 1982. G. ’t Hooft, Quantum field theory for
elementary particles – is quantum field theory a theory?, Physics Reports 104, pp. 129–142,
1984. Cited on page 1031.
1080 See for example: L. B ombelli, J. Lee, D. Meyer & R.D. Sorkin, Space-time as
a causal set, Physical Review Letters 59, pp. 521–524, 1987. G. Brightwell & R.
Gregory, Structure of random space-time, Physical Review Letters 66, pp. 260–263, 1991.
Cited on page 1031.
1050 xi general rel ativity versus quantum theory
1081 For a discussion, see R. Sorabji, Time, Creation and the Continuum: Theories in Antiquity
and the Early Middle Ages, Duckworth, 1983. Cited on page 1031.
1082 For quantum gravity, see for example A. Ashtekar, Quantum geometry in action: big
bang and black holes, http://arxiv.org/abs/math-ph/0202008. Cited on page 1032.
1083 The false belief that particles like quarks or electrons are composite is slow to die out. See
for example: S. Fredriksson, Preon prophecies by the standard model, http://arxiv.org/
abs/hep-ph/0309213. Preon models gained popularity in the 1970s and 1980s, in particular
through the papers by J.C. Pati & A. Sal am, Physical Review D 10, p. 275, 1974, H.
Harari, Physics Letters B 86, p. 83, 1979, M.A. Shupe, Physics Letters B 86, p. 87, 1979,
and H. Fritzsch & G. Mandelbaum, Physics Letters B 102, p. 319, 1981. Cited on page
1032.
1084 A.D. Sakharov, Vacuum quantum fluctuations in curved space and the theory of grav-
itation, Soviet Physics – Doklady 12, pp. 1040–1041, 1968. Cited on page 1033.
1085 C. Wolf, Upper limit for the mass of an elementary particle due to discrete time quantum
γmc 2
E= p=
γmv
γmc 2
and γmc 2
. (697)
1+ E Pl
1+ E Pl
Another, similar approach of recent years, with a different proposal, is called ‘doubly
special relativity’. A recent summary is G. Amelino-Camelia, Doubly-special relativ-
ity: first results and key open problems, http://arxiv.org/abs/gr-qc/0210063. The paper
shows how conceptual problems hinder the advance of the field, as does R. Aloisio, A.
Gal ante, A.F. Grillo, E. Luzio & F. Méndez, Approaching space-time through
velocity in doubly special relativity, preprint at http://arxiv.org/abs/gr-qc/0410020. Cited
on page 1038.
1091 H.D. Zeh, On the interpretation of measurement in quantum theory, Foundations of Phys-
ics 1, pp. 69–76, 1970. Cited on page 1039.
bibliography 1051
1092 Steven Weinberg, The cosmological constant problem, Reviews of Modern Physics 61,
pp. 1–23, 1989. Cited on page 1040.
1093 See Y.J. Ng, W.A. Christiansen & H. van Dam, Probing Planck-scale physics with
extragalactic sources?, Astrophysical Journal 591, pp. L87–L90, 2003, preprint at http://
arxiv.org/abs/astro-ph/0302372; D.H. Coule, Planck scale still safe from stellar images,
Classical and Quantum Gravity 20, pp. 3107–3112, 2003, preprint at http://arxiv.org/abs/
astro-ph/0302333. Negative experimental results (and not always correct calculations)
are found in R. Lieu & L. Hillman, The phase coherence of light from extragalactic
sources – direct evidence against first order Planck scale fluctuations in time and space,
Astrophysical Journal 585, pp. L77–L80, 2003, and R. R agazzoni, M. Turatto & W.
Gaessler, The lack of observational evidence for the quantum structure of spacetime at
Planck scales, Astrophysical Journal 587, pp. L1–L4, 2003. Cited on page 1042.
1094 B.E. Schaefer, Severe limits on variations of the speed of light with frequency, Physical
Review Letters 82, pp. 4964–4966, 21 June 1999. Cited on page 1042.
1095 G. Amelino-Camelia, J. Ellis, N.E. Mavromatos, D.V. Nanopoulos & S.
1103 See the paper by U. Amaldi, W. de B oer & H. Fürstenau, Comparison of grand
unified theories with electroweak and strong coupling constants measured at LEP, Physics
Letters 260, pp. 447–455, 1991. This is the standard reference on the topic. Cited on pages
1044 and 1045.
1104 M.V. Romalis, W.C. Griffith, J.P. Jacobs & E.N. Fortson, New limit on the
permanent electric dipole moment of 199 Hg, Physical Review Letters 86, pp. 2505–2508, 19
March 2001, or at http://arxiv.org/abs/hep-ex/0012001. Their upper limit on the dipole mo-
ment of mercury atoms is 2.1 ë 1030 e m. Research for better measurement methods is pro-
gressing; a modern proposal is by S.K. L amoreaux, Solid state systems for electron elec-
tric dipole moment and other fundamental measurements, http://arxiv.org/abs/nucl-ex/
0109014. Cited on page 1044.
“ Erkenntnis.*
Paul Tillich, Auf der Grenze.
”
T
his strange question is the topic of the current leg of our mountain ascent. In
Ref. 1105 he last section we explored nature in the vicinity of Planck scales; but
he other limit, namely the description of motion at large, cosmological scales,
is equally fascinating. As we proceed, many incredible results will appear, and at the end
we will discover a surprising answer to the question in the title.
This section is not standard textbook material; a large part of it is original** and thus
speculative and open to question. It aims to explain in simple terms the ongoing research
in the domains of quantum gravity and superstring theory. But be warned: with almost
Cosmological scales
Hic sunt leones.***
“
The description of motion requires the application of general relativity whenever the scale
Antiquity
”
d of the situation is of the order of the Schwarzschild radius, i.e. whenever Copyright © Christoph Schiller November 1997–January 2007
d r S = 2Gm~c 2 . (698)
It is straightforward to confirm that, with the usually quoted mass m and size d of
Challenge 1480 n everything visible in the universe, this condition is indeed fulfilled. We do need general
relativity, and thus curved space-time, when talking about the whole of nature.
Similarly, quantum theory is required for the description of the motion of an object
whenever we approach it within a distance d of the order of the Compton wavelength λ C ,
* ‘The frontier is the really productive place of understanding’. Paul Tillich (1886–1965), German theologian,
socialist and philosopher.
** Written between June and December 2000.
*** ‘Here are lions.’ Written across unknown and dangerous regions on ancient maps.
1054 xi general rel ativity versus quantum mechanics
i.e. whenever
d λC =
h
. (699)
mc
Obviously, for the total mass of the universe this condition is not fulfilled. However, we
are not interested in the motion of the universe itself; we are interested in the motion
of its components. In the description of these components, quantum theory is required
whenever pair production and annihilation play a role. This is especially the case in the
early history of the universe and near the horizon, i.e. for the most distant events that
we can observe in space and time. We are thus obliged to include quantum theory in any
precise description of the universe.
Since at cosmological scales we need both quantum theory and general relativity, we
start our investigation with the study of time, space and mass, by asking at large scales
the same questions that we asked before at Planck scales.
new idea, although looking at the issue in more detail does reveal some surprises.
”
* This implies that so-called ‘oscillating universe’ models, in which it is claimed that ‘before’ the big bang
there were other phenomena, cannot be justified on the basis of nature or observations. They are based on
beliefs.
nature at l arge scales – is the universe something or nothing? 1055
In light of the above discussion, it may seem silly to about the age of the universe. The
Page 1183 value is found in many books and tables, including that of Appendix B, and its precise
determination is one of the most important quests in modern astrophysics. But is this
quest reasonable?
In order to measure the duration of a movement or the age of a system, we need a clock
that is independent of that movement or system, and thus outside the system. However,
there are no clocks outside the universe, and no clock inside it can be independent. In
fact we have just seen that no clock inside the universe can run throughout its history.
Time can be defined only if it is possible to distinguish between matter and space-time.
Given this distinction, we can talk either about the age of space-time, by assuming that
matter provides suitable and independent clocks – as is done in general relativity – or
about the age of matter, such as stars or galaxies, by assuming that the extension of space-
time, or some other matter, provides a good clock. Both possibilities are being explored
experimentally in modern astrophysics – and both give the same result, of about fourteen
The first way to measure the age of the universe* is to look at clocks in the usual sense of
”
Copyright © Christoph Schiller November 1997–January 2007
the word, namely at clocks made of matter. As explained in the part on quantum theory,
Ref. 1107 Salecker and Wigner showed that a clock built to measure a total time T with a precision
* Note that the age t 0 is not the same as the Hubble time T = 1~H 0 . The Hubble time is only a computed
quantity and (almost) always larger than the age; the relation between the two depends on the values of
the cosmological constant, the density and other parameters of the universe. For example, for the standard
‘hot big bang’ scenario, i.e. for the matter-dominated Einstein–de Sitter model, we have the simple relation
Ref. 1106 T = (3~2) t 0 .
1056 xi general rel ativity versus quantum mechanics
mA
ħ T
.
c 2 (∆t)2
(700)
A simple way to incorporate general relativity into this result was suggested by Ng and
Ref. 1108 van Dam. Any clock of mass m has a minimum resolution ∆t due to the curvature of
space that it introduces, given by
∆t A 3 .
Gm
(701)
c
If m is eliminated, these two results imply that a clock with a precision ∆t can only meas-
ure times T up to a certain maximum value, namely
(∆t)3
T<
Page 44
“ Oscar Wilde
Ever since people began to study physics, the concept of ‘time’ has designated what is
measured by a clock. But the inequality (702) implies that perfect clocks do not exist, and
”
thus that time is only an approximate concept: perfect time does not exist. Thus there
is no ‘idea’ of time, in the Platonic sense. In fact, all the discussion in the previous and
present sections can be seen as proof that there are no perfect or ‘ideal’ examples of any
classical or everyday concept.
nature at l arge scales – is the universe something or nothing? 1057
Time does not exist. Yet it is obviously a useful concept in everyday life. The key to
understanding this is energy. Any clock – in fact, any system of nature – is characterized
by a simple number, namely the highest ratio of its kinetic energy to the rest energy of
Challenge 1484 e its components. In daily life, this ratio is about 1 eV~10 GeV = 10−10 . Such low-energy
systems are well suited for building clocks. The more precisely the motion of the main
moving part – the pointer of the clock – can be kept constant, and monitored, the higher
the precision of the clock. To achieve very high precision, the pointer must have very high
mass. Indeed, both the position and the speed of the pointer must be measured, and the
two measurement errors are related by the quantum-mechanical indeterminacy relation
∆v ∆x A ħ~m. High mass implies low intrinsic fluctuation. Furthermore, in order to
screen the pointer from outside influences, even more mass is needed. This connection
between mass and accuracy might explain why more accurate clocks are usually more
expensive.
The standard indeterminacy relation quoted above is valid only at everyday energies.
To clarify the issue, we can calculate the error in measurement as a function of the
observation energy E meas . There are two limit cases. For low energies, the error due to
quantum effects is given by
∆t 1
(704)
T E meas
which decreases with increasing measurement energy. For high energies, however, the
error due to gravitational effects is given by
∆t E meas
(705)
T E Pl
1058 xi general rel ativity versus quantum mechanics
Relative
measurement error
1
quantum
total
error
error
quantum
gravity
∆Emin error
E
T=
d
. (706)
v
In simple terms, this is the method used to determine the age of the universe from the
expansion of space-time, for galaxies with red-shifts below unity.* The (positive) meas-
* At higher red-shifts, the speed of light, as well as the details of the expansion, come into play. To continue
with the analogy of the trees, we find that the trees are not straight all the way up to the top and that they
nature at l arge scales – is the universe something or nothing? 1059
2~3
A 2~3
∆d l Pl
(708)
T d
thus giving the same indeterminacy in the age of the universe as the one we found above
Challenge 1487 e in the case of material clocks.
We can try to reduce this error in two ways: by choosing objects at either small or
large distances. Let us start with small distances. In order to get high precision at small
Challenge 1488 e distances, we need high observation energies. It is fairly obvious that at observation ener-
gies near the Planck value, ∆T~T approaches unity. In fact, both terms on the right-hand
side of equation (707) become of order one. At these energies, ∆v approaches c and the
maximum value for d approaches the Planck length, for the same reason that at Planck
energies the maximum measurable time is the Planck time. In short, at Planck scales it is
impossible to say whether the universe is old or young.
Let us consider the other extreme, namely objects extremely far away, say with a red-
shift of z Q 1. Relativistic cosmology requires the diagram of Figure 411 to be replaced by
the more realistic diagram of Figure 412. The ‘light onion’ replaces the familiar light cone
Ref. 1106 of special relativity: light converges near the big bang.
In this case the measurement error for the age of the universe also depends on the
Copyright © Christoph Schiller November 1997–January 2007
distance and velocity errors. At the largest possible distances, the signals an object sends
out must be of high energy, because the emitted wavelength must be smaller than the
universe itself. Thus, inevitably, we reach Planck energies. However, we have seen that
in such high-energy situations, the emitted radiation, as well as the object itself, is indis-
tinguishable from the space-time background. In other words, the red-shifted signal we
would observe today would have a wavelength as large as the size of the universe, with a
correspondingly small frequency.
There is another way to describe the situation. At Planck energies or near the horizon,
the original signal has an error of the same size as the signal itself. When measured at the
present time, the red-shifted signal still has an error of the same size as the signal. As a
result, the error in the horizon distance becomes as large as the value to be measured.
In short, even if space-time expansion and large
scales are used, the instant of the so-called begin- big bang
space
ning of the universe cannot be determined with an
error smaller than the age of the universe itself: a
result we also found at Planck distances. If we aim
for perfect precision, we just find that the universe 4ct0/9
is 13.7 13.7 thousand million years old! In other light cone:
words, in both extremal situations, it is impossible to what we
say whether the universe has a non-vanishing age. can see
time
We have to conclude that the anthropocentric
concept of ‘age’ does not make any sense for the other galaxies
universe as a whole. The usual textbook value is in the night sky
Maximum length
General relativity shows that in the standard cosmological model, for hyperbolic (open)
and parabolic (marginal) evolutions of the universe, the size of the universe is infinite. It is
only the horizon distance, i.e. the distance of objects with infinite red-shift, that is finite.*
For elliptical evolution, the total size is finite and depends on the curvature. However, in
Page 468 this case also the present measurement limit yields a minimum size for the universe many
times larger than the horizon distance.
Quantum field theory, on the other hand, is based on flat and infinite space-time. Let
us see what happens when the two theories are combined. What can we say about meas-
urements of length in this case? For example, would it be possible to construct and use a
metre rule to measure lengths larger than the distance to the horizon?
Copyright © Christoph Schiller November 1997–January 2007
It is true that we would have no time to push it out that far, since in the standard
Ref. 1106 Einstein–de Sitter big bang model the horizon moves away from us faster than the speed
of light. (We should have started using the metre rule right at the big bang.) But just
for fun, let us assume that we have actually managed to do this. How far away can we
read off distances? In fact, since the universe was smaller in the past, and since every
observation of the sky is an observation of the past, Figure 412 shows that the maximum
* In cosmology, we need to distinguish between the scale factor R, the Hubble radius c~H = cR~Ṙ, the
horizon distance h and the size d of the universe. The Hubble radius is a computed quantity giving the
distance at which objects move away with the speed of light. It is always smaller than the horizon distance,
at which in the standard Einstein–de Sitter model, for example, objects move away with twice the speed of
Ref. 1106 light. However, the horizon itself moves away with three times the speed of light.
nature at l arge scales – is the universe something or nothing? 1061
Ref. 1106 spatial distance away from us at which an object can be seen is only (4~9)ct 0 . Obviously,
for space-time intervals, the maximum remains ct 0 .
Thus, in all cases it turns out to be impossible to measure lengths larger than the hori-
zon distance, even though general relativity predicts such distances. This result is unsur-
prising, and in obvious agreement with the existence of a limit for measurements of time
intervals. The real surprises come next.
where the numerical constant C relates the commonly accepted horizon distance to the
commonly accepted age. Indeed, observation shows that the universe is large, and is get-
ting larger. But let us investigate what happens if we add some quantum theory to this
result from general relativity. Is it really possible to measure the distance to the horizon?
We look first at the situation at high energies. We saw above that space-time and matter
are not distinguishable at Planck scales. Therefore, at Planck energies we cannot state
whether or not objects are localized. At Planck scales, the distinction between matter and
vacuum – so basic to our thinking – disappears.
Another way to say this is that we cannot claim that space-time is extended at Planck
scales. Our concept of extension derives from the possibility of measuring distances and
time intervals, and from observations such as the ability to align several objects behind
one another. Such observations are not possible at Planck scales. In fact, all of the everyday
observations from which we deduce that space is extended are impossible at Planck scales,
where the basic distinction between vacuum and matter, namely between extension and
localization, disappears. As a consequence, at Planck energies the size of the universe
cannot be measured. It cannot even be called larger than a matchbox.
At cosmological distances, the situation is even simpler. All the arguments given above
for the errors in measurement of the age can be repeated for the distance to the horizon.
Copyright © Christoph Schiller November 1997–January 2007
Challenge 1490 ny
Essentially, at the largest distances and at Planck energies, the measurement errors are
of the same magnitude as the measured values. This is because length measurements
become impossible at nature’s limits. This is corroborated by the lack of any standard
with which to compare the size of the universe.
Studying the big bang also produces strange results. At Planck energies, whenever we
try to determine the size of the big bang, we cannot claim that the universe was smaller
than it is today: there is no way to distinguish length values. Somehow, Planck scales and
the size of the universe get confused.
There are other confirmations too. Let us go back to the example above. If we had a
metre rule spanning the whole universe, even beyond the horizon, with zero at the place
where we live, what measurement error would it produce for the horizon? It does not take
1062 xi general rel ativity versus quantum mechanics
Challenge 1491 ny long to work out that the expansion of space-time from Planck scales to the present size
implies an expansion in the indeterminacy in the Planck size to one of the order of the
distance to the horizon. The error is as large as the measurement result.
Since this also applies if we try to measure the diameter of the universe instead of its
radius, it is impossible to say whether the antipodes in the sky really are distant from each
other!
We can summarize the situation by noting that anything said about the size of the
universe is as limited as anything said about its age. The height of the sky depends on the
observation energy. At Planck energies, it cannot be distinguished from the Planck length.
If we start measuring the sky at standard observation energies, and try to increase the
precision of measurement of the distance to the horizon, the measurement error increases
beyond all bounds. At Planck energies, the volume of the universe is indistinguishable
from the Planck volume!
Worse still, quantum theory shows that space-time is not continuous at a horizon: this
Ref. 1105 can easily be deduced using the Planck-scale arguments from the previous section. Time
and space are not defined there.
Finally, there is no way to decide whether the various boundary points are different
from each other. The distance between two points in the night sky is undefined. Therefore
it is unclear what the diameter of the horizon is.
* The measurement errors also imply that we cannot say anything about translational symmetry at cosmo-
Challenge 1492 ny logical scales. Can you confirm this? In addition, at the horizon it is impossible to distinguish between
spacelike and timelike distances. Even worse, concepts such as ‘mass’ or ‘momentum’ become muddled at
the horizon. This means that, as at Planck energies, we are unable to distinguish between object and back-
ground, and between state and intrinsic properties. We will come back to this important point shortly.
nature at l arge scales – is the universe something or nothing? 1063
In summary, the horizon has no specific distance or shape. The horizon, and thus the
universe, cannot be shown to be manifolds. This leads us to the next question.
Does the universe contain particles and stars? Copyright © Christoph Schiller November 1997–January 2007
The total number of stars, about 10231 , is included in every book on cosmology, as it is
Page 1183 in the table of Appendix B. A smaller number can be counted on clear nights.
We can ask the same question about particles instead of stars. The commonly quoted
numbers are is 10811 baryons and 10901 photons. However, the issue is not settled.
Neither quantum theory nor general relativity alone makes predictions about the number
of particles, either inside or outside the horizon. What happens if we combine them?
In order to define the number of particles in a region, quantum theory first of all re-
Page 794 quires a vacuum state to be defined. The number of particles is defined by comparing the
system with the vacuum. If we neglect or omit general relativity by assuming flat space-
time, this procedure poses no problem. However, if we include general relativity, and
thus a curved space-time, especially one with a strangely behaved horizon, the answer is
1064 xi general rel ativity versus quantum mechanics
simple: there is no vacuum state with which we can compare the universe, for two reasons.
First, nobody can explain what an empty universe would look like. Secondly, and more
importantly, there is no way to define a state of the universe. The number of particles in
the universe thus becomes undefinable. Only at everyday energies and for finite dimen-
sions are we able to speak of an approximate number of particles.
Comparison between a system and the vacuum is also impossible, in the case of the
universe, for purely practical reasons. The particle counter would have to be outside the
Challenge 1494 ny system. (Can you confirm this?) In addition, it is impossible to remove particles from the
universe.
The impossibility of defining a vacuum state, and thus the number of particles in the
universe, is not surprising. It is an interesting exercise to investigate the measurement
errors that appear when we try to determine the number of particles despite this funda-
Challenge 1495 ny mental impossibility.
Can we count the stars? In principle, the same conclusion applies as for particles. How-
m min (R)
ρ min = 4 .
ħ
3
(710)
R cR
For the universe, if the standard horizon distance R 0 of 14 000 million light years is in-
serted, the value becomes about 10−142 kg~m3 . This describes the density of the vacuum.
In other words, the universe, with a density of about 10−26 kg~m3 , seems to be clearly
different from vacuum. But are we sure?
We have just deduced that the radius of the horizon is undefined: depending on the
observation energy, it can be as small as the Planck length. This implies that the density
nature at l arge scales – is the universe something or nothing? 1065
of the universe lies somewhere between the lowest possible value, given by the density
of vacuum just mentioned, and the highest possible one, namely the Planck density.* In
short, the relation (710) does not really provide a clear statement.
Another way to measure the mass of the universe would be to apply the original defini-
Page 318 tion of mass, as given by Mach and modified by special relativity. Thus, let us try to collide
a standard kilogram with the universe. It is not hard to see that whatever we do, using
either low or high energies for the standard kilogram, the mass of the universe cannot be
Challenge 1497 ny constrained by this method. We would need to produce or to measure a velocity change
∆v for the rest of the universe after the collision. To hit all the mass in the universe at the
same time, we need high energy; but then we are hindered by Planck energy effects. In
addition, a properly performed collision measurement would require a mass outside the
universe, which is rather difficult to achieve.
Yet another way to measure the mass would be to determine the gravitational mass
of the universe through straightforward weighing. But the lack of balances outside the
3 4πR 2 − S 15 4πR 3 ~3 − V
κ= = =
1
. (712)
2
r curvature 4π R4 4π R5
We have to insert the horizon radius R 0 and either its surface area S 0 or its volume V0 .
However, given the error margins on the radius and the volume, especially at Planck
Challenge 1498 ny energies, we again find no reliable result for the radius of curvature.
Ref. 1111 An equivalent method starts with the usual expression provided by Rosenfeld for the
indeterminacy ∆κ in the scalar curvature for a region of size R, namely
2
16πl Pl
∆κ A . (713)
R4 Copyright © Christoph Schiller November 1997–January 2007
However, this expression also shows that the error in the radius of curvature behaves like
the error in the distance to the horizon.
In summary, at Planck energies, the average radius of curvature of nature lies between
infinity and the Planck length. This implies that the density of matter lies between the
minimum value and the Planck value. There is thus no method to determine the mass
* In fact, at everyday energies the density of the universe lies midway between the two values, yielding the
Challenge 1496 ny strange relation
m 0 ~R 0 m Pl ~R Pl = c ~G .
2 2 2 2 4 2
(711)
Page 891 But this is nothing new. The approximate equality can be deduced from equation 16.4.3 (p. 620) of Steven
Weinberg, Gravitation and Cosmology, Wiley, 1972, namely Gn b m p = 1~t 02 . The relation is required by
several cosmological models.
1066 xi general rel ativity versus quantum mechanics
Challenge 1499 ny of the universe at Planck energies. (Can you find one?) The concept of mass cannot be
applied to the universe as a whole. Thus, the universe has no mass.
— ‘Nothing’ could mean ‘no matter’. But we have just seen that this distinction cannot be
made at Planck scales. So either the boundary will not exist at all or it will encompass
the horizon as well as the whole universe.
— ‘Nothing’ could mean ‘no space-time’. We then have to look for those domains where
space and time cease to exist. These occur at Planck scales and at the horizon. Again,
either the boundary will not exist or it will encompass the whole universe.
— ‘Nothing’ could mean ‘neither space-time nor matter.’ The only possibility is a bound-
ary that encloses domains beyond the Planck scale and beyond the horizon; but again,
such a boundary would also encompass all of nature.
This is puzzling. When combining quantum theory and relativity, we do not seem to
nature at l arge scales – is the universe something or nothing? 1067
be able to find a conceptual definition of the horizon that distinguishes it from what it
Challenge 1501 ny includes. (If you find one, publish it!) A distinction is possible in general relativity alone,
and in quantum theory alone; but as soon as we combine the two, the boundary becomes
indistinguishable from its content. The interior of the universe cannot be distinguished
from its horizon. There is no boundary.
This difficulty in distinguishing the horizon from its contents is interesting: it suggests
that nature may be symmetric under transformations that exchange interiors and bound-
aries. This connection is called holography, because it vaguely recalls the working of credit-
Ref. 1112 card holograms. It is a busy research field in high-energy physics. However, for the time
being, we shall continue with our original theme, which leads us to our next question.
ized to describe the universe. The universe must be described by a mathematical concept
that does not contain any set. The new concept must be more general than that of a set.
This is a powerful result: it means that no precise description of the universe can use
any concept that presupposes the use of sets. But all the concepts we have used so far to
describe nature, such as space-time, metric, phase space, Hilbert space and its generaliz-
ations (Fock space and particle space), are based on elements and sets. They must all be
abandoned at Planck energies, and in any precise description.
Furthermore, many speculations about unified descriptions do depend on sets. In
particular, all studies of quantum fluctuations, mathematical categories, posets, complex
mathematical spaces, computer programs, Turing machines, Gödel’s theorem, creation of
Ref. 1113 any sort, space-time lattices, quantum lattices and Bohm’s unbroken wholeness presup-
1068 xi general rel ativity versus quantum mechanics
pose sets. In addition, almost none of the speculations about the origin of the universe
can be correct. For example, you may wish to check the religious explanations you know
Challenge 1502 ny against this result. In fact, no approach used by theoretical physicists in the year 2000
satisfies the requirement that elements and sets must be abandoned. Perhaps a future ver-
sion of string or M theory will do so. The task is not easy: do you know of a single concept
Challenge 1503 n not based on elements or sets?
Note that this radical conclusion is deduced from only two statements: the necessity
of using quantum theory whenever the dimensions are of the order of the Compton
wavelength, of using general relativity whenever the dimensions are of the order of the
Schwarzschild radius. Together, they mean that no precise description of nature can con-
tain elements and sets. We have reached this result after a long and interesting, but in a
sense unnecessary, digression. The difficulties in complying with this result may explain
why the unification of the two theories has not so far been successful. Not only does unific-
ation require that we stop using space, time and mass for the description of nature; it also
* Some people knew this long before physicists. For example, the belief that the universe is or contains
information was ridiculed most thoroughly in the popular science-fiction parody by Douglas Adams,
The Hitchhiker’s Guide to the Galaxy, 1979, and its sequels.
** ‘In so far as mathematical statements describe reality, they are not certain, and as far as they are certain,
they are not a description of reality.’
nature at l arge scales – is the universe something or nothing? 1069
ideas: that the number of particles in the universe is a well-defined quantity, and that
the universe is a computer, i.e. a physical system. We now know that neither assumption
is correct. This shows the power of the criteria that we have deduced for any precise or
complete description of motion.
**
People take pictures of the cosmic background radiation and its variations. Is it possible
that these photographs will show that the spots in one direction of the sky are exactly the
Challenge 1505 ny same as those in the diametrically opposite direction?
**
In 1714, Leibniz published his Monadologie. In it he explores what he calls a simple sub-
stance, which he defined to be a substance that has no parts. He called it a monade and
Ref. 1115 describes some of its properties. However, mainly thanks to his incorrect deductions, the
**
We usually speak of the universe, implying that there is only one of them. Yet there is a
simple case to be made that ‘universe’ is an observer-dependent concept, since the idea
Challenge 1507 ny of ‘all’ is observer-dependent. Does this mean that there are many universes?
**
If all particles were removed (assuming one knew where to put them), there wouldn’t be
Challenge 1508 ny much of a universe left. True?
**
At Planck energies, interactions cannot be defined. Therefore, ‘existence’ cannot be
Challenge 1509 ny defined. In short, at Planck energies we cannot say whether particles exist. True?
modelled on the basis of mathematics. Physicists have always had to live with logical
problems. Mathematics helps physics, but mathematics is not the foundation of physics.
The situation is similar to a child’s description of the sky as ‘made of air and clouds’.
Looking closely, we discover that clouds are made up of water droplets. However, there
is air inside clouds, and there is also water vapour elsewhere in the air. When clouds and
air are viewed through a microscope, there is no clear boundary between the two. We
cannot define either of the terms ‘cloud’ and ‘air’ without the other.
Objects and vacuum are also indistinguishable in this sense. Virtual particles are found
in vacuum, and vacuum is found inside objects. At Planck scales there is no clear bound-
ary between the two; we cannot define either without the other. Despite the lack of a pre-
cise definition, and the logical problems that ensue, in both cases the description works
well at large, everyday scales.
The result does not mean that physics has to abandon mathematics. This is not neces-
sary. The way out will become clear as we continue in our ascent of motion mountain.
“ [...]
Daß ich erkenne, was die Welt
Im Innersten zusammenhält.*
Goethe, Faust
Is the universe really the sum of matter–energy and space-time? Or of particles and va-
cuum? We have heard these statements so often that we may forget to check them. We
do not need magic, as Faust thought: we only need to list what we have found so far, espe-
”
Page 647 cially in this section, in the section on Planck scales, and in the intermezzo on brain and
language. Table 75 shows the result.
Not only are we unable to state that the universe is made of space-time and matter; we
are unable to say anything about the universe at all!** It is not even possible to say that
it exists, since it is impossible to interact with it. The term ‘universe’ does not allow us to
Challenge 1511 r make a single sensible statement. (Can you find one?) We are only able to list properties
Copyright © Christoph Schiller November 1997–January 2007
it does not have. We are unable to find any property that the universe does have. Thus,
the universe has no properties! We cannot even say whether the universe is something
or nothing. The universe isn’t anything in particular. In other words, the term ‘universe’ is
not at all useful for the description of motion.
Page 660 We can obtain a confirmation of this strange conclusion from the first intermezzo.
There we found that any concept needs defined content, defined limits and a defined
domain of application. In this section, we have found that the term ‘universe’ has none of
these; there is thus no such concept. If somebody asks why the universe exists, the answer
* ‘Thus I have devoted myself to magic, [...] that I understand how the innermost world is held together.’
** There is another well-known, non-physical concept about which nothing can be said. Many scholars have
Challenge 1510 n explored it in detail. What is it?
nature at l arge scales – is the universe something or nothing? 1071
is: not only does the use of the word ‘why’ wrongly suggest that something may exist
outside the universe, providing a reason for it and thus contradicting the definition of the
term ‘universe’ itself; but more importantly, the universe does not exist. In summary, any
sentence containing the word ‘universe’ is meaningless. The word only seems to express
something.*
This conclusion may be interesting, even strangely beautiful, but does it help us to
understand motion more precisely? Yes, it does.
Nature cannot be made of separate, distinct and discrete entities. This conclusion will
help us on the way to a precise description of nature. We will see how later on.
By taking into account the limits on length, time, mass and all the other quantities we
have encountered, we have reached a number of almost painful conclusions about nature.
However, we have also received something in exchange: all the contradictions between
general relativity and quantum theory that we mentioned at the beginning of this chapter
Challenge 1512 e are now resolved. Although we have had to leave many cherished habits behind us, in
* Of course, the term ‘universe’ still makes sense if it is defined more restrictively: for example, as everything
interacting with a particular human or animal observer in everyday life. But such a definition is not useful
for our quest, as it lacks the precision required for a description of motion.
1072 xi general rel ativity versus quantum mechanics
P h ys i c a l p r o p e rt y o f n at u r e At h ori - At At P l a n c k
zon scale every - scale
day
scale
requires quantum theory and relativity true false true
intervals can be measured precisely false true false
length and time intervals limited unlimited limited
space-time is not continuous true false true
points and events cannot be distinguished true false true
space-time is not a manifold true false true
space is 3-dimensional false true false
space and time are indistinguishable true false true
Extremal scales and open questions in physics Copyright © Christoph Schiller November 1997–January 2007
Page 981 In the chapter Quantum physics in a nutshell we listed all the unexplained properties of
nature left open either by general relativity or by quantum theory. The present conclusions
provide us with new connections among them. Indeed, many of the cosmological results
of this section sound surprisingly familiar. Let us compare them systematically with those
of the section on Planck scales. Both sections explored topics from the list of unexplained
properties of nature, some in more detail than others.
First, Table 76 shows that none of the unexplained properties makes sense at both limits
of nature, the small and the large. All open questions are open at both extremes. Secondly,
and more importantly, nature behaves in the same way at horizon scales and at Planck
Challenge 1513 r scales. In fact, we have not found any difference between the two cases. (can you discover
one?) We are thus led to the hypothesis that nature does not distinguish between the large
and the small. Nature seems to be characterized by extremal identity.
nature at l arge scales – is the universe something or nothing? 1073
2 2
l Pl l Pl xµ
r or xµ (714)
r xµ x µ
relating distances r or coordinates x µ with their inverse values using the Planck length
l Pl . Can this mapping be a symmetry of nature? At every point of space? For example,
if the horizon distance is inserted, the relation (714) implies that lengths smaller than
Challenge 1515 ny l Pl ~1061 10−96 m never appear in physics. Is this the case? What would inversion imply
for the big bang?
Numerous fascinating questions are contained in the simple hypothesis of extremal
identity. They lead to two main directions for investigation.
First, we have to search for some stronger arguments for the validity of extremal iden-
tity. We will discover a number of simple arguments, all showing that extremal identity
is indeed a property of nature, and producing many beautiful insights.
The other quest then follows. We need to find the correct version of the relation (714).
Copyright © Christoph Schiller November 1997–January 2007
That oversimplified expression is neither sufficient nor correct. It is not sufficient because
it does not explain any of the issues left open by general relativity and quantum theory. It
only relates some of them, thus reducing their number, but it does not solve any of them.
Challenge 1516 ny (You may wish to check this for yourself.) In other words, we need to find the precise
description of quantum geometry and of elementary particles.
However, inversion is also simply wrong. Inversion is not the correct description of
extremal identity because it does not realize a central result discovered above: it does not
connect states and intrinsic properties, but keeps them distinct. This means that inversion
does not take interactions into account. And most open issues at this point of our moun-
tain ascent concern properties of interactions.
1074 xi general rel ativity versus quantum mechanics
Biblio graphy
1113 D. B ohm & B.J. Hiley, On the intuitive understanding of nonlocality as implied by
quantum theory, Foundations of Physics 5, pp. 93–109, 1975. Cited on page 1067.
1114 S. Lloyd, Computational capacity of the universe, Physical Review Letters 88, p. 237901,
2002. Cited on page 1068.
1115 Gottfried Wilhelm Leibniz, La Monadologie, 1714. Written in French, it is avail-
able freely at http://www.uqac.uquebec.ca/zone30/Classiques_des_sciences_sociales and
in various other languages on other websites. Cited on page 1069.
1116 See, for example, H. Wussing & P.S. Alexandrov, editors, Die Hilbertschen Probleme,
Akademische Verlagsgesellschaft Geest & Portig, 1983, or Ben H. Yandell, The Honours
Class: Hilbert’s Problems and their Solvers, A.K. Peters, 2002. Cited on page 1069.
1117 A large part of the study of dualities in string and M theory can be seen as investigations
into the detailed consequences of extremal identity. For a review of dualities, see ... A
bibliography 1075
classical version of duality is discussed by M.C.B. Abdall a, A.L. Gadelka & I.V.
Vancea, Duality between coordinates and the Dirac field, http://arxiv.org/abs/hep-th/
0002217. Cited on page 1073.
“ Graffito
”
M
aybe you have once met a physicist who has told you, in one of those
oments of confidentiality, that studying physics is even more beautiful than
aking love. At this statement, many will simply shake their head in disbelief
and probably disapproval. In this section we will argue that it is possible to learn so much
about physics while making love that discussions about their relative beauty can be put
aside altogether.
Imagine being with your partner on a beautiful tropical island, just after sunset, and
looking together at the evening sky. Imagine, as well, that you know little of what is taught
Let us see what these observations imply for the description of nature.
**
Copyright © Christoph Schiller November 1997–January 2007
* Here we deduce physics from love. We could also deduce physics from sexuality. The modern habit of
Ref. 1118 saying ‘sex’ instead of ‘sexuality’ confuses two completely different concepts. In fact, studying the influences
Ref. 1119 of sex on physics is almost a complete waste of time. Maybe one day we will understand why there do not
seem to be any female crackpots proposing pet physical theories.
the physics of love – a summary of the first two-and-a-half parts 1077
**
Love is an interaction between moving bodies. Together with the previous result, this im-
plies that we can and need to describe moving bodies with mass, energy and momentum.
Challenge 1517 n That is not a small feat. For example, it implies that the Sun will rise tomorrow.
**
Love is attractive. When feeling attracted to your partner, you may wonder if this attrac-
tion is the same as that which keeps the Moon going around the Earth. You make a quick
calculation and find that applying the expression for universal gravity
E pot = −
GMm
(715)
r
to both of you, the energy involved is about as much as the energy added by the leg of a
**
Love makes noise. That is not news. However, even after making love, even when every-
body and everything is quiet, in a completely silent environment, we do hear something.
The noises we hear are produced within the ear, partly by the blood flowing through the
head, partly by the electrical noise generated in the nerves. That is strange. If matter were
continuous, there would be no noise even for low signal levels. In fact, all proofs of the
discreteness of matter, of electric current, of energy, or of light are based on the increase
of fluctuations with the smallness of the system under consideration. The persistence of
noise thus makes us suspect that matter is made of smallest entities. Making love confirms
this suspicion in several ways.
**
Love is for reproduction. We owe our life to love, as we are all results of reproduction. But Copyright © Christoph Schiller November 1997–January 2007
**
Ref. 1120 Love needs memory. If you could not recognize your partner among all possible ones,
your love life would be quite complicated. A memory is a device which, in order to store
information, must have small internal fluctuations. Fluctuations in systems get smaller as
the number of components in them increases. Since our memory works so well, we must
be made of a large number of small particles.
In summary, love shows that we are made of some kind of Lego bricks: depending
on the level of magnification, these bricks are called molecules, atoms, or elementary
particles. It is possible to estimate their size using the sea around the tropical island and
Challenge 1518 n a bit of oil. Can you imagine how?
**
where m is the mass of the object. For example, this applies to electrons inside a television
tube.
**
Love is based on touch. When we touch our partner, we sometimes get small shocks.
The energies involved are larger than than those of touching fly legs. In short, people are
electric.
In the dark, we observe that discharges emit light. Light is thus related to electricity. In
addition, touching proves that light is a wave: simply observe the dark lines between two
Challenge 1519 n fingers near your eye in front of a bright background. The lines are due to interference
effects. Thus light does not move with infinite speed. In fact, it moves with the same speed
as mobile phone calls.
the physics of love – a summary of the first two-and-a-half parts 1079
**
Love is fun. People like to make love in different ways – for example, in a dark room. But
rooms get dark when the light is switched off only because we live in a space with an odd
number of dimensions. In even-dimensional space, a lamp would not turn off directly
Page 212 after the switch is flicked, but only dim slowly.
Love is also fun because with our legs, arms and bodies we can make knots. Knots
are possible only in three dimensions. In short, love is fun only because we live in three
dimensions.
**
Love is tiring. The reason is gravity. But what is gravity? A little thinking shows that since
there is a maximum speed, gravity is the curvature of space-time. Curved space also
means that a horizon – a largest possible visible distance – can appear. From equations
(715) and (716), we deduce that this happens when distances are of the order of
For example, it is only because of a horizon – albeit one appearing in a different way –
that the night sky is dark.
**
Love takes time. It is known that men and women have different opinions on durations. It
is also known that love happens between your ears. Indeed, biological research has shown
Ref. 1125 that we have a clock inside the brain, which depends on circulating electrical currents.
This clock provides our normal sense of time. Since this clock exists, there must be a
time standard in nature. Such a standard must be either a minimum or a maximum time
interval. We shall discover it later on.
**
Love is repulsive. And in love, size matters. These facts turn out to be two sides of the
same coin. Love is based on touch, and touch needs repulsion. Repulsion needs a length
scale, but neither gravity nor classical electrodynamics provides one. Classical physics
only allows for the measurement of speed. It cannot explain how the measurement of
length, time, or mass is possible.* Classically, matter cannot be hard: it should be possible Copyright © Christoph Schiller November 1997–January 2007
to compress it. But love shows us that this is not the case. Love shows us that lengths scales
do exist, and thus that classical physics is not sufficient for the description of nature.
**
Love can hurt. For example, it can lead to injuries. Atoms can get ripped apart. That hap-
pens when energies are concentrated on small volumes: a few attojoule per atom. Investig-
ating such situations more precisely, we finds that strange phenomena appear at distances
r if energies exceed the value
E
ħc
. (718)
r
* Note that the classical electron radius is not an exception: it depends on the elementary charge e, which
depends on a length scale, as shown on page 500.
1080 xi general rel ativity versus quantum mechanics
In particular, energy becomes chunky, things become fuzzy, boxes are not tight, and
particles get confused. These are called quantum phenomena. The new constant ħ =
10−34 Js is important: it determines the size of things, because it allows us to define dis-
tance and time units. In other words, objects tear and break because in nature there is a
minimum action, given roughly by ħ.
If even more energy is concentrated in small volumes – of the order of mc 2 per particle
– one even observes transformation of energy into matter, or pair production. From equa-
tions (716) and (718), we deduce that this happens at distances of
r pair production
ħ
. (719)
mc
At such small distances we cannot avoid using the quantum description of nature.
**
The second version is more honest, but which of the two is less sensible? Both statements
show that there are unexplained facts in the Greek description nature, in particular the Copyright © Christoph Schiller November 1997–January 2007
number of particles.
**
Love is animal. We have seen that we can learn a lot about nature from the existence of
love. We could be tempted to see this approach to understanding nature as a case of the
so-called anthropic principle. However, some care is required here. We could have learned
exactly the same if we had taken as our starting point the observation that apes or pigs
have love. There is no ‘law’ of nature which distinguishes between them and humans. In
fact, there is a simple way to determine whether any ‘anthropic’ statement makes sense:
the reasoning must be equally true for humans, apes, and pigs.
One famous anthropic deduction was stated by the British astrophysicist Fred Hoyle.
While studying stars, he predicted a resonance in the carbon-12 nucleus. If it did not exist,
the physics of love – a summary of the first two-and-a-half parts 1081
he argued, stars could not have produced the carbon which afterwards was spread out by
explosions into interstellar space and collected on Earth. Since apes or pigs might also
reason in this way, Hoyle’s statement does make sense.
On the other hand, claiming that the universe is made especially for people is not sens-
ible: pigs would say that it is made for pigs. The existence of either humans or pigs re-
quires all ‘laws’ of nature. In summary, the anthropic principle is valid only in so far as
its consequences are indistinguishable from those of the porcine or the simian principle.
In short, the animal side of love imposes limits on the philosophy of physics.
**
Love is holy. According to the famous definition by the theologian Rudolf Otto, holiness
Ref. 1123 results from a mixture of a mysterium tremendum and a mysterium fascinans. Tremendum
means that it makes one tremble. Indeed, love produces heat and is a dissipative process.
All systems in nature that produce heat have a finite lifetime. That is true for machines,
TA B L E 77 Everything quantum field theory and general relativity do not explain: in other words, a list of
all the experimental data and criteria available for tests of the unified description of motion
O b s e r va b l e P r o p e r t y u n e x p l a i n e d s o f a r
O b s e r va b l e P r o p e r t y u n e x p l a i n e d s o f a r
This ‘why’ list is fascinating because it is so short, and many researchers are still trying
to reduce it further. But it is equally interesting to study what the consequences would be
if any of the values from Table 77 were just a tiny bit different. Small changes in nature
would lead to completely different observations, as shown in Table 78.
the physics of love – a summary of the first two-and-a-half parts 1083
O b s e r va b l e C h a n g e R e su lt
Moon size smaller small Earth magnetic field; too much cosmic radiation;
widespread child cancers
Moon size larger large Earth magnetic field; too little cosmic radiation;
no evolution into humans
Jupiter smaller many comet impacts on Earth; extinction of animal life
Jupiter larger few comet impacts on Earth; no Moon; no dinosaur
extinction
Oort belt smaller no comets, no irregular asteroids, no Moon; still
dinosaurs
star distance smaller irregular planetary motion; supernova dangers
strong coupling smaller proton decay; leukaemia
The large number of coincidences of life forces our mind to realize that we are only a
tiny part of nature. We are a small droplet shaken around in the ocean of nature. Even the
tiniest changes in nature would prevent the existence of humans, apes and pigs. In other
words, making love tells us that the universe is much larger than we are, and how much
we are dependent on and connected to the rest of the universe.
**
We said above that love uses motion. This fact contains a remarkable mystery, worth a
second look.
Motion is the change of position with time of some bodies. Position is what we meas-
ure with a ruler. Time is what we measure with a clock. Both rulers and clocks are bodies.
A body is an entity distinct from its environment by its shape or its mass. Shape is
the extension of a body in space (and time). Mass is measured by measuring speed or
acceleration, i.e. by measuring space and time.
This means that we define space-time in terms of bodies – as in general relativity – and
we define bodies in terms of space-time – as in quantum theory. This circular reasoning
shows that making love is truly a mystery. It has not yet been eliminated yet: modern Copyright © Christoph Schiller November 1997–January 2007
theoretical physicists are busy with the attempt. In the year 2000, the most promising
approach seems to be M-theory, an extension of string theory. But any such attempt has
to overcome major difficulties – which can also be experienced while making love.
**
Love is private. But is it? The concept of privacy assumes that a person can separate from
the rest, without important interactions, at least for a given time, and come back later.
This is possible only if there is enough empty space between the person and others. In
other words, privacy is based on the idea that objects can be distinguished from vacuum.
Let us check whether this is always possible.
Ref. 1124 What is the smallest measurable distance? This question was almost answered by Max
Planck in 1899. The distance δl between two objects of mass m is surely larger than their
1084 xi general rel ativity versus quantum mechanics
R R
δl E ∆l E
ħ
. (720)
mc
In addition, measurement requires that signals leave the objects, so the two masses must
not be black holes. Their masses must be small enough to ensure that the Schwarzschild
radius is smaller than the distance to be measured. This means that r S Gm~c 2 < δl, or
¾
δl E = l Pl = 1.6 ë 10−35 m .
ħG
(721)
c3
This expression defines a minimum length in nature, the so-called Planck length. Every
other Gedanken experiment leads to this characteristic length as well. In fact, this min-
imum distance (and the corresponding minimum time interval) provides the measure- Copyright © Christoph Schiller November 1997–January 2007
ment standard we were looking for at the beginning of our musings about length and
time measurements.
A more detailed discussion shows that the smallest measurable distance is somewhat
larger, because measurements require a distinction to be made between matter and radi-
ation. This happens at scales of about 800 times the Planck length.
In other words, privacy has its limits. In fact, the issue becomes even more muddled
when we explore the consequences for bodies. A body (even a human one) is something
we can touch, throw, hit, carry or weigh. Physicists say that a body is something with
energy or momentum. Vacuum has none of these properties. In addition, vacuum is un-
bounded, whereas objects are bounded.
What happens if we try to weigh objects at Planck scales? Quantum theory makes a
simple prediction. If we put an object of mass M in a box of size R onto a scale – as in
the physics of love – a summary of the first two-and-a-half parts 1085
Figure 413 – equation (718) implies that there is a minimal mass error ∆M given by
∆M
ħ
. (722)
cR
If the box has Planck size, the mass error is the Planck mass
»
∆M = M Pl = ħc~G 22 µg . (723)
How large a mass can we put into a box of Planck size? Obviously one with the maximum
possible mass density. To determine this, imagine a planet with a satellite in orbit around
it, just skimming its surface. The density ρ of the planet with radius r is given by
v2
ρ =
M
. (724)
c5
ρ Pl = = 5.2 ë 1096 kg~m3 . (725)
G2ħ
Therefore the maximum mass that can be contained inside a Planck box is the Planck
mass. But that was also the measurement error for that situation. This implies that we
cannot say whether the original box we measured was empty or full: vacuum cannot be
distinguished from matter at Planck scales. This astonishing result is confirmed by every
Ref. 1124 other Gedanken experiment exploring this issue.
It is straightforward to deduce with similar arguments that objects are not bound in
size at Planck scales, i.e. that they are not localized, and that the vacuum is not necessarily
Challenge 1520 n extended at those scales. In addition, the concept of particle number cannot be defined
at Planck scales.
So, why is there something instead of nothing? Making love shows that there is no
difference between the two options!
**
Copyright © Christoph Schiller November 1997–January 2007
Love makes us dream. When we dream, especially at night, we often look at the sky. How
far away is it? How many atoms does it enclose? How old is it? These questions have an
answer for small distances and for large distances; but for the whole of the sky, or the
whole of nature, they cannot have an answer, as there is no way to be outside the sky in
order to measure it. In fact, the impossibility of measuring nature at the smallest scales
is found again at the largest scales. There seems to be a fundamental equivalence, or a
Ref. 1126 duality, between the very large and the very small.
Hopefully, the coming years will show how we can translate these results into an even
more precise description of motion and of nature. In particular, this description should
allow us to reduce the number of unexplained properties of nature.
Motion Mountain – The Adventure of Physics available free of charge at www.motionmountain.net Copyright © Christoph Schiller November 1997–January 2007
xi general rel ativity versus quantum mechanics
In summary, making love is a good physics lesson. Enjoy the rest of your day.
**
1086
bibliography 1087
Biblio graphy
1118 If you are more interested in sexuality than in love, the best book to read is the fascinating
text by Olivia Judson, Dr. Tatjana’s Sex Advice to all Creation, Metropolitan Books,
2002. Cited on page 1076.
1119 A provocative attempt to explain the lack of women in physics is made in Margaret
Wertheim, Pythagoras’ Trousers – God, Physics and the Gender Wars, Fourth Estate, 1997.
Cited on page 1076.
1120 The consequences of memory loss in this case are already told by Voltaire, Aventure de
la mémoire, 1775. Cited on page 1078.
1121 A picture of objects in a red-hot oven and at room temperature is shown in C.H. Ben-
nett, Demons, Engines and the Second Law, Scientific American 255, pp. 108–117, Novem-
ber 1987. Cited on page 1078.
1122 This famous statement is found at the beginning of chapter XI, ‘The Physical Universe’, in
vDc. (726)
* These limits were given for the first time in 2003, in this section of the present text.
** Stimulating discussions with Saverio Pascazio, Corrado Massa and Steve Carlip helped shape this section.
fundamental limits to all observables 1089
A few well-known facts set the framework for the discussion that follows. The speed v is
less than or equal to the speed of light c for all physical systems;* in particular, this limit
is valid both for composite systems and for elementary particles. The speed limit is also
valid for all observers. Indeed, only a maximum speed ensures that cause and effect can
be distinguished in nature, or that sequences of observations can be defined. The opposite
hypothesis, that speeds greater than c are possible, which implies the existence of (real)
tachyons, has been explored and tested in great detail; it leads to numerous conflicts with
observations.
The maximum speed forces us to use the concept of space-time to describe nature. The
existence of a maximum speed implies that space and time mix. It also implies observer-
Page 290 dependent time and space coordinates, length contraction, time dilation, mass–energy
equivalence, and all the other effects that characterize special relativity. Only this prin-
ciple leads to the principle of maximum ageing that governs special relativity; and only
this principle leads to the principle of least action at low speeds. In addition, only with
c2
lD . (727)
a
This size limit is induced by the speed of light c; it is also valid for the displacement d of
a system, if the acceleration measured by an external observer is used. Finally, the speed
limit implies a relativistic ‘indeterminacy relation’
∆l ∆a D c 2 (728)
for the length and acceleration indeterminacies. You may wish to take a minute to deduce
Challenge 1521 n this relation from the time–frequency indeterminacy. All this is standard knowledge. Copyright © Christoph Schiller November 1997–January 2007
SE
ħ
. (729)
2
* A physical system is a region of space-time containing mass–energy, the location of which can be followed
over time and which interacts incoherently with its environment. This definition excludes images, geomet-
rical points or incomplete entangled situations.
1090 xi general rel ativity versus quantum mechanics
Half the Planck constant ħ is the smallest observable action or angular momentum. This
statement is valid for both composite and elementary systems.
The action limit is used less frequently than the speed limit. It starts from the usual
definition of the action, S = ∫ (T − U)dt, and states that between two observations per-
formed at times t and t + ∆t, even if the evolution of a system is not known, the action is
at least ħ~2. Physical action measures the change in the state of a physical system. Thus
there is always a minimum change of state between two observations of a system. The
quantum of action expresses the fundamental fuzziness of nature at a microscopic scale.
It can easily be checked that no observation – whether of photons, electrons or mac-
roscopic systems – gives a smaller action than this. The minimum action has been veri-
fied for fermions, bosons, laser beams and matter systems, and for any combination of
these. The opposite hypothesis, implying the existence of arbitrary small change, has been
explored in detail: Einstein’s long discussion with Bohr, for example, can be seen as a
repeated attempt by Einstein to find experiments that would make it possible to meas-
dE
ħ
. (730)
2mc
In other words, (half) the (reduced) Compton wavelength of quantum theory appears as
the lower limit on the displacement of a system, whenever gravity plays no role. Since the
quantum displacement limit applies in particular to an elementary system, it also applies
to the size of a composite system. However, the limit is not valid for the sizes of elementary
particles.
The action limit of quantum theory also implies Heisenberg’s well-known indetermin-
Page 725 acy relation for the displacement d and momentum p of systems:
fundamental limits to all observables 1091
∆d ∆p E
ħ
. (731)
2
This is valid for both massless and massive systems. All this is textbook knowledge.
c4 c5
FD = 3.0 ë 1043 N or PD = 9.1 ë 1051 W . (732)
4G 4G
These limit statements contain both the speed of light c and the gravitational constant
* It may be that the present author was the first to point it out, in this very text. Gary Gibbons found the
limit independently.
1092 xi general rel ativity versus quantum mechanics
The force and power limits are achieved only at horizons. They are valid for all observ-
ers and for all interactions.
As an alternative to the maximum force and power limits, we can take as a basic prin-
ciple the statement: There is a maximum rate of mass change in nature. For all systems
and observers, one has
dm c 3
D = 1.0 ë 1035 kg~s . (733)
dt 4G
Equivalently, we can state: There is a maximum mass-to-length ratio in nature. For all
systems and observers, one has
dm c 2
D = 3.4 ë 1026 kg~m . (734)
dl 4G
Both the force and the power limits state that the flow of momentum or of energy through
dimensional surface, and that it has all the properties usually associated with it.
The connection between horizons and the maximum force or power allows us to de-
duce the field equations in a simple way. First, there is always a flow of energy at a horizon.
Horizons cannot be planes, since an infinitely extended plane would imply an infinite en-
ergy flow. To characterize the finite extension of a given horizon, we use its radius R and
its total area A.
The energy flow through a horizon is characterized by an energy E and a proper length
L of the energy pulse. When such an energy pulse flows perpendicularly through a hori-
F=
E
. (735)
L
Since we are at a horizon, we need to insert the maximum possible values. In terms of the
horizon area A and radius R, we can rewrite the limit case as
c4
= 4πR 2
E 1
(736)
4G A L
where we have introduced the maximum force and the maximum possible area 4πR 2 of
a horizon of (maximum local) radius R. The ratio E~A is the energy per unit area flowing
through the horizon.
Horizons are often characterized by the so-called surface gravity a instead of the radius
E=
1
a2 A L . (737)
4πG
Ref. 1131 Special relativity shows that at horizons the product aL of proper length and accelera-
tion is limited by the value c 2 ~2. This leads to the central relation for the energy flow at
horizons:
c2
E= aA. (738)
8πG
This equation makes three points. First, the energy flowing through a horizon is limited.
Secondly, this energy is proportional to the area of the horizon. Thirdly, the energy flow is
proportional to the surface gravity. These are fundamental, and characteristic, statements
of general relativity.
Taking differentials, the last relation can be rewritten as
c2
δE = a δA . (739)
8πG
In this form, the result can also be applied to general horizons, including those that are
Copyright © Christoph Schiller November 1997–January 2007
curved or time-dependent.*
Ref. 1132 In a well-known paper, Jacobson has given a beautiful proof of a simple connection: if
* Relation (739) is well known, though with different names for the observables. Since no communication
is possible across a horizon, the detailed fate of energy flowing through a horizon is also unknown. Energy
whose detailed fate is unknown is often called heat. Relation (739) therefore states that the heat flowing
through a horizon is proportional to the horizon area. When quantum theory is introduced into the discus-
sion, the area of a horizon can be called ‘entropy’ and its surface gravity can be called ‘temperature’; relation
(739) can then be rewritten as
δQ = T δS . (740)
However, this translation of relation (739), which requires the quantum of action, is unnecessary here. We
only cite it to show the relation between horizon behaviour and quantum gravity.
1094 xi general rel ativity versus quantum mechanics
energy flow is proportional to horizon area for all observers and all horizons, then general
relativity follows. To see the connection to general relativity, we generalize relation (739)
to general coordinate systems and general energy-flow directions.
The proof uses tensor notation. One introduces the general surface element dΣ and the
local boost Killing vector field k that generates the horizon (with suitable norm). Jacobson
rewrites the left-hand side of relation (739) as
δE = ∫T ab k
a
dΣ b , (741)
where Tab is the energy–momentum tensor. This is valid in arbitrary coordinate systems
and for arbitrary energy flow directions. Jacobson’s main result is that the essential part
of the right-hand side of relation (739) can be rewritten, using the (purely geometric)
Raychaudhuri equation, as
aδA = c 2 ∫R a
dΣ b ,
c4
Tab = R ab − ( R + Λ)дab ,
1
(744)
8πG 2
where Λ is a constant of integration whose value is not determined by the problem. These
are the full field equations of general relativity, including the cosmological constant Λ.
The field equations are thus shown to be valid at horizons. Since it is possible, by choos-
ing a suitable coordinate transformation, to position a horizon at any desired space-time
event, the field equations must be valid over the whole of space-time.
Since it is possible to have a horizon at every event in space-time, at every event in
nature there is the same maximum possible force (or power). This maximum force (or
Copyright © Christoph Schiller November 1997–January 2007
c4
D
∆E
. (746)
∆l 4G
Since experimental data are available only for composite systems,* we cannot yet say
whether this inequality also holds for elementary particles. This relation is not as well
known as the previous one. In fact, tests of it – for example with binary pulsars – may
distinguish general relativity from competing theories.
Copyright © Christoph Schiller November 1997–January 2007
Ref. 1135 * For example, the latest measurements for the Sun give Tb = GMb ~c 3 = 4.925 490 947 µs; the error in E
is thus much smaller than the (scaled) error in its radius.
1096 xi general rel ativity versus quantum mechanics
well as mattresses. In fact, for perfect crystals (without dislocations), these two material
constants are the same.
Empty space-time somehow behaves like a perfect crystal, or a perfect mattress: it has
a deformation-energy constant that is equal to the maximum force that can be applied to
it. The constant of gravitation thus determines the elasticity of space-time. Now, crystals
are not homogeneous, but are made up of atoms, while mattresses are made up of foam
bubbles. What is the corresponding structure of space-time? This is a central question
in the rest of our adventure. One thing is sure: unlike crystals, vacuum has no preferred
directions. In fact, all these analogies suggest that the appearance of matter might be
nature’s way of preventing space-time from ripping apart. We have to patient for a while,
before we can judge this opinion. By putting all the limits together, we take first step
towards answering the question of the structure of space-time.
S E
ħ
quantum theory on action:
2
special relativity on speed: v Dc
c4
general relativity on force: F D . (747)
4G
These limits are valid for all physical systems, whether composite or elementary, and for
all observers. Note that the limit quantities of special relativity, quantum theory and gen-
eral relativity can also be seen as the right-hand sides of the respective indeterminacy
relations. Indeed, the set (728, 731, 746) of indeterminacy relations or the set (727, 730,
Challenge 1525 e 745) of length limits is each fully equivalent to the three limit statements (747). Each set
of limits can be taken as a (somewhat selective) summary of twentieth-century physics.
By combining the three fundamental limits, we can obtain limits on a number of phys-
ical observables. The following limits are valid generally, for both composite and element-
ary systems:
¾
Copyright © Christoph Schiller November 1997–January 2007
tE = 7.6 ë 10−44 s
2Għ
time interval: (748)
c5
td E 4 = 1.7 ë 10−78 ms
2Għ
time-distance product: (749)
¾
c
ω D 2π = 8.2 ë 1043 ~s
c5
angular frequency: (751)
2Għ
units and limit values for all physical observables 1097
With the additional knowledge that space and time can mix, we get
¾
dE = 2.3 ë 10−35 m
2Għ
distance: (752)
c3
AE 3 = 5.2 ë 10−70 m2
2Għ
area: (753)
c
2Għ 3~2
volume: V E 3 = 1.2 ë 10−104 m3 (754)
c
c3
curvature: KD = 1.9 ë 1069 ~m2 (755)
2Għ
c5
mass density: ρD = 6.5 ë 1095 kg~m3 (756)
8G 2 ħ
ample, the founding fathers of quantum theory forgot the 1~2 in the definition of the
quantum of action. Similarly, the early relativists omitted the factor 4. With the present
framework, the question of the correct factors in the Planck units can be considered
settled.
Note that the dimensional independence of the three limits in nature means that
quantum effects cannot be used to overcome the force limit; similarly, the power limit
cannot be used to overcome the speed limit. The same is true for any other combination
of limits: they are mutually independent and consistent.
1098 xi general rel ativity versus quantum mechanics
The three limits of nature (747) result in a minimum distance and a minimum time in-
”
terval. These minimum intervals arise directly from the unification of quantum theory
and relativity: they do not appear if the theories are kept separate. In short, unification
implies that there is a smallest length in nature. This result is important: the formulation
of physics as a set of limit statements shows that the continuum model of space and time
is not correct. Continuity and manifolds are only approximations, valid for large actions,
low speeds and small forces. The reformulation of general relativity and quantum theory
with limit statements makes this especially clear. No other assumptions are needed.
Ref. 1139 In fact, the connection between minimum length and gravity is not new. In 1969, An-
drei Sakharov pointed out that a minimum length implies gravity. He showed that regular-
Ref. 1137
bound is sharpened.
lD
ħ
for elementary particles: . (757)
2mc
Using this new limit, valid only for elementary particles, we find the well-known mass,
energy and momentum limits:
¾
mD = 7.7 ë 10−9 kg = 0.42 ë 1019 GeV~c2
ħc
for elementary particles:
¾
8G
These single-particle limits, corresponding to the corrected Planck mass, energy and mo-
mentum, were discussed in 1968 by Andrei Sakharov, though with different numerical
Ref. 1143 factors. They are regularly cited in elementary particle theory. All known measurements
comply with them. It is also known that the unification of the electromagnetic and the two
nuclear interactions takes place at an energy near, but still clearly below, the maximum
particle energy.
Limits in thermodynamics
Copyright © Christoph Schiller November 1997–January 2007
The entropy S is limited by half the Boltzmann constant k. This result is almost 100 years
Ref. 1144 old; it was stated most clearly by Leo Szilard, though with a different numerical factor.
Like the other limit statements we have examined, this result can also be phrased as a
indeterminacy relation:
∆ ∆U E .
1 k
(760)
T 2
1100 xi general rel ativity versus quantum mechanics
Ref. 1145 This relation was given by Bohr and discussed by Heisenberg and many others (though
with k instead of k~2). It is mentioned here in order to complete the list of indeterminacy
relations and fundamental constants. With the single-particle limits, the entropy limit
leads to an upper limit for temperature:
¾
TD = 1.0 ë 1032 K .
ħc 5
(761)
2Gk 2
This corresponds to the temperature at which the energy per degree of freedom is given
by the (corrected) Planck energy. A more realistic value would have to take account of
the number of degrees of freedom of a particle at Planck energy. This would change the
numerical factor.
LE = = 4.4 ë 10−40 H
1 2ħG 1 ħ3 G
inductance: (766)
8πε o α c7 e2 2c 5
With the additional assumption that in nature at most one particle can occupy one Planck
volume, one gets
¾
Copyright © Christoph Schiller November 1997–January 2007
½
πε o α c 5
ρe D = = 1.3 ë 1085 C~m3
c9
charge density: e (767)
¾ ¾
2G 3 ħ 8G 3 ħ 3
RE = 2 = 2.1 kΩ
1 ħ
electric resistance: (769)
8πε 0 αc 2e
units and limit values for all physical observables 1101
2e 2
electric conductivity: G D 8πε 0 αc = = 0.49 mS (770)
¾ ¾
ħ
ID =e = 7.4 ë 1023 A
2πε 0 αc 6 c5
electric current: (771)
G 2ħG
Many of these limits have been studied already. The magnetic field limit is significant in
the study of extreme stars and black holes. The maximum electric field plays a role in the
Ref. 1146 theory of gamma ray bursters. For current, conductivity and resistance in single channels,
Ref. 1147 the limits and their effects were studied extensively in the 1980s and 1990s by researchers
who will probably win a Nobel Prize in the not too distant future.
The observation of quarks and of collective excitations in semiconductors with charge
e~3 does not necessarily invalidate the charge limit for physical systems. In neither case
is there is a physical system – defined as localized mass–energy interacting incoherently
with the environment – with charge e~3.
**
The general fact that to every limit value in nature there is a corresponding indeterminacy
relation is valid in particular for electricity. Indeed, there is an indeterminacy relation for
capacitors, of the form
∆C ∆U E e (772)
where e is the positron charge, C capacity and U potential difference; there is also an
indeterminacy relation between electric current I and time t
**
The gravitational attraction between two masses never yields force values high enough to
exceed the force limit. Why? First of all, masses m and M cannot come closer together
than the sum of their horizon radii. Using F = GmM~r 2 with the distance r given by the
(naive) sum of the two black hole radii as r = 2G(M + m)~c 2 , one gets
c4
FD
Mm
4G (M + m)2
, (774)
units and limit values for all physical observables 1103
which is never larger than the force limit. Thus even two attracting black holes cannot
exceed the force limit – in the inverse-square approximation of universal gravity. In short,
the minimum size of masses means that the maximum force cannot be exceeded.
**
It is well known that gravity bends space. Therefore, if they are to be fully convincing, our
calculation needs to be repeated taking into account the curvature of space. The simplest
way is to study the force generated by a black hole on a test mass hanging from a wire that
is lowered towards a black hole horizon. For an unrealistic point mass, the force would
Ref. 1150 diverge at the horizon. Indeed, for a point mass m lowered towards a black hole of mass
M at (conventionally defined radial) distance d, the force would be
F= ¼
GMm
. (775)
d 2 1 − 2G M
This diverges at d = 0, the location of the horizon. However, even a test mass cannot be
smaller than its own gravitational radius. If we want to reach the horizon with a realistic
test mass, we need to choose a small test mass m: only a small mass can get near the
horizon. For vanishingly small masses, however, the resulting force tends to zero. Indeed,
letting the distance tend to the smallest possible value by letting d = 2G(m + M)~c 2
2GM~c 2 requires m 0, which makes the force F(m, d) vanish. If on the other hand, we
remain away from the horizon and look for the maximum force by using a mass as large as
can possibly fit into the available distance (the calculation is straightforward), then again
the force limit is never exceeded. In other words, for realistic test masses, expression (775)
is never larger than c 4 ~4G. Taking into account the minimal size of test masses, we thus
see that the maximum force is never exceeded in gravitational systems.
**
An absolute power limit implies a limit on the energy that can be transported per unit
time through any imaginable surface. At first sight, it may seem that the combined power
emitted by two radiation sources that each emit 3~4 of the maximum value should give
3~2 times the maximum value. However, the combination forms a black hole, or at least
prevents part of the radiation from being emitted by swallowing it between the two
sources.
Copyright © Christoph Schiller November 1997–January 2007
**
One possible system that actually achieves the power limit is the final stage of black hole
evaporation. But even in this case, the power limit is not exceeded.
**
The maximum force limit states that the stress–energy tensor, when integrated over any
Ref. 1129 physical surface, does not exceed the limit value. No such integral, over any physical
surface, of any tensor component in any coordinate system, can exceed the force limit,
provided that it is measured by a nearby observer or a test body with a realistic proper
size. The maximum force limit thus applies to any component of any force vector, as well
1104 xi general rel ativity versus quantum mechanics
**
The force limit cannot be overcome with Lorentz boosts. A Lorentz boost of any non-
vanishing force value seems to allow the force limit to be exceeded at high speeds. How-
ever, such a transformation would create a horizon that makes any point with a potentially
**
The power limit is of interest if applied to the universe as a whole. Indeed, it can be used
to explain Olbers’ paradox. The sky is dark at night because the combined luminosity of
all light sources in the universe cannot be brighter than the maximum value.
**
The negative energy volume density −Λc ~4πG introduced by the positive cosmological
4
constant Λ corresponds to a negative pressure (both quantities have the same dimen-
sions). When multiplied by the minimum area it yields a force value
F= = 4.8 ë 10−79 N .
Λħc
(776)
2π
»
This is also the gravitational force between two corrected Planck masses located at the
cosmological distance π~4Λ . If we make the (wishful) assumption that this is the smal-
lest possible force in nature (the numerical prefactor is not yet known for sure), we arrive
at the fascinating conjecture that the full theory of general relativity, including the cos-
mological constant, is defined by the combination of a maximum and a minimum force Copyright © Christoph Schiller November 1997–January 2007
in nature.
Page 1088 ponding (corrected) Planck values. We explored the Planck limits in the last section. The
question now arises whether nature also imposes limits on physical observables at the op-
posite end of the measurement scale. For example, there is a highest force and a highest
power in nature; is there also a lowest force and a lowest power? Is there also a lowest
speed?
We will show that there are indeed such limits, for all observables. We give the general
method to generate such bounds, and explore several examples.* This exploration will
take us on an interesting survey of modern physics.
DD
1
LE . (777)
c
Indeed, there do not seem to be any exceptions to this limit in nature. In no known sys-
tem, from atoms to molecules, from ice skaters to galaxies, does the angular momentum
exceed this value. Even the most violently rotating objects, the so-called extremal black
Ref. 1151 holes, are limited in angular momentum by D D LE~c. (Actually, this limit is correct
only if the energy is taken as the irreducible mass times c 2 ; if the usual mass is used, the
limit is too large by a factor of 4.) The limit deduced from general relativity, given by
Copyright © Christoph Schiller November 1997–January 2007
D D L 2 c 3 ~4G, is not stricter than the one just given. No system-dependent lower limit
for angular momentum can be deduced.
The maximum value for angular momentum is also interesting when it is seen as an
action limit. Action is the time integral of the difference between kinetic and potential
energy. Since nature always seeks to minimize the action W, it seems strange to search for
systems that maximize it. You might check for yourself that the action limit W D LE~c is
Challenge 1527 ny not exceeded in any physical process.
Similarly, speed times mass times length is an action. Since action values in nature are
limited from below by ħ~2, we get
ħc 2 1
vE . (778)
2 LE
This is not a new result: it is just a form of the indeterminacy relation of quantum theory.
It gives a minimum speed for any system of energy E and diameter L. Even the extremely
Challenge 1528 ny slow radius change of a black hole by evaporation just realizes this minimal speed. Con-
tinuing with the same method, one also finds that the limit deduced from general relativ-
Challenge 1529 e ity, v D (c 2 ~4G)(L~E), gives no new information. Therefore, no system-dependent upper
speed limit exists.
Incidentally, the limits are not unique. Other limits can be found in a systematic way.
Upper limits can be multiplied, for example, by factors of (L~E)(c 4 ~4G) or (LE)(2~ħc),
FE
ħ 1
. (779)
2c T 2
Experiments do not reach this limit. The smallest forces measured in nature are those
in atomic force microscopes, where values as small as 1 aN are observed. But even these
values are above the lower force limit.
The power P emitted by a system of size L and mass M is limited by
E P E 2ħG 3 .
M M
c3 (780)
L L
The limit on the left is the upper limit for any engine or lamp, as deduced from relativity;
Copyright © Christoph Schiller November 1997–January 2007
not even the universe exceeds it. The limit on the right is the minimum power emitted by
any system through quantum gravity effects. Indeed, no system is completely tight. Even
black holes, the systems with the best ability to keep components inside their enclosure,
radiate. The power radiated by black holes should just meet this limit, provided the length
L is taken to be the circumference of the black hole. Thus the claim of the quantum gravity
limit is that the power emitted by a black hole is the smallest power that is emitted by any
composite system of the same surface gravity. (However, the numerical factors in the
black hole power appearing in the literature are not yet consistent.)
* The strictest upper limits are those with the smallest exponent for length, and the strictest lower limits are
those with the largest exponent of length.
upper and lower limits to observables 1107
Acceleration
The acceleration a of a system of a system of size L and mass M, as measured by a nearby
inertial observer, is limited by
E a E 4G 2 .
1 M
c2 (781)
L L
The lower limit (on the right) gives the acceleration due to universal gravity. Indeed, the
relative acceleration between a system and an observer has at least this value. The upper
limit (on the left) is due to special relativity. No exception to either of these limits has ever
been observed. Using the limit on the size of masses, the upper limit can be transformed
Ref. 1152 into the equivalent acceleration limit
2c 3
MEa
Momentum
The momentum p of a system of size L is limited by
c3
LEpE
ħ 1
. (783)
4G 2 L
The lower limit is due to quantum theory; experiments have confirmed it for all radiation
and matter. The upper limit is due to general relativity; it has never been exceeded.
ML 2 E t E 3 M .
2 4G
(784)
ħ c
The lower time limit is the gravitational time. No clock can measure a time shorter than
this. Similarly, no system can produce signals of shorter duration. Black holes, for ex-
ample, emit radiation with a frequency given by this minimum value. The upper time
limit is expected to be the exact lifetime of a black hole. (There is not yet a consensus in
the literature on the numerical factor.)
The upper limit to time measurements is due to quantum theory. It leads to an obvious
question: what happens to a single atom in space after the limit time has passed by? As
a composite system, an atom is not comparable with a black hole. The formula for the
1108 xi general rel ativity versus quantum mechanics
lifetime assumes that decay can take place in the tiniest energy steps. If there is no decay
mechanism, the formula does not apply. Thus the expression (784) does not apply to
atoms.
Distance limits are straightforward:
ML 2 E d E 2 M .
2c 4G
(785)
ħ c
Since curvature is an inverse square of distance, the curvature of space-time is also lim-
ited.
Mass change
The rate of mass change dM~dt of a system of size L and mass M is limited by
c2 1
EρE
ħ 1
.
Copyright © Christoph Schiller November 1997–January 2007
(788)
4G L 2 2c L 4
The upper limit, due to general relativity, is only achieved for black holes. The lower limit
is due to quantum theory. It also applies to the vacuum, if a piece of vacuum of size L
is taken as a physical system. Again, many other (less strict) limits can be formulated by
Page 1106 using the transformation rules mentioned above.
D
S A
(789)
S limit A limit
which gives
kc 3
SD A, (790)
4Għ
where A is the surface of the system. Equality is realized only for black holes. The old
question of the origin of the factor 4 in the entropy of black holes is thus answered here: it
is due to the factor 4 in the force or power bound in nature (provided that the factors from
Ref. 1154 the Planck entropy and the Planck action cancel). Time will tell whether this explanation
Page 252 will be generally accepted. Stay tuned.
We can also derive a more general relation by using a mysterious assumption, which
SD ML =
πkc 2πkc
MR . (791)
ħ ħ
In other words, we used
D
S M A L corr. Planck
. (792)
S corr. Planck M corr. Planck A corr. Planck L
Expression (791) is called Bekenstein’s entropy bound. No exception has ever been found
Ref. 1141 or constructed, despite many attempts. Again, the limit value itself is only realized for
black holes.
We need to explain the strange assumption used above. We are investigating the en-
tropy of a horizon. Horizons are not matter, but limits to empty space. The entropy of
horizons is due to the large number of virtual particles found at them. In order to de-
duce the maximum entropy of expression (792) one therefore has to use the properties of
the vacuum. In other words, either we use a mass-to-length ratio for vacuum above the
Planck limit, or we use the Planck entropy as the maximum value for vacuum.
Copyright © Christoph Schiller November 1997–January 2007
Other, equivalent limits for entropy can be found if other variables are introduced. For
Ref. 1155 example, since the ratio of the shear viscosity η to the volume density of entropy (times
k) has the dimensions of action, we can directly write
SD
k
ηV . (793)
ħ
Again, equality is only attained in the case of black holes. In time, no doubt, the list of
similar bounds will grow longer.
Is there also a smallest entropy? So far, there does not seem to be a system-dependent
minimum value for entropy: the present approach gives no expression that is larger than
Challenge 1531 ny k.
1110 xi general rel ativity versus quantum mechanics
The establishment of the entropy limit is an important step towards making our de-
scription of motion consistent. If space-time can move, as general relativity maintains, it
also has an entropy. How could entropy be limited if space-time were continuous? Clearly,
because of the existence of a minimum distance and minimum time in nature, space-time
cannot be continuous, but must have a finite number of degrees of freedom, and thus a
finite entropy.
The Bekenstein limit also leads to some interesting speculations. Let us speculate that
the universe itself, being surrounded by a horizon, meets the Bekenstein bound. The en-
tropy bound gives a bound to all degrees of freedom inside a system: it tells us that the
Challenge 1532 e number N d.o.f . of degrees of freedom of the universe is roughly
Compare this with the number N Pl. vol. of Planck volumes in the universe
We see that particles are only a tiny fraction of what moves around. Most motion must
be movement of space-time. But also, space-time moves much less than naively expected.
To find out how all this happens is the challenge of the unified description of motion.
Temperature
A lower limit for the temperature of a thermal system can be found using the idea that
the number of degrees of freedom of a system is limited by its surface, or more precisely,
by the ratio between the surface and the Planck surface. One gets the limit
TE
4Għ M
. (797)
πkc L 2
Alternatively, using the method given above, one can use the limit on the thermal energy
kT~2 E ħc~2πL (the thermal wavelength must be smaller than the size of the system)
Copyright © Christoph Schiller November 1997–January 2007
together with the limit on mass c 2 ~4G E M~L, and deduce the same result.
We have met this limit already: when the system is a black hole, it gives the temperat-
ure of the emitted radiation. In other words, the temperature of black holes is the lower
limit for all physical systems for which a temperature can be defined, provided they share
the same boundary gravity. The latter condition makes sense: boundary gravity is access-
ible from the outside and describes the full physical system, since it depends on both its
boundary and its content. So far, no exception to this claim is known. All systems from
everyday life comply with it, as do all stars. Even the coldest known systems in the uni-
verse, namely Bose–Einstein condensates and other cold matter produced in laboratories,
Challenge 1533 n are much hotter than the limit, and thus much hotter than black holes of the same sur-
upper and lower limits to observables 1111
Page 318 face gravity. (We saw earlier that a consistent Lorentz transformation for temperature is
not possible; so the minimum temperature limit is only valid for an observer at the same
gravitational potential as the system under consideration and stationary relative to it.)
There seems to be no consistent way to define an upper limit for a system-dependent
Challenge 1534 ny temperature. Limits for other thermodynamic quantities can be found, but we will not
discuss them here.
Electromagnetic observables
When electromagnetism plays a role in a system, the system also needs to be characterized
by a charge Q. Our method then gives the following lower limit for the electric field E:
M2
E E 4Ge . (798)
Q 2 L2
4Ge M 2
BE . (799)
c Q 2 L2
occur when one of the two aspects is somehow forgotten. We will study a few examples.
**
Several black hole limits are of importance to the universe itself. For example, the ob-
served average mass density of the universe is not far from the corresponding limit. Also,
the lifetime limit provides an upper limit for the age of the universe. However, the age of
the universe is far from that limit by a large factor. In fact, since the universe’s size and
age are increasing, the age limit is pushed further into the future with every second that
passes. The universe evolves so as to escape its own decay.
**
The content of a system is characterized not only by its mass and charge, but also by its
1112 xi general rel ativity versus quantum mechanics
strangeness, isospin, colour charge, charge and parity. Can you deduce the limits for these
Challenge 1537 r quantities?
**
In our discussion of black hole limits, we silently assumed that they interact, like any
thermal system, in an incoherent way with the environment. Which of the results of this
Challenge 1538 n section change when this condition is dropped, and how? Which limits can be overcome?
**
Challenge 1539 e Can you find a general method to deduce all limits?
**
Brushing aside some important details, we can give the following summary of our study
of nature. For Galilean physics, there is no difference between composite and elementary
antiparticles are not clearly distinguished at Planck scales. The minimum length implies
that there is no way to define the exact boundaries of objects or elementary particles. But
a boundary is what separates matter from vacuum. In short, a minimum measurement
error means that, at Planck scales, it is impossible to distinguish objects from vacuum
Ref. 1137 with complete precision. To put it bluntly, at Planck scales, time and space do not exist.
These conclusions are in complete agreement with the conclusions of modern research
on unified theories. The limit principles imply the same conceptual results as are found
in string theory and the various approaches to quantum gravity. To show the power of the
maximum force limit, we now explore a few conclusions that go beyond such approaches.
particles, a discrete set of points, or any other discrete set of physical observables – the
entities have to be separable. But the inevitable measurement errors contradict separab-
Ref. 1137 ility. Thus at the Planck energy it is impossible to count physical objects with precision.
Nature has no parts.
In short, at Planck energies, perfect separation is impossible in principle. We cannot
distinguish observations. At Planck scales it is impossible to split nature into separate en-
tities. In nature, elements of sets cannot be defined. Neither discrete nor continuous sets
Ref. 1140 can be constructed. Nature does not contain sets or elements.
Since sets and elements are only approximations, the concept of a ‘set’, which assumes
separable elements, is too specialized to describe nature. Nature cannot be described at
Planck scales – i.e., with full precision – if any of the concepts used for its description
1114 xi general rel ativity versus quantum mechanics
presupposes sets. However, all concepts used in the past 25 centuries to describe nature
– space, time, particles, phase space, Hilbert space, Fock space, particle space, loop space
or moduli space – are based on sets. They must all be abandoned at Planck energy.
Nature is one and has no parts. No correct mathematical model of nature can be based
on sets. But none of the approach used so far in theoretical physics – not even string theory
or the various quantum gravity approaches – has abandoned sets. This requirement must
guide us in the future search for the unification of relativity and quantum theory.
Es ist fast unmöglich, die Fackel der Wahrheit
Since nature does not contain sets, such an axiomatic description of nature cannot
Ref. 1140 exist. The reasoning is simple: we only have to look at the axiomatic systems found in
mathematics. Axiomatic systems define mathematical structures. These structures are of
three main types: algebraic systems, order systems or topological systems. Most math-
ematical structures – such as symmetry groups, vector spaces, manifolds or fields – are
combinations of all three. But all mathematical structures are based on elements and sets.
Elements and sets are common to all axiomatic systems in mathematics. The underlying
reason is that every mathematical concept contains at least one set.
Furthermore, all concepts used so far in physics depend on elements and sets. For hu-
mans, it is difficult even to think without first defining a set of possibilities. Yet nature does
* ‘It is almost impossible to carry the torch of truth through a crowd without scorching somebody’s beard.’
limits to measurement precision and their challenge to thought 1115
Outlook
Physics can be summarized in a few limit statements. They imply that in nature every
physical observable is limited by a value near the Planck value. The speed limit is equival-
ent to special relativity, the force limit to general relativity, and the action limit to quantum
theory. Although this summary could have been made (or at least conjectured) by Planck,
Einstein or the fathers of quantum theory, it is actually much more recent. The numer-
ical factors for most of the limit values are new. The limits suggest interesting Gedanken
gravity attempt this, though possibly not yet with the full radicalism necessary. To realize
this unification is our next task.
1116 xi general rel ativity versus quantum mechanics
Biblio graphy
1127 Bohr explained the indivisibilty of the quantum of action in his famous Como lecture. N.
B ohr, Atomtheorie und Naturbeschreibung, Springer, Berlin, 1931. More statements about
the indivisibility of the quantum of action can be found in N. B ohr, Atomic Physics and
Human Knowledge, Science Editions, New York, 1961. For summaries of Bohr’s ideas by
others see Max Jammer, The Philosophy of Quantum Mechanics, Wiley, first edition, 1974,
pp. 90–91, and John Honner, The Description of Nature – Niels Bohr and the Philosophy
of Quantum Physics, Clarendon Press, 1987, p. 104. Cited on page 1089.
1128 For an overview of the quantum of action as a basis of quantum theory, see the introduction
to quantum theory in this textbook, available as C. Schiller, An appetizer – quantum
theory for poets and lawyers, http://www.motionmountain.net/C-5-QEDA.pdf. Cited on
page 1090.
1129 The first published statements of the principle of maximum force were in the present text-
T. Jaekel & S. R enaud, Gravitational quantum limit for length measurement, Physics
Letters A 185, pp. 143–148, 1994. Cited on page 1095.
1135 M. Kramer & al., Tests of general relativity from timing the double pulsar, prerpint at
http://arxiv.org/abs/astro-ph/060941. Cited on page 1095.
1136 Minimal length and minimal time intervals are discussed, for example, by G. Amelino-
Camelia, Limits on the measurability of space-time distances in (the semiclassical ap-
proximation of) quantum gravity, Modern Physics Letters A 9, pp. 3415–3422, 1994, and by
Y.J. Ng & H. Van Dam, Limit to space-time measurement, Modern Physics Letters A 9,
pp. 335–340, 1994. Many other authors have explored the topic. Cited on page 1097.
1137 A pedagogical summary is C. Schiller, http://www.motionmountain.net/C10-QMGR.
pdf or C. Schiller, Le vide diffère-t-il de la matière? in E. Gunzig & S. Diner, editors,
Le Vide – Univers du tout et du rien – Des physiciens et des philosophes s’interrogent, Les
bibliography 1117
L90, 1998, and C.L. Bianco, R. Ruffini & S.-S. Xue, The elementary spike produced
by a pure e+ e− pair-electromagnetic pulse from a black hole: the PEM pulse, Astronomy
and Astrophysics 368, pp. 377–390, 2001. Cited on page 1101.
1147 See for example the review in C.W.J. Beenakker & al., Quantum transport in semicon-
ductor nanostructures, pp. 1–228, in H. Ehrenreich & D. Turnbull, editors, Solid
State Physics, volume 44, Academic Press, 1991. The prediction of a future Nobel Prize was
made and conveyed to the authors by C. Schiller in 1992. Cited on page 1101.
1148 Brian Greene, The Elegant Universe – Superstrings, Hidden Dimensions, and the Quest
for the Ultimate Theory, Vintage, 2000. Cited on page 1101.
1149 A discussion of a different electrical uncertainty relation, between current and charge, can
be found in Y-Q. Li & B. Chen, Quantum theory for mesoscopic electronic circuits and
its applications, http://arxiv.org/abs/cond-mat/9907171. Cited on page 1102.
1118 xi general rel ativity versus quantum mechanics
1150 Wolf gang R indler, Relativity – Special, General and Cosmological, Oxford University
Press, 2001, p. 230. Cited on page 1103.
1151 Hans C. Ohanian & R emo Ruffini, Gravitation and Spacetime, W.W. Norton & Co.,
1994. Cited on page 1105.
1152 A maximal acceleration for microscopic systems was stated by E.R. Caianiello, Lettere
al Nuovo Cimento 41, p. 370, 1984. It is discussed by G. Papini, Shadows of a maximal
acceleration, http://arxiv.org/abs/gr-qc/0211011. Cited on page 1107.
1153 The entropy limit for black holes is discussed by J.D. Bekenstein, Entropy bounds and
black hole remnants, Physical Review D 49, pp. 1912–1921, 1994. See also J.D. Beken-
stein, Universal upper bound on the entropy-to-energy ratio for bounded systems, Phys-
ical Review D 23, pp. 287–298, 1981. Cited on page 1108.
1154 Private communications with Jos Uffink and Bernard Lavenda. Cited on page 1109.
1155 P. Kovtun, D.T. Son & A.O. Starinets, A viscosity bound conjecture, preprint at
http://arxiv.org/abs/hep-th/0405231. Cited on page 1109.
“ possiet.
The expressions for the Compton wavelength λ = h~mc and for the Schwarzschild radius
Terence*
”
r s = 2Gm~c 2 imply a number of arguments which lead to the conclusion that at Planck
energies, what we call space-time ‘points’ and ‘point’ particles must actually be described
by extended entities. These arguments point towards a connection between microscopic
and macroscopic scales, confirming the results of modern string theory and quantum
gravity. At the same time, they provide a pedagogical summary of this aspect of present-
day theoretical physics. Here is a summary of what follows.**
1 – Any experiment trying to measure the size or the shape of an elementary particle
with high precision inevitably leads to the result that at least one dimension of the particle
research efforts is also given. To complete the discussion, experimental and theoretical
checks for the idea of extended building blocks of nature are presented.
* ‘Nothing is so difficult that it could not be investigated.’ Terence is Publius Terentius Afer (c. 190–159 bce ),
important roman poet. He writes this in his play Heauton Timorumenos, verse 675.
** This section describes a research topic, and is not a compendium of generally accepted results (yet). It
was written between December 2001 and May 2002.
1120 xi general rel ativity vs. quantum mechanics • the shape of points
We can only use continuous space-time and point particles as basic entities when our
description of nature is separated into general relativity and quantum theory. When the
two theories are united, what we usually call a ‘point’ turns out to have quite counter-
”
intuitive properties. We explore a few of them in the following.
We have already given a standard argument showing that points do not exist in nature.
The Compton wavelength and the Schwarzschild radius together determine a minimal
Ref. 1157, Ref. 1158 length and a minimal time interval in nature. They are given (within a factor of order one,
which we will usually omit) by the Planck length and the Planck time, with the values
The existence of minimal length and time intervals in nature implies that ‘points’ in space,
time or space-time do not really exist, and forces us to give up the traditional idea of
continuity. Even though, properly speaking, points do not exist, and thus space points,
events and point particles do not exist either, we can still study these entities in detail.
The results provide many fascinating surprises.
Page 1035 Using a simple Gedanken experiment, we found earlier that particles and space-time
cannot be distinguished from each other at Planck scales. We argued as follows. The
largest mass that can be put in a box of size R is a black hole with a Schwarzschild ra-
dius of the same value. That is also the largest possible mass measurement error. But any
piece of vacuum also has a smallest mass measurement error.
The existence of a smallest mass measurement error is so important that it merits spe-
cial attention. Mass measurement errors always prevent humans from stating that a re-
gion of space has zero mass. Similarly, nature cannot ‘know’ that the mass of a region is
zero, because of quantum indeterminacy. Energy and mass are always blurred in quantum
theory. We are not used to applying this fact to the vacuum, but at Planck scales we have
to. We know from quantum field theory that the vacuum, like any other system, has a
Copyright © Christoph Schiller November 1997–January 2007
mass; of course, its value when averaged over a long time is zero. For finite measuring
times, there will be an uncertainty in the mass value.
Indeterminacy for the vacuum arises from limitations in space as well as time. These in-
determinacies imply minimum mass errors for finite-sized regions of vacuum. Quantum
theory implies that nobody – not even nature – knows the exact mass value of a system
or of a region of empty space.
A box is empty if it does not contain anything. But emptiness is not well defined in
relation to photons whose wavelength is greater than or equal to the size R of the box. The
mass measurement error for an ‘empty’ box – corresponding to what we call a vacuum – is
due to the indeterminacy relation, and is equal to the mass whose Compton wavelength
Page 1035 matches the size of the box. As we saw earlier, the same mass value is found in every
the shape of points 1121
other Gedanken experiment – for example, trying to determine the gravitational mass by
Ref. 1157 weighing the ‘piece’ of vacuum or by measuring its kinetic energy.
Another – ultimately equivalent – way to show that a region of vacuum has a finite
mass error is to look at how the vacuum energy depends on the positional indeterminacy
of the border of the region. Any region is defined by its border. The positional indeterm-
inacy of the border will induce a mass error for the contents of the box, in the same way
as a time limit does. Again, the resulting mass error for a region of vacuum is equal to
the mass for which the box size is the Compton wavelength.
Summarizing, for a box of size R, nature allows only mass values and mass measure-
ment error values m between two limits:
c2 R
EmE
ħ
(full box) (empty box) . (801)
G cR
We see directly that for sizes R of the order of the Planck length, the two limits coincide:
In other words, for boxes of Planck size, we cannot distinguish a full box from an empty
one. This means that there is no difference between vacuum and matter at Planck scales.
Of course, a similar statement holds vacuum and radiation. At Planck scales, vacuum
cannot be distinguished from particles of any kind.
where the differences are taken between the values before and after the collision. The error
δM of such a measurement is simply given by
=
δM δ∆v δm δ∆V
+ + . (804)
M ∆v m ∆V
At Planck scales we have δ∆v~∆v 1, because the velocity error is always – like the velo-
cities themselves – of the order of the speed of light. In other words, at Planck scales the
mass measurement error is so large that we cannot determine whether a mass is different
from zero: vacuum is indistinguishable from matter.
We reach the same conclusion if we take light with wavelength λ as the probe particle.
1122 xi general rel ativity vs. quantum mechanics • the shape of points
object
comparison
standard
=
δM δ∆λ δ∆V
+ . (805)
M ∆λ ∆V
* ‘Our task is not to see what nobody has ever seen, but to think what nobody has ever thought about that
which everybody has seen already.’
argument 1: the size and shape of elementary particles 1123
What happens if Planck energy is approached, advancing step by step to higher en-
ergy? Every measurement requires comparison with a standard. A standard is made of
matter and comparison is performed using radiation (see Figure 414). Thus any measure-
ment requires distinction between matter, radiation and space-time. However, distinc-
tion between matter and radiation is possible only up to the (grand) unification energy,
which is about 1~800 of the Planck energy. Measurements do not allow us to prove that
particles are point-like.
Let us take a step back. Do measurements at least allow us to say whether particles can
be contained inside small spheres?
Do boxes exist?
The first and simplest way to determine the size of a compact particle such as a sphere, or
something akin to it, is to measure the size of a box it fits in. To be sure that the particle
fits inside, we must first must be sure that the box is tight: that is, whether anything (such
must stops when (or before) we reach Planck scales. Cutting is the insertion of a wall, and
knives are limited just as walls are. The limits of walls imply limits to determination of
size.
In particular, the limits to walls and knives imply that at Planck energies, a cut does
not necessarily lead to two separate parts. One cannot state that the two parts have been
really separated: the possibility of a thin connection between them can never be excluded.
In short, one cannot prove compactness by cutting objects at Planck scales.
point-likeness. At high energy, detecting this departure requires scattering. For example,
we can suspend the particle in a trap and then shoot a probe at it. What happens in a scat-
tering experiment at high energies? This question has been studied by Leonard Susskind
Ref. 1160 and his colleagues. When shooting at the particle with a high-energy probe, the scattering
process is characterized by an interaction time. Extremely short interaction times imply
sensitivity to the size and shape fluctuations due to the quantum of action. An extremely
short interaction time provides a cut-off for high-energy shape and size fluctuations, and
thus determines the measured size. As a result, the size measured for any microscopic, but
extended, object increases when the probe energy is increased towards the Planck value.
In summary, even though at experimentally achievable energies the size is always smal-
ler than a measurable amount, when one approaches the Planck energy, the size increases
above all bounds. So at high energies we cannot give an upper limit to the size of a particle.
In other words, since particles are not point-like at everyday energies, at Planck energies
they are enormous: particles are extended.
“
Καιρὸν γνῶθι.*
Pittacus
Humans – or any other types of observers – can only observe a part of the world with
finite resolution in time and in space. In this respect, humans resemble a film camera.
”
Ref. 1161 The highest possible resolution was (almost) discovered in 1899: the Planck time and the
Copyright © Christoph Schiller November 1997–January 2007
Planck length. No human, no film camera and no apparatus can measure space or time
Ref. 1157, Ref. 1162 intervals smaller than the Planck values. But what would happen if we took photographs
with shutter times approaching the Planck time?
Imagine that you have the world’s best shutter and that you are taking photographs at
shorter and shorter times. Table 79 gives a rough overview of the possibilities. At shorter
shutter times, photographs get darker. When the shutter time reaches the oscillation time
of light, strange things happen: light has no chance to pass undisturbed; signal and noise
become indistinguishable; and the moving shutter will produce colour shifts. In contrast
* ‘Recognize the right moment.’ Also rendered as: ‘Recognize thine opportunity.’ Pittacus (Πιττακος) of
Mytilene (c. 650–570 BCE ), was a Lesbian tyrant and lawmaker; he was also one of the ‘Seven Sages’ of
ancient Greece.
argument 1: the size and shape of elementary particles 1125
D u r at i o n B lu r O b s e r va t i o n p o s s i b i l i t i e s
to our everyday experience, the picture would get more blurred at extremely short shutter
times. Photography is impossible not only at long but also at short shutter times.
The difficulty of taking photographs is independent of the wavelength used. The limits
move, but do not disappear. With a shutter time of τ, photons of energy lower than ħ~τ
cannot pass undisturbed. The blur is small when shutter times are those of everyday life,
but increases when shutter times are shortened towards Planck times. As a result, there
is no way to detect or confirm the existence of point objects by taking pictures. Points in
space, as well as instants of time, are imagined concepts: they do not belong in a precise
description of nature.
At Planck shutter times, only signals with Planck energy can pass through the shutter.
Since at these energies matter cannot be distinguished from radiation or from empty
space, all objects, light, and vacuum look the same. It is impossible to say what nature
looks like at very short times.
But the situation is worse than this: a Planck shutter cannot exist at all, as it would
Copyright © Christoph Schiller November 1997–January 2007
need to be as small as a Planck length. A camera using it could not be built, as lenses do
not work at this energy. Not even a camera obscura – without any lens – would work, as
diffraction effects would make image production impossible.
In other words, the idea that at short shutter times a photograph of nature shows a
frozen image of everyday life, like a stopped film, is completely wrong.* Indeed, at a single
instant of time nature is not frozen at all. Zeno criticized this idea in his discussions of
motion, though not as clearly as we can do now. At short times, nature is blurred. In
particular, point particles do not exist.
In summary, whatever the intrinsic shape of what we call a ‘point’ might be, we know
* In fact, a shutter does not exist even at medium energy: shutters, like walls, stop existing at around 10 MeV.
1126 xi general rel ativity vs. quantum mechanics • the shape of points
that, being always blurred, it is first of all a cloud. Whatever method is used to photograph
of a point particle, it always shows an extended entity. Let us study its shape in more detail.
information.
Are these arguments watertight? We assumed three dimensions at all scales, and that
the shape of the particle itself is fixed. Maybe these assumptions are not valid at Planck
scales. Let us check the alternatives. We have already shown that because of the funda-
mental measurement limits, the dimensionality of space-time cannot be determined at
Ref. 1157 Planck scales. Even if we could build perfect three-dimensional boxes, holes could remain
in other dimensions. It does not take long to see that all the arguments against compact-
ness work even if space-time has additional dimensions.
argument 1: the size and shape of elementary particles 1127
makes up space-time.**
In summary, only the idea of points leads to problems at Planck scales. If space-time
and matter are imagined to be made, at Planck scales, of extended and fluctuating entities,
all problems disappear. We note directly that for extended entities the requirement of a
Ref. 1157 non-local description is satisfied; while for fluctuating entities the requirement of a statist-
ical description of the vacuum is satisfied. Finally, the argument forbidding composition
* Examples are the neutron, positronium, or the atoms. Note that the argument does not change when the
elementary particle itself is unstable, like the muon. The possibility that all components are heavier than the
composite, which would avoid this argument, does not seem to lead to satisfying physical properties: for
example, it leads to intrinsically unstable composites.
Ref. 1164 ** Thus at Planck scales there is no quantum Zeno effect.
1128 xi general rel ativity vs. quantum mechanics • the shape of points
”
Ref. 1165
We are used to the idea that empty space is made of spatial points. Let us check whether
this is true at high energy. At Planck scales, no measurement can give zero length, zero
are sets of points for which volume makes no sense. Physically speaking, we can say that
the concept of volume does not exist for continuous space; it is only definable if an in-
trinsic length exists. This is the case for matter, and it must also be the case for vacuum.
But any concept with an intrinsic length must be described by one or several extended
components.* In summary, we need extended entities to build up space-time.
In mathematics it is impossible to define the dimension of a set of points on the basis
of the set structure alone. Any compact one-dimensional set has as many points as any
compact three-dimensional set – indeed, as any compact set of any dimensionality greater
Ref. 1166 * Imagining the vacuum as a collection of entities, such as spheres, with Planck size in all directions would
avoid the Banach–Tarski paradox, but would not allow us to deduce the number of dimensions of space and
Challenge 1543 n time. It would also contradict all the other results of this section. Therefore we do not explore it further.
argument 2: the shape of points in vacuum 1129
than zero. To build up the physical three-dimensional vacuum, we need entities that or-
ganize their neighbourhood. The fundamental entities must possess some sort of ability
to form bonds, which will construct or fill precisely three dimensions. Bonds require ex-
tended entities. A collection of tangled entities extending to the maximum scale of the
region under consideration would work perfectly. Of course, the precise shape of the fun-
damental entities is not yet known. In any case, we again find that any constituents of
physical three-dimensional space must be extended.
In summary, we need extension to define dimensionality and to define volume. This
is not surprising. We deduced above that the constituents of particles are extended. Since
vacuum is not distinguishable from matter, we would expect the constituents of vacuum
to be extended as well. Stated simply, if elementary particles are not point-like, then space-
time points cannot be either.
The maximum mass that fits into a piece of vacuum is a black hole. But in this case too,
the maximum mass depends only on the surface of the given region of vacuum. The max-
imum mass increases less rapidly than the volume. In other words, the number of points
in a region is only proportional to the surface area of the region. We are forced to conclude
that vacuum must be made of extended entities crossing the whole region, independently
of its shape.
Page 228 More than two thousand years ago, the Greeks argued that matter must be made of
Ref. 1167 particles because salt can be dissolved in water and because fish can swim through water.
Now that we know more about Planck scales, we have to reconsider this argument. Like
fish swimming through water, particles can move through vacuum; but since vacuum
has no bounds, and cannot be distinguished from matter, vacuum cannot be made of
1130 xi general rel ativity vs. quantum mechanics • the shape of points
particles. However, there is another possibility that allows for motion of particles through
a vacuum: both vacuum and particles might be made of a web of extended entities. Let
us study this possibility in more detail.
it includes the gravitational constant and the speed of light. Let us study duality in more
detail.
* There is also an S-duality, which connects large and small coupling constants, and a U-duality, which is
the combination of S- and T-duality.
argument 3: the l arge, the small and their connection 1131
Is small large?
[Zeno of Elea maintained:] If the existing are
”
Ref. 1168
length, time, and mass cannot be distinguished from each other. Since every observable is
a combination of length, mass and time, space-time duality means that there is a symmetry
between all observables. We call it the total symmetry.*
Total symmetry implies that there are many specific types of duality, one for each pair
of quantities under investigation. Indeed, in string theory, the number of duality types dis-
Page 565 covered is increasing every year. It includes, among others, the famous electric–magnetic
Ref. 1169 duality we first encountered in the chapter on electrodynamics, coupling constant dual-
* A symmetry between size and Schwarzschild radius, i.e. a symmetry between length and mass, will lead to
general relativity. Additionally, at Planck energy there is a symmetry between size and Compton wavelength.
In other words, there is a symmetry between length and inverse mass. This means that there is a symmetry
between coordinates and wave functions. This is a symmetry between states and observables. It leads to
quantum theory.
1132 xi general rel ativity vs. quantum mechanics • the shape of points
ity, surface–volume duality, space-time duality, and many more. All this confirms that
there is an enormous amount of symmetry at Planck scales. Similar symmetries have
Ref. 1170 been known right from the beginning of string theory.
Most importantly, total symmetry implies that gravity can be seen as equivalent to all
other forces. Space-time duality shows that unification is possible. Physicists have always
dreamt about unification. Duality tells us that this dream can indeed be realized.
It may seem that total symmetry completely contradicts what was said in the previous
section, where we argued that all symmetries are lost at Planck scales. Which result is
correct? Obviously, both of them are.
At Planck scales, all low-energy symmetries are indeed lost. In fact, all symmetries
that imply a fixed energy are lost. However, duality and its generalizations combine both
small and large dimensions, or large and small energies. Most of the standard symmetries
of physics, such as gauge, permutation and space-time symmetries, are valid at each fixed
energy separately. But nature is not made this way. The precise description of nature re-
it. In particular, we need to understand exactly what happens to duality when we restrict
Challenge 1545 e ourselves to low energies, as we do in everyday life. This question is left for the next sec-
tion.
* Renormalization energy does connect different energies, but not in the correct way; in particular, it does
not include duality.
argument 4: d oes nature have parts? 1133
“ William of Occam
There is another argument, independent of those given above, that points towards a
model of nature based on extended entities. Let us take a broader view. Any concept
”
for which we can distinguish parts is described by a set. We usually describe nature as a
set of objects, positions, instants and so on. The most famous set-theoretic description
of nature is the oldest known, given by Democritus: ‘The world is made of indivisible
particles and void.’ This description was extremely successful in the past: there were no
discrepancies with observations. However, after 2500 years, the conceptual difficulties of
this approach are obvious.
We now know that Democritus was wrong, first of all, because vacuum and matter
cannot be distinguished at Planck scales. Thus the word ‘and’ in his sentence is already a
number of electrons.
In fact, practically all physicists share this belief, although they usually either pretend to
favour some other number, or worse, keep the number unspecified. We have seen dur-
ing our walk that in modern physics many specialized sets are used to describe nature.
* ‘Multitude should not be introduced without necessity.’ This famous principle is commonly called Occam’s
razor. William of Ockham (b. 1285/1295 Ockham, d. 1349/50 München), or Occam in the common Latin
spelling, was one of the great thinkers of his time. In his famous statement he expresses that only those
concepts which are strictly necessary should be introduced to explain observations. It can be seen as the
requirement to abandon beliefs when talking about nature. But at this stage of our mountain ascent it has
an even more direct interpretation.
1134 xi general rel ativity vs. quantum mechanics • the shape of points
We have used vector spaces, linear spaces, topological spaces and Hilbert spaces. But
we consistently refrained, like all physicists, from asking about the origin of their sizes
(mathematically speaking, of their dimensionality or cardinality). In fact, it is just as un-
satisfying to say that the universe contains some specific number of atoms as it is to say
that space-time is made of point-like events arranged in 3 + 1 dimensions. Both are state-
ments about set sizes, in the widest sense. In a complete, unified description of nature
the number of smallest particles and the number of space-time points must not be part
of the description, but must result from the description.
Any part of nature is by definition smaller than the whole of nature, and different from
other parts. As a result, no description of nature by a set can possibly yield the number
of particles or the dimensionality of space-time. As long as we insist on using space-time
or Hilbert spaces for the description of nature, we cannot understand the number of di-
mensions or the number of particles.
That is not too bad, as we know already that nature is not made of parts. We know that
* As a curiosity, practically the same discussion can already be found in Plato’s Parmenides,Parmenides
Ref. 1172 written in the fourth century bce. There, Plato musically ponders different arguments on whether nature
is or can be a unity or a multiplicity, i.e. a set. It seems that the text is based on the real visit to Athens by
Parmenides and Zeno. (Their home city, Elea, was near Naples.) Plato does not reach a conclusion. Modern
physics, however, does.
argument 4: d oes nature have parts? 1135
thing.
Let us go further. We need a description of nature that allows us to state that at Planck
energies nothing can be distinguished from anything else. For example, it must be im-
possible to distinguish particles from each other or from the vacuum. There is only one
solution: everything – or at least, what we call ‘everything’ in everyday life – must be made
of the same single entity. All particles are made of one ‘piece’. Every point in space, every
event, every particle and every instant of time must be made of the same single entity.
An amoeba
A theory of nothing describing everything is
We have found that parts are approximate concepts. The parts of nature are not strictly
smaller than nature itself. As a result, any ‘part’ must be extended. Let us try to extract
always larger than their constituents. On the other hand, composed entities must usually
appear to be larger than their constituents. For example, space ‘points’ or ‘point’ particles
are tiny, even though they are only approximations. Which concept without boundaries
can be at their origin? Using usual concepts, the world is everywhere at the same time; if
nature is to be described by a single constituent, this entity must be extended.
The entity has to be a single one, but it must seem to be multiple: it has to be multiple
approximately, as nature shows multiple aspects. The entity must be something folded.
It must be possible to count the folds, but only approximately. (An analogy is the ques-
tion of how many grooves there are on an LP or a CD: depending on the point of view,
local or global, one gets different answers.) Counting folds would correspond to a length
measurement.
1136 xi general rel ativity vs. quantum mechanics • the shape of points
The simplest model would be a single entity which is extended and fluctuating, goes
to infinity, and allows approximate localization, thus allowing approximate definition of
parts and points.* In more vivid imagery, nature could be described by some deformable,
folded and tangled entity: a giant, knotted amoeba. An amoeba slides between the fingers
whenever one tries to grab a part of it. A perfect amoeba flows around any knife trying
to cut it. The only way to hold it would be to grab it in its entirety. However, for someone
himself made of amoeba strands, this is impossible. He can only grab it approximately, by
catching part of it and approximately blocking it, for example using a small hole so that
the escape takes a long time.
To sum up, nature is modelled by an entity which is a single unity (to eliminate distin-
guishability), extended (to eliminate localizability) and fluctuating (to ensure approxim-
ate continuity): a far-reaching, fluctuating fold, like an amoeba. The tangled branches of
the amoeba allow a definition of length via counting of the folds. In this way, discreteness
of space, time, and particles could also be realized; the quantization of space-time, matter
dius
r S = 2Gm~c 2 (806)
is a black hole.
A black hole can be formed when a whole star collapses under its own weight. Such a
black hole is a macroscopic body, with a large number of constituents. Therefore it has an
entropy. The entropy S of a macroscopic black hole was determined by Bekenstein and
Ref. 1174, Ref. 1175 Hawking, and is given by
Challenge 1546 ny * Is this the only way to describe nature? Is it possible to find another description, particularly if space and
time are not used as background? The answers are unclear at present.
argument 5: the entropy of bl ack holes 1137
4πGm 2
S= S=k
k A
2 4
or (807)
l Pl ħc
where k is the Boltzmann constant and A = 4πr S2 is the surface of the black hole horizon.
Ref. 1176 This important result has been derived in many different ways. The various derivations
also confirm that space-time and matter are equivalent, by showing that the entropy value
can be interpreted as an entropy either of matter or of space-time. In the present context,
the two main points of interest are that the entropy is finite, and that it is proportional to
the area of the black hole horizon.
In view of the existence of minimum lengths and times, the finiteness of the entropy is
not surprising: it confirms the idea that matter is made of a finite number of discrete en-
tities per given volume. It also shows that these entities behave statistically: they fluctuate.
In fact, quantum gravity implies a finite entropy for any object, not only for black holes.
Ref. 1177 Bekenstein has shown that the entropy of an object is always smaller than the entropy of
3r 2 º
S(r) = k for Na Q Na Q r . (808)
2Na 2
This formula can be derived in a few lines from the properties of a random walk on a lat-
tice, using only two assumptions: the chains are extended; and they have a characteristic
internal length a given by the smallest straight segment. Expression (808) is only valid if
the polymersºare effectively infinite: in other words, if the length Na of the chain and the
elongation a N , are much larger than the radius r of the region of interest. If the chain
length is comparable to or smaller than the region of interest, one gets the usual extensive
entropy, satisfying S r 3 . Thus only flexible extended entities yield an S r 2 dependence.
º From the expression
However, there is a difficulty. º for the entropy of a black hole we
deduce that the elongation a N is given by a N l Pl ; thus it is much smaller than
Copyright © Christoph Schiller November 1997–January 2007
the radius of a general macroscopic black hole, which can have a diameter of several
kilometres. On the other hand, the formula for long entities is only valid when the chains
are longer than the distance r between the end points.
This difficulty disappears when we remember that space near a black hole is strongly
curved. All lengths have to be measured in the same coordinate system. It is well known
that for an outside observer, any object of finite size falling into a black hole seems to cover
the complete horizon for long times (whereas for an observer attached to the object it falls
Ref. 1160 into the hole in its original size). In short, an extended entity can have a proper length of
Ref. 1179 * The result can be derived from quantum statistics alone. However, this derivation does not yield the pro-
portionality coefficient.
1138 xi general rel ativity vs. quantum mechanics • the shape of points
Planck size but still, when seen by an outside observer, be as long as the horizon of the
black hole. We thus find that black holes are made of extended entities.
Another viewpoint can confirm this result. Entropy is (proportional to) the number
of yes-or-no questions needed to know the exact state of the system. This view of black
holes has been introduced by Gerard ’t Hooft. If a system is defined by its surface, as a
black hole is, its components must be extended.
Finally, imagining black holes as made of extended entities is also consistent with the
so-called no-hair theorem: black holes’ properties do not depend on what falls into them,
as all matter and radiation particles are made of the same extended components. The
final state only depends on the number of entities. In short, the entropy formula for a
black hole is consistent with the idea that it is made of a big tangle of extended entities,
fluctuating in shape.
[x, y] = x y − yx = 0 (809)
between any two points with coordinates x and y, making them bosons. But at Planck
scales, because of the existence of minimal distances and areas, this relation must at least
be changed to
[x, y] = l Pl
2
+ ... . (810)
This means that ‘points’ are neither bosons nor fermions.* They have more complex ex-
change properties. In fact, the term on the right-hand side will be energy-dependent, to
Ref. 1157 an increasing extent as we approach Planck scales. In particular, as we have seen, gravity
implies that double exchange does not lead back to the original situation at Planck scales.
Entities obeying this or similar relations have been studied in mathematics for many
Ref. 1182 decades: they are called braids. Thus at Planck scales space-time is not made of points,
Copyright © Christoph Schiller November 1997–January 2007
but of braids or their generalizations. Again, quantum theory and general relativity taken
Ref. 1182 together show that the vacuum must be made of extended entities.
Particles behave in a similar way. We know that at low, everyday energies, particles
of the same type are identical. Experiments sensitive to quantum effects show that there
is no way to distinguish them: any system of several identical particles has permutation
symmetry. On the other hand, we know that at Planck energy all low-energy symmetries
disappear. We also know that at Planck energy permutation cannot be carried out, as it
implies exchanging positions of two particles. At Planck energy, nothing can be distin-
guished from vacuum; thus no two entities can be shown to have identical properties.
* The same reasoning applies to the fermionic or Grassmann coordinates used in supersymmetry.
argument 7: the meaning of spin 1139
At Planck energies this cannot be correct. Quantum gravity effects modify the right-hand
side: they add an energy-dependent term, which is negligible at experimentally accessible
energies but which becomes important at Planck energy. We know from our experience
Ref. 1157 with Planck scales that, in contrast to everyday life, exchanging particles twice cannot
lead back to the original situation. A double exchange at Planck energy cannot have no
effect, because at Planck energy such statements are impossible. The simplest extension
of the commutation relation (811) for which the right-hand side does not vanish is braid
space under consideration – has the same properties as a spin 1/2 particle. It is a fam-
ous exercise to show that such a model, shown in Figure 415, is indeed invariant under
4π rotations but not under 2π rotations, and that two such particles get entangled when
exchanged, but get untangled when exchanged twice. This model has all the properties
of spin 1/2 particles, independently of the precise structure of the central region, which
remains unknown at this point. The tail model even has the same problems with highly
curved space as real spin 1~2 particles have. We will explore this idea in detail shortly.
* With a flat (or other) background, it is possible to define a local energy–momentum tensor. Thus particles
can be defined. Without a background, this is not possible, and only global quantities can be defined.
Without a background, even particles cannot be defined. Therefore, in this section we assume that we have
a slowly varying space-time background.
1140 xi general rel ativity vs. quantum mechanics • the shape of points
flexible bands
in unspecified
number
particle
reaching the
position
border
of space
J=1/2
Current research
To understand is to perceive patterns.
Ref. 1167
“
The Greeks deduced the existence of atoms from the fact that fish can swim through
Isaiah Berlin
water. They argued that only if water is made of atoms could a fish make its way through
”
it, by pushing the atoms aside. We can ask a similar question of a particle flying through a
vacuum: why is it able to do so? A vacuum cannot be a fluid or a solid composed of small
entities, as its dimensionality would not then be fixed. Only one possibility remains: both
vacuum and particles are made of a web of extended entities.
The idea of describing matter as composed of extended entities dates from the 1960s.
That of describing nature as composed of ‘infinitely’ extended entities dates from the Copyright © Christoph Schiller November 1997–January 2007
1980s. In addition to the arguments presented so far, current research provides several
other approaches that arrive at the same conclusion.
**
Bosonization, the construction of fermions using an infinite number of bosons, is a cent-
Ref. 1184 ral property of modern unification attempts. It also implies coupling duality, and thus the
extension of fundamental constituents.
**
String theory, and in particular its generalization to membranes, is explicitly based on
Ref. 1185 extended entities, as the name implies. The fundamental entities are indeed assumed to
reach the limits of space-time.
current research 1141
**
Ref. 1186 Research into quantum gravity – in particular the study of spin networks and spin foams
– has shown that the vacuum must be thought of as a collection of extended entities.
**
In the 1990s, Dirk Kreimer showed that high-order QED diagrams are related to knot
Ref. 1187 theory. He thus proved that extension arrives by the back door even when electromagnet-
ism is described in terms of point particles.
**
A recent discovery in particle physics, ‘holography’, connects the surface and volume of
Ref. 1188 physical systems at high energy. Even if it were not part of string theory, it would still
imply extended entities.
Eeveryday EPl
This section has already provided enough food for thought, but here is some more.
”
**
Challenge 1550 n Would a detector able to detect Planck-mass particles have to be be of infinite size? What
about a detector able to detect a particle moving with Planck energy?
**
Challenge 1551 e Can you provide an argument against the idea of extended entities? If so, publish it (and
email it to the author).
**
Does duality imply that the cosmic background fluctuations (at the origin of galaxies and Copyright © Christoph Schiller November 1997–January 2007
**
Duality also implies that in some sense large and small masses are equivalent. A mass
1144 xi general rel ativity vs. quantum mechanics • the shape of points
2
m in a radius r is equivalent to a mass m Pl ~m in a radius l Pl
2
~r. In other words, duality
transforms mass density from ρ to ρ Pl ~ρ. Vacuum and maximum density are equivalent.
2
**
Total symmetry and space-time duality together imply that there is a symmetry between
Challenge 1555 n all values an observable can take. Do you agree?
**
**
Any description is a mapping from nature to mathematics, i.e. from observed differences
(and relations) to thought differences (and relations). How can we do this accurately, if
Challenge 1557 n differences are only approximate? Is this the end of physics?
**
Challenge 1558 d What do extended entities imply for the big bang?
**
Can you show that going to high energies or selecting a Planck-size region of space-time
Challenge 1559 d is equivalent to visiting the big bang?
**
One needs a description for the expansion of the universe in terms of extended entities.
Ref. 1190, Ref. 1191 Some approaches are currently being explored; no final conclusions can be drawn yet.
Challenge 1560 ny Can you speculate about the solution?
Copyright © Christoph Schiller November 1997–January 2007
Many efforts towards unification advance by digging deeper and deeper into the details of ”
quantum field theory and general relativity. We have taken the opposite approach: we took
a step back and looked at the general picture. We found several arguments, all leading to
the same conclusion: space-time points and point particles are made of extended entities.
Somehow it seems that the universe is best described by a fluctuating, many-branched
entity, a cross between a giant amoeba and a heap of worms. Another analogy is a pot
* ‘We must know, we will know.’ This was Hilbert’s famous personal credo.
current research 1145
of boiling and branched spaghetti. Such a model of quantum geometry is beautiful and
simple, and these two criteria are often taken as an indication, before any experimental
tests, of the correctness of a description. We have toured topics such as the existence of
Planck limits, three-dimensionality, curvature, renormalization, spin, bosonization, the
cosmological constant problem, and the search for a ‘background-free’ description of
Ref. 1192 nature. We will study and test specific details of the model in the next section. Each of
these tests concerns one of only three possible topics: the construction of space-time, the
construction of particles, and the construction of the universe. These are the only issues
remaining on our ascent of Motion Mountain. We will discuss them in the next section.
Before we do so, we enjoy two small thoughts.
A physical aphorism
To ‘show’ that we are not far from the top of Motion Mountain, we give a less serious
Ref. 1194, Ref. 1195 argument as final curiosity. Salecker and Wigner, and then Zimmerman, formulated the
fundamental» limit for the measurement precision τ attainable by a clock of mass M. It is
given by τ = ħT~Mc 2 , where T is the time to be measured. We can then ask what time
T can be measured with a precision of a Planck time t Pl , given a clock of the mass of the
Copyright © Christoph Schiller November 1997–January 2007
String theory, superstring theory or membrane theory – all are names for related ap- ”
proaches – is the most investigated theory ever. String theory explores the idea of particles
as extended entities. The theory contains a maximum speed, a minimum action and a
maximum force. It thus incorporates special relativity, quantum theory and general re-
lativity.
In its attempt to achieve a unified description, string theory uses four ideas that go bey-
ond standard general relativity and quantum theory: extension, supersymmetry, higher
dimensions, and dualities.
— String theory describes particles as extended entities. In a further generalization, M-
electromagnetic field. It is thus an extension – and retains all the essential points – of
quantum field theory.
String theory has many large symmetries, which arise from its many dualities. These
symmetries connect many situations that seem intuitively to be radically different: this
makes the theory fascinating but also difficult to picture. Finally, in string theory there
are special cancellations of anomalies and inconsistencies. Historically, the first example
was the Green–Schwarz anomaly cancellation; but string theory also solves many other
mathematical problems of quantum field theory.
In short, string theorists have shown that general relativity and quantum field theory
can be unified using extended entities.
strings and membranes – why string theory is so difficult 1147
In other words, M-theory meets all our conceptual requirements for a unified theory. Let
us now turn to experiment.
One of the main results of QCD, the theory of strong interactions, is the explanation of
Ref. 1197 mass relations such as
* The history of string theory was characterized by short periods of excitement followed by long periods
of disappointment. The main reason for this ineffective evolution was that most researchers only studied
topics that everybody else was also studying. Young researchers’ fear of unemployment and the general fear
of missing something out meant that people were reluctant to research topics that nobody else was looking
at. Thus string theory had an extremely difficult birth.
1148 xi general rel ativity versus quantum mechanics • string theory
Here, the value of the coupling constant α unif is taken at the unifying energy. This energy
is known to be about a factor of 800 below the Planck energy. In other words, a general
understanding of masses of bound states of the strong interaction, such as the proton,
requires little more than a knowledge of the unification energy and the coupling constant
at that energy. The approximate value α unif = 1~25 is an extrapolation from the low-energy
value, based on experimental data.
Any unified theory must allow one to calculate the coupling constants as a function
of energy – including the value α unif at the unification energy itself. At present, most
researchers regard the search for the vacuum state as the main difficulty facing string
theory and M-theory. Without knowledge of the vacuum state, no calculations of the
masses and coupling constants are possible.
The vacuum state of string theory or M-theory is expected to be one of an extremely
involved set of topologically distinct manifolds. It is estimated that there are around
10750250 candidate vacuum states. (One notes that the error in this estimate alone is as
Outlook
It is estimated that 10 000 man-years have been invested in the search for string theory:
compare this with about 5 for Maxwell’s electrodynamics, 10 for general relativity, and a
similar number for the foundation of quantum theory.
Historically, the community of string theorists took over 10 years to understand that
strings were not the basic entities of string theory. The fundamental entities are mem-
branes. Then string theorists took another 10 years, and more, to understand that mem-
branes are not the most practical fundamental entities for calculation. However, the
search for the most practical entities is still ongoing. It is probable that knotted mem-
branes must be taken into consideration.
Copyright © Christoph Schiller November 1997–January 2007
Biblio graphy
1166 See for example the speculative model of vacuum as composed of Planck-size spheres pro-
posed by F. Winterberg, Zeitschrift für Naturforschung 52a, p. 183, 1997. Cited on page
1128.
1167 The Greek salt-and-water argument and the fish argument are given by Lucretius, in full
Titus Lucretius Carus, De natura rerum, c. 60 bce. Cited on pages 1129 and 1140.
1168 Cited in Jean-Paul Dumont, Les écoles présocratiques, Folio Essais, Gallimard, p. 379,
1991. Cited on page 1131.
1169 D. Olive & C. Montonen, Magnetic monopoles as gauge particles, Physics Letters 72B,
pp. 117–120, 1977. Cited on page 1131.
1170 J.H. Schwarz, The second superstring revolution,, Colloquium-level lecture presented
at the Sakharov Conference in Moscow, May 1996, http://arxiv.org/abs/hep-th/9607067.
Cited on pages 1130 and 1132.
bibliography 1151
1171 This famous quotation is found at the beginning of chapter XI, ‘The Physical Universe’, in
Arthur Eddington, The Philosophy of Physical Science, Cambridge, 1939. Cited on
page 1133.
1172 Pl ato, Parmenides, c. 370 bce. It has been translated into most languages. Reading it
aloud, like a song, is a beautiful experience. A pale reflection of these ideas is Bohm’s
concept of ‘unbroken wholeness’. Cited on page 1134.
1173 P. Gibbs, Event-symmetric physics, http://arxiv.org/abs/hep-th/9505089; see also his web-
site http://www.weburbia.com/pg/contents.htm. Cited on page 1134.
1174 J.D. Bekenstein, Black holes and entropy, Physical Review D 7, pp. 2333–2346, 1973.
Cited on page 1136.
1175 S.W. Hawking, Particle creation by black holes, Communications in Mathematical Phys-
ics 43, pp. 199–220, 1975; see also S.W. Hawking, Black hole thermodynamics, Physical
Review D 13, pp. 191–197, 1976. Cited on page 1136.
1176 J.B. Hartle & S.W. Hawking, Path integral derivation of black hole radiance, Physical
mer School: Unity from Duality: Gravity, Gauge Theory and Strings (Les Houches, France,
2001), Springer Verlag, 76, pp. 241–322, 2002. Cited on page 1146.
1197 F. Wilczek, Getting its from bits, Nature 397, pp. 303–306, 1999. Cited on page 1147.
1198 See the well-known blog by Peter Woit, Not even wrong, at http://www.math.columbia.
edu/~woit/blog. Cited on page 1148.
Motion Mountain – The Adventure of Physics available free of charge at www.motionmountain.net Copyright © Christoph Schiller November 1997–January 2007
– CS – this chapter will appear in the near future – CS –
(NOT YET AVAI L ABLE)
UNI FIC ATION
C h a p t e r XII
Motion Mountain – The Adventure of Physics available free of charge at www.motionmountain.net Copyright © Christoph Schiller November 1997–January 2007
– CS – this chapter will appear in the near future – CS –
THE TOP OF THE MOUNTAI N
(NOT YET AVAI L ABLE)
C h a p t e r XIII
Fourth Part
Appendices
N
ewly introduced and defined concepts in this text are indicated by italic typeface.
ew definitions are also referred to in the index by italic page numbers. We
naturally use SI units throughout the text; these are defined in Appendix B. Exper-
imental results are cited with limited precision, usually only two digits, as this is almost
“ without pleasure.
Samuel Johnson
Books are collections of symbols. Writing was probably invented between 3400 and 3300 ”
bce by the Sumerians in Mesopotamia (though other possibilities are also discussed). It
then took over a thousand years before people started using symbols to represent sounds
instead of concepts: this is the way in which the first alphabet was created. This happened
between 2000 and 1600 bce (possibly in Egypt) and led to the Semitic alphabet. The use
of an alphabet had so many advantages that it was quickly adopted in all neighbouring
cultures, though in different forms. As a result, the Semitic alphabet is the forefather of
all alphabets used in the world.
This text is written using the Latin alphabet. At first sight, this seems to imply that its
Copyright © Christoph Schiller November 1997–January 2007
The letter G was added in the third century bce by the first Roman to run a fee-paying
school, Spurius Carvilius Ruga. He added a horizontal bar to the letter C and substituted
the letter Z, which was not used in Latin any more, for this new letter. In the second
century bce, after the conquest of Greece, the Romans included the letters Y and Z from
the Greek alphabet at the end of their own (therefore effectively reintroducing the Z) in
”
Copyright © Christoph Schiller November 1997–January 2007
TA B L E 80 The ancient and classical Greek alphabets, and the correspondence with Latin and Indian
digits
The regional archaic letters yot, sha and san are not included in the table. The letter san was the ancestor of
sampi.
1. Only if before velars, i.e. before kappa, gamma, xi and chi.
2. ‘Digamma’ is the name used for the F-shaped form. It was mainly used as a letter (but also sometimes, in its
lower-case form, as a number), whereas the shape and name ‘stigma’ is used only for the number. Both names
were derived from the respective shapes; in fact, the stigma is a medieval, uncial version of the digamma.
The name ‘stigma’ is derived from the fact that the letter looks like a sigma with a tau attached under it –
though unfortunately not in all modern fonts. The original letter name, also giving its pronunciation, was
‘waw’.
3. The version of qoppa that looks like a reversed and rotated z is still in occasional use in modern Greek.
Unicode calls this version ‘koppa’.
4. The second variant of sigma is used only at the end of words.
5. Uspilon corresponds to ‘u’ only as the second letter in diphthongs.
6. In older times, the letter sampi was positioned between pi and qoppa.
bet. Still later, accents, subscripts and breathings were introduced. Table 80 also gives the
values signified by the letters took when they were used as numbers. For this special use,
the obsolete ancient letters were kept during the classical period; thus they also acquired
lower-case forms.
The Latin correspondence in the table is the standard classical one, used for writing
Greek words. The question of the correct pronunciation of Greek has been hotly debated
in specialist circles; the traditional Erasmian pronunciation does not correspond either
to the results of linguistic research, or to modern Greek. In classical Greek, the sound that
sheep make was βη–βη. (Erasmian pronunciation wrongly insists on a narrow η; modern
Greek pronunciation is different for β, which is now pronounced ‘v’, and for η, which is
now pronounced as ‘i’ – a long ‘i’.) Obviously, the pronunciation of Greek varied from
region to region and over time. For Attic Greek, the main dialect spoken in the classical
period, the question is now settled. Linguistic research has shown that chi, phi and theta
were less aspirated than usually pronounced in English and sounded more like the initial
² daleth d 4
etc.
* The Greek alphabet is also the origin of the Gothic alphabet, which was defined in the fourth century by
Wulfila for the Gothic language, using also a few signs from the Latin and futhorc scripts.
The Gothic alphabet is not to be confused with the so-called Gothic letters, a style of the Latin alphabet
used all over Europe from the eleventh century onwards. In Latin countries, Gothic letters were replaced in
the sixteenth century by the Antiqua, the ancestor of the type in which this text is set. In other countries,
Gothic letters remained in use for much longer. They were used in type and handwriting in Germany until
1941, when the National Socialist government suddenly abolished them, in order to comply with popular
demand. They remain in sporadic use across Europe. In many physics and mathematics books, Gothic letters
are used to denote vector quantities.
1162 a notation and conventions
Novem figure indorum he sunt 9 8 7 6 5 4 3 2 1. Cum his itaque novem fig- Copyright © Christoph Schiller November 1997–January 2007
* A well-designed website on the topic is http://www.omniglot.com. The main present and past writing
systems are encoded in the Unicode standard, which at present contains 52 writing systems. See http://
www.unicode.org.
** The story of the development of the numbers is told most interestingly by G. Ifrah, Histoire universelle
des chiffres, Seghers, 1981, which has been translated into several languages. He sums up the genealogy in ten
beautiful tables, one for each digit, at the end of the book. However, the book contains many factual errors,
as explained in the http://www.ams.org/notices/200201/rev-dauben.pdf and http://www.ams.org/notices/
200202/rev-dauben.pdf review.
It is not correct to call the digits 0 to 9 Arabic. Both the actual Arabic digits and the digits used in Latin
texts such as this one derive from the Indian digits. Only the digits 0, 2, 3 and 7 resemble those used in
Arabic writing, and then only if they are turned clockwise by 90°.
*** Leonardo di Pisa, called Fibonacci (b. c. 1175 Pisa, d. 1250 Pisa), Italian mathematician, and the most
important mathematician of his time.
a notation and conventions 1163
uris, et cum hoc signo 0, quod arabice zephirum appellatur, scribitur quilibet
numerus, ut inferius demonstratur.*
The Indian method of writing numbers consists of a large innovation, the positional sys-
tem, and a small one, the digit zero. The positional system, as described by Fibonacci, was
so much more efficient that it completely replaced the previous Roman number system,
which writes 1996 as IVMM or MCMIVC or MCMXCVI, as well as the Greek number system,
in which the Greek letters were used for numbers in the way shown in Table 80, thus
writing 1996 as ͵αϠϞϚʹ. Compared to these systems, the Indian numbers are a much bet-
ter technology. Indeed, the Indian system proved so practical that calculations done on
paper completely eliminated the need for the abacus, which therefore fell into disuse. The
abacus is still in use only in those countries which do not use a positional system to write
numbers. (The Indian system also eliminated the need fir systems to represent numbers
with fingers. Such systems, which could show numbers up to 10 000 and more, have left
Besides text and numbers, physics books contain other symbols. Most symbols have been
developed over hundreds of years, so that only the clearest and simplest are now in use.
In this mountain ascent, the symbols used as abbreviations for physical quantities are all
”
taken from the Latin or Greek alphabets and are always defined in the context where
they are used. The symbols designating units, constants and particles are defined in Ap-
pendices B and C. There is an international standard for them (ISO 31), but it is virtually
Ref. 1204 inaccessible; the symbols used in this text are those in common use.
The mathematical symbols used in this text, in particular those for operations and
relations, are given in the following list, together with their origins. The details of their
Copyright © Christoph Schiller November 1997–January 2007
* ‘The nine figures of the Indians are: 9 8 7 6 5 4 3 2 1. With these nine figures, and with this sign 0 which in
Arabic is called zephirum, any number can be written, as will be demonstrated below.’
** Currently, the shortest time for finding the thirteenth (integer) root of a hundred-digit number, a result
with 8 digits, is 11.8 seconds. For more about the stories and the methods of calculating prodigies, see the
fascinating book by Steven B. Smith, The Great Mental Calculators – The Psychology, Methods and Lives
of the Calculating Prodigies, Columbia University Press, 1983. The book also presents the techniques that
they use, and that anybody else can use to emulate them.
*** Robert Recorde (c. 1510–1558), English mathematician and physician; he died in prison, though not for
Ref. 1206 his false pretence to be the inventor of the ‘equal’ sign, which he took over from his Italian colleagues, but
for a smaller crime, namely debt. The quotation is from his The Whetstone of Witte, 1557. An image showing
the quote in manuscript can be found at the http://members.aol.com/jeff94100/witte.jpg website.
1164 a notation and conventions
Other signs used here have more complicated origins. The & sign is a contraction of Latin
Ref. 1207 et meaning ‘and’, as is often more clearly visible in its variations, such as &, the common
italic form.
Each of the punctuation signs used in sentences with modern Latin alphabets, such
as , . ; : ! ? ‘ ’ » « – ( ) ... has its own history. Many are from ancient Greece, but the
Ref. 1208 question mark is from the court of Charlemagne, and exclamation marks appear first in
the sixteenth century.* The @ or at-sign probably stems from a medieval abbreviation of
Ref. 1209 Latin ad, meaning ‘at’, similarly to how the & sign evolved from Latin et. In recent years,
the smiley :-) and its variations have become popular. The smiley is in fact a new version of
the ‘point of irony’ which had been formerly proposed, without success, by A. de Brahm
(1868–1942).
The section sign § dates from the thirteenth century in northern Italy, as was shown
by the German palaeographer Paul Lehmann. It was derived from ornamental versions
Calendars
The many ways to keep track of time differ greatly from civilization to civilization. The
most common calendar, and the one used in this text, is also one of the most absurd, as
it is a compromise between various political forces who tried to shape it.
In ancient times, independent localized entities, such as tribes or cities, preferred lunar
calendars, because lunar timekeeping is easily organized locally. This led to the use of the
month as a calendar unit. Centralized states imposed solar calendars, based on the year.
Solar calendars require astronomers, and thus a central authority to finance them. For
Copyright © Christoph Schiller November 1997–January 2007
various reasons, farmers, politicians, tax collectors, astronomers, and some, but not all,
religious groups wanted the calendar to follow the solar year as precisely as possible. The
compromises necessary between days and years are the origin of leap days. The com-
promises necessary between months and year led to the varying lengths are different in
different calendars. The most commonly used year–month structure was organized over
2000 years ago by Gaius Julius Ceasar, and is thus called the Julian calendar.
The system was destroyed only a few years later: August was lengthened to 31 days
when it was named after Augustus. Originally, the month was only 30 days long; but in
order to show that Augustus was as important as Caesar, after whom July is named, all
* On the parenthesis see the beautiful book by J. L ennard, But I Digress, Oxford University Press, 1991.
1166 a notation and conventions
month lengths in the second half of the year were changed, and February was shortened
by an additional day.
The week is an invention of Babylonia, from where it was spread through the world
by various religious groups. (The way astrology and astronomy cooperated to determine
Page 147 the order of the weekdays is explained in the section on gravitation.) Although it is about
three thousand years old, the week was fully included into the Julian calendar only around
the year 300, towards the end of the Western Roman Empire. The final change in the
Julian calendar took place between 1582 and 1917 (depending on the country), when more
precise measurements of the solar year were used to set a new method to determine leap
days, a method still in use today. Together with a reset of the date and the fixation of
the week rhythm, this standard is called the Gregorian calendar or simply the modern
calendar. It is used by a majority of the world’s population.
Despite its complexity, the modern calendar does allow you to determine the day of
the week of a given date in your head. Just execute the following six steps:
* Remembering the intermediate result for the current year can simplify things even more, especially since
the dates 4.4, 6.6, 8.8, 10.10, 12.12, 9.5, 5.9, 7.11, 11.7 and the last day of February all fall on the same day of
the week, namely on the year’s intermediate result plus 4.
** The present counting of years was defined in the Middle Ages by setting the date for the foundation of
Rome to the year 753 bce, or 753 before the Common Era, and then counting backwards, so that the bce
years behave almost like negative numbers. However, the year 1 follows directly after the year 1 bce: there
Copyright © Christoph Schiller November 1997–January 2007
was no year 0.
Some other standards set by the Roman Empire explain several abbreviations used in the text:
- c. is a Latin abbreviation for circa and means ‘roughly’;
- i.e. is a Latin abbreviation for id est and means ‘that is’;
- e.g. is a Latin abbreviation for exempli gratia and means ‘for the sake of example’;
- ibid. is a Latin abbreviation for ibidem and means ‘at that same place’;
- inf. is a Latin abbreviation for infra and means ‘(see) below’;
- op. cit. is a Latin abbreviation for opus citatum and means ‘the cited work’;
- et al. is a Latin abbreviation for et alii and means ‘and others’.
By the way, idem means ‘the same’ and passim means ‘here and there’ or ‘throughout’. Many terms used in
physics, like frequency, acceleration, velocity, mass, force, momentum, inertia, gravitation and temperature,
are derived from Latin. In fact, it is arguable that the language of science has been Latin for over two thousand
years. In Roman times it was Latin vocabulary with Latin grammar, in modern times it switched to Latin
vocabulary with French grammar, then for a short time to Latin vocabulary with German grammar, after
a notation and conventions 1167
The EPR paradox in the Bohm formulation can perhaps be resolved using the GRW
approach, using the WKB approximation of the Schrödinger equation.
Using such vocabulary is the best way to make language unintelligible to outsiders. First
of all, it uses abbreviations, which is a shame. On top of this, the sentence uses people’s
names to characterize concepts, i.e. it uses eponyms. Originally, eponyms were intended
as tributes to outstanding achievements. Today, when formulating radical new laws or
variables has become nearly impossible, the spread of eponyms intelligible to a steadily
decreasing number of people simply reflects an increasingly ineffective drive to fame.
Biblio graphy
1199 For a clear overview of the various sign conventions in general relativity, see the front cover
of Charles W. Misner, Kip S. Thorne & John A. Wheeler, Gravitation, Free-
man, 1973. We use the gravitational sign conventions of Hans C. Ohanian & R emo
Ruffini, Gravitazione e spazio-tempo, Zanichelli, 1997. Cited on page 1158.
1200 The first written record of the letter U seems to be Leon Battista Alberti, Grammat-
ica della lingua toscana, 1442, the first grammar of a modern (non-latin) language, written
by a genius that was intellectual, architect and the father of cryptology. The first written re-
cord of the letter J seems to be Antonio de Nebrija, Gramática castellana, 1492. Before
writing it, Nebrija lived for ten years in Italy, so that it is possible that the I/J distinction is
of Italian origin as well. Nebrija was one of the most important Spanish scholars. Cited on
page 1159.
1201 For more information about the letters thorn and eth, have a look at the extensive report
History of Mathematical Notations, 2 volumes, The Open Court Publishing Co., 1928–1929.
The square root sign is used in Christoff Rud olff, Die Coss, Vuolfius Cephaleus
Joanni Jung: Argentorati, 1525. (The full title was Behend vnnd Hubsch Rechnung durch
die kunstreichen regeln Algebre so gemeinlicklich die Coss genent werden. Darinnen alles so
treülich an tag gegeben, das auch allein auss vleissigem lesen on allen mündtliche vnterricht
mag begriffen werden, etc.) Cited on page 1163.
1207 J. Tschichold, Formenwamdlungen der et-Zeichen, Stempel AG, 1953. Cited on page
1165.
1208 Malcolm B. Parkes, Pause and Effect: An Introduction to the History of Punctuation
in the West, University of California Press, 1993. Cited on page 1165.
1209 This is explained by Berthold Louis Ullman, Ancient Writing and its Influence, 1932.
Cited on page 1165.
bibliography 1169
1210 Paul Lehmann, Erforschung des Mittelalters – Ausgewählte Abhandlungen und Aufsätze,
Anton Hiersemann, 1961, pp. 4–21. Cited on page 1165.
1211 Bernard Bischoff, Paläographie des römischen Altertums und des abendländischen
Mittelalters, Erich Schmidt Verlag, 1979, pp. 215–219. Cited on page 1165.
1212 The connections between Greek roots and many French words – and thus many English
ones – can be used to rapidly build up a vocabulary of ancient Greek without much study, as
shown by the practical collection by J. Chaineux, Quelques racines grecques, Wetteren
– De Meester, 1929. See also Donald M. Ayers, English Words from Latin and Greek
Elements, University of Arizona Press, 1986. Cited on page 1167.
1213 To write well, read William Strunk & E.B. White, The Elements of Style, Macmillan,
1935, 1979, or Wolf Schneider, Deutsch für Kenner – Die neue Stilkunde, Gruner und
Jahr, 1987. Cited on page 1167.
M
easurements are comparisons with standards. Standards are based on a unit.
any different systems of units have been used throughout the world.
ost standards confer power to the organization in charge of them. Such power
can be misused; this is the case today, for example in the computer industry, and was so
conductors of infinite length, of negligible circular cross-section, and placed 1 metre apart
in vacuum, would produce between these conductors a force equal to 2 ë 10−7 newton per
metre of length.’ (1948)
‘The kelvin, unit of thermodynamic temperature, is the fraction 1/273.16 of the ther-
modynamic temperature of the triple point of water.’ (1967)*
‘The mole is the amount of substance of a system which contains as many elementary
entities as there are atoms in 0.012 kilogram of carbon 12.’ (1971)*
‘The candela is the luminous intensity, in a given direction, of a source that emits
monochromatic radiation of frequency 540 ë 1012 hertz and has a radiant intensity in that
direction of (1/683) watt per steradian.’ (1979)*
* The respective symbols are s, m, kg, A, K, mol and cd. The international prototype of the kilogram is a
b units, measurements and constants 1171
Note that both time and length units are defined as certain properties of a standard
example of motion, namely light. This is an additional example making the point that the
observation of motion as the fundamental type of change is a prerequisite for the defin-
ition and construction of time and space. By the way, the use of light in the definitions
had been proposed already in 1827 by Jacques Babinet.*
From these basic units, all other units are defined by multiplication and division. Thus,
all SI units have the following properties:
SI units form a system with state-of-the-art precision: all units are defined with a pre-
cision that is higher than the precision of commonly used measurements. Moreover, the
precision of the definitions is regularly being improved. The present relative uncertainty
of the definition of the second is around 10−14 , for the metre about 10−10 , for the kilogram
about 10−9 , for the ampere 10−7 , for the mole less than 10−6 , for the kelvin 10−6 and for
the candela 10−3 .
SI units form an absolute system: all units are defined in such a way that they can
platinum–iridium cylinder kept at the BIPM in Sèvres, in France. For more details on the levels of the caesium
Ref. 1215 atom, consult a book on atomic physics. The Celsius scale of temperature θ is defined as: θ~°C = T~K−273.15;
note the small difference with the number appearing in the definition of the kelvin. SI also states: ‘When the
mole is used, the elementary entities must be specified and may be atoms, molecules, ions, electrons, other
particles, or specified groups of such particles.’ In the definition of the mole, it is understood that the carbon
12 atoms are unbound, at rest and in their ground state. In the definition of the candela, the frequency of the
light corresponds to 555.5 nm, i.e. green colour, around the wavelength to which the eye is most sensitive.
* Jacques Babinet (1794–1874), French physicist who published important work in optics.
1172 b units, measurements and constants
Name A b b r e v i at i o n Name A b b r e v i at i o n
We note that in all definitions of units, the kilogram only appears to the powers of 1, 0
SI units form a complete system: they cover in a systematic way the complete set of
observables of physics. Moreover, they fix the units of measurement for all other sciences
* Some of these names are invented (yocto to sound similar to Latin octo ‘eight’, zepto to sound similar
check the greek to Latin septem, yotta and zetta to resemble them, exa and peta to sound like the Greek words ἑξάκις and
check the greek πεντάκις for ‘six times’ and ‘five times’, the unofficial ones to sound similar to the Greek words for nine, ten,
eleven and twelve); some are from Danish/Norwegian (atto from atten ‘eighteen’, femto from femten ‘fifteen’);
some are from Latin (from mille ‘thousand’, from centum ‘hundred’, from decem ‘ten’, from nanus ‘dwarf ’);
some are from Italian (from piccolo ‘small’); some are Greek (micro is from µικρός ‘small’, deca/deka from
δέκα ‘ten’, hecto from ἑκατόν ‘hundred’, kilo from χίλιοι ‘thousand’, mega from µέγας ‘large’, giga from γίγας
‘giant’, tera from τέρας ‘monster’).
Translate: I was caught in such a traffic jam that I needed a microcentury for a picoparsec and that my
Challenge 1563 e car’s fuel consumption was two tenths of a square millimetre.
b units, measurements and constants 1173
as well.
SI units form a universal system: they can be used in trade, in industry, in commerce,
at home, in education and in research. They could even be used by extraterrestrial civil-
izations, if they existed.
SI units form a coherent system: the product or quotient of two SI units is also an SI
unit. This means that in principle, the same abbreviation, e.g. ‘SI’, could be used for every
unit.
The SI units are not the only possible set that could fulfil all these requirements, but
they are the only existing system that does so.*
Remember that since every measurement is a comparison with a standard, any meas-
urement requires matter to realize the standard (yes, even for the speed standard), and
radiation to achieve the comparison. The concept of measurement thus assumes that mat-
Page 1123 ter and radiation exist and can be clearly separated from each other.
Name Definition Va l u e
¼
Basic units
= = 1.616 0(12) ë 10−35 m
¼
the Planck length l Pl ħG~c 3
= = 5.390 6(40) ë 10−44 s
¼
the Planck time t Pl ħG~c 5
= =
¼
the Planck mass m Pl ħc~G 21.767(16) µg
= c 6 ~G = 3.479 3(22) ë 1025 A
¼
the Planck current I Pl 4πε 0
= ħc 5 ~Gk 2 =
Copyright © Christoph Schiller November 1997–January 2007
* Most non-SI units still in use in the world are of Roman origin. The mile comes from milia passum, which
used to be one thousand (double) strides of about 1480 mm each; today a nautical mile, once defined as
minute of arc on the Earth’s surface, is exactly 1852 m). The inch comes from uncia/onzia (a twelfth – now of
a foot). The pound (from pondere ‘to weigh’) is used as a translation of libra – balance – which is the origin
of its abbreviation lb. Even the habit of counting in dozens instead of tens is Roman in origin. These and all
other similarly funny units – like the system in which all units start with ‘f ’, and which uses furlong/fortnight
as its unit of velocity – are now officially defined as multiples of SI units.
** The natural units x Pl given here are those commonly used today, i.e. those defined using the constant
ħ, and not, as Planck originally did, by using the constant h = 2πħ. The electromagnetic units can also be
defined with other factors than 4πε 0 in the expressions: for example, using 4πε 0 α, with the fine structure
constant α, gives q Pl = e. For the explanation of the numbers between brackets, the standard deviations, see
page 1180.
1174 b units, measurements and constants
Name Definition Va l u e
Trivial units
the Planck velocity v Pl = c = 0.3 Gm~s
the Planck angular momentum L Pl = ħ = 1.1 ë 10−34 Js
the Planck action S aPl = ħ = 1.1 ë 10−34 Js
the Planck entropy S ePl = k = 13.8 yJ~K
Composed units
the Planck mass density ρ Pl = c 5 ~G 2 ħ
¼ = 5.2 ë 1096 kg~m3
= ħc 5 ~G = 2.0 GJ = 1.2 ë 1028 eV
¼
the Planck energy E Pl
the Planck momentum p Pl = ħc 3 ~G = 6.5 Ns
the Planck force FPl = c 4 ~G = 1.2 ë 1044 N
= c 5 ~G =
The natural units are important for another reason: whenever a quantity is sloppily called
‘infinitely small (or large)’, the correct expression is ‘as small (or as large) as the corres-
ponding corrected Planck unit’. As explained throughout the text, and especially in the
Page 1096 third part, this substitution is possible because almost all Planck units provide, within a
correction factor of order 1, the extremal value for the corresponding observable – some
Copyright © Christoph Schiller November 1997–January 2007
an upper and some a lower limit. Unfortunately, these correction factors are not yet widely
known. The exact extremal value for each observable in nature is obtained when G is sub-
stituted by 4G, ħ by ħ~2, k by k~2 and 4πε 0 by 8πε 0 α in all Planck quantities. These
extremal values, or corrected Planck units, are the true natural units. To exceeding the
extremal values is possible only for some extensive quantities. (Can you find out which
Challenge 1565 n ones?)
1 eV 1
6
aJ (815)
which is easily remembered. The simplification c = ħ = 1 yields G = 6.9 ë 10−57 eV−2 and
allows one to use the unit eV also for mass, momentum, temperature, frequency, time and
Challenge 1567 e length, with the respective correspondences 1 eV 1.8 ë 10−36 kg 5.4 ë 10−28 Ns 242 THz
11.6 kK and 1 eV−1 4.1 fs 1.2 µm.
To get some feeling for the unit eV, the following relations are useful. Room temper-
ature, usually taken as 20°C or 293 K, corresponds to a kinetic energy per particle of
0.025 eV or 4.0 zJ. The highest particle energy measured so far belongs to a cosmic ray
Ref. 1219 with an energy of 3 ë 1020 eV or 48 J. Down here on the Earth, an accelerator able to pro-
duce an energy of about 105 GeV or 17 nJ for electrons and antielectrons has been built,
Copyright © Christoph Schiller November 1997–January 2007
* Other definitions for the proportionality constants in electrodynamics lead to the Gaussian unit system
often used in theoretical calculations, the Heaviside–Lorentz unit system, the electrostatic unit system, and
Ref. 1217 the electromagnetic unit system, among others.
** In the list, l is length, E energy, F force, E electric the electric and B the magnetic field, m mass, p momentum,
a acceleration, f frequency, I electric current, U voltage, T temperature, v speed, q charge, R resistance, P
power, G the gravitational constant.
The web page http://www.chemie.fu-berlin.de/chemistry/general/units_en.html provides a tool to con-
vert various units into each other.
Researchers in general relativity often use another system, in which the Schwarzschild radius rs = 2Gm~c 2
is used to measure masses, by setting c = G = 1. In this case, mass and length have the same dimension, and
ħ has the dimension of an area.
» »
Ref. 1218 Already in the nineteenth century, George Stoney had proposed to use as length, time and mass units
Ge 2 ~(c 4 4πε 0 ) = 1.4 ë 10−36 m, t S = Ge 2 ~(c 6 4πε 0 ) = 4.6 ë 10−45 s and m S =
»
the quantities l S =
Challenge 1566 n e ~(G4πε 0 ) = 1.9 µg. How are these units related to the Planck units?
2
1176 b units, measurements and constants
and one able to produce an energy of 10 TeV or 1.6 µJ for protons will be finished soon.
Both are owned by CERN in Geneva and have a circumference of 27 km.
The lowest temperature measured up to now is 280 pK, in a system of rhodium nuclei
Ref. 1220 held inside a special cooling system. The interior of that cryostat may even be the coolest
point in the whole universe. The kinetic energy per particle corresponding to that temper-
ature is also the smallest ever measured: it corresponds to 24 feV or 3.8 vJ = 3.8 ë 10−33 J.
For isolated particles, the record seems to be for neutrons: kinetic energies as low as
10−7 eV have been achieved, corresponding to de Broglie wavelengths of 60 nm.
**
Are you confused by the candela? The definition simply says that 683 cd = 683 lm~sr
corresponds to 1 W~sr. The candela is thus a unit for light power per (solid) angle, except
that it is corrected for the eye’s sensitivity: the candela measures only visible power per
angle. Similarly, 683 lm = 683 cd sr corresponds to 1 W. So both the lumen and the watt
measure power, or energy flux, but the lumen measures only the visible part of the power.
This difference is expressed by inserting ‘radiant’ or ‘luminous’: thus, the Watt measures
radiant flux, whereas the lumen measures luminous flux.
The factor 683 is historical. An ordinary candle emits a luminous intensity of about a
Challenge 1568 e candela. Therefore, at night, a candle can be seen up to a distance of 10 or 20 kilometres. A
100 W incandescent light bulb produces 1700 lm, and the brightest light emitting diodes
Copyright © Christoph Schiller November 1997–January 2007
about 5 lm. Cinema projectors produce around 2 Mlm, and the brightest flashes, like light-
ning, 100 Mlm.
The irradiance of sunlight is about 1300 W~m2 on a sunny day; on the other hand,
the illuminance is only 120 klm~m2 = 120 klux or 170 W~m2 . (A cloud-covered summer
day or a clear winter day produces about 10 klux.) These numbers show that most of the
energy from the Sun that reaches the Earth is outside the visible spectrum.
On a glacier, near the sea shore, on the top of a mountain, or in particular weather
condition the brightness can reach 150 klux. The brightest lamps, those used during sur-
gical operations, produce 120 klux. Humans need about 30 lux for comfortable reading.
* This story revived an old (and false) urban legend that states that only three countries in the world do not
use SI units: Liberia, the USA and Myanmar.
b units, measurements and constants 1177
Museums are often kept dark because water-based paintings are degraded by light above
100 lux, and oil paintings by light above 200 lux. The full moon produces 0.1 lux, and
Ref. 1222 the sky on a dark moonless night about 1 mlux. The eyes lose their ability to distinguish
colours somewhere between 0.1 lux and 0.01 lux; the eye stops to work below 1 nlux. Tech-
nical devices to produce images in the dark, such as night goggles, start to work at 1 µlux.
By the way, the human body itself shines with about 1 plux, a value too small to be detected
with the eye, but easily measured with specialized apparatus. The origin of this emission
is still a topic of research.
**
The highest achieved light intensities are in excess of 1018 W~m2 , more than 15 orders of
magnitude higher than the intensity of sunlight. They are produced by tight focusing of
pulsed lasers. The electric field in such light pulses is of the same order as the field inside
Ref. 1223 atoms; such a beam therefore ionizes all matter it encounters.
**
The second does not correspond to 1/86 400th of the day any more, though it did in the
year 1900; the Earth now takes about 86 400.002 s for a rotation, so that the International Copyright © Christoph Schiller November 1997–January 2007
Earth Rotation Service must regularly introduce a leap second to ensure that the Sun is at
the highest point in the sky at 12 o’clock sharp.* The time so defined is called Universal
Time Coordinate. The speed of rotation of the Earth also changes irregularly from day to
day due to the weather; the average rotation speed even changes from winter to summer
because of the changes in the polar ice caps; and in addition that average decreases over
time, because of the friction produced by the tides. The rate of insertion of leap seconds
is therefore higher than once every 500 days, and not constant in time.
* Their website at http://hpiers.obspm.fr gives more information on the details of these insertions, as does
http://maia.usno.navy.mil, one of the few useful military websites. See also http://www.bipm.fr, the site of
the BIPM.
1178 b units, measurements and constants
**
The most precisely measured quantities in nature are the frequencies of certain milli-
second pulsars,* the frequency of certain narrow atomic transitions, and the Rydberg
constant of atomic hydrogen, which can all be measured as precisely as the second is
defined. The caesium transition that defines the second has a finite linewidth that limits
the achievable precision: the limit is about 14 digits.
**
The most precise clock ever built, using microwaves, had a stability of 10−16 during a
Ref. 1225 running time of 500 s. For longer time periods, the record in 1997 was about 10−15 ; but
Ref. 1226 values around 10−17 seem within technological reach. The precision of clocks is limited for
short measuring times by noise, and for long measuring times by drifts, i.e. by systematic
effects. The region of highest stability depends on the clock type; it usually lies between
1 ms for optical clocks and 5000 s for masers. Pulsars are the only type of clock for which
**
The shortest times measured are the lifetimes of certain ‘elementary’ particles. In particu-
Ref. 1227 lar, the lifetime of certain D mesons have been measured at less than 10−23 s. Such times
are measured using a bubble chamber, where the track is photographed. Can you estimate
Challenge 1569 n how long the track is? (This is a trick question – if your length cannot be observed with
an optical microscope, you have made a mistake in your calculation.)
**
The longest times encountered in nature are the lifetimes of certain radioisotopes, over
1015 years, and the lower limit of certain proton decays, over 1032 years. These times are
thus much larger than the age of the universe, estimated to be fourteen thousand million
Ref. 1228 years.
**
The least precisely measured of the fundamental constants of physics are the gravitational
constant G and the strong coupling constant α s . Even less precisely known are the age of
Page 1183 the universe and its density (see Table 86). Copyright © Christoph Schiller November 1997–January 2007
**
The precision of mass measurements of solids is limited by such simple effects as the
adsorption of water. Can you estimate the mass of a monolayer of water – a layer with
Challenge 1570 n thickness of one molecule – on a metal weight of 1 kg?
**
Variations of quantities are often much easier to measure than their values. For example,
in gravitational wave detectors, the sensitivity achieved in 1992 was ∆l~l = 3 ë 10−19 for
* An overview of this fascinating work is given by J.H. Taylor, Pulsar timing and relativistic gravity, Philo-
sophical Transactions of the Royal Society, London A 341, pp. 117–134, 1992.
b units, measurements and constants 1179
Ref. 1229 lengths of the order of 1 m. In other words, for a block of about a cubic metre of metal
it is possible to measure length changes about 3000 times smaller than a proton radius.
These set-ups are now being superseded by ring interferometers. Ring interferometers
measuring frequency differences of 10−21 have already been built; and they are still being
Ref. 1230 improved.
**
The Swedish astronomer Anders Celsius (1701–1744) originally set the freezing point
of water at 100 degrees and the boiling point at 0 degrees. Later the scale was reversed.
Ref. 1231 However, this is not the whole story. With the official definition of the kelvin and the
degree Celsius, at the standard pressure of 1013.25 Pa, water boils at 99.974°C. Can you
Challenge 1571 n explain why it is not 100°C any more?
**
where l is the length of a foot and the correction length depends on the manufactur-
Copyright © Christoph Schiller November 1997–January 2007
ing company. In addition, the Anglosaxon formula is not valid for women and children,
where the first factor depends, for marketing reasons, both on manufacturer and size
itself. The ISO standard requires, unsurprisingly, to use foot length in millimetres.
**
The table of SI prefixes covers 72 orders of magnitude. How many additional prefixes will
be needed? Even an extended list will include only a small part of the infinite range of
possibilities. Will the Conférence Générale des Poids et Mesures have to go on forever,
Challenge 1572 n defining an infinite number of SI prefixes? Why?
**
1180 b units, measurements and constants
The French philosopher Voltaire, after meeting Newton, publicized the now famous story
that the connection between the fall of objects and the motion of the Moon was dis-
covered by Newton when he saw an apple falling from a tree. More than a century later,
just before the French Revolution, a committee of scientists decided to take as the unit
of force precisely the force exerted by gravity on a standard apple, and to name it after
the English scientist. After extensive study, it was found that the mass of the standard
apple was 101.9716 g; its weight was called 1 newton. Since then, visitors to the museum
in Sèvres near Paris have been able to admire the standard metre, the standard kilogram
and the standard apple.*
1 n
σ2 = Q(x i − x̄) ,
2
(817)
n − 1 i=1
where x̄ is the average of the measurements x i . (Can you imagine why n − 1 is used in the
Challenge 1573 n formula instead of n?)
For most experiments, the distribution of measurement values tends towards a normal
distribution, also called Gaussian distribution, whenever the number of measurements is
increased. The distribution, shown in Figure 417, is described by the expression
N(x) e−
(x− x̄)2
2σ 2 . (818)
The square σ 2 of the standard deviation is also called the variance. For a Gaussian distri-
Challenge 1574 e bution of measurement values, 2.35σ is the full width at half maximum.
Lack of accuracy is due to systematic errors; usually these can only be estimated. This
estimate is often added to the random errors to produce a total experimental error, some-
Ref. 1233 times also called total uncertainty.
The following tables give the values of the most important physical constants and
Copyright © Christoph Schiller November 1997–January 2007
particle properties in SI units and in a few other common units, as published in the stand-
Ref. 1234 ard references. The values are the world averages of the best measurements made up to
the present. As usual, experimental errors, including both random and estimated system-
atic errors, are expressed by giving the standard deviation in the last digits; e.g. 0.31(6)
means – roughly speaking – 0.31 0.06. In fact, behind each of the numbers in the follow-
ing tables there is a long story which is worth telling, but for which there is not enough
room here.**
* To be clear, this is a joke; no standard apple exists. It is not a joke however, that owners of several apple
trees in Britain and in the US claim descent, by rerooting, from the original tree under which Newton had
Ref. 1232 his insight. DNA tests have even been performed to decide if all these derive from the same tree. The result
was, unsurprisingly, that the tree at MIT, in contrast to the British ones, is a fake.
** Some of the stories can be found in the text by N.W. Wise, The Values of Precision, Princeton University
b units, measurements and constants 1181
N
number of measurements
standard deviation
x x
average value measured values
What are the limits to accuracy and precision? There is no way, even in principle,
to measure a length x to a precision higher than about 61 digits, because ∆x~x A
Challenge 1575 ny l Pl ~d horizon = 10−61 . (Is this valid also for force or for volume?) In the third part of our
Page 1056 text, studies of clocks and metre bars strengthen this theoretical limit.
But it is not difficult to deduce more stringent practical limits. No imaginable machine
can measure quantities with a higher precision than measuring the diameter of the Earth
within the smallest length ever measured, about 10−19 m; that is about 26 digits of preci-
sion. Using a more realistic limit of a 1000 m sized machine implies a limit of 22 digits.
If, as predicted above, time measurements really achieve 17 digits of precision, then they
are nearing the practical limit, because apart from size, there is an additional practical
restriction: cost. Indeed, an additional digit in measurement precision often means an
additional digit in equipment cost.
the following constants using the equations of the standard model of high-energy physics.
Q ua n t i t y Symbol Va l u e i n S I u n i t s U n c e r t. a
Press, 1994. The field of high-precision measurements, from which the results on these pages stem, is a
world on its own. A beautiful introduction to it is J.D. Fairbanks, B.S. Deaver, C.W. Everitt & P.F.
Michaelson, eds., Near Zero: Frontiers of Physics, Freeman, 1988.
1182 b units, measurements and constants
Q ua n t i t y Symbol Va l u e i n S I u n i t s U n c e r t. a
Why do all these constants have the values they have? The answer is different in each
case. For any constant having a dimension, such as the quantum of action ħ, the numerical
value has only historical meaning. It is 1.054 ë 10−34 Js because of the SI definition of the
joule and the second. The question why the value of a dimensional constant is not larger
or smaller always requires one to understand the origin of some dimensionless number
giving the ratio between the constant and the corresponding natural unit. Understanding
Copyright © Christoph Schiller November 1997–January 2007
the sizes of atoms, people, trees and stars, the duration of molecular and atomic processes,
or the mass of nuclei and mountains, implies understanding the ratios between these
values and the corresponding natural units. The key to understanding nature is thus the
understanding of all ratios, and thus of all dimensionless constants. The story of the most
important ratios is told in the third part of the adventure.
The basic constants yield the following useful high-precision observations.
Q ua n t i t y Symbol Va l u e i n S I u n i t s U n c e r t.
»
Vacuum wave resistance Z 0 = µ 0 ~ε 0 376.730 313 461 77... Ω 0
Avogadro’s number NA 6.022 141 99(47) ë 1023 7.9 ë 10−8
b units, measurements and constants 1183
Q ua n t i t y Symbol Va l u e i n S I u n i t s U n c e r t.
Some properties of nature at large are listed in the following table. (If you want a challenge,
Challenge 1576 n can you determine whether any property of the universe itself is listed?)
TA B L E 86 Astrophysical constants
Q ua n t i t y Symbol Va l u e
Q ua n t i t y Symbol Va l u e
a. Defining constant, from vernal equinox to vernal equinox; it was once used to define the second. (Remem-
ber: π seconds is about a nanocentury.) The value for 1990 is about 0.7 s less, corresponding to a slowdown
Challenge 1577 n of roughly 0.2 ms~a. (Watch out: why?) There is even an empirical formula for the change of the length of
Ref. 1235 the year over time.
b. Average distance Earth–Sun. The truly amazing precision of 30 m results from time averages of signals
sent from Viking orbiters and Mars landers taken over a period of over twenty years.
b units, measurements and constants 1185
Warning: in the third part of this text it is shown that many of the constants in Table 86
are not physically sensible quantities. They have to be taken with many grains of salt. The
more specific constants given in the following table are all sensible, though.
TA B L E 87 Astronomical constants
Q ua n t i t y Symbol Va l u e
Q ua n t i t y Symbol Va l u e
a. The shape of the Earth is described most precisely with the World Geodetic System. The last edition dates
from 1984. For an extensive presentation of its background and its details, see the http://www.wgs84.com
Useful numbers
π 3.14159 26535 89793 23846 26433 83279 50288 41971 69399 375105
e 2.71828 18284 59045 23536 02874 71352 66249 77572 47093 699959
γ 0.57721 56649 01532 86060 65120 90082 40243 10421 59335 939923
Ref. 1236
ln 2 0.69314 71805 59945 30941 72321 21458 17656 80755 00134 360255
º
ln 10 2.30258 50929 94045 68401 79914 54684 36420 76011 01488 628772
10 3.16227 76601 68379 33199 88935 44432 71853 37195 55139 325216
If the number π is normal, i.e. if all digits and digit combinations in its decimal expansion
appear with the same limiting frequency, then every text ever written or yet to be written,
as well as every word ever spoken or yet to be spoken, can be found coded in its sequence. Copyright © Christoph Schiller November 1997–January 2007
The property of normality has not yet been proven, although it is suspected to hold. Does
this mean that all wisdom is encoded in the simple circle? No. The property is nothing
special: it also applies to the number 0.123456789101112131415161718192021... and many
Challenge 1578 n others. Can you specify a few examples?
By the way, in the graph of the exponential function ex , the point (0, 1) is the only
point with two rational coordinates. If you imagine painting in blue all points on the
plane with two rational coordinates, the plane would look quite bluish. Nevertheless, the
graph goes through only one of these points and manages to avoid all the others.
bibliography 1187
Biblio graphy
[...] moi, qui trouve toujours tous les livres trop
”
1214 Le Système International d’Unités, Bureau International des Poids et Mesures, Pavillon de
Breteuil, Parc de Saint Cloud, 92310 Sèvres, France. All new developments concerning SI
units are published in the journal Metrologia, edited by the same body. Showing the slow
pace of an old institution, the BIPM launched a website only in 1998; it is now reachable at
http://www.bipm.fr. See also the http://www.utc.fr/~tthomass/Themes/Unites/index.html
website; this includes the biographies of people who gave their names to various units.
The site of its British equivalent, http://www.npl.co.uk/npl/reference, is much better; it
provides many details as well as the English-language version of the SI unit definitions.
Cited on page 1170.
1221 G. Charpak & R.L. Garwin, The DARI, Europhysics News 33, pp. 14–17, Janu-
ary/February 2002. Cited on page 1176.
1222 Measured values and ranges for physical quantities are collected in Horst Völz &
Peter Ackermann, Die Welt in Zahlen, Spektrum Akademischer Verlag, 1996. Cited
on page 1177.
1223 See e.g. K. Codling & L.J. Frasinski, Coulomb explosion of simple molecules in in-
tense laser fields, Contemporary Physics 35, pp. 243–255, 1994. Cited on page 1177.
1224 A. Zeilinger, The Planck stroll, American Journal of Physics 58, p. 103, 1990. Can you
Challenge 1579 e find another similar example? Cited on page 1177.
* ‘[...] me, who always finds all books too long, first of all my own [...]’.
1188 b units, measurements and constants
1225 The most precise clock built in 2004, a caesium fountain clock, had a precision of one
part in 1015 . Higher precision has been predicted to be possible soon, among others by M.
Takamoto, F.-L. Hong, R. Higashi & H. Katori, An optical lattice clock, Nature
435, pp. 321–324, 2005. Cited on page 1178.
1226 J. Bergquist, ed., Proceedings of the Fifth Symposium on Frequency Standards and Met-
rology, World Scientific, 1997. Cited on page 1178.
1227 See the information on Ds mesons from the particle data group at http://pdg.web.cern.ch/
pdg. Cited on page 1178.
1228 About the long life of tantalum 180, see D. Belic & al., Photoactivation of 180 Tam and its
implications for the nucleosynthesis of nature’s rarest naturally occurring isotope, Physical
Review Letters 83, pp. 5242–5245, 20 December 1999. Cited on page 1178.
1229 See the review by L. Ju, D.G. Bl air & C. Zhao, The detection of gravitational waves,
Reports on Progress in Physics 63, pp. 1317–1427, 2000. Cited on page 1179.
1230 See the clear and extensive paper by G.E. Stedman, Ring laser tests of fundamental
The site also explains the newly discovered methods for calculating specific binary digits of
π without having to calculate all the preceding ones. By the way, the number of (consecut-
ive) digits known in 1999 was over 1.2 million million, as told in Science News 162, p. 255, 14
December 2002. They pass all tests of randomness, as the http://mathworld.wolfram.com/
PiDigits.html website explains. However, this property, called normality, has never been
proven; it is the biggest open question about π. It is possible that the theory of chaotic dy-
namics will lead to a solution of this puzzle in the coming years.
Another method to calculate π and other constants was discovered and published by
D.V. Chudnovsky & G.V. Chudnovsky, The computation of classical constants, Pro-
ceedings of the National Academy of Sciences (USA) 86, pp. 8178–8182, 1989. The Chud-
nowsky brothers have built a supercomputer in Gregory’s apartment for about 70 000
euros, and for many years held the record for calculating the largest number of digits of
π. They have battled for decades with Kanada Yasumasa, who held the record in 2000, cal-
culated on an industrial supercomputer. New formulae to calculate π are still occasionally
discovered.
PARTIC LE PROPERTI ES
T
he following table lists the known and predicted elementary particles.
he list has not changed since the mid-1970s, mainly because of the inefficient use
hat was made of the relevant budgets across the world since then.
Z0
Quarks d s t
(each in three colours) u c b
Copyright © Christoph Schiller November 1997–January 2007
Matter particles are fermions with spin 1/2; all have a cor-
responding antiparticle.
The following table lists all properties for the elementary particles (for reasons of space,
the colour quantity is not given explicitely).* The table and its header thus allow us, in
principle, to deduce a complete characterization of the intrinsic properties of any com-
posed moving entity, be it an object or an image.
* The official reference for all these data, worth a look for every physicist, is the massive collection of informa-
tion compiled by the Particle Data Group, with the website http://pdg.web.cern.ch/pdg containing the most
recent information. A printed review is published about every two years, with updated data, in one of the
major journals on elementary particle physics. See for example S. Eidelman & al., Review of Particle Phys-
ics, Physics Letters B 592, p. 1, 2004. For many measured properties of these particles, the official reference
is the set of CODATA values. The most recent list was published by P.J. Mohr & B.N. Taylor, Reviews of
Modern Physics 59, p. 351, 2000.
1192 c particle properties
a
Particle Mass m Lifetime τ Isospin I, Charge, Lepton
or energy spin J, c isospin, &
width, b parity P, strange- baryon e
main decay charge ness, c num-
modes parity C charm, bers L B,
beau ty, R-
topness: parity
QISCBT
electric dipole moment d = (3.7 3.4) ë 10−22 e m
tau τ 1.777 05(29) GeV~c 2 290.0(1.2) fs J= 1
2
−100000 1, 0, 1
Squark h J
Hypothetical elementary matter (fermions)
Higgsino(s) h J = 1
2
R = −1
Wino h (a chargino) J = 1
2
R = −1
Zino h (a neutralino) J = 1
2
R = −1
Photino h J = 1
2
R = −1
Gluino h J = 1
2
R = −1
Notes:
a. See also the table of SI prefixes on page 1171. About the eV~c 2 mass unit, see page 1175.
b. The energy width Γ of a particle is related to its lifetime τ by the indeterminacy relation Γτ = ħ. There
c particle properties 1193
is a difference between the half-life t 1~2 and the lifetime τ of a particle: they are related by t 1~2 = τ ln 2,
where ln 2 0.693 147 18; the half-life is thus shorter than the lifetime. The unified atomic mass unit u is
defined as 1/12 of the mass of a carbon 12 atom at rest and in its ground state. One has 1 u = 121 m(12 C) =
1.660 5402(10) yg.
c. To keep the table short, the header does not explicitly mention colour, the charge of the strong interactions.
This has to be added to the list of basic object properties. Quantum numbers containing the word ‘parity’
are multiplicative; all others are additive. Time parity T (not to be confused with topness T), better called
motion inversion parity, is equal to CP. The isospin I (or I Z ) is defined only for up and down quarks and their
composites, such as the proton and the neutron. In the literature one also sees references to the so-called
G-parity, defined as G = (−1)IC .
The header also does not mention the weak charge of the particles. The details of weak charge д, a quantum
number assigned to all left-handed fermions (and right-handed anti-fermion), but to no right-handed fer-
Page ?? mion (and no left-handed antifermion), are given in the section on the weak interactions.
d. ‘Beauty’ is now commonly called bottomness; similarly, ‘truth’ is now commonly called topness. The signs
of the quantum numbers S, I, C, B, T can be defined in different ways. In the standard assignment shown
here, the sign of each of the non-vanishing quantum numbers is given by the sign of the charge of the
corresponding quark.
Using the table of elementary particle properties, together with the standard model and
the fundamental constants, in principle all properties of composite matter and radiation
can be deduced, including all those encountered in everyday life. (Can you explain how
Challenge 1582 n the size of an object follows from them?) In a sense, this table contains all our knowledge
of matter and radiation, including materials science, chemistry and biology.
The most important examples of composites are grouped in the following table.
C o m p o si t e M a ss m, q ua n t u m L i f e t i m e τ , m a i n Size
numbers d e c ay m o d e s (diam.)
mesons (hadrons, bosons) (selected from over 130 known types)
º
pion π 0 (u ū − d d̄)~ 2 134.976 4(6) MeV~c 2 84(6) as, 2γ 98.798(32)% 1 fm
I (J ) = 1 (0 ), S = C = B = 0
G PC − −+
Copyright © Christoph Schiller November 1997–January 2007
kaon K S0
m K S0 89.27(9) ps 1 fm
kaon K L0 m K S0 + 3.491(9) µeV~c 2
51.7(4) ns 1 fm
kaon K (us̄, ūs) 493.677(16) MeV~c 2 12.386(24) ns, 1 fm
µ + ν µ 63.51(18)%
π+ π 0 21.16(14)%
kaon K 0 (ds̄) (50 % K S , 50 % 497.672(31) MeV~c 2 n.a. 1 fm
KL )
all kaons K , K 0 , K S0 , K L0 I(J P ) = 21 (0− ), S = 1, B = C = 0
1194 c particle properties
C o m p o si t e M a ss m, q ua n t u m L i f e t i m e τ , m a i n Size
numbers d e c ay m o d e s (diam.)
C o m p o si t e M a ss m, q ua n t u m L i f e t i m e τ , m a i n Size
numbers d e c ay m o d e s (diam.)
water (H2 O) 18 u A 1025 a
ATP 507 u A 1010 a c. 3 nm
(adenosinetriphosphate)
human Y chromosome 70 ë 106 base pairs A 106 a c. 50 mm
(uncoiled)
other composites
blue whale nerve cell 1 kg 50 a 20 m
cell (red blood) 0.1 ng 7 plus 120 days 10 µm
cell (sperm) 10 pg not fecundated: 5 d length
60 µm,
head
composite.
The Y (3940) resonance is a candidate for a hybrid meson, a composite of a gluon and a quark–
antiquark pair. This prediction of 1980 seems to have been confirmed in 2005.
Ref. 1245 In 2002, the first evidence for the existence of tetra-neutrons was published by a French group.
However, more recent investigations seem to have refuted the claim.
The number of existing molecules is several orders of magnitude larger than the number of
molecules that have been analysed and named.
Some nuclei have not yet been observed; in 2006 the known nuclei ranged from 1 to 116, but 113
and 115 were still missing.
The first anti-atoms, made of antielectrons and antiprotons, were made in January 1996 at CERN
Ref. 1246 in Geneva. All properties of antimatter checked so far are consistent with theoretical predictions.
The charge parity C is defined only for certain neutral particles, namely those that are different
1196 c particle properties
from their antiparticles. For neutral mesons, the charge parity is given by C = (−1) L+S , where L
is the orbital angular momentum.
P is the parity under space inversion r −r. For mesons, it is related to the orbital angular
momentum L through P = (−1) L+1 .
The electric polarizability, defined on page 554, is predicted to vanish for all elementary particles.
The most important matter composites are the atoms. Their size, structure and interac-
tions determine the properties and colour of everyday objects. Atom types, also called ele-
ments in chemistry, are most usefully set out in the so-called periodic table, which groups
together atoms with similar properties in rows and columns. It is given in Table 91 and
results from the various ways in which protons, neutrons and electrons can combine to
form aggregates.
Comparable to the periodic table of the atoms, there are tables for the mesons (made
of two quarks) and the baryons (made of three quarks). Neither the meson nor the ba-
ryon table is included here; they can both be found in the cited Review of Particle Physics
metals, semimetals, a liquid and gases; they have no special name. Groups 1 and 13 to 17
are central for the chemistry of life; in fact, 96 % of living matter is made of C, O, N, H;*
almost 4 % of P, S, Ca, K, Na, Cl; trace elements such as Mg, V, Cr, Mn, Fe, Co, Ni, Cu,
Zn, Cd, Pb, Sn, Li, Mo, Se, Si, I, F, As, B form the rest. Over 30 elements are known to be
essential for animal life. The full list is not yet known; candidate elements to extend this
list are Al, Br, Ge and W.
Many elements exist in versions with different numbers of neutrons in their nucleus,
and thus with different mass; these various isotopes – so called because they are found at
the same place in the periodic table – behave identically in chemical reactions. There are
TA B L E 91 The periodic table of the elements known in 2006, with their atomic numbers
Group
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18
I II IIIa IVa Va VIa VIIa VIIIa Ia IIa III IV V VI VII VIII
Period
1 2
1
H He
3 4 5 6 7 8 9 10
2
Li Be B C N O F Ne
11 12 13 14 15 16 17 18
19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36
4
K Ca Sc Ti V Cr Mn Fe Co Ni Cu Zn Ga Ge As Se Br Kr
37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54
5
Rb Sr Y Zr Nb Mo Tc Ru Rh Pd Ag Cd In Sn Sb Te I Xe
55 56 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86
6
Cs Ba Hf Ta W Re Os Ir Pt Au Hg Tl Pb Bi Po At Rn
87 88 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118
7
Fr Ra Rf Db Sg Bh Hs Mt Ds Rg Uub Uut Uuq Uup Uuh
57 58 59 60 61 62 63 64 65 66 67 68 69 70 71
Lanthanoids
La Ce Pr Nd Pm Sm Eu Gd Tb Dy Ho Er Tm Yb Lu
Name Sym- At. Aver. mass a Ato- Main properties, (naming) h dis-
bol n. in u mic e covery date and use
(error), ra-
longest dius
lifetime in pm
Actiniumb Ac 89 (227.0277(1)) (188) highly radioactive metallic rare Earth
21.77(2) a (Greek aktis ray) 1899, used as alpha-
emitting source
1198 c particle properties
Name Sym- At. Aver. mass a Ato- Main properties, (naming) h dis-
bol n. in u mic e covery date and use
(error), ra-
longest dius
lifetime in pm
Aluminium Al 13 26.981 538 (8) 118c, light metal (Latin alumen alum) 1827,
stable 143m used in machine construction and living
beings
Americiumb Am 95 (243.0614(1)) (184) radioactive metal (Italian America from
7.37(2) ka Amerigo) 1945, used in smoke detectors
Antimony Sb 51 121.760(1) f 137c, toxic semimetal (via Arabic from Latin
stable 159m, stibium, itself from Greek, Egyptian for
205v one of its minerals) antiquity, colours rub-
ber, used in medicines, constituent of en-
Bohriumb Bh 107 (264.12(1)) n.a. made in lab, probably metallic (after Niels
0.44 s д Bohr) 1981, found in nuclear reactions, no
use
Boron B 5 10.811(7) f 83c semimetal, semiconductor (Latin borax,
stable from Arabic and Persian for brilliant)
1808, used in glass, bleach, pyrotechnics,
rocket fuel, medicine
Bromine Br 35 79.904(1) 120c, red-brown liquid (Greek bromos strong
stable 185v odour) 1826, fumigants, photography, wa-
ter purification, dyes, medicines
c particle properties 1199
Name Sym- At. Aver. mass a Ato- Main properties, (naming) h dis-
bol n. in u mic e covery date and use
(error), ra-
longest dius
lifetime in pm
Cadmium Cd 48 112.411(8) f 157m heavy metal, cuttable and screaming
stable (Greek kadmeia, a zinc carbonate mineral
where it was discovered) 1817, electroplat-
ing, solder, batteries, TV phosphors, dyes
Caesium Cs 55 132.905 4519(2) 273m alkali metal (Latin caesius sky blue) 1860,
getter in vacuum tubes, photoelectric
stable cells, ion propulsion, atomic clocks
Calcium Ca 20 40.078(4) f 197m Earth-alkali metal (Latin calcis chalk) an-
stable tiquity, pure in 1880, found in stones and
catalyst
Cobalt Co 27 58.933 195(5) 125m ferromagnetic transition metal (German
stable Kobold goblin) 1694, part of vitamin
B12 , magnetic alloys, heavy-duty alloys,
enamel dyes, ink, animal nutrition
Copper Cu 29 63.546(3) f 128m red metal (Latin cuprum from Cyprus is-
stable land) antiquity, part of many enzymes,
electrical conductors, bronze, brass and
other alloys, algicides, etc.
Curiumb Cm 96 (247.0704(1)) n.a. highly radioactive, silver-coloured (after
15.6(5) Ma Pierre and Marie Curie) 1944, used as ra-
dioactivity source
1200 c particle properties
Name Sym- At. Aver. mass a Ato- Main properties, (naming) h dis-
bol n. in u mic e covery date and use
(error), ra-
longest dius
lifetime in pm
Darmstadtiumb Ds 110 (271) 1.6 min д n.a. (after the German city) 1994, no use
Dubniumb Db 105 (262.1141(1)) n.a. made in lab in small quantities, radioact-
34(5) s ive (Dubna, Russian city) 1967, no use
(once known as hahnium)
Dysprosium Dy 66 162.500(1) f 177m rare Earth metal (Greek dysprositos diffi-
stable cult to obtain) 1886, used in laser mater-
ials, as infrared source material, and in
nuclear industry
Einsteiniumb Es 99 (252.0830(1)) n.a. made in lab, radioactive (after Albert Ein-
ics
Gold Au 79 196.966 569(4) 144m heavy noble metal (Sanskrit jval to shine,
stable Latin aurum) antiquity, electronics, jew-
els
Hafnium Hf 72 178.49(2) c 158m metal (Latin for Copenhagen) 1923, alloys,
stable incandescent wire
Hassiumb Hs 108 (277) 16.5 min д n.a. radioactive element (Latin form of Ger-
man state Hessen) 1984, no use
Helium He 2 4.002 602(2) f (31n) noble gas (Greek helios Sun) where it was
stable discovered 1895, used in balloons, stars,
diver’s gas and cryogenics
c particle properties 1201
Name Sym- At. Aver. mass a Ato- Main properties, (naming) h dis-
bol n. in u mic e covery date and use
(error), ra-
longest dius
lifetime in pm
Holmium Ho 67 164.930 32(2) 177m metal (Stockholm, Swedish capital) 1878,
stable alloys
Hydrogen H 1 1.007 94(7) f 30c reactive gas (Greek for water-former)
stable 1766, used in building stars and universe
Indium In 49 114.818(3) 141c, soft metal (Greek indikon indigo) 1863,
stable 166m used in solders and photocells
Iodine I 53 126.904 47(3) 140c, blue-black solid (Greek iodes violet) 1811,
stable 198v used in photography
Iridium Ir 77 192.217(3) 136m precious metal (Greek iris rainbow) 1804,
Name Sym- At. Aver. mass a Ato- Main properties, (naming) h dis-
bol n. in u mic e covery date and use
(error), ra-
longest dius
lifetime in pm
Mercury Hg 80 200.59(2) 157m liquid heavy metal (Latin god Mercurius,
stable Greek hydrargyrum liquid silver) an-
tiquity, used in switches, batteries, lamps,
amalgam alloys
Molybdenum Mo 42 95.94(2) f 140m metal (Greek molybdos lead) 1788, used in
stable alloys, as catalyst, in enzymes and lubric-
ants
Neodymium Nd 60 144.242(3) c, f 182m (Greek neos and didymos new twin) 1885
stable
Name Sym- At. Aver. mass a Ato- Main properties, (naming) h dis-
bol n. in u mic e covery date and use
(error), ra-
longest dius
lifetime in pm
Platinum Pt 78 195.084(9) 139m silvery-white, ductile, noble heavy
stable metal (Spanish platina little silver)
pre-Columbian, again in 1735, used
in corrosion-resistant alloys, magnets,
furnaces, catalysts, fuel cells, cathodic
protection systems for large ships and
pipelines; being a catalyst, a fine platinum
wire glows red hot when placed in vapour
of methyl alcohol, an effect used in hand
warmers
Name Sym- At. Aver. mass a Ato- Main properties, (naming) h dis-
bol n. in u mic e covery date and use
(error), ra-
longest dius
lifetime in pm
Radium Ra 88 (226.0254(1)) (223) highly radioactive metal (Latin radius
1599(4) a ray) 1898, no use any more; once used in
luminous paints and as radioactive source
and in medicine
Radon Rn 86 (222.0176(1)) (130n) radioactive noble gas (from its old name
3.823(4) d ‘radium emanation’) 1900, no use (any
more), found in soil, produces lung can-
cer
Rhenium Re 75 186.207(1) c 138m (Latin rhenus for Rhine river) 1925, used
Name Sym- At. Aver. mass a Ato- Main properties, (naming) h dis-
bol n. in u mic e covery date and use
(error), ra-
longest dius
lifetime in pm
Silicon Si 14 28.0855(3) f 105c, grey, shiny semiconductor (Latin silex
stable 210v pebble) 1823, Earth’s crust, electronics,
sand, concrete, bricks, glass, polymers,
solar cells, essential for life
Silver Ag 47 107.8682(2) f 145m white metal with highest thermal and
stable electrical conductivity (Latin argentum,
Greek argyros) antiquity, used in photo-
graphy, alloys, to make rain
Sodium Na 11 191m light, reactive metal (Arabic souwad soda,
Name Sym- At. Aver. mass a Ato- Main properties, (naming) h dis-
bol n. in u mic e covery date and use
(error), ra-
longest dius
lifetime in pm
Thulium Tm 69 168.934 21(2) 175m rare Earth metal (Thule, mythical name
stable for Scandinavia) 1879, found in monazite
Tin Sn 50 118.710(7) f 139c, grey metal that, when bent, allows one
stable 210v, to hear the ‘tin cry’ (Latin stannum) an-
162m tiquity, used in paint, bronze and super-
conductors
Titanium Ti 22 47.867(1) 146m metal (Greek hero Titanos) 1791, alloys,
stable fake diamonds
Tungsten W 74 183.84(1) 141m heavy, highest-melting metal (Swedish
a. The atomic mass unit is defined as 1 u = 121 m(12 C), making 1 u = 1.660 5402(10) yg. For elements found
on Earth, the average atomic mass for the naturally occurring isotope mixture is given, with the error in the
Ref. 1247 last digit in brackets. For elements not found on Earth, the mass of the longest living isotope is given; as it is
c particle properties 1207
the atomic mass of commercial lithium ranges between 6.939 and 6.996 u. The masses of isotopes are known
Ref. 1240 in atomic mass units to nine or more significant digits, and usually with one or two fewer digits in kilograms.
Ref. 1247 The errors in the atomic mass are thus mainly due to the variations in isotopic composition.
д. The lifetime errors are asymmetric or not well known.
h. Extensive details on element names can be found on http://elements.vanderkrogt.net.
1208 c particle properties
Biblio graphy
1237 The electron radius limit is deduced from the д − 2 measurements, as explained in the
Nobel Prize talk by Hans Dehmelt, Experiments with an isolated subatomic particle
at rest, Reviews of Modern Physics 62, pp. 525–530, 1990, or in Hans Dehmelt, Is the
electron a composite particle?, Hyperfine Interactions 81, pp. 1–3, 1993. Cited on page 1193.
1238 G. Gabrielse, H. Dehmelt & W. Kells, Observation of a relativistic, bistable hyster-
esis in the cyclotron motion of a single electron, Physical Review Letters 54, pp. 537–540,
1985. Cited on page 1193.
1239 The proton charge radius was determined by measuring the frequency of light emitted by
hydrogen atoms to high precision by T. Udem, A. Huber, B. Gross, J. R eichert, M.
Prevedelli, M. Weitz & T.W. Hausch, Phase-coherent measurement of the hydro-
gen 1S–2S transition frequency with an optical frequency interval divider chain, Physical
Review Letters 79, pp. 2646–2649, 1997. Cited on page 1194.
from 2005, however, seem to rule out that the 1540 MeV particle is a pentaquark. Cited on
page 1195.
1245 F. Marques & al., Detection of neutron clusters, Physical Review C 65, p. 044006,
2002. Opposite results have been obtained by B.M. Sherrill & C.A. Bertul ani,
Proton-tetraneutron elastic scattering, Physical Review C 69, p. 027601, 2004, and D.V.
Aleksandrov & al., Search for resonances in the three- and four-neutron systems in
the 7Li(7Li, 11C)3n and 7Li(7Li, 10C)4n reactions, JETP Letters 81, p. 43, 2005. Cited on
page 1195.
1246 For a good review, see the article by P.T. Greenl and, Antimatter, Contemporary Physics
38, pp. 181–203, 1997. Cited on page 1195.
1247 The atomic masses, as given by IUPAC, can be found in Pure and Applied Chemistry 73,
pp. 667–683, 2001, or on the http://www.iupac.org website. For an isotope mass list, see
bibliography 1209
”
M
athematical concepts can all be expressed in terms of ‘sets’ and ‘relations.’
We start with a short introduction to the vocabulary. Any mathematical system with the
same basic properties as the natural numbers is called a semi-ring. Any mathematical
system with the same basic properties as the integers is called a ring. (The term is due to
” Copyright © Christoph Schiller November 1997–January 2007
David Hilbert. Both structures can also be finite rather than infinite.) More precisely, a
ring (R, +, ë) is a set R of elements with two binary operations, called addition and multi-
plication, usually written + and ë (the latter may simply be understood without notation),
for which the following properties hold for all elements a, b, c > R:
— R is a commutative group with respect to addition, i.e.
a + b > R, a + b = b + a, a + 0 = a, a + (−a) = a − a = 0 and a + (b + c) = (a + b) + c;
— R is closed under multiplication, i.e. ab > R;
* The opposite approach can have the same effect: it is taken in the delightful text by Carl E. L inder-
holm, Mathematics Made Difficult, Wolfe Publishing, 1971.
numbers as mathematical structures 1211
* A set is mathematically complete if physicists call it continuous. More precisely, a set of numbers is complete
if every non-empty subset that is bounded above has a least upper bound.
A set is totally ordered if there exists a binary relation D between pairs of elements such that for all elements
a and b
— if a D b and b D c, then a D c;
— if a D b and b D a, then a = b;
— a D b or b D a holds.
In summary, a set is totally ordered if there is a binary relation that allows to say about any two elements
which one is the predecessor of the other in a consistent way.
1212 d numbers and spaces
Complex numbers
A complex number is defined by z = a + ib, where a and b are real numbers, and i is a
new symbol. Under multiplication, the generators of the complex numbers, 1 and i, obey
ë 1 i
1 1 i (821)
i i −1
º
often summarized as i = + −1 .
The complex conjugate z , also written z̄, of a complex number z = a + ib isºdefined
as z = aº− ib. The absolute value SzS of a complex number is defined as SzS = zz =
º
z z = a 2 + b 2 . It defines a norm on the vector space of the complex numbers. From
SwzS = SwS SzS follows the two-squares theorem
valid for all real numbers a i , b i . It was already known, in its version for integers, to Dio-
phantus of Alexandria.
Complex numbers can also be written as
ordered pairs (a, A) of real numbers, with their
addition defined as (a, A) + (b, B) = (a + b, A + ic
B) and their multiplication defined as (a, A) ë
(b, B) = (ab − AB, aB + bA). This notation al-
lows us to identify the complex numbers with the
points on a plane. Translating the definition of
multiplication into geometrical language allows ih = − i ab
c
Challenge 1585 e us to rapidly prove certain geometrical theorems,
such as the one of Figure 419.
Complex numbers a + ib can also be repres-
a 0 b
ented as 2 2 matrices
F I G U R E 419 A property of triangles easily
witha, b > R .
a b provable with complex numbers
(823)
−b a
Copyright © Christoph Schiller November 1997–January 2007
Matrix addition and multiplication then correspond to complex addition and multiplic-
ation. In this way, complex numbers can be represented by a special type of real matrix.
Challenge 1586 n What is SzS in matrix language?
The set C of complex numbers with addition and multiplication as defined above
Page 1221 forms both a commutative two-dimensional field and a vector space over R. In the field
of complex numbers, quadratic equations az 2 + bz + c = 0 for an unknown z always have
Challenge 1587 ny two solutions (for a x 0 and counting multiplicity).
Complex numbers can be used to describe the points of a plane. A rotation around the
origin can be described by multiplication by a complex number of unit length. Other two-
dimensional quantities can also be described with complex numbers. Electrical engineers
numbers as mathematical structures 1213
use complex numbers to describe quantities with phases, such as alternating currents or
electrical fields in space.
Writing complex numbers of unit length as cos θ + i sin θ is a useful method for re-
Challenge 1588 e membering angle addition formulae. Since one has cos nθ + i sin nθ = (cos θ + i sin θ)n ,
one can easily deduce formulae cos 2θ = cos2 θ − sin2 θ and sin 2θ = 2 sin θ cos θ.
The compex exponential function is periodic in 2πi; in other words, one has
e1 = e1+2π i . (824)
Quaternions
“ ”
The positions of the points on a line can be described by real numbers. Complex numbers
can be used to describe the positions of the points of a plane. It is natural to try to gen-
eralize the idea of a number to higher-dimensional spaces. However, it turns out that no
useful number system can be defined for three-dimensional space. A new number system,
the quaternions, can be constructed which corresponds the points of four-dimensional
space, but only if the commutativity of multiplication is sacrificed. No useful number
system can be defined for dimensions other than 1, 2 and 4.
The quaternions were discovered by several mathematicians in the nineteenth century,
among them Hamilton,* who studied them for much of his life. In fact, Maxwell’s theory
of electrodynamics was formulated in terms of quaternions before three-dimensional vec-
Ref. 1253 tors were used.
Page 1223 Under multiplication, the quaternions H form a 4-dimensional algebra over the reals
with a basis 1, i, j, k satisfying
Copyright © Christoph Schiller November 1997–January 2007
ë 1 i j k
1 1 i j k
i i −1 k − j . (826)
j j −k −1 i
k k j −i −1
* William Rowan Hamilton (b. 1805 Dublin, d. 1865 Dunsink), Irish child prodigy and famous mathem-
atician, named the quaternions after an expression from the Vulgate (Acts. 12: 4).
1214 d numbers and spaces
ki = −ik = j. The quaternions 1, i, j, k are also called basic units or generators. The lack of
symmetry across the diagonal of the table shows the non-commutativity of quaternionic
multiplication. With the quaternions, the idea of a non-commutative product appeared
for the first time in mathematics. However, the multiplication of quaternions is associat-
ive. As a consequence of non-commutativity, polynomial equations in quaternions have
many more solutions than in complex numbers: just search for all solutions of the equa-
Challenge 1592 ny tion X 2 + 1 = 0 to convince yourself of it.
Every quaternion X can be written in the form
X = x 0 + x 1 i + x 2 j + x 3 k = x 0 + v = (x 0 , x 1 , x 2 , x 3 ) = (x 0 , v) , (827)
where x 0 is called the scalar part and v the vector part. The multiplication is thus defined
as (x, v)(y, w) = (x y−vëw, xw+yv+vw). The multiplication of two general quaternions
can be written as
No relation of this type exists for complex numbers. In the language of physics, a complex
number and its conjugate are independent variables; for quaternions, this is not the case.
As a result, functions of quaternions are less useful in physics than functions of complex
variables.
The relation SXYS = SXS SYS implies the four-squares theorem
(a 12 + a 22 + a 32 + a 42 )(b 12 + b 22 + b 32 + b 42 )
= (a 1 b 1 − a 2 b 2 − a 3 b 3 − a 4 b 4 )2 + (a 1 b 2 + a 2 b 1 + a 3 b 4 − a 4 b 3 )2
Copyright © Christoph Schiller November 1997–January 2007
+(a 1 b 3 + a 3 b 1 + a 4 b 2 − a 2 b 4 )2 + (a 1 b 4 + a 4 b 1 + a 2 b 3 − a 3 b 2 )2 (830)
valid for all real numbers a i and b i , and thus also for any set of eight integers. It was
discovered in 1748 by Leonhard Euler (1707–1783) when trying to prove that each integer
is the sum of four squares. (That fact was proved only in 1770, by Joseph Lagrange.)
Hamilton thought that a quaternion with zero scalar part, which he simply called a vec-
tor (a term which he invented), could be identified with an ordinary three-dimensional
translation vector; but this is wrong. Such a quaternion is now called a pure, or homo-
geneous, or imaginary quaternion. The product of two pure quaternions V = (0, v) and
W = (0, w) is given by V W = (−v ë w, v w), where ë denotes the scalar product and
numbers as mathematical structures 1215
denotes the vector product. Note that any quaternion can be written as the ratio of two
pure quaternions.
In reality, a pure quaternion (0, v) does not behave like a translation vector under
coordinate transformations; in fact, a pure quaternion represents a rotation by the angle
Challenge 1593 ny π or 180° around the axis defined by the direction v = (v x , v y , v z ).
It turns out that in three-dimensional
space, a general rotation about the origin
can be described by a unit quaternion Q,
also called a normed quaternion, for which α~2
SQS = 1. Such a quaternion can be written as l π − γ~2
(cos θ~2, n sin θ~2), where n = (n x , n y , n z )
is the normed vector describing the direc-
tion of the rotation axis and θ is the ro-
tation angle. Such a unit quaternion Q =
n
(cos θ~2, n sin θ~2) rotates a pure quaternion
W = QV Q .
F I G U R E 420 Combinations of rotations
(831)
Thus, if we use pure quaternions such as V or W to describe positions, we can use unit
quaternions to describe rotations and to calculate coordinate changes. The concatenation
of two rotations is then given by the product of the corresponding unit quaternions. In-
deed, a rotation by an angle α about the axis l followed by a rotation by an angle β about
the axis m gives a rotation by an angle γ about the axis n, with the values determined by
(cos γ~2, sin γ~2n) = (cos α~2, sin α~2l)(cos β~2, sin β~2m) . (832)
One way to show the result graphically is given in Figure 420. By drawing a triangle on
a unit sphere, and taking care to remember the factor 1~2 in the angles, the combination
of two rotations can be simply determined.
The interpretation of quaternions as rotations is also illustrated, in a somewhat differ-
Ref. 1251 ent way, in the motion of any hand. To see this, take a green marker and write the letters 1,
i, j and k on your hand as shown in Figure 421. Defining the three possible 90° rotations Copyright © Christoph Schiller November 1997–January 2007
axes as shown in the figure and taking concatenation as multiplication, the motion of the
Challenge 1594 e right hand follows the same ‘laws’ as those of pure unit quaternions. (One still needs to
distinguish +i and −i, and the same for the other units, by the sense of the arm twist.
And the result of a multiplication is that letter that can be read by a person facing you.)
Challenge 1595 n You can show that i 2 = j2 = k 2 = −1, that i 4 = 1, and all other quaternion relations.)
The model also shows that the rotation angle of the arm is half the rotation angle of the
corresponding quaternion. In other words, quaternions can be used to describe the belt
trick, if the multiplication V W of two quaternions is taken to mean that rotation V is
Page 802 performed after rotation W. Quaternions, like human hands, thus behave like a spin 1/2
particle. Quaternions and spinors are isomorphic.
The reason for the half-angle behaviour of rotations can be specified more precisely
using mathematical language. The rotations in three dimensions around a point form the
1216 d numbers and spaces
1 k
j i
palm back
of right of right
hand hand
‘special orthogonal group’ in three dimensions, which is called SO(3). But the motions of
a hand attached to a shoulder via an arm form a different group, isomorphic to the Lie
Page 1235 group SU(2). The difference is due to the appearance of half angles in the parametrization
of rotations; indeed, the above parametrizations imply that a rotation by 2π corresponds
to a multiplication by −1. Only in the twentieth century was it realized that there exist fun-
damental physical observables that behaves like hands attached to arms: they are called
Page 802 spinors. More on spinors can be found in the section on permutation symmetry, where
belts are used as an analogy as well as arms. In short, the group SU(2) of the quaternions
Ref. 1254 is the double cover of the rotation group SO(3).
The simple representation of rotations and positions with quaternions is used by com-
puter programmes in robotics, in astronomy and in flight simulation. In the software used
to create three-dimensional images and animations, visualization software, quaternions Copyright © Christoph Schiller November 1997–January 2007
are often used to calculate the path taken by repeatedly reflected light rays and thus give
surfaces a realistic appearance.
The algebra of the quaternions is the only associative, non-commutative, finite-dimen-
sional normed algebra with an identity over the field of real numbers. Quaternions form
a non-commutative field, i.e. a skew field, in which the inverse of a quaternion X is X~SXS.
We can therefore define division of quaternions (while being careful to distinguish XY −1
and Y −1 X). Therefore quaternions are said to form a division algebra. In fact, the qua-
ternions H, the complex numbers C and the reals R are the only three finite-dimensional
associative division algebras. In other words, the skew-field of quaternions is the only
finite-dimensional real associative non-commutative algebra without divisors of zero.
The centre of the quaternions, i.e. the set of quaternions that commute with all other qua-
ternions, is just the set of real numbers.
numbers as mathematical structures 1217
a b c d
−b a −d c
with A, B > C, or as with a, b, c, d > R, (833)
A B
−c d a −b
−B A
−d −c b a
a b −d −c
−b a −c d
d a b
(836)
c
c −d −b a
Page 554 notation. (The now usual 3-vector notation was introduced later by Hertz and Heaviside.)
The equations can be written in variousº ways using quaternions. The simplest is achieved
Ref. 1253 when one keeps a distinction between −1 and the units i, j, k of the quaternions. One
Challenge 1597 n then can write all of electrodynamics in a single equation:
1 0 0 1 0 −i 1 0
, , , (835)
0 1 1 0 i 0 0 −1
σ0 = 1 = σ1 = σ2 = σ3 =
Page 1225 all of whose eigenvalues are 1; they satisfy the relations [σ i , σ k ]+ = 2 δ i k and [σ i , σ k ] = 2i ε i k l σ l . The linear
combinations σ = 21 (σ1 σ2 ) are also frequently used. By the way, another possible representation of the
quaternions is i iσ3 , j iσ2 , k iσ1 .
1218 d numbers and spaces
dF = −
Q
(837)
ε0
where F is the generalized electromagnetic field and Q the generalized charge. These are
defined by
º
F = E + −1 cB
E = iE x + jE y + kE z
B = iB x + jB y + kB z
º
(838)
d = δ + −1 ∂ t ~c
δ = i∂ x + j∂ y + k∂ z
º
Q = ρ + −1 J~c
Octonions
In the same way that quaternions are constructed from complex numbers, octonions
can be constructed from quaternions. They were first investigated by Arthur Cayley
(1821–1895). Under multiplication, octonions (or octaves) are the elements of an eight-
dimensional algebra over the reals with the generators 1, i n with n = 1 . . . 7 satisfying
ë 1 i1 i2 i3 i4 i5 i6 i7
1 1 i1 i2 i3 i4 i5 i6 i7
i1 i1 −1 i 3 −i 2 i 5 −i 4 i 7 −i 6
i2 i2 −i 3 −1 i 1 −i 6 i7 i 4 −i 5
i3 i3 i 2 −i 1 −1 i7 i 6 −i 5 −i 4 (839) Copyright © Christoph Schiller November 1997–January 2007
i4 i4 −i 5 i 6 −i 7 −1 i 1 −i 2 i3
i5 i5 i 4 −i 7 −i 6 −i 1 −1 i3 i2
i6 i6 −i 7 −i 4 i5 i 2 −i 3 −1 i1
i7 i7 i6 i5 i 4 −i 3 −i 2 −i 1 −1
Nineteen other, equivalent multiplication tables are also possible. This algebra is called the
Cayley algebra; it has an identity and a unique division. The algebra is non-commutative,
and also non-associative. It is, however, alternative, meaning that for all elements x and y,
one has x(x y) = x 2 y and (x y)y = x y 2 : a property somewhat weaker than associativity.
It is the only 8-dimensional real alternative algebra without zero divisors. Because it is
not associative, the set Ω of all octonions does not form a field, nor even a ring, so that
numbers as mathematical structures 1219
the old designation of ‘Cayley numbers’ has been abandoned. The octonions are the most
general hypercomplex ‘numbers’ whose norm is multiplicative. Its generators obey since
(i n i m )i l = i n (i m i l ), where the minus sign, which shows the non-associativity, is valid
for combinations of indices, such as 1-2-4, which are not quaternionic.
Octonions can be represented as matrices of the form
(a 12 + a 22 + a 32 + a 42 + a 52 + a 62 + a 72 + a 82 )(b 12 + b 22 + b 32 + b 42 + b 52 + b 62 + b 72 + b 82 )
= (a 1 b 1 − a 2 b 2 − a 3 b 3 − a 4 b 4 − a 5 b 5 − a 6 b 6 − a 7 b 7 − a 8 b 8 )2
valid for all real numbers a i and b i and thus in particular also for all integers. (There
are many variations of this expression, with different possible sign combinations.) The
theorem was discovered in 1818 by Carl Ferdinand Degen (1766–1825), and then redis-
covered in 1844 by John Graves and in 1845 by Arthur Cayley. There is no generalization
to higher numbers of squares, a fact proved by Adolf Hurwitz (1859–1919) in 1898.
The octonions can be used to show that a vector product can be defined in more than
three dimensions. A vector product or cross product is an operation satisfying
u v = −v u anticommutativity
(u v)w = u(v w)
Copyright © Christoph Schiller November 1997–January 2007
the -products of imaginary quaternions, i.e. of quaternions of the type (0, u), are again
imaginary, and correspond to the usual vector product, thus fulfilling (842). Interestingly,
Ref. 1252 it is possible to use definition (843) for octonions as well. In that case, the product of
Challenge 1598 e imaginary octonions is also imaginary, and (842) is again satisfied. In fact, this is the only
other non-trivial example of a vector product. Thus a vector product exists only in three
1220 d numbers and spaces
Grassmann numbers
With the discovery of supersymmetry, another system of numbers became important,
called the Grassmann numbers.* They are in fact a special type of hypercomplex numbers.
In supersymmetric Lagrangians, fields depend on two types of coordinates: on the usual Copyright © Christoph Schiller November 1997–January 2007
Challenge 1601 ny You may want to look for a matrix representation of these numbers.
* Hermann Günther Grassmann (1809–1877), schoolteacher in Stettin, and one of the most profound math-
ematical thinkers of the nineteenth century.
vector spaces 1221
Vector spaces
Vector spaces, also called linear spaces, are mathematical generalizations of certain as-
pects of the intuitive three-dimensional space. A set of elements any two of which can
be added together and any one of which can be multiplied by a number is called a vector
space, if the result is again in the set and the usual rules of calculation hold.
More precisely, a vector space over a number field K is a set of elements, called vectors,
for which a vector addition and a scalar multiplication is defined, such that for all vectors
a, b, c and for all numbers s and r from K one has
If the field K, whose elements are called scalars in this context, is taken to be the real (or
complex, or quaternionic) numbers, one speaks of a real (or complex, or quaternionic)
vector space. Vector spaces are also called linear vector spaces or simply linear spaces.
The complex numbers, the set of all real functions defined on the real line, the set of
all polynomials, the set of matrices with a given number of rows and columns, all form
vector spaces. In mathematics, a vector is thus a more general concept than in physics.
Challenge 1602 ny (What is the simplest possible mathematical vector space?)
In physics, the term ‘vector’ is reserved for elements of a more specialized type of
vector space, namely normed inner product spaces. To define these, we first need the
concept of a metric space.
A metric space is a set with a metric, i.e. a way to define distances between elements.
A real function d(a, b) between elements is called a metric if
Challenge 1604 ny Usually there are many ways to define a norm for a given space. Note that a norm can
always be used to define a metric by setting
so that all normed spaces are also metric spaces. This is the natural distance definition
(in contrast to unnatural ones like that between French cities).
The norm is often defined with the help of an inner product. Indeed, the most special
class of linear spaces are the inner product spaces. These are vector spaces with an inner
product, also called scalar product ë (not to be confused with the scalar multiplication!)
which associates a number to each pair of vectors. An inner product space over R satisfies
for all vectors a, b, c and all scalars r, s. A real inner product space of finite dimension is
also called a Euclidean vector space. The set of all velocities, the set of all positions, or the
set of all possible momenta form such spaces.
An inner product space over C satisfies*
for all vectors a, b, c and all scalars r, s. A complex inner product space (of finite dimen-
Copyright © Christoph Schiller November 1997–January 2007
sion) is also called a unitary or Hermitean vector space. If the inner product space is
Page 1211 complete, it is called, especially in the infinite-dimensional complex case, a Hilbert space.
The space of all possible states of a quantum system forms a Hilbert space.
All inner product spaces are also metric spaces, and thus normed spaces, if the metric
is defined by »
d(a, b) = (a − b) ë (a − b) . (851)
Only in the context of an inner product spaces we can speak about angles (or phase dif-
ferences) between vectors, as we are used to in physics. Of course, like in normed spaces,
* Two inequivalent forms of the sequilinearity axiom exist. The other is (ra) ë (sb) = rs(a ë b). The term
sesquilinear is derived from Latin and means for ‘one-and-a-half-linear’.
algebras 1223
inner product spaces also allows us to speak about the length of vectors and to define a
basis, the mathematical concept necessary to define a coordinate system.
The dimension of a vector space is the number of linearly independent basis vectors.
Challenge 1605 ny Can you define these terms precisely?
Challenge 1606 ny Which vector spaces are of importance in physics?
Algebras
The term algebra is used in mathematics with three different, but loosely related, mean-
ings. First, it denotes a part of mathematics, as in ‘I hated algebra at school’. Secondly,
it denotes a set of formal rules that are obeyed by abstract objects, as in the expression
‘tensor algebra’. Finally – and this is the only meaning used here – an algebra denotes a
specific type of mathematical structure.
Intuitively, an algebra is a set of vectors with a vector multiplication defined on it. More
precisely, an algebra is a vector space (over a field K) that is also a ring. (The concept is
for all vectors x, y, z and all scalars c > K. To stress their properties, algebras are also
called linear algebras.
For example, the set of all linear transformations of an n-dimensional linear space
(such as the translations on a plane, in space or in time) is a linear algebra, if the com- Copyright © Christoph Schiller November 1997–January 2007
system.*
An associative algebra is an algebra whose multiplication has the additional property
that
x(yz) = (x y)z associativity . (854)
Most algebras that arise in physics are associative.** Therefore, in mathematical physics,
a linear associative algebra is often simply called an algebra.
The set of multiples of the unit 1 of the algebra is called the field of scalars scal(A) of
the algebra A. The field of scalars is also a subalgebra of A. The field of scalars and the
scalars themselves behave in the same way.
We explore a few examples. The set of all polynomials in one variable (or in several
Challenge 1608 e variables) forms an algebra. It is commutative and infinite-dimensional. The constant
polynomials form the field of scalars.
The set of n n matrices, with the usual operations, also forms an algebra. It is n 2 -
(x ) = x
(x + y) = x + y
(cx) = cx for all c>C
(x y) = y x (855)
valid for all elements x, y of the algebra A. The element x is called the adjoint of x.
Star algebras are the main type of algebra used in quantum mechanics, since quantum-
mechanical observables form a -algebra.
* Linear transformations are mappings from the vector space to itself, with the property that sums and scalar
multiples of vectors are transformed into the corresponding sums and scalar multiples of the transformed
Challenge 1607 n vectors. Can you specify the set of all linear transformations of the plane? And of three-dimensional space?
Copyright © Christoph Schiller November 1997–January 2007
YxY2 = x x . (856)
(A Banach algebra is a complete normed algebra; an algebra is complete if all Cauchy se-
quences converge.) In short, C-algebra is a nicely behaved algebra whose elements form
a continuous set and a complex vector space. The name C comes from ‘continuous func-
tions’. Indeed, the bounded continuous functions form such an algebra, with a properly
Challenge 1611 n defined norm. Can you find it?
Every C-algebra contains a space of Hermitean elements (which have a real spec-
trum), a set of normal elements, a multiplicative group of unitary elements and a set of
positive elements (with non-negative spectrum).
We should mention one important type of algebra used in mathematics. A division
algebra is an algebra for which the equations ax = b and ya = b are uniquely solvable in
subfield.
In physics, it is the algebras related to symmetries which play the most important role.
We study them next.
Lie algebras
A Lie algebra is special type of algebra (and thus of vector space). Lie algebras are the
most important type of non-associative algebras. A vector space L over the field R (or C)
with an additional binary operation [ , ], called Lie multiplication or the commutator, is
1226 d numbers and spaces
for all elements X, Y , Z > L and for all a, b > R (or C). (Lie algebras are named after
Challenge 1612 e Sophus Lie.) The first two conditions together imply bilinearity. A Lie algebra is called
commutative if [X, Y] = 0 for all elements X and Y. The dimension of the Lie algebra is
the dimension of the vector space. A subspace N of a Lie algebra L is called an ideal* if
[L, N] ⊂ N; any ideal is also a subalgebra. A maximal ideal M which satisfies [L, M] = 0
is called the centre of L.
A Lie algebra is called a linear Lie algebra if its elements are linear transformations of
another vector space V (intuitively, if they are ‘matrices’). It turns out that every finite-
(This fact gave the commutator its name.) Lie algebras are non-associative in general;
but the above definition of the commutator shows how to build one from an associative
algebra.
Since Lie algebras are vector spaces, the elements Ti of a basis of the Lie algebra always
obey a relation of the form:
[Ti , T j ] = Q c ikj Tk . (859)
k
The numbers c ikj are called the structure constants of the Lie algebra. They depend on
the choice of basis. The structure constants determine the Lie algebra completely. For
example, the algebra of the Lie group SU(2), with the three generators defined by Ta =
Copyright © Christoph Schiller November 1997–January 2007
Page 1217 σ a ~2i, where the σ a are the Pauli spin matrices, has the structure constants C abc = ε abc .**
Challenge 1613 ny * Can you explain the notation [L, N]? Can you define what a maximal ideal is and prove that there is only
one?
** Like groups, Lie algebras can be represented by matrices, i.e. by linear operators. Representations of Lie
algebras are important in physics because many continuous symmetry groups are Lie groups.
The adjoint representation of a Lie algebra with basis a 1 ...a n is the set of matrices ad(a) defined for each
element a by
[a, a j ] = Q ad(a)c j a c . (860)
c
physics: among them are the Poincaré algebra, the Cartan algebra, the Virasoro algebra
Note that for any Lie algebra, a scalar product can be defined by setting
This scalar product is symmetric and bilinear. (Can you show that it is independent of the representation?)
The corresponding bilinear form is also called the Killing form, after the German mathematician Wilhelm
Killing (1847–1923), the discoverer of the ‘exceptional’ Lie groups. The Killing form is invariant under the
action of any automorphism of the Lie algebra L. In a given basis, one has
Lie superalgebras
Lie superalgebras arise when the concept of Lie algebra is extended to the case of super-
symmetry. A Lie superalgebra contains even and odd elements; the even elements cor-
respond to bosons and the odd elements to fermions. Supersymmetry applies to systems
with anticommuting coordinates. Lie superalgebras are a natural generalization of Lie
algebras to supersymmetry, and simply add a Z2 -grading.
In detail, a Lie superalgebra is a Z2 -graded algebra over a field of characteristic 0 –
usually R or C – with a product [., .], called the (Lie) superbracket or supercommutator,
that has the properties
where x, y and z are algebra elements that are ‘pure’ in the Z2 -grading. The expression
SxS denotes the degree of the algebra element x and is either 0 or 1. The second condition
is sometimes called the ‘super Jacobi identity’. Obviously, the even subalgebra of a Lie
superalgebra forms a Lie algebra; in that case the superbracket becomes the usual Lie
bracket.
As in the case of Lie algebras, the simple Lie superalgebras have been completely clas-
sified. They fall into five infinite classes and some special cases, namely
— A(n, m) corresponding to the Lie supergroups SL(n + 1Sm + 1) and their compact
‘cousins’ SU(NSM);
— B(n, m) corresponding to the Lie supergroups OSp(2n + 1S2m);
— D(n, m) corresponding to the Lie supergroups OSp(2nS2m);
— P(n);
— Q(n);
— the exceptional cases D α (2, 1), G(3), F(4);
— and finally the Cartan superalgebras.
where the number c, which may be zero, is called the central charge, and the factor 1~12 is
introduced by historical convention. This rather specific algebra is important in physics
because it is the algebra of conformal symmetries in two dimensions.* Can you find a rep-
Challenge 1614 ny resentation in terms of infinite square matrices? Mathematically speaking, the Virasoro
algebra is a special case of a Kac–Moody algebra.
Page 336 * Note that, in contrast, the conformal symmetry group in four dimensions has 15 parameters, and thus its
Lie algebra is finite (15) dimensional.
topology – what shapes exist? 1229
Kac–Moody algebras
In physics, more general symmetries than Lie groups also appear. This happens in particu-
lar when general relativity is taken into account. Because of the symmetries of space-time,
the number of generators becomes infinite. The concepts of Lie algebra and of superal-
gebra have to be extended to take space-time symmetry into account. The corresponding
algebras are the Kac–Moody algebras. (‘Kac’ is pronounced like ‘Katz’.) Kac–Moody al-
gebras are a particular class of infinite-parameter Lie algebras; thus they have an infinite
number of generators. They used to be called associated affine algebras or affine Lie al-
gebras; sometimes they are also called Z-graded Lie algebras. They were introduced inde-
pendently in 1968 by Victor Kac and Robert Moody. We present some specific examples.
Of basic physical importance are those Kac–Moody algebras associated to a symmetry
(n)
group G a C[t], where t is some continuous parameter. The generators M a of the cor-
responding algebra have two indices, one, a, that indexes the generators of the group G
and another, n, that indexes the generators of the group of Laurent polynomials C[t]. The
For example, if we take G as SU(2) with its generators T n = σ a ~2i, σ a being the Pauli
spin matrices, then the objects
= Ta a t n , =
(n) 1 tn 0
M (n)
a e.g. M3 (866)
2i 0 −t n
form a Kac–Moody algebra with the structure constants C abc = ε abc of SU(2).
In a simplified view of topology that is sufficient for physicists, only one type of entity ”
can possess shape: manifolds. Manifolds are generalized examples of pullovers: they are
locally flat, can have holes, boundaries and can often be turned inside out.
Copyright © Christoph Schiller November 1997–January 2007
Pullovers are subtle entities. For example, can you turn your pullover inside out while
Challenge 1615 n your hands are tied together? (A friend may help you.) By the way, the same feat is also
possible with your trousers, while your feet are tied together. Certain professors like to
demonstrate this during topology lectures – of course with a carefully selected pair of
underpants.
Topological spaces
Ref. 1259 The study of shapes requires a good definition of a set made of ‘points’. To be able to talk
about shape, these sets must be structured in such a way as to admit a useful concept
of ‘neighbourhood’ or ‘closeness’ between the elements of the set. The search for the
most general type of set which allows a useful definition of neighbourhood has led to
1230 d numbers and spaces
Of the many special kinds of topological spaces that have been studied, one type is par-
ticularly important. A Hausdorff space is a topological space in which for any two points
x and y there are disjoint open sets U and V such that x is in U and y is in V . A Haus-
dorff space is thus a space where, no matter how ‘close’ two points are, they can always
Copyright © Christoph Schiller November 1997–January 2007
be separated by open sets. This seems like a desirable property; indeed, non-Hausdorff
spaces are rather tricky mathematical objects. (At Planck energy, it seems that vacuum
appears to behave like a non-Hausdorff space; however, at Planck energy, vacuum is not
really a space at all. So non-Hausdorff spaces play no role in physics.) A special case of
Hausdorff space is well-known: the manifold.
* The Cauchy-Weierstass definition of continuity says that a real function f (x) is continuous at a point a if
(1) f is defined on an open interval containing a, (2) f (x) tends to a limit as x tends to a, and (3) the limit is
f (a). In this definition, the continuity of f is defined using the intuitive idea that the real numbers form the
Challenge 1617 ny basic model of a set that has no gaps. Can you see the connection with the general definition given above?
topology – what shapes exist? 1231
Manifolds
In physics, the most important topological spaces are differential manifolds. Loosely
speaking, a differential manifold is a set of points that looks like Rn under the micro-
scope – at small distances. For example, a sphere and a torus are both two-dimensional
differential manifolds, since they look locally like a plane. Not all differential manifolds
are that simple, as the examples of Figure 422 show.
A differential manifold is called connected if any two points can be joined by a path
lying in the manifold. (The term has a more general meaning in topological spaces. But
the notions of connectedness and pathwise connectedness coincide for differential mani-
folds.) We focus on connected manifolds in the following discussion. A manifold is called
simply connected if every loop lying in the manifold can be contracted to a point. For ex-
ample, a sphere is simply connected. A connected manifold which is not simply connec-
ted is called multiply connected. A torus is multiply connected.
Manifolds can be non-orientable, as the well-known Möbius strip illustrates. Non-
orientable manifolds have only one surface: they do not admit a distinction between front
Challenge 1618 e and back. If you want to have fun, cut a paper Möbius strip into two along a centre line.
You can also try this with paper strips with different twist values, and investigate the reg- Copyright © Christoph Schiller November 1997–January 2007
ularities.
In two dimensions, closed manifolds (or surfaces), i.e. surfaces that are compact and
without boundary, are always of one of three types:
— The simplest type are spheres with n attached handles; they are called n-tori or surfaces
of genus n. They are orientable surfaces with Euler characteristic 2 − 2n.
— The projective planes with n handles attached are non-orientable surfaces with Euler
characteristic 1 − 2n.
— The Klein bottles with n attached handles are non-orientable surfaces with Euler char-
acteristic −2n.
Therefore Euler characteristic and orientability describe compact surfaces up to
homeomorphism (and if surfaces are smooth, then up to diffeomorphism). Homeo-
1232 d numbers and spaces
In other words, through homotopy and homology theory, mathematicians can clas- Copyright © Christoph Schiller November 1997–January 2007
sify manifolds. Given two manifolds, the properties of the holes in them thus determine
whether they can be deformed into each other.
Physicists are now extending these results of standard topology. Deformation is a clas-
sical idea which assumes continuous space and time, as well as arbitrarily small action.
In nature, however, quantum effects cannot be neglected. It is speculated that quantum
effects can transform a physical manifold into one with a different topology: for example,
Challenge 1619 d a torus into a sphere. Can you find out how this can be achieved?
Topological changes of physical manifolds happen via objects that are generalizations
of manifolds. An orbifold is a space that is locally modelled by Rn modulo a finite group.
Examples are the tear-drop or the half-plane. Orbifolds were introduced by Satake Ichiro
in 1956; the name was coined by William Thurston. Orbifolds are heavily studied in string
theory.
1234 d numbers and spaces
808 017 424 794 512 875 886 459 904 961 710 757 005 754 368 000 000 000 or 246 ë 320 ë 59 ë
76 ë 112 ë 133 ë 17 ë 19 ë 23 ë 29 ë 31 ë 41 ë 47 ë 59 ë 71.)
Of the infinite groups, only those with some finiteness condition have been studied. It
is only such groups that are of interest in the description of nature. Infinite groups are
divided into discrete groups and continuous groups. Discrete groups are an active area
of mathematical research, having connections with number theory and topology. Con-
tinuous groups are divided into finitely generated and infinitely generated groups. Finitely
generated groups can be finite-dimensional or infinite-dimensional.
The most important class of finitely generated continuous groups are the Lie groups.
types and cl assification of groups 1235
Lie groups
In nature, the Lagrangians of the fundamental forces are invariant under gauge trans-
formations and under continuous space-time transformations. These symmetry groups
are examples of Lie groups, which are a special type of infinite continuous group. They
are named after the great Norwegian mathematician Sophus Lie (1849–1899). His name
is pronounced like ‘Lee’.
A (real) Lie group is an infinite symmetry group, i.e. a group with infinitely many ele-
ments, which is also an analytic manifold. Roughly speaking, this means that the elements
of the group can be seen as points on a smooth (hyper-) surface whose shape can be de-
scribed by an analytic function, i.e. by a function so smooth that it can be expressed as
a power series in the neighbourhood of every point where it is defined. The points of
the Lie group can be multiplied according to the group multiplication. Furthermore, the
coordinates of the product have to be analytic functions of the coordinates of the factors,
and the coordinates of the inverse of an element have to be analytic functions of the co-
A = UeH . (867)
The simple Lie groups U(1) and SO(2,R) and the Lie groups based on the real and
complex numbers are abelian (see Table 93); all others are non-abelian.
Lie groups are manifolds. Therefore, in a Lie group one can define the distance between
two points, the tangent plane (or tangent space) at a point, and the notions of integration
and differentiations. Because Lie groups are manifolds, Lie groups have the same kind of
1236 d numbers and spaces
structure as the objects of Figures 422, 423 and 424. Lie groups can have any number of
dimensions. Like for any manifold, their global structure contains important information;
let us explore it.
Connectedness
It is not hard to see that the Lie groups SU(N) are simply connected for all N = 2, 3 . . . ;
they have the topology of a 2N-dimensional sphere. The Lie group U(1), having the topo-
logy of the 1-dimensional sphere, or circle, is multiply connected.
The Lie groups SO(N) are not simply connected for any N = 2, 3 . . . . In general,
SO(N,K) is connected, and GL(N,C) is connected. All the Lie groups SL(N,K) are connec-
ted; and SL(N,C) is simply connected. The Lie groups Sp(N,K) are connected; Sp(2N,C)
is simply connected. Generally, all semi-simple Lie groups are connected.
The Lie groups O(N,K), SO(N,M,K) and GL(N,R) are not connected; they contain two
connected components.
Compactness
A Lie group is compact if it is closed and bounded when seen as a manifold. For a given
parametrization of the group elements, the Lie group is compact if all parameter ranges
are closed and finite intervals. Otherwise, the group is called non-compact. Both compact
and non-compact groups play a role in physics. The distinction between the two cases is
important, because representations of compact groups can be constructed in the same
simple way as for finite groups, whereas for non-compact groups other methods have
to be used. As a result, physical observables, which always belong to a representation of
a symmetry group, have different properties in the two cases: if the symmetry group is
compact, observables have discrete spectra; otherwise they do not.
All groups of internal gauge transformations, such as U(1) and SU(n), form compact
groups. In fact, field theory requires compact Lie groups for gauge transformations. The
only compact Lie groups are Tn , O(n), U(n), SO(n) and SU(n), their double cover
Spin(n) and the Sp(n). In contrast, SL(n,R), GL(n,R), GL(n,C) and all others are not Copyright © Christoph Schiller November 1997–January 2007
compact.
Besides being manifolds, Lie groups are obviously also groups. It turns out that most
of their group properties are revealed by the behaviour of the elements which are very
close (as points on the manifold) to the identity.
Every element of a compact and connected Lie group has the form exp(A) for some A.
The elements A arising in this way form an algebra, called the corresponding Lie algebra.
For any linear Lie group, every element of the connected subgroup can be expressed as a
finite product of exponentials of elements of the corresponding Lie algebra. In short, Lie
algebras express the local properties of Lie groups. That is the reason for their importance
Page 1225 in physics.
types and cl assification of groups 1237
and to so(3)
GL(n, R) general linear not connected, not M(n, R) n-by-n matrices, with n 2
group: compact; π 0 = Z2 , Lie bracket the
invertible no π 1 commutator
n-by-n real
matrices
n-by-n real simply connected, M(n, R) n-by-n matrices, with n 2
GL+ (n, R) matrices with not compact; π 0 = 0, Lie bracket the
positive for n = 2: π 1 = Z, for commutator
determinant n C 2: π 1 = Z2 ;
GL+ (1, R)
isomorphic to RA0
1238 d numbers and spaces
a. The group of components π 0 of a Lie group is given; the order of π 0 is the number of components of the
Lie group. If the group is trivial (0), the Lie group is connected. The fundamental group π 1 of a connected
Lie group is given. If the group π 1 is trivial (0), the Lie group is simply connected. This table is based on that
in the Wikipedia, at http://en.wikipedia.org/wiki/Table_of_Lie_groups.
b. The standard skew-symmetric matrix J of rank 2n is J k l = δ k,n+l − δ k+n, l .
c. Complex Lie groups and Lie algebras can be viewed as real Lie groups and real Lie algebras of twice the
dimension.
**
Mathematics provides many counter-intuitive results. Reading a book on the topic, such
as Bernard R. Gelbaum & John M.H. Olmsted, Theorems and Counter-examples
in Mathematics, Springer, 1993, can help you sharpen your mind.
**
The distinction between one, two and three dimensions is blurred in mathematics. This is
well demonstrated in the text Hans Sagan, Space Filling Curves, Springer Verlag, 1994.
**
Show that two operators A and B obey
e A eB = exp(A + B + [A, B]
1
(868)
2
for most operators A and B. This result is often called the Baker–Campbell–Hausdorff
formula or BCH formula.
**
There are at least seven ways to earn a million dollars with mathematical research. The
Clay Mathematics Institute at http://www.claymath.org offers them for major advances
in seven topics:
— proving the Birch and Swinnerton–Dyer conjecture about algebraic equations;
— proving the Poincaré conjecture about topological manifolds;
— solving the Navier–Stokes equations for fluids;
— finding criteria distinguishing P and NP numerical problems;
— proving the Riemann hypothesis stating that the nontrivial zeros of the zeta function
lie on a line;
— proving the Hodge conjectures;
Copyright © Christoph Schiller November 1997–January 2007
— proving the connection between Yang–Mills theories and a mass gap in quantum field
theory.
On each of these topics, substantial progress can buy you a house.
**
Challenge 1621 ny A theorem of topology says: you cannot comb a hairy football. Can you proove it?
Motion Mountain – The Adventure of Physics available free of charge at www.motionmountain.net Copyright © Christoph Schiller November 1997–January 2007
d numbers and spaces
”
Samuel Johnson
No man but a blockhead ever wrote except for
money.
“
1242
bibliography 1243
Biblio graphy
I
n the text, outstanding books introducing neighbouring domains are presented
n footnotes. The bibliographies at the end of each chapter collect general material
n order to satisfy further curiosity about what is encountered in this mountain ascent.
”
All citations can also be found by looking up the author’s name in the index. To find
additional information, either libraries or the internet can help.
In a library, review articles of recent research appear in journals such as Reviews of
Modern Physics, Reports on Progress in Physics, Contemporary Physics and Advances in
Physics. Good pedagogical introductions are found in the American Journal of Physics,
the European Journal of Physics and Physik in unserer Zeit. Another useful resource is
Living Reviews in Relativity, found at http://www.livingreviews.org.
Overviews on research trends occasionally appear in magazines such as Physics
World, Physics Today, Europhysics Journal, Physik Journal and Nederlands tijdschrift voor
natuurkunde. For coverage of all the sciences together, the best sources are the magazines
Nature, New Scientist, Naturwissenschaften, La Recherche and the cheap but excellent Sci-
ence News.
Research papers appear mainly in Physics Letters B, Nuclear Physics B, Physical Review
D, Physical Review Letters, Classical and Quantum Gravity, General Relativity and Grav- Copyright © Christoph Schiller November 1997–January 2007
itation, International Journal of Modern Physics and Modern Physics Letters. The newest
results and speculative ideas are found in conference proceedings, such as the Nuclear
Physics B Supplements. Research articles also appear in Fortschritte der Physik, Zeitschrift
für Physik C, La Rivista del Nuovo Cimento, Europhysics Letters, Communications in Math-
ematical Physics, Journal of Mathematical Physics, Foundations of Physics, International
Journal of Theoretical Physics and Journal of Physics G. There is also the purely electronic
New Journal of Physics, which can be found at the http://www.njp.org website.
Papers on the description of motion without time and space which appear after this
text is published can be found via the Scientific Citation Index. It is published in printed
form and as compact disc, and allows, one to search for all publications which cite a given
* Ivan Illich (b. 1926 Vienna, d. 2002 Bremen), Austrian theologian and social and political thinker.
e sources of information on motion 1245
To p i c A b b r e v i at i o n
paper. Then, using the bimonthly Physics Abstracts, which also exists in both paper and
electronic form, you can look up the abstract of the paper and check whether it is of
interest.
But by far the simplest and most efficient way to keep in touch with ongoing research
on motion is to use the internet, the international computer network. To anybody with
a personal computer connected to a telephone, most theoretical physics papers are avail-
able free of charge, as preprints, i.e. before official publication and checking by referees, at
Copyright © Christoph Schiller November 1997–January 2007
the http://arxiv.org website. Details are given in Table 94. A service for finding subsequent
preprints that cite a given one is also available.
In the last decade of the twentieth century, the internet expanded into a combination
of library, media store, discussion platform, order desk, brochure collection and time
waster. Today, commerce, advertising and – unfortunately – crime of all kind are also an
integral part of the web. With a personal computer, a modem and free browser software,
one can look for information in millions of pages of documents. The various parts of the
documents are located in various computers around the world, but the user does not need
to be aware of this.*
* Several decades ago, the provocative book by Ivan Illich, Deschooling Society, Harper & Row, 1971,
listed four basic ingredients for any educational system:
1246 e sources of information on motion
To start using the web, ask a friend who knows.* Searching the web for authors, organ-
izations, books, publications, companies or simple keywords using search engines can
be a rewarding or a time-wasting experience. A selection of interesting servers are given
below.
TA B L E 95 Some interesting servers on the world-wide web
To p i c W e b s i t e a d d r e s s o r ‘ URL ’
General topics
Wikipedia http://www.wikipedia.org
Information search http://www.altavista.com
engines http://www.metager.de
http://www.google.com
http://www.yahoo.com
1. access to resources for learning, e.g. books, equipment, games, etc. at an affordable price, for everybody,
at any time in their life;
2. for all who want to learn, access to peers in the same learning situation, for discussion, comparison,
cooperation and competition;
3. access to elders, e.g. teachers, for their care and criticism towards those who are learning;
4. exchanges between students and performers in the field of interest, so that the latter can be models for
Copyright © Christoph Schiller November 1997–January 2007
the former. For example, there should be the possibility to listen to professional musicians and reading the
works of specialist writers. This also gives performers the possibility to share, advertise and use their skills.
Illich develops the idea that if such a system were informal – he then calls it a ‘learning web’ or ‘opportun-
ity web’ – it would be superior to formal, state-financed institutions, such as conventional schools, for the
development of mature human beings. These ideas are deepened in his following works, Deschooling Our
Lives, Penguin, 1976, and Tools for Conviviality, Penguin, 1973.
Today, any networked computer offers one or more of the following: email (electronic mail), FTP (file
transfer to and from another computer), access to Usenet (the discussion groups on specific topics, such
as particle physics), and the powerful world-wide web. (Roughly speaking, each of those includes the ones
before.) In a rather unexpected way, all these facilities of the internet have transformed it into the backbone
of the ‘opportunity web’ discussed by Illich. However, as in any school, it strongly depends on the user’s
discipline whether the internet actually does provide a learning web.
* It is also possible to use both the internet and to download files through FTP with the help of email only.
But the tools change too often to give a stable guide here. Ask your friend.
e sources of information on motion 1247
To p i c W e b s i t e a d d r e s s o r ‘ URL ’
http://www.slac.stanford.edu/spires
Particle data http://pdg.web.cern.ch/pdg
Physics news, weekly http://www.aip.org/physnews/update
Physics news, daily http://www.innovations-report.de/berichte/physik.php
Physics problems by Yakov http://star.tau.ac.il/QUIZ
Kantor
Physics problems by Henry http://www.phy.duke.edu/~hsg/physics-challenges/challenges.
Greenside html
Physics ‘question of the week’ http://www.physics.umd.edu/lecdem/outreach/QOTW/active
Physics ‘miniproblem’ http://www.nyteknik.se/miniproblemet
Official SI unit site http://www.bipm.fr
Unit conversion http://www.chemie.fu-berlin.de/chemistry/general/units.html
To p i c W e b s i t e a d d r e s s o r ‘ URL ’
Mathematics
‘Math forum’ internet resource http://mathforum.org/library
collection
Biographies of mathematicians http://www-history.mcs.st-andrews.ac.uk/BiogIndex.html
Purdue math problem of the http://www.math.purdue.edu/academics/pow
week
Macalester College maths http://mathforum.org/wagon
problem of the week
Mathematical formulae http://dlmf.nist.gov
Functions http://functions.wolfram.com
Symbolic integration http://www.integrals.com
Weisstein’s World of http://mathworld.wolfram.com
To p i c W e b s i t e a d d r e s s o r ‘ URL ’
Pseudoscience http://suhep.phy.syr.edu/courses/modules/PSEUDO/pseudo_
main.html
Crackpots http://www.crank.net
Mathematical quotations http://math.furman.edu/mwoodard/~mquot.html
The ‘World Question Center’ http://www.edge.org/questioncenter.html
Plagiarism http://www.plagiarized.com
Hoaxes http://www.museumofhoaxes.com
Do you want to study physics without actually going to university? Nowadays it is possible
to do so via email and internet, in German, at the University of Kaiserslautern.* In the
near future, a nationwide project in Britain should allow the same for English-speaking
students. As an introduction, use the latest update of this physics text!
“ geschlossenen Anstalt.**
Matthias Deutschmann
“ answer.
John Wheeler’s motto
John Wheeler wanted people to estimate, to try and to guess; but not saying it out loud. ”
Challenge 2, page 23: These topics are all addressed later in the text.
Challenge 1, page 21: Do not hesitate to be demanding and strict. The next version will benefit
from it.
Challenge 3, page 30: There are many ways to distinguish real motion from an illusion of mo-
tion: for example, only real motion can be used to set something else into motion. In addition,
the motion illusions of the figures show an important failure; nothing moves if the head and the
paper remain fixed with respect to each other. In other words, the illusion only amplifies existing
motion, it does not create motion from nothing.
Challenge 4, page 30: Without detailed and precise experiments, both sides can find examples
to prove their point. Creation is supported by the appearance of mould or bacteria in a glass
of water; creation is also supported by its opposite, namely traceless disappearance, such as the
disappearance of motion. However, conservation is supported and creation falsified by all those
investigations that explore assumed cases of appearance or disappearance in full detail.
Challenge 6, page 32: Political parties, sects, helping organizations and therapists of all kinds
are typical for this behaviour.
Challenge 7, page 36: The issue is not yet completely settled for the motion of empty space, Copyright © Christoph Schiller November 1997–January 2007
such as in the case of gravitational waves. In any case, empty space is not made of small particles
of finite size, as this would contradict the transversality of gravity waves.
Challenge 8, page 38: The circular definition is: objects are defined as what moves with respect
to the background, and the background is defined as what stays when objects change. We shall
return to this important issue several times in our adventure. It will require a certain amount of
patience to solve it, though.
Challenge 9, page 39: Holes are not physical systems, because in general they cannot be
tracked.
Challenge 10, page 39: See page 835.
Challenge 11, page 40: A ghost can be a moving image; it cannot be a moving object, as objects
cannot interpenetrate. See page 805.
Challenge 12, page 40: Hint: yes, there is such a point.
f challenge hints and solu tions 1251
Challenge 13, page 40: If something could stop moving, motion could disappear into nothing.
For a precise proof, one would have to show that no atom moves any more. So far, this has never
been observed: motion is conserved. (Nothing in nature can disappear into nothing.)
Challenge 14, page 40: This would indeed mean that space is infinite; however, it is impossible
to observe that something moves “for ever”: nobody lives that long.
Challenge 15, page 40: How would you measure this?
Challenge 16, page 40: The number of reliable digits of a measurement result is a simple quan-
tification of precision. More details can be found by looking up‘standard deviation’ in the index.
Challenge 17, page 40: No; memory is needed for observation and measurements. This is the
case for humans and measurement apparatus. Quantum theory will make this particularly clear.
Challenge 18, page 41: Note that you never have observed zero speed. There is always some
measurement error which prevents one to say that something is zero. No exceptions!
Challenge 19, page 41: (264 −1) = 18 446 744 073 700 551 615 grains of rice, given a world harvest
Discorsi e dimostrazioni matematiche intorno a due nuove scienze attenenti alla mecanica e i mo-
vimenti locali, usually simply called the ‘Discorsi’, which he published in 1638 with Louis Elsevier
in Leiden, in the Netherlands.
Challenge 37, page 48: There is no way to define a local time at the poles that is consistent
with all neighbouring points. (For curious people, check the website http://www.arctic.noaa.gov/
gallery_np.html.)
Challenge 39, page 50: The forest is full of light and thus of light rays: they are straight, as
shown by the sunbeams in Figure 427.
Challenge 40, page 50: One pair of muscles moves the lens along the third axis by deforming
the eye from prolate to spherical to oblate.
Challenge 41, page 51: This you can solve trying to think in four dimensions. Try to imagine
how to switch the sequence when two pieces cross. Note: it is usually not correct, in this domain,
f challenge hints and solu tions 1253
description over the other, except possibly for reasons of simplicity or intellectual laziness.
Challenge 53, page 58: A hint is given in Figure 428. For the measurement of the speed of light
with almost the same method, see page 286.
Challenge 54, page 58: Colour is a property that applies only to objects, not to boundaries. The
question shows that it is easy to ask questions that make no sense also in physics.
Challenge 55, page 58: You can do this easily yourself. You can even find websites on the topic.
Challenge 70, page 60: The required gap is
¼ »
= (L − b)2 − w 2 + 2w R 2 − (L − b)2 − L + b , (872)
paper discs
d b
Challenge 72, page 60: The first solution sent in will go here.
Challenge 56, page 58: Clocks with two hands: 22 times. Clocks with three hands: 2 times.
Challenge 57, page 58: For two hands, the answer is 143 times.
Challenge 58, page 59: The Earth rotates with 15 minutes per minute.
Challenge 59, page 59: You might be astonished, but no reliable data exist on this question. The
highest speed of a throw measured so far seems to be a 45 m~s cricket bowl. By the way, much
more data are available for speeds achieved with the help of rackets. The c. 70 m~s of fast badmin-
ton smashes seem to be a good candidate for record racket speed; similar speeds are achieved by
golf balls.
Challenge 60, page 59: Yes, it can. In fact, many cats can slip through as well.
Copyright © Christoph Schiller November 1997–January 2007
1°
3° 2° 3°
10°
F I G U R E 432 The angles defined by the hands against the sky, when the arms
are extended
St. Louis arch is in shape of a catenary. A suspension bridge has the shape of a catenary before it
is loaded, i.e. before the track is attached to it. When the bridge is finished, the shape is in between
a catenary and a parabola.
Challenge 68, page 60: A limit does not exist in classical physics; however, there is one in nature
which appears as soon as quantum effects are taken into account.
Challenge 84, page 63: The inverse radii, or curvatures, obey a 2 + b 2 + c 2 + d 2 = (1~2)(a + b +
c + d)2 . This formula was discovered by René Descartes. If one continues putting circles in the
remaining spaces, one gets so-called circle packings, a pretty domain of recreational mathematics.
They have many strange properties, such as intriguing relations between the coordinates of the
circle centres and their curvatures.
Challenge 85, page 64: There are two solutions. (Why?) They are the two positive solutions of
l 2 = (b + x)2 + (b + b 2 ~x)2 ; the height is then given as h = b + x. The two solutions are 4.84 m
and 1.26 m. There are closed formulas for the solutions; can you find them?
Challenge 86, page 64: The best way is to calculate first the height B at which the blue ladder
touches the wall. It is given as a solution of B 4 −2hB 3 −(r 2 −b 2 )B 2 +2h(r 2 −b 2 )B−h 2 (r 2 −b 2 ) = 0.
Challenge 92, page 64: Problems appear when quantum effects are added. A two-dimensional
universe would have no matter, since matter is made of spin 1/2 particles. But spin 1/2 particles
do not exist in two dimensions. Can you find other reasons?
Challenge 93, page 65: Two dimensions do not allow ordering of events. To say ‘before’ and
‘afterwards’ becomes impossible. In everyday life and all domains accessible to measurement, time
is surely one-dimensional.
Challenge 94, page 66: From x = дt 2 ~2 you get the following rule: square the number of
Copyright © Christoph Schiller November 1997–January 2007
Challenge 98, page 67: Neglecting air resistance and approximating the angle by 45°, we get v =
»
d д , or about 3.8 m~s. This speed is created by a stead pressure build-up, using blood pressure,
which is suddenly released with a mechanical system at the end of the digestive canal. The cited
reference tells more about the details.
Challenge 99, page 67: On horizontal ground, for a speed v and an angle from the horizontal
α, neglecting air resistance and the height of the thrower, the distance d is d = v 2 sin 2α~д.
Challenge 100, page 68: Walk or run in the rain, measure your own speed v and the angle from
the vertical α with which the rain appears to fall. Then the speed of the rain is v rain = v~ tan α.
Challenge 101, page 68: Check your calculation with the information that the 1998 world re-
cord is juggling with 9 balls.
Challenge 102, page 68: The long jump record could surely be increased by getting rid of the
sand stripe and by measuring the true jumping distance with a photographic camera; that would
allow jumpers to run more closely to their top speed. The record could also be increased by a
small inclined step or by a spring-suspended board at the take-off location, to increase the take-
Challenge 111, page 72: A candidate for low acceleration of a physical system might be the
accelerations measured by gravitational wave detectors. They are below 10−13 m~s2 .
Challenge 113, page 73: One can argue that any source of light must have finite size.
Challenge 115, page 73: What the unaided human eye perceives as a tiny black point is usually
about 50 µm in diameter.
Challenge 116, page 73: See page 583.
Challenge 117, page 73: One has to check carefully whether the conceptual steps that lead us
to extract the concept of point from observations are correct. It will be shown in the third part of
the adventure that this is not the case.
Challenge 118, page 74: One can rotate the hand in a way that the arm makes the motion de-
scribed. See also page 804.
1258 f challenge hints and solu tions
definition of mass?
Challenge 129, page 81: The weight decreased due to the evaporated water lost by sweating and,
to a minor degree, due to the exhaled carbon bound in carbon dioxide.
Challenge 130, page 81: Rather than using the inertial effects of the Earth, it is easier to deduce
its mass from its gravitational effects. See challenge 243.
Challenge 134, page 83: At first sight, relativity implies that tachyons have imaginary mass;
however, the imaginary factor can be extracted from the mass–energy and mass–momentum
relation, so that one can define a real mass value for tachyons; as a result, faster tachyons have
smaller energy and smaller momentum. Both momentum and energy can be a negative number
of any size.
Challenge 135, page 83: Legs are never perfectly vertical; they would immediately glide away.
Once the cat or the person is on the floor, it is almost impossible to stand up again.
f challenge hints and solu tions 1259
Challenge 136, page 83: Momentum (or centre of mass) conservation would imply that the en-
vironment would be accelerated into the opposite direction. Energy conservation would imply
that a huge amount of energy would be transferred between the two locations, melting everything
in between. Teleportation would thus contradict energy and momentum conservation.
Challenge 137, page 84: The part of the tides due to the Sun, the solar wind, and the interac-
tions between both magnetic fields are examples of friction mechanisms between the Earth and
the Sun.
Challenge 138, page 85: With the factor 1~2, increase of (physical) kinetic energy is equal to
the (physical) work performed on a system: total energy is thus conserved only if the factor 1/2 is
added.
Challenge 140, page 86: It is a smart application of momentum conservation.
Challenge 141, page 86: Neither. With brakes on, the damage is higher, but still equal for both
cars.
Challenge 142, page 87: Heating systems, transport engines, engines in factories, steel plants,
Challenge 154, page 92: Yes, the ape can reach the banana. The ape just has to turn around its
own axis. For every turn, the plate will rotate a bit towards the banana. Of course, other methods,
like blowing at a right angle to the axis, peeing, etc., are also possible.
Challenge 156, page 93: The points that move exactly along the radial direction of the wheel
form a circle below the axis and above the rim. They are the points that are sharp in Figure 40 of
page 93.
Challenge 157, page 93: Use the conservation of angular momentum around the point of con-
tact. If all the wheel’s mass is assumed in the rim, the final rotation speed is half the initial one; it
is independent of the friction coefficient.
Challenge 160, page 97: The Coriolis effect can be seen as the sum two different effects of equal
magnitude. The first effect is the following: on a rotating background, velocity changes time. What
an inertial (nonrotating) observer sees as a constant velocity will be seen a velocity changing in
1260 f challenge hints and solu tions
time by the rotating observer. The acceleration seen by the rotating observer is negative, and
proportional to the angular velocity and to the velocity.
The second effect is change of velocity in space. In a rotating frame of reference, different
points have different velocities. The effect is negative, and proportional to the angular velocity
and to the velocity.
In total, the Coriolis acceleration (or Coriolis effect) is thus aC = −2ω v.
Challenge 161, page 97: A short pendulum of length L that swings in two dimensions (with
amplitude ρ and orientation φ) shows two additional terms in the Lagrangian L:
1 ρ2 l2 1 ρ2
L = T − V = m ρ̇ 2 (1 + 2 ) + z 2 − mω 20 ρ 2 (1 + ) (874)
2 L 2mρ 2 4 L2
where as usual the basic frequency is ω 20 = д~L and the angular momentum is l z = mρ 2 φ̇. The
two additional terms disappear when L ª; in that case, if the system oscillates in an ellipse
with semiaxes a and b, the ellipse is fixed in space, and the frequency is ω 0 . For finite pendulum
a2 + b2
ω = ω 0 (1 − ) ; (875)
16 L 2
most of all, the ellipse turns with a frequency
3 ab
Ω=ω . (876)
8 L2
(These formulae can be derived using the least action principle, as shown by C.G. Gray, G.
Karl & V.A. Novikov, Progress in classical and quantum variational principles, http://arxiv.
org/abs/physics/0312071.) In other words, a short pendulum in elliptical motion shows a preces-
sion even without the Coriolis effect. Since this precession frequency diminishes with 1~L 2 , the
effect is small for long pendulums, where only the Coriolis effect is left over. To see the Coriolis
effect in a short pendulum, one thus has to avoid that it starts swinging in an elliptical orbit by
adding a suppression method of elliptical motion.
Challenge 162, page 98: The Coriolis acceleration is the reason for the deviation from the
straight line. The Coriolis acceleration is due to the change of speed with distance from the rota-
tion axis. Now think about a pendulum, located in Paris, swinging in the North-South direction
with amplitude A. At the Southern end of the swing, the pendulum is further from the axis by
A sin φ, where φ is the latitude. At that end of the swing, the central support point overtakes
the pendulum bob with a relative horizontal speed given by v = 2πA sin φ~24 h. The period of
precession is given by TF = v~2πA, where 2πA is the circumference 2πA of the envelope of the
Copyright © Christoph Schiller November 1997–January 2007
light
source
Ωt
t=0 t = 2πR~c
Challenge 167, page 99: When rotated by π around an east–west axis, the Coriolis force pro-
duces a drift velocity of the liquid around the tube. It has the value
Challenge 171, page 105: The galaxy forms a stripe in the sky. The galaxy is thus a flattened
structure. This is even clearer in the infrared, as shown more clearly in Figure 211 on page 447.
From the flattening (and its circular symmetry) we can deduce that the galaxy must be rotating.
Thus other matter must exist in the universe.
Challenge 172, page 106: Probably the ‘rest of the universe’ was meant by the writer. Indeed, a
moving a part never shifts the centre of gravity of a closed system. But is the universe closed? Or
a system? The third part of the adventure centres on these issues.
Challenge 173, page 106: Hint: an energy per distance is a force.
Challenge 176, page 106: The scale reacts to your heartbeat. The weight is almost constant over
time, except when the heart beats: for a short duration of time, the weight is somewhat lowered
at each beat. Apparently it is due to the blood hitting the aortic arch when the heart pumps it
upwards. The speed of the blood is about 0.3 m~s at the maximum contraction of the left ventricle.
1262 f challenge hints and solu tions
The distance to the aortic arch is a few centimetres. The time between the contraction and the
reversal of direction is about 15 ms.
Challenge 177, page 106: The conservation of angular momentum saves the glass. Try it.
Challenge 178, page 107: The mass decrease could also be due to expelled air. The issue is still
open.
Challenge 179, page 107: Assuming a square mountain, the height h above the surrounding
crust and the depth d below are related by
=
h ρm − ρc
(880)
d ρc
where ρ c is the density of the crust and ρ m is the density of the mantle. For the density values
given, the ratio is 6.7, leading to an additional depth of 6.7 km below the mountain.
Challenge 183, page 108: The behaviour of the spheres can only be explained by noting that
elastic waves propagate through the chain of balls. Only the propagation of these elastic waves, in
Challenge 217, page 114: See the article by C. Ucke & H.-J. Schlichting, Faszinierendes
Dynabee, Physik in unserer Zeit 33, pp. 230–231, 2002.
Challenge 218, page 114: See the article by C. Ucke & H.-J. Schlichting, Die kreisende
Büroklammer, Physik in unserer Zeit 36, pp. 33–35, 2005.
Challenge 219, page 114: Yes. Can you imagine what happens for an observer on the Equator?
Challenge 220, page 114: A straight line at the zenith, and circles getting smaller at both sides.
See an example on the website http://antwrp.gsfc.nasa.gov/apod/ap021115.html.
Challenge 222, page 115: The curve described by the midpoint of a ladder sliding down a wall
is a circle.
Challenge 224, page 116: The plane is described in the websites cited; for a standing human the
plane is the vertical plane containing the two eyes.
1264 f challenge hints and solu tions
Challenge 225, page 116: As said before, legs are simpler than wheels to grow, to maintain and
to repair; in addition, legs do not require flat surfaces (so-called ‘streets’) to work.
Challenge 226, page 117: The staircase formula is an empirical result found by experiment,
used by engineers world-wide. Its origin and explanation seems to be lost in history.
Challenge 227, page 117: Classical or everyday nature is right-left symmetric and thus requires
an even number of legs. Walking on two-dimensional surfaces naturally leads to a minimum of
four legs.
Challenge 229, page 118: The length of the day changes with latitude. So does the length of a
shadow or the elevation of stars at night, facts that are easily checked by telephoning a friend.
Ships appear at the horizon first be showing only their masts. These arguments, together with the
round shadow of the earth during a lunar eclipse and the observation that everything falls down-
wards everywhere, were all given already by Aristotle, in his text On the Heavens. It is now known
that everybody in the last 2500 years knew that the Earth is s sphere. The myth that many people
Challenge 230, page 118: Robert Peary had forgotten that on the date he claimed to be at the
North Pole, 6th of April 1909, the Sun is very low on the horizon, casting very long shadows, about
ten times the height of objects. But on his photograph the shadows are much shorter. (In fact, the
picture is taken in such a way to hide all shadows as carefully as possible.) Interestingly, he had
even convinced the US congress to officially declare him the first man on the North Pole in 1911.
(A rival had claimed to have reached it earlier on, but his photograph has the same mistake.)
Challenge 231, page 118: Yes, the effect has been measured for skyscrapers. Can you estimate
the values?
Challenge 232, page 119: The tip of the velocity arrow, when drawn over time, produces a circle
around the centre of motion.
Challenge 235, page 120: The value of the product GM for the Earth is 4.0 ë 1014 m3 ~s2 .
Challenge 236, page 120: All points can be reached for general inclinations; but when shoot-
ing horizontally in one given direction, only points on the first half of the circumference can be
reached.
Copyright © Christoph Schiller November 1997–January 2007
Challenge 238, page 121: On the moon, the gravitational acceleration is 1.6 m~s2 , about one
sixth of the value on Earth. The surface values for the gravitational acceleration for the planets
can be found on many internet sites.
Challenge 239, page 121: The Atwood machine is the answer: two almost equal masses m 1 and
m 2 connected by a string hanging from a well-oiled wheel of negligible mass. The heavier one falls
very slowly. Can show that the acceleration a of this ‘unfree’ fall is given by a = д(m 1 − m 2 )~(m 1 +
m 2 )? In other words, the smaller the mass difference is, the slower the fall is.
Challenge 240, page 121: You should absolutely try to understand the origin of this expression.
It allows to understand many essential concepts of mechanics. The idea is that for small amp-
litudes, the acceleration of a pendulum of length l is due to gravity. Drawing a force diagram for
f challenge hints and solu tions 1265
ma = −mд sin α
d2 α
ml = −mд sin α
dt 2
d2 α
l 2 = −д sin α . (881)
dt
For the mentioned small amplitudes (below 15°) we can approximate this to
d2 α
l = −дα . (882)
dt 2
This is the equation for a harmonic oscillation (i.e., a sinusoidal oscillation). The resulting motion
is:
α(t) = A sin(ωt + φ) . (883)
(For arbitrary amplitudes, the formula is much more complex; see the internet or special mech-
anics books for more details.)
Challenge 241, page 121: Walking speed is proportional to l~T, which makes it proportional to
l 1~2 .
Challenge 243, page 122: Cavendish suspended a horizontal handle with a long metal wire. He
then approached a large mass to the handle, avoiding any air currents, and measured how much
the handle rotated.
Challenge 244, page 122: The acceleration due to gravity is a = Gm~r 2 5 nm~s2 for a mass
of 75 kg. For a fly with mass m fly = 0.1 g landing on a person with a speed of v fly = 1 cm~s and
deforming the skin (without energy loss) by d = 0.3 mm, a person would be accelerated by a =
(v 2 ~d)(m fly ~m) = 0.4 µm~s2 . The energy loss of the inelastic collision reduces this value at least
by a factor of ten.
Challenge 246, page 124: The easiest way to see this is to picture gravity as a flux emanating Copyright © Christoph Schiller November 1997–January 2007
form a sphere. This gives a 1~r d−1 dependence for the force and thus a 1~r d−2 dependence of the
potential.
Challenge 248, page 125: Since the paths of free fall are ellipses, which are curves lying in a
plane, this is obvious.
Challenge 250, page 127: The low gravitational acceleration of the Moon, 1.6 m~s2 , implies that
gas molecules at usual temperatures can escape its attraction.
Challenge 252, page 128: A flash of light is sent to the Moon, where several Cat’s-eyes have been
deposited by the Lunakhod and Apollo missions. The measurement precision of the time a flash
take to go and come back is sufficient to measure the Moon’s distance change. For more details,
see challenge 571.
Challenge 259, page 132: This is a resonance effect, in the same way that a small vibration of a
string can lead to large oscillation of the air and sound box in a guitar.
1266 f challenge hints and solu tions
Challenge 261, page 133: The expression for the strength of tides, namely 2GM~d 3 , can be re-
written as (8~3)πGρ(R~d)3 . Now, R~d is roughly the same for Sun and Moon, as every eclipse
shows. So the density ρ must be much larger for the Moon.
Challenge 262, page 134: The total angular momentum of the Earth and the Moon must re-
main constant.
Challenge 268, page 138: The centre of mass of a broom falls with the usual acceleration; the
end thus falls faster.
Challenge 269, page 138: Just use energy conservation for the two masses of the jumper and the
string. For more details, including the comparison of experimental measurements and theory, see
N. Dubel aar & R. Brantjes, De valversnelling bij bungee-jumping, Nederlands tijdschrift
voor natuurkunde 69, pp. 316–318, October 2003.
Challenge 270, page 138: About 1 ton.
Challenge 271, page 138: About 5 g.
Challenge 272, page 139: Your weight is roughly constant; thus the Earth must be round. On a
and the immobile sky at night. We shall find out later that the darkness of the night sky gives a
second argument for the big bang.
Challenge 284, page 140: Due to the plateau, the effective mass of the Earth is larger.
Challenge 285, page 140: The choice is clear once you notice that there is no section of the orbit
which is concave towards the Sun. Can you show this?
Challenge 286, page 141: It would be a black hole; no light could escape. Black holes are dis-
cussed in detail in the chapter on general relativity.
Challenge 287, page 141: A handle of two bodies.
Challenge 290, page 141: Using a maximal jumping height of h =0.5 m on Earth and an estim-
ated asteroid density of ρ =3 Mg~m3 , we get a maximum radius of R 2 = 3дh~4πGρ 703 m.
Challenge 291, page 143: For each pair of opposite shell elements (drawn in yellow), the two
attractions compensate.
f challenge hints and solu tions 1267
Challenge 292, page 143: There is no practical way; if the masses on the shell could move, along
the surface (in the same way that charges can move in a metal) this might be possible, provided
that enough mass is available.
Challenge 293, page 144: Capture of a fluid body if possible if it is split by tidal forces.
Challenge 294, page 144: The tunnel would be an elongated ellipse in the plane of the Equator,
reaching from one point of the Equator to the point at the antipodes. The time of revolution
would not change, compared to a non-rotating Earth. See A.J. Simonson, Falling down a hole
through the Earth, Mathematics Magazine 77, pp. 171–188, June 2004.
Challenge 296, page 144: The centre of mass of the solar system can be as far as twice the radius
from the centre of the Sun; it thus can be outside the Sun.
Challenge 297, page 144: First, during northern summer time the Earth moves faster around
the Sun than during northern winter time. Second, shallow Sun’s orbits on the sky give longer
days because of light from when the Sun is below the horizon.
Challenge 298, page 144: Apart from the visibility of the moon, no such effect has ever been
Challenge 312, page 148: There is no simple answer: the speed depends on the latitude and on
other parameters.
Challenge 313, page 148: The centrifugal force must be equal to the gravitational force. Call
dr~r 2 =
R+l
the constant linear mass density d and the unknown length l. Then we have GMd ∫R
ω 2 d ∫R r dr. This gives GMdl~(R 2 + Rl) = (2Rl + l 2 )ω 2 d~2, yielding l = 0.14 Gm.
R+l
Challenge 317, page 151: In reality muscles keep an object above ground by continuously lifting
and dropping it; that requires energy and work.
Challenge 318, page 151: The electricity consumption of a rising escalator indeed increases
when the person on it walks upwards. By how much?
Challenge 322, page 153: The lack of static friction would avoid that the fluid stays attached to
the body; the so-called boundary layer would not exist. One then would have to wing effect.
1268 f challenge hints and solu tions
Challenge 319, page 151: Knowledge is power. Time is money. Now, power is defined as work
per time. Inserting the previous equations and transforming them yields
money =
work
, (885)
knowledge
which shows that the less you know, the more money you make. That is why scientists have low
salaries.
Challenge 324, page 154: True?
Challenge 327, page 154: From dv~dt = д − v 2 (1~2c w Aρ~m) and using the abbreviation c =
»
1~2c w Aρ, we can solve for v(t) by putting all terms containing the variable » side, all
v on one
terms with t on the other, and integrating on both sides. We get v(t) = дm~c tanh c д~m t.
Challenge 329, page 156: The phase space has 3N position coordinates and 3N momentum co-
ordinates.
to ∂~∂x( 21 m ẋ 2 − V (x)) − d~[∂~∂ ẋ( 21 m ẋ 2 − V (x))] = 0, which is Lagrange’s equation for this
case.
Challenge 357, page 184: Do not despair. Up to now, nobody has been able to imagine a uni-
verse (that is not necessarily the same as a ‘world’) different from the one we know. So far, such
attempts have always led to logical inconsistencies.
Challenge 359, page 184: The two are equivalent since the equations of motion follow from the
principle of minimum action and at the same time the principle of minimum action follows from
the equations of motion.
Challenge 361, page 185: For gravity, all three systems exist: rotation in galaxies, pressure in
planets and the Pauli pressure in stars. Against the strong interaction, the Pauli principle acts
in nuclei and neutron stars; in neutron stars maybe also rotation and pressure complement the
Pauli pressure. But for the electromagnetic interaction there are no composites other than our
everyday matter, which is organized by the Pauli principle alone.
Challenge 363, page 188: Angular momentum is the change with respect to angle, whereas ro-
tational energy is again the change with respect to time, as all energy is. Copyright © Christoph Schiller November 1997–January 2007
Challenge 364, page 189: Not in this way. A small change can have a large effect, as every switch
shows. But a small change in the brain must be communicated outside, and that will happen
roughly with a 1~r 2 dependence. That makes the effects so small, that even with the most sensitive
switches – which for thoughts do not exist anyway – no effects can be realized.
Challenge 368, page 190: The relation is
c 1 sin α 1
= . (886)
c2 α2
The particular speed ratio between air (or vacuum, which is almost the same) and a material gives
the index of refraction n:
n= =
c 1 sin α 1
(887)
c0 α0
1270 f challenge hints and solu tions
Challenge 369, page 190: Gases are mainly made of vacuum. Their index of refraction is near
to one.
Challenge 370, page 190: Diamonds also sparkle because they work as prisms; different colours
have different indices of refraction. Thus their sparkle is also due to their dispersion; therefore it
is a mix of all colours of the rainbow.
Challenge 371, page 190: The principle for the growth of trees is simply the minimum of po-
tential energy, since the kinetic energy is negligible. The growth of vessels inside animal bodies
is minimized for transport energy; that is again a minimum principle. The refraction of light is
the path of shortest time; thus it minimizes change as well, if we imagine light as moving entities
moving without any potential energy involved.
Challenge 372, page 190: Special relativity requires that an invariant measure of the action ex-
ist. It is presented later in the walk.
Challenge 373, page 190: The universe is not a physical system. This issue will be discussed in
Page 1067 detail later on.
Compare: The inertia against motion of an extended body can be seen as sum of a mass, a
mass dipole, a mass quadrupole, a mass octupole, etc. The mass quadrupole is described by the
moment of inertia.
Challenge 391, page 201: The conserved charge for rotation invariance is angular momentum.
Challenge 394, page 205: An oscillation has a period in time, i.e. a discrete time translation
symmetry. A wave has both discrete time and discrete space translation symmetry.
Challenge 395, page 205: Motion reversal is a symmetry for any closed system; despite the ob-
servations of daily life, the statements of thermodynamics and the opinion of several famous
physicists (who form a minority though) all ideally closed systems are reversible.
Challenge 404, page 210: The potential energy is due to the ‘bending’ of the medium; a simple
displacement produces no bending and thus contains no energy. Only the gradient captures the
bending idea.
f challenge hints and solu tions 1271