Anda di halaman 1dari 8

CHAPTER 1

Introduction and Preliminaries


Introduction
The calculus of variations is almost as old as calculus itself. In Principia Mathematica
Isaac Newton posed the problem of finding the shape of a body moving along the x-axis
which minimized air resistance. This was the first problem in the calculus of variations.
He solved his formulation of this problem in 1694. However, his formulation was not to
the liking of other scientists, and the basic problem remains of interest to this day. Other
problems soon followed, posed and solved by the Bernoulli brothers, Euler and others.
Here are some examples:
 Geodesics. Find the shortest path connecting two points. If the two points are in
the plane you know the answer. If the two points and the path connecting them
are on a sphere you might well know the answer. In both cases can you prove
that your answer is correct? What if the points and the curve lie on a different
surface, such as the torus?
 The brachistochrone. A particle slides under the force of gravity along a smooth
curve joining the points a and b in the plane. What is the curve that minimizes
the time of transit?
 The path of a ray of light. A light ray proceeds from point a to point b. Fermat
proposed that the path it takes minimizes the transit time. If the velocity of light
is constant in the region containing a and b, the path is the straight line. What is
the path if the velocity of light is not constant? For example, what if the point a
is in open air, but the point b is under water?
 The shape of a heavy chain. Suppose a heavy chain has its ends fixed in space
and is attracted downward by the force of gravity. What is the shape of the chain?
 The shape of an elastic beam. Consider an elastic beam with its ends clamped
at two points. The beam is bent downward by the force of gravity. What is its
shape?
 Minimal surfaces. Given a closed curve in space what is the shape of the surface
with minimal area which has this curve as its boundary? You can think of the
curve as made of wire. The surface of minimal area is a soap film achieved by
dipping the wire in a soap solution. This is an extremely hard problem. Consider
the special case when the boundary curve consists of two circles centered on the
x-axis lying in planes perpendicular to that axis. Then the surface is a surface of
revolution about the x-axis. What is the curve that generates that surface?
 The isoperimetric problem. Find the region in the plane with fixed perimeter
L having the largest area. Many of you will realize that it is a circle, but can you
prove it?
In this book we will solve most of the problems listed above, as well as many others.
One of our goals is to show how important the calculus of variations is to research that is
1
2 1. INTRODUCTION AND PRELIMINARIES

being pursued today. We will occasionally use ad hoc methods, but we will spend most of
our time developing the indirect methods. Hopefully at the end of the course we will find
time to look at some direct methods as well.
Although we will spend most of the time solving problems with one spatial dimension,
we will also want to discuss the situation in several variables, such as the minimal surface
problem. This will allow us to consider Hamilton’s approach to mechanics.
Let’s start by examining some of the above problems in more detail.

1.1. Examples of variational problems


1.1.1. Geodesics
Let’s look at a couple of instances of the problem of finding the shortest path be tween
two points.
1.1.1.1. The shortest graph. The easiest problem of finding geodesics is finding shortest
graph connecting two points in the plane. Suppose that u is a continuously differentiable
function defined on the interval Œa; b. Let c D u.a/ and d D u.b/. Then the graph of u is
a curve u in R2 connecting the points .a; c/ and .b; d /. Among all such functions which
has the shortest graph?
Consider a short segment of length s of the graph centered at the point .x; u.x/.
Suppose it lies above a segment of length x in the x- axis. The corresponding segment in
y has length y  u0 .x/x, where  means approximately equal to. By the Pythagorean
theorem
s 2  x 2 C y 2  .1 C u0 .x/2 / x 2 :
Thus, if we define the element of arclength by ds 2 D dx 2 C dy 2 ; then
p p
(1.1.1) ds D dx 2 C dy 2 D 1 C u0 .x/2 dx;
and the length of the graph of u is given by
Z Z b p
(1.1.2) L.u/ D ds D 1 C u0 .x/2 dx:
u a

We want to find u which minimizes L.u/ subject to the endpoint constraints


(1.1.3) u.a/ D c and u.b/ D d:
1.1.1.2. Geodesics in R2 . Not all curves in the plane are graphs. How would we pose the
problem if we wanted to consider all curves instead of just graphs? Let a D .a1 ; a2 / and
b D .b 1 ; b 2 / be the two points. If is a piecewise continuously differentiable curve joining
a to b, according to Example A.8 in the Appendix, it has a parametrization x W Œ0; 1 ! R2
satisfying x.0/ D a and x.1/ D b. The length of is given by
Z 1
l. / D jx0 .t/j dt:
0

We will indicate the dependence on the parametrization x by setting


Z 1
(1.1.4) L.x/ D jx0 .t/j dt:
0

Then
Z 1
l. / D L.x/ D jx0 .t/j dt:
0
1.1. EXAMPLES OF VARIATIONAL PROBLEMS 3

The geodesic joining the points a and b is the curve joining them that has the shortest
length. The geometric problem of finding the geodesic joining a and b can now be stated
analytically as follows:
Problem 1. Find a continuously differentiable curve parametrization x defined on the
interval Œ0; 1 satisfying the end-point conditions
(1.1.5) x.0/ D a and x.1/ D b;
such that
(1.1.6) L.x/  L.y/
for all continuously differentiable curve parametrizations y W Œ0; 1 ! R2 which satisfy the
end-point conditions in (1.1.5).
In other words, we want to find a curve parametrization which minimizes L.x/ as
defined in equation (1.1.4), subject to the end-point conditions in (1.1.5). Notice L depends
on the curve parametrization x. It is a function of a function. We will call such an object a
functional. When a functional is defined by an integral, as is L in (1.1.4), we will call it a
variational integral. The functional L defined in (1.1.4) is called the length functional.
One difference between the problem in Section 1.1.1.1 and this one is that the func-
tional in this section depends on a curve parametrization, which is a vector valued function.
In Section 1.1.1.1 the functional depended on a real valued function.
1.1.1.3. Geodesics in RN . There is nothing in Section 1.1.1.2 that requires the dimension
to be 2. If we consider the problem of finding geodesics in RN , we end up with the exactly
the same formulation.
1.1.1.4. Geodesics in the sphere. Of course we all know that a solution to the geodesics
problem is a straight line, parametrized by x.t/ D a C t.b a/. We are interested in more
serious problems. An example is to look for geodesics in a sphere. The set
SR2 D x D .x; y; z/ 2 R3 x 2 C y 2 C z 2 D R2
˚

is called the sphere of radius R centered at the origin in R3 . Suppose that our end points a
and b are in SR2 , and we want the curve that connects them to lie completely in the sphere
as well. Stated analytically the problem becomes:
Problem 2. Find a continuously differentiable curve parametrization x defined on the
interval Œ0; 1 satisfying the end-point conditions
(1.1.7) x.0/ D a and x.1/ D b;
and the additional constraint
p
(1.1.8) jx.t/j D x.t/2 C y.t/2 C z.t/2 D R2 for 0  t  1:
such that
(1.1.9) L.x/  L.y/
for all continuously differentiable curve parametrizations y W Œ0; 1 ! RN which satisfy
the end-point conditions in (1.1.7) as well as the additional constraint in (1.1.8).
This is the same minimization problem we saw in Problem 1, except that our parametriza-
tions must satisfy the additional constraint in (1.1.8), which requires the curve to lie on the
sphere.
Perhaps you know that a geodesic on a sphere is a great circle, defined to be the
intersection of a plane though the center of the sphere with the sphere itself. However,
proving this is no easy matter.
4 1. INTRODUCTION AND PRELIMINARIES

1.1.2. The brachistochrone


This is one of the oldest problems in the calculus of variations. It was posed by Galileo
in 1638, but could not be solved until later, after the invention of calculus and the calculus
of variations.
Suppose we have a continuously differentiable function u defined on the interval Œa; b.
Let c D u.a/ and d D u.b/, where we will assume that d  c, so that the point .b; d /
is lower in the plane than the point .a; c/. The graph of u is a curve u in R2 connecting
the points .a; c/ and .b; d /. Suppose there is a wire connecting the two points having the
shape of u . Suppose, in addition, we have a mass m which is constrained to follow the
wire, and moves along it without friction under the force of gravity. If we release it at
the point .a; c/, the mass will follow the wire and end up at .b; d /. How long will it take
to complete the passage from .a; c/ to .b; d /? What function u will result in the shortest
passage?
The mass has kinetic energy T D 21 mv 2 , where v is its speed along the wire. If we
specify that the potential energy U is 0 at the starting point where y D c, then U D
mg.y c/. The total energy of the particle is

1 2
E DT CU D mv C mg.y c/:
2
Since energy is conserved under the force of gravity, we know that E is a constant. We
will set the mass free at .a; c/ with speed v0 , so T D 21 mv02 there and we know that U D 0
there. Hence
1 2 1
mv D E D mv 2 C mg.y c/:
2 0 2
Solving this equation for the velocity v we get
q
(1.1.10) v D v02 C 2g.c y/:

Suppose that we have a small segment s of the graph centered at the point .x; u.x/.
If t is the time needed to traverse s, then s  vt. Hence dt D v1 ds, and using
(1.1.10), we see that the total time needed to traverse u is

(1.1.11)
s
b b
p
ds 1 C u0 .x/2 1 1 C u0 .x/2
Z Z Z
T .u/ D D q dx D p ; dx;
u v a v02 C 2g.c u.x// 2g a C u.x/

where C D c C v02 =2g:


p
The constant 1= 2g in T .u/ does not affect things, so our problem is to find the
function u which minimizes
Z bs
1 C u0 .x/2
(1.1.12) F .u/ D dx; with C D c C v02 =2g;
a C u.x/

subject to the end conditions

u.a/ D c and u.b/ D d:


1.1. EXAMPLES OF VARIATIONAL PROBLEMS 5

1.1.3. Minimal surface of revolution


Suppose that once more we have a continuously differentiable function u defined on
the interval Œa; b, and that the graph of u is a curve u in R2 connecting the points .a; c/
and .b; d /. if we rotate u about the x-axis, we get a surface of revolution, which we will
denote by Su . We learn in calculus that the area of Su is given by
Z Z b p
(1.1.13) A.Su / D A.u/ D 2y ds D 2u.x/ 1 C u0 .x/2 dx:
u a

We want to find the function u for which Su has minimal area. The 2 in (1.1.13) is
not important, so we want to minimize
Z b p
(1.1.14) F .u/ D u.x/ 1 C u0 .x/2 dx;
a
subject to the end conditions in (1.1.3).
1.1.4. The shape of a heavy chain
Suppose that once more we have a continuously differentiable function u defined on
the interval Œa; b, and that the graph of u is a curve u in R2 connecting the points .a; c/
and .b; d /. The graph is a possible shape of a heavy chain of length l supported at the two
points if
Z bp
(1.1.15) L.u/ D 1 C u0 .x/2 dx D l:
a
If we assume that the chain is stable under the force of gravity, which function u satisfying
(1.1.15) gives us the actual shape of the chain?
Physics tells us unless the potential energy of the chain is at a minimum, the chain will
not be stable, so we want to find the chain of length l which minimizes potential energy.
Let  denote the density of the chain, which we will assume is constant. A small segment
s of the chain at the point .x; u.x// will have mass approximately equal to gs, so the
potential energy of that segment will be approximately gys. Hence the total potential
energy of the chain is
Z Z b p
(1.1.16) U.u/ D gy ds D gu.x/ 1 C u0 .x/2 dx:
u a
Since g and  are constant, we again want to find the function u which minimizes
F .u/ as defined in (1.1.14), but now in addition to the endpoint constraints in (1.1.3), u
must satisfy the additional length constraint in (1.1.15).
1.1.5. Summary
In previous sections we have seen several examples of problems in the calculus of
variations. Let’s try to generalize what it is that we want to minimize or maximize. The
examples are in equations (1.1.2), (1.1.11), (1.1.12), (1.1.14), and (1.1.16). These quanti-
ties have the form
Z b
(1.1.17) F .u/ D F .x; u.x/; u0 .x// dx;
a
where u is a function defined on the interval Œa; b, and F is a function of three variables,
which we will denote as x, u, and p. Thus F is a function that depends on the function u.
We will call such a thing a functional. An integral of the form we seen in (1.1.17) is called
a variational integral.
6 1. INTRODUCTION AND PRELIMINARIES

The problem is to find functions u which minimize or maximize the functional in (1.1.17)
subject to the boundary conditions
(1.1.18) u.a/ D c and u.b/ D d:

1.2. Methods in the Calculus of Variations


All of these problems involve finding extrema where the object being varied is a curve
or a surface and not a point in Euclidean space, as is the case in calculus. It is this difference
that defines the calculus of variations. It is clear that the ordinary tools of calculus do not
directly apply. Over time three methods of solution have been developed:
 Ad hoc methods. Methods depending on the nature of the specific problems.
 Direct methods. Start with a sequence of functions which come closer and closer
to the actual minimum. We have to find out if the sequence converges or not. If
it does converge, does it converges to a minimum?
 Indirect methods. These methods extend the methods of the ordinary calculus.
As is the case in calculus, we will find necessary conditions that are not sufficient,
and sufficient conditions that are not necessary.
Let’s look briefly at each of these.

1.2.1. Ad hoc methods


These methods involve using everything we know about the subject at hand, and avoid-
ing the special mathematics we will develop in this book. An example is the problem of
finding the shortest curve joining two points in R2 or in RN . Of course we know the answer
is a straight line, but we want to prove it.
We will give the proof in R2 . The same proof works in RN , but it is easier to under-
stand in R2 . Let A D .a; c/ and B D .b; d / be two points in R2 . Let
p
 D jB Aj D .b a/2 C .d c/2
denote the distance between them.
Let be any curve that joins A to B. We know that it has a parametrization defined
on the unit interval. Thus there is function u W Œ0; 1 ! R2 which parametrizes . u is a
vector valued function and we will set u.t/ D .x.t/; y.t//, where x and y are continuously
differentiable functions on Œ0; 1. The length of is
Z 1 Z 1p
(1.2.1) l. / D ju0 .t/j dt D x 0 .t/2 C y 0 .t/2 dt:
0 0

Now we use what we know about geometry.


We may assume that A D .0; 0/: We achieve this by translating the whole prob-
lem by the vector B A. Since a translation is a rigid motion, it does not change
the distance between points, and therefore does not change the formula for the
length of in (1.2.1).
We may assume that B D .l; 0/: The line segment AB makes an angle of  with
the positive x-axis. If we rotate around the origin by the angle  we send B
to a point on the x-axis. Again a rotation is a rigid motion, so it does not affect
length. Therefore the image of B must be .; 0/. Again formula (1.2.1) is not
affected by rotations.
1.2. METHODS IN THE CALCULUS OF VARIATIONS 7

The curve now connects .0; 0/ to .l; 0/ so the component x.t/ of the parametrization
u satisfies x.0/ D 0, and x.1/ D l. Thus we have the equality/inequality chain
Z 1p
L. / D x 0 .t/2 C y 0 .t/2 dt
0
Z 1 p
(1.2.2)  x 0 .t/2 dt
0
Z 1
(1.2.3)  x 0 .t/ dt
0
D x.1/ x.0/
D :
Thus L. /   D jB Aj for every curve joining A to B. The length of the straight
line joining A to B is equal to  D jB Aj, so we can conclude that the straight line is a
geodesic.
However, this does not prove that every geodesic is a straight line. Suppose, therefore,
that is a geodesic. Then L. / D  D jB Aj, so we must have equality at every step
of the equality/inequality chain. To have equality in (1.2.2) we must have y 0 .t/ D 0 for all
t. Thus y.t/ is a constant, and since y.0/ D 0 we must have y.t/ D 0 for all t. To have
equality in (1.2.3), we must have x 0 .t/  0 for all t. Thus the parametrization for has
the form u.t/ D .x.t/; 0/, where x 0 .t/  0. This is a parametrization for the line segment
AB. Thus any geodesic is a straight line.
The only difference in the proof in R3 or in general in RN is that there are more com-
ponents in A, B, and the parametrization u.t/. However, we still have the two important
geometric simplifications, and we still have the same equality/inequality chain, so the proof
goes through in RN .
1.2.2. Direct methods
Consider the functional
Z b
(1.2.4) F .u/ D F .x; u.x/; u0 .x// dx:
a
The calculus of variations studies how to find a function u which is a minimum of func-
tionals like this. The minimum of F .u/ over all available functions u may be 1, but for
this section let’s suppose that
m D inf F .u/ > 1:
u
Then for each integer n there is a function un such that
1
m  F .un /  m C :
n
Thus we have a sequence fun g of functions such that F .un / ! m as n ! 1. This is the
beginning of the direct methods in the calculus of variations.
There are many questions that arise:
 Does the sequence fun g converge? If not, perhaps it has a subsequence that
converges.
 If the sequence fun g (or a subsequence) converges to u, what can we say about
u? Assuming that each un is continuous or differentiable, can we say the same
thing about u? Is F .u/ defined?
 Can we conclude that F .u/ D m?
8 1. INTRODUCTION AND PRELIMINARIES

The answer to each of these questions is “not always”. We have to put conditions on
the Lagrangian F to get a “yes”. Finding the answers involves some mathematics at a very
high level. This approach has been very fruitful and continues to lead to important results.
However, since we are not assuming much in the way of mathematical background, we
will not say any more about the direct methods of the calculus of variations.
1.2.3. Indirect methods
Indirect methods do not require too much advanced mathematics. We start by assum-
ing that a function u is a minimizer for a functional like (1.2.4). We then use the methods
of calculus to derive properties of such a minimizer. These amount to necessary condi-
tions for a minimizer to exist. Often these conditions are enough to isolate one or a few
candidates for a minimizer. We are then left to show that a candidate is actually a mini-
mizer. This can frequently be done using ad hoc methods, but we will also derive general
conditions that will ensure that a function is a minimizer.
This is the subject matter of this book, and we will begin the study in the next chapter.

Anda mungkin juga menyukai