CALCULUS_
ALLEN DEVINATZ
Northwestern University
_./
Chicago
Toronto
San Francisco
London
Atlanta
Dallas
Copyright
1968
2689453
Printed in the United States of America
1 2 3 4 5 6 7 8 9
PREFACE
The contents of this book represent a somewhat expanded version of
a one-year course that I have given from time to time since 1961.
Those taking the course have been mainly undergraduate and first
students from engineering and the physical sciences have taken the
course and have told me they enjoyed it. I recommend that students
who select a course such as this should generally have a little more
of logic, but rather to make him aware of the formal processes involved
first reading, but if the student will reread it several times during the
course, it probably will begin to appear more reasonable. The notation
they can begin the discussion o(real numbers with Section 1.8. In that
section, I have given what amounts to a set of axioms for the real num
ber system, the more usual starting point for a beginning course in
analysis.
point should be waved away. I have triecrto convey the idea to the novice
that he should be sure that he can really prove these seemingly trivial
points.
calculus, I have chosen the more cumbersome and less general method
A. D.
CONTENTS
Preface
CHAPTER 1
I. I
1.2
1.3
1.4
1.5
1.6
1. 7
1.8
1. 9
Sets
Relations and Functions
The Natural Numbers
The Integers and the Rationals
Countability
The Reals
A Review of the Real Number System and Sequences
CHAPTER 2
2.1
2.2
2.3
2.4
Convergence Tests
Decimal Expansions
Sequences and Series of Functions
Infinite Products
69
76
84
90
INFINITE SERIES
CHAPTER 4
4.1
4.2
4.3
4.4
LIMITS
CHAPTER 3
3.1
3.2
3.3
3.4
3.5
1
16
22
26
31
38
44
55
62
99
110
118
126
131
DIFFERENTIATION
138
145
149
158
v
4.5
4.6
Power Series
The Weierstrass Approximation Theorem
165
178
CHAPTER 5 I INTEGRATION
5.1
5.2
5.3
5.4
5.5
Riemann-Darboux Integrals
Properties and Existence of Riemann-Darboux Integrals
Improper Integrals
Riemann-Stiel tj es Integrals
183
190
201
210
217
228
235
241
248
256
274
293
Motivation
Directional Derivatives and Differentials
Differentiation Rules
Higher-Order Differentials and Taylor's Theorem
The Inverse and Implicit Function Theorems
Maxima and Minima
CHAPTER
305
309
319
324
332
344
8 I HIGHER DIMENSIONAL
INTEGRATION
8.1
8.2
8.3
Riemann-Darboux Integrals
Jordan Content
Existence and Properties of Riemann-Darboux Integrals
353
359
366
8.4
8.5
Iterated Integration
The Transformation Theorem for Integrals
CHAPTER 9
374
380
THE INTEGRATION OF
DIFFERENTIAL FORMS
I. LINE INTEGRALS
9.1
9.2
9.3
9.4
396
403
407
416
9.5
9.6
9.7
9.8
9.9
9.10
Symbols
Index
429
437
452
455
461
467
479
482
However,
Negation
I.I
-A
f
b.
Implication A =>B
(To be read: A implies B, or if A then B, or B if A, or A only if
B, or A is a sufficient condition for B, or B 1s a necessary con
dition for A.)
A
c.
A =>B
A&B
Conjunction A&B
(To be read: A and B.)
A
d.
Disjunction AVB
(To be read: A or B.)
A
AVB
t
t
The preceding truth tables give a prescription for the use of the
symbols
'-
capital Latin letter and ending with a period, and having no period in
between is called a sentence, and so forth.
Statements in mathematics are formed in the same way, that is, by
placing symbols in various positions. However, it is usually the case that
we form these statements from a different collection of symbols than
those that we use for the English language. The symbols '
' . '&',
,
I.I
A, B,
and
[(AB)
(BC)] [A CJ
&
A, B, and C.
'AB', E for
'AC'. The table
To get the table to fit on one page, let us set 'D' for
'BC',
'F' for
'(AB)
&
(BC)',
'
'
looks as follows.
FG
t
t
Since the last column always has the 't' value we have shown what we set
out to show.
Once we have given the basic symbols of our mathematical language
we can
define
point we can
define
When we write
AB,
this is to be read: A is equivalent with
A if and only if B.
sentation
the basic
AB has
both haye the 't' value or whenever A and B both have the 'f'
value. Otherwise AB has the 'f' value attached to it.
Hn which
are meaningf ul in the sense that they have 't' op-"f' values attached to
them. In addition, we suppose that there are statements A1,
, Am
& Hn ==* C
always has a 't' value. Then,if all the H k have the 't' value we shall give
C a 't' value. This is a new rule for giving statements 't' or 'f' values and
is usually called the rule of inference. Our hope here,as with the truth
table rules, is that starting from our axioms we cannot also give C an
'f' value,that is, -C cannot be given a 't' value by the scheme that has
been outlined.
As an example of this new rule,suppose A1 =*A2 and A2 =*A3 are
axioms and therefore have 't' values, even though we don't know
whether A1, A2, and A3 can be given 't' or 'f' values. However, under
the supposition that they are meaningful we have established by a truth
table that
always has a 't' value. Hence we would give A 1 ==* A3 a 't' value. Another
example is
If Ai and A1 ==*A2 have 't' values,we would assign the 't' value to A2
In case a statement can be given a 't' value by means of the rules we
have prescribed,then the statement is said to be derived or proved from
the axioms. Suppose a statement C has a 't' value and it is obtained by
our rules through an implication Hi & H2 &
& Hn -==*C .
Each
1.1
Since
(i)(Q(x))
This is to be read: For every
or
Vx Q(x).
(3x)(Q(x)) .
This is to be read: There exists an
'(x)' and 'Vx' are called universal quantifiers and the symbol '(3x)' is
existential quantifier.
It is also possible that we may have a statement Q(x, y) which depends
called an
(x)(y)(Q(x,y)).
This is to be read:
For every
(x)(3 y)(Q(x,y)).
y the statement
Q(x, y)
This statement is to be read: For every x there exists a y such that Q(x, y)
x is chosen,
x. As an example, suppose
x and y may be replaced by real numbers. We may then write the true
statement:
(x)(3y)(x
<
y).
x) that is larger. On the other hand, we may write down the false
statement:
(3 y)(x)(x
<
y).
In the English language this says: There exists a (fixed) real number
that is larger than every real number. Many other situations will arise
later on in the text which will demonstrate again and again the dis
tinction between these two types of statements.
We shall now outline some of the rules for operating with the predi
cate calculus. These rules will show the connections with the rules we
reader may very well recognize that he has been using these rules in
an informal way inhis previous studies of mathematics.
given set of statements that have 't' or 'f' values attached to them, and
by means of a set of rules we obtain other statements that have 't' or 'f'
values attached to them. The first rules we shall give are those for
negating statements. The rule for negation is the following:
the right side is to be given the 't' or 'f' value, respectively, and vice
- (3x)(Q(x))
{::::>
(x)(-Q(x)).
example,
is equivalent to
1.1
-(x1)(Q1(x1))
is equivalent to
,xs))
(3xi)(-Q1(x1)),
and con
'x',
are
(x)(P(x) =>R(x))
has a 't' value attached to it. The rule here is to remove the universal
quantifiers to get the statements
'x'
by any member of
a given set, then the statements will be true. By the methods of the
propositional calculus we conclude that
&
P(x) =>R(x)
is a true statement. The rule now is to add the universal quantifier and
get the statement
(x)(P(x) =>R(x)).
'
1.1
11
ple of the type of difficulty that could arise if we did not follow this
procedure, let us consider the statements:
(x)(x< 1x+1<2),
(3x)(x<l).
(x)(x+y<x+y).
I.I
(3y)(3z)(x)(x+y <x+z).
We shall consider one final example. The variables are again assumed
to represent real numbers. Let us start with the true statements:
(x)(3 y)(x<y),
(x)(y)(x <y =}x + y <2y).
If we proceed according to the previous rules we get the statements:
x <a,
x <a =:::} x+ a<2a.
Using the propositional calculus we get the statement:
x+a<2a.
Replacing the quantifiers by our previous rules leads to the incorrect
statement:
(3y)(x)(x+y < 2y).
The reader has probably already recognized what has gone wrong
here. In removing the quantifier from the first statement the ambiguous
element a does not remain the same for all x but must change as x
changes. In mathematics one usually indicates this by writing 'ax' or
'a(x)'. If we do this we get the statements:
x <ax,
X <ax
==> X+ ax<2ax
(x)(3y)(x+y<2y).
In working with quantified statements the reader should be aware
of the fact that any letters or symbols may be used for the variables.
This is in accordance with accepted mathematical practice. For example,
the following three sets of symbols all have the same meaning:
J:!(x)
dx,
dt.
B; that is, the last statement has a 't' value. If A had the 't'
tached to it, which would contradict our hypothesis that B can have
only one value attached to it. Hence A must have the 'f' value.
Our discussion of the elements of logic has of necessity been rather
circumscribed, so that at this point there is not a sufficiently adequate
basis for a completely formal development of a branch of mathematics.
On the other hand, a completely formal development usually turns out
to be very cumbersome and, unless one already understands the sub
ject that is being formalized, is probably undesirable. After all, the
process of understanding is an intuitive psychological phenomenon.
In understanding the proof of a theorem or in discovering a new
theorem, the mind seems to operate in a haphazard manner rather than
in a formal step by step process that a logical proof requires. What
then is the role of logic in mathematics? There are several important
roles it can play: It can provide a mechanical check on our informal
reasoning processes; it can provide a system of automatic symbolic
operations to assist in complicated situations; it can provide a means
of increasing precision and generality; and, last but not least, it can
provide an accurate method of transmitting mathematical information.
When we begin the discussion of the natural numbers we shall
proceed in a somewhat more formal manner than we do later on in
the book. The reason for this is that we feel that most readers will be
very familiar with the basic facts of elementary arithmetic and hence
the formalism will not interfere with their understanding. As the mate
rial becomes less familiar, our approach shall become more informal
so that understanding may not be jeopardized. However, we shall
never completely abandon a certain amount of formalism, since we feel
References
Kershner, R. B., and L. R. Wilcox,
The Ronald
Mendelson, Elliott,
D Exercises
I.
(b)
7.
xgives xagain.
Suppose
1.
(x)(P (x)).
Use the rules of the propositional and predicate calculus to show that
the following statement is true:
(3x)(P(x)).
8. In the following list show that if the left statement is true, so is
the right one. The results of Exercise 1 may be helpful.
(c)
1.2
SETS
(a)
(b)
We shall adopt the naive, intuitive point of view that a set is a collection
of objects without questioning what these words mean. The term
Q(x)is a sentence containing a variable 'x'. Then we can form the class
'x' make the
sentence Q true. We denote this by means of the term '{x: Q(x)}',
and this is to be read: The collection of all xsuch that Q(x) is true. As
in the situation for quantifiers, x is understood to vary over a specified
set. The set consisting of the one element a will be designated by '{a}',
the set consisting of the two elements a and b will be designated by
'{a, b}', and so on.
Given two elements a EA and b EB we can form the ordered pair
(a, b). The reason we use the word 'ordered' is that in general {a, b)
and (b, a)are not considered the s.ame object. Indeed, the ordered pairs
(a, b) and {a1, b1) shall be identified if and only if a= a1 and b= b1
Recall that the symbol '=' has the meaning that the symbols that stand
of all objects that when their names are substituted for
to the left and right of it are simply different names for the same object
and we adopted the rule that different names for the same object may
be used interchangeably in any expression.
From two setsA and B we can form a new set, the Cartesian product
A X B, which is defined by the equality
AX B
(l.2.1)
1.2
SETS I 17
We can also form the intersection of two sets defined by the equality
(1.2.2)
n {A:A EV'6}
(l.2.2')
that is, the collection of all elements each of which belongs to every set
in <76. The union of two sets is defined by the equality
A U B={x: x EA V x EB}.
(l.2.3)
(1.2.3')
that is, the collection of all elements that belong to at least one set in A.
The complement of a set A is defined by the equality
(1.2.4)
We usually use the term 'x
ft A'
(1.2.5)
This latter set can be described as the set of all elements in A that are
not in B; that is,
(1.2.5')
FIGURE 1.2.1
FIGURE 1.2.2
The rectangle in which the sets are enclosed represents the entire
universe of elements of the discourse.
We shall define the inclusion symbol C' by means of the equivalence
(l.2.6)
The term
B.
If
of
to
be identified if and only if they are contained in each other, that is,
(1.2. 7)
A=B (ACB&BCA).
'(A)'
and
'(B)',
'{x: Q(x)}',
'{x: Q(x)}'
to
be a name for that set for which the following statement is true:
A n B
'{x: Q(x)}'.
(l.2.8)
An BCA.
First,
removing
the
universal
(l.2.9)
statement:
x EA n B [x EA &x E B],
(l.2.9')
which for the purpose of applying the rules of the propositional cal
culus is assumed to have a 't' value. The truth-table method of the prop
ositional calculus tells us that the following statement has a 't' value:
{x EA n B [x EA&x E B]}
=:} {x EA n B =:} [x EA &x EB]}.
(1.2.lO)
1.2
SETS j 19
From (l.2.9), (1.2.10), and the rule of inference we find that the follow
ing statement is true:
x EA n B ""* [x EA
&
x E B].
(l.2.11)
[x EA
&
x E BJ ===* x EA.
'S(x)',
'R(x)'
(l.2.12)
[R(x)
&
(l.2.13)
Using the rule of inference the following statement has a 't' value:
(l.2.14)
(l.2.15)
Using the statement (l.2.6) and the rules of the propositional calculus,
we arrive at the true statement
(l.2.16)
Using the fact that statements (1.2.15) and (1.2.16) are true, by the rule
of inference we finally arrive at the true statement:
An BC A.
We have presented above a formal proof of the last statement, being
careful to point out at each stage exactly what was being used. Of course,
we could have developed a scheme so that the proof would have been
more mechanical and the amount of space needed to write it down
would have been much less. Nevertheless, we think the reader now
sees how cumbersome a formal proof can be, even of the simplest
statements. For this pragmatic reason most of the discourse of mathe
matics is carried on in an informal way.
In an informal proof we do not write down all the steps but only
those considered to be essential. This is analogous to the situation
when in making an arithmetic or algebraic computation we usually
do not take cognizance of the fact that we are using, for example, the
commutative or associative laws, but suppose these are standard facts
which the reader recognizes. For example, the chain of argument
leading from ( l .2.9) to (l.2.11) or the chain of argument leading from
(1.2.11) to (l.2.14) is usually considered a standard argument and
would not be mentioned in an informal proof. Of course, just how much
is written down is at the discretion of the writer. Usually enough should
be written down so that it would be clear how to make the formal proof
if any question should arise about the validity of the informal proof.
As an example of an informal proof let us show the following:
(B
B)
(A
(B U C) [x EA
&
x EB U C].
C)
(A
C).
We have
x EA
Also,
x EB UC [x EB V x EC].
Now, it is easily checked by a truth table that
[ (x EA) & (x EB V x E C)]
[ (x EA & x EB) V (x EA
&
x EC)].
x E (A
B)
(A
C).
x EA
(B
C) x E (A
B)
(A
C)'
FIGURE 1.2.3
The reader may now object that we have not defined the symbol 'E'
in terms of the symbols of the predicate calculus. This is true, and in a
formal development of mathematics it is necessary to give the rules or
axioms that prescribe the use of this symbol. The situation is analogous
to that of Euclidean geometry, where points and lines are taken as
undefined objects and a set of axioms are given that give the relation
ships between points and lines. In axiomatic set theory, sets and the
symbol
'E' are taken as undefined things and a set of axioms is given that
will allow us to develop the kind of a theory of sets which seems intui-
1.2 SETS I 21
tively reasonable to us. These axioms deal mainly with prescribing the
conditions under which new sets can be formed from given sets. For
example, in axiomatic set theory the facts that
AnB and A
B can
axioms.
To try to give a reasonable axiomatic approach to set theory would
be too difficult at this stage and would delay our study of the calculus
for a long time. Hence, as we mentioned at the beginning of the dis
cussion on sets, we shall suppose that everyone understands what a
set is and we shall allow operations on sets and the construction of sets
that seem intuitively reasonable. Such a procedure can, on occasion, lead
to serious philosophical difficulties, but we shall pretend that they
don't exist.
Finally, let us remark that it is convenient to consider the set that has
no elements. It is defined by the equality
0= {x: x x}.
The set 0 is called the null set or the empty set or the void set.
D Exercises
1.
2.
3.
(b)
4.
5.
AU B U cc.
An(BU(C U D)c).
6.
(c)
(d)
0.
7.
8.
If A,, is any collection of sets and B is any set, show the following:
(a)
(b)
9.
(b)
Bn u {A:AEA,, }= u {AnB:AEA,,}.
B u n {A:AEA,, }= n {A u B:A E A,,}.
11.
If A,, is any collection of sets and B is any set, show the following:
(a)
10.
AB;,,,_A\(A\B).
An B= A\(A\B).
-n c
If A,, is any collection of sets and B is any set, use the results of
1.3
The concept of the Cartesian product of two sets leads to the concept
of a relation. We shall first give a formal definition and then comment
on the meaning.
1.3.1
Definition.
1.3
1.3.2
Definition.
that
(x)(y)(z)([(x,y)
F & (x,z )
E F]
==>y=z)'.
For example, if A and B are the sets given above, then the set of all
(x, y)
E A X B with
function.
Some people prefer the words multivalued function in place of the word
'relation.'
If Fis a function and
(x, y)
F(x)
F(x).
x, and
we also
is said
1.3.3
Definition.
A function F is
1.3.4
Proposition.
to-one function.
Given two or more functions there may be ways of combining these
tWe are supposing that every man has a relative.
functions to get a new function. We shall give one way here, the
com
F0G(x) =F(G(x) ) .
In very formal terms we can write
f by the symbol
{J(x) : x EE(f)}.
If A is a set, we can define a new function g as that subset off consisting
of those ordered pairs (possibly void) whose first members belong to
A.
g=JIA.
If
E(f ) ,when
we write
g.
X and
(c)
on a set
(reflexive)
,B (R) .
In fact, from
Hence from
'(x,y) E='we
shall write
'x
y'.
1.3
4.
Let
following:
(a)
(b)
(c)
(d)
6.
Vx
7.
m n
8.
=
(u, v) there exists a real number t > 0 so that (x,y) =(tu, tv). Show
9.
(a)
(/3)
1.4
In this section we shall give a set of axioms for the natural numbers
and derive some of their more important properties. The proofs we
give will be informal, as explained in Section 1.2, and the set theory
we shall use will be intuitive. One may, quite legitimately, ask why
we bother to be so formal about the development of the real number
system when we are being so informal about logic and set theory. One
answer is that the first serious questions about the nature of mathematics
arose in connection with the real numbers, first among the ancient
Greeks and later again among the nineteenth-century mathematicians.
Hence an enormous amount of intellect and energy have been expended
in trying to clarify the nature of these objects. Many people seem to
feel that between certain limits these efforts have been successful agd
that a usable system can be obtained from a few psychologically satis
fying and clearly stated principles or axioms. Of course, there may
be sharp disagreement on just where to start and how far one can go
without getting involved in contradictions. We shall start with a set
of axioms that are not as minimal and/or perhaps not as intuitively
satisfactory as others. However, we feel they are reasonably satisfac
tory and have the advantage that the development of the real numbers
can proceed quite rapidly from them.
The name 'the natural numbers' is given to any set N together with two
functions + and
following axioms:
1.4
(a)
(a')
(b)
(commutative laws)
(b')
(x)(y)(z)(x+(y+z)=(x+y)+z).
(x)(y)(z)(x (y z)=(x y) z).
(associative laws)
(c)
(x)(y)(z)(x
(distributive law)
(d)
1E
(e)
N & (x)(x
x, y in N, one
(I) x=y.
(2) (3z)(x=y+z).
(3) (3z)(y=x+z).
For every
x).
and only one of the following is possible:
(trichotomy/aw)
(f)
For every
(induction)
N.
'N'
to designate the
'(N +
, , )'.]
previous axioms. The axioms (a) through (c) are of course the familiar
ones from arithmetic. The first part of the cortjunction of axiom (d)
says that
=/:-
the axiom states a property for this element. Axiom (e) has been stated
rather informally for the sake of clarity. It simply says that we can have
one and only one possibility; either
is greater
than y, or xis less than y. More formally, we could have given this axiom
in terms of two axioms:
( e' )
(e")
The last axiom (f) is often stated in the following way: If P (x) is a state
ment depending on
x,
then
(f')
This can be translated to our statement (f) by the following device. Set
M={x: P(x)};
then if (f') is true, (f) is true for
M,
Let us now give some examples that show how these axioms may be
used to obtain other true statements about
that
N.
1.4.1
Proposition.
There is no
and no y in
that is,
-(3x)(3y)(x+y=x).
N,
so that
x + y=x;
Proposition.
1.4.2
z x=y).
Proof. The fact that x=y=>x z=y z follows from our rules
x and y are different names for the same thing and hence we may
that
(3x) (3 y)(3z)(xz=y
z &
=F
y).
(l.4.1)
=;/:
=F
(l.4.2)
z Vy z=x z+w
z) ) ],
1.4.3
Definition
(x)(y)(x < y(3z) (y=x+z)),
(x)(y)(xyx <y Vx=y).
1.4.4
Proposition.
Exercise.
1.4.5
Proposition
(x) (1 x).
Proof..
Let us set
that is,
1.4
Clearly 1
and
1.4.6
Proposition
::s;;
(1.4.3)
Vx E N,
x=:::} n+ 1
::s;;
n +x.
n+ 1
::s;;
Vx E N,
n +x.
(1.4.4)
m=n+k<n+l.
Replace
(l.4.5)
Let
ER,
R n S= 0= N n S=S,
+1 E S.
must have
By the
0.
= N.
which contradicts
For, if k+ 1 fj.
S,
3k E R,
so that (k + 1)
fJ.
But for
:::!? k
x E S
we
R.
We now claim that (x) (x E S :::!? k + I ,,,;;; x). Indeed, in the contrary
case, (3x)(x E S & x < k+I), which we have seen contradicts Prop
osition l.4.6. This shows that k+ l is the smallest element of S. The
uniqueness is an immediate consequence of trichotomy.
1.4.8
Proof.
y).
#l
:::!?
( 3 z) (y=I+z)).
#I:::!? (3z)(x
# I :::!? x
< x
y=x+x
z)),
l),
it follows
we shall
D Exercises
1.
2.
N.
3.
4.
5.
If we define
l+ l,
I:::!? x=I).
prove that V m E
N,
one k E N so that
m < k < m+
6.
2.
1.5
7. If we define 3
2n=3.
8.
ing in the axioms for the positive integers and prove (x)( 1 x).
9.
ing in the axioms for the positive integers and prove the principle of
induction as a theorem.
10.
1.5
If m and n are natural numbers with m > n, then it is not true that there
(p,q) n +q =m+ p.
(N X N) X (N X N) by writ
(n,m)
(n,m)
(r ,s).
with domain Z x Z and range in Z for which the rules (a) through (d)
of Section 1.3 are valid and moreover (x) (y) ( 3 z) (x=y+z) . We shall
define these functions as follows:
Z(n,m) +Z(p,q)=Z(n+p,m+q),
Z(n,m)
Z(p,q)=Z(np+mq,mp+nq).
and
If we add corresponding sides of both equations we get
n + P + mi + Qi = m + Q + n1 + P1.
This is precisely the condition that (n + p, m +
q)
O=Z(n,n),
and it is given the name zero. It is almost immediate that for every x
x
{ Z,
+ 0 = x.
Also,
q);
then z =Z(m+ p, n +
q)
x + z =y.
As we noted in Definition
1.3.3,
to be one to one.
The function i also satisfies the equalities
Indeed, the first equality is nothing more than the fact that (n + m +
1, I)
1.5
is given the name 'the integers', although usually we shall speak only
of Z as being the integers. The set N in Z will be called the natural num
bers or the positive integers. As in the case of the natural numbers, the
symbol for multiplication in Z X Z will usually be dropped and we shall
write 'nm' instead of 'n
m' ..
1.5.1
Definition
(x)(-x=Z(m, n) x=Z(n,m)).
(x)(y )(x - y=.x + ( -y )).
-N = {x: -x EN}.
1.5.2
Defiiiftio n
(x)(y)(x < y y - x EN).
(x)(y)(x ,,;;; y x < y V x= y).
(x)(y )(y > x x < y).
(x)(y)(y x x,,;;; y ).
y, y < x.
1.5.3
Proposition
(b)
-NnN=0.
-N U N U {O}=Z.
(c)
0 ft -N
(a)
U N.
1.4.1.
(b) Suppose now that x EZ and x NU {O}. Let us write x =
Z(m,n). If n < m, then 3p EN, so that m =n+ p. Since Z (p + n,n)
=Z(p+ 1,1), it follows that x EN, which is a contradiction. Also
m n, since otherwise x= 0. Hence m < n and -x=Z(n, m) EN,
which implies x E -N. Thus Z C -NUNU {O}. Since the converse
inclusion is obvious, the proof is finished.
(c)
Clear.
1.5.4
(a)
(b)
Proposition
(x)(xO= O).
(x)( y)( (-x)y=x(- y) =-(xy)).
(b)
have
0 = Oy= (x - x)y =xy+ ( -x)y.
Add -(xy) to both ends of the equality to get
-(xy) = (-x)y.
If we use the commutative rule and interchange the symbols 'x' and ' y'
in this formula we get
-(xy)
(x) (- y).
EZand m
: x
EZ & y EZ & y
0}.
[We are, of course, thinking of the ordered pair (x,y) as x/y in the usual
notation.] We shall say
(m,n)
(p, q)
::}
mq
np.
1.5
THE
1.5.5
If x and y are in
Lemma.
more formal
and xy
0, then x = 0
0. In
terms:
y=OJ).
Since
-N
and y belong to -N U
Hence we have
(x=0
(m,n)
(r,s). This
means
mq=np
and
s to get
ps=qr.
mqs = nps,
Then use the fact that
ps=qr to get
( ms - nr)q=0.
From Lemma
ms=
1.5.5
we get
ms
- nr=0
q=0. But
#0
and thus
nr.
(m,n),
'Q (m,n) '. The collection of equivalence classes shall be de
#.
0,
by
with domain
Q(m, -n)),
{Q(m, n) : (m >
0)
& (n >
O) }.
(r)(s)(r<s<=> s - r E Q+),
(r)(s)(rs<=>r<s V r=s),
and we shall also define
have 'YJ(m)<'YJ(n).
1.5.6
Definition.
and range in Q+
r<=>r 0,
Ir! = -r<=>r
1.5.7
<
0.
Proposition
1.5.8
Since
x lxl
and
1.5
by
-x
and
by
y+x to
get
by
-y
and
by
y+ x
we get
I lxl - IYI I
,,;;;
D Exercises
I.
Show that the properties (a) through (d) given in Section 1.4 as
2.
hold for Z.
3.
5.
6.
x,yE-N=>xyEN.
xE N, yE- N ==> xyE-N.
Using these facts and the known fact that the product of elements in
N, show that
x<y &z > 0 ==> xz <yz.
x<y & z<0 ==> yz<xz.
is again an element of
(c)
(d)
7.
8.
9.
IO.
(a)
(1)
(2)
(b)
for which
(m, n)
(m, O),
n =F 0. If we had allowed
12.
Show that
13.
in
14.
Let us define
Show that
Q-
15.
16.
If
Q+, respectively.
m EZ, let
MC Zm:
[m EM & (x)(x EM=>x+ 1 EM)] =>M=Zm
17.
18.
1.6
COUNTABILITY
N and range Q.
1.6.1 Definition. Any set that is the range of a one-to-one function with
domain N will be called denumerable.
1.6.2
Lemma.
(m, n), Fig. 1.6.1. Now follow along the paths of the arrows and attach
successive positive integers to the successive points on the paths. We
shall write this out as follows, calling the point (I,I) the first arrow.
1st arrow:
I---+ (1,I)
1.6
nth arrow:
COUNTABILITY I 39
- (n,1).
,-.
(I.5)
(1, 4)
"""'.-""'---+-- -+-----+--
(1, 3) 1.---+--+----l
1_....
.,__.
(1, 2)
We have taken m,, to be the integer that begins the nth arrow. Since
the nth row contains n elements, it is clear that
mn+i
=mn+ n.
Hence
mn =m1 + (mi-m1)
(m3 - mi) +
+ (mn-m..-1)
=1+1+2+ +n-1
n(n-1).
=1+
2
; I)+k+l ""(k+l,n-k),
Okn-l.
n(n
nEN,
Okn-l,
' ( m, n)'
(1.6.1)
Let us first show that the doman of <I> is N. We shall show the follow
ing is t rue:
(m) (m E N => (3n)(3k)
(n EN & k E ( O, n-
1) &
;- l)+k+l=m)).
n(n
..----
) &
; I) +k+1 =m).
n(n
show that Vm, P (m)=> P(m +1). Let nand kbe integers of the required
kind, so that
; 1) +k+1.
m= n(n
(1.6.2)
Then
m+1 =
If 0 k
<
n-
1,
n(n
then 0 k+ 1 n-
1,
that
(l.6.2).
Suppose that n, k, n.
i and k1 are integers with
0 k n- 1, 0 k1 n1 - 1, and
If n
<
ni. then
n (n-
l)
+k
,.:::. n(n-
.....,
1)
+n
< n( n + I)
,.:::.
n1 ( n1 -
I)
k1
'
<
and conversely if n1
n we get the reverse inequality. Hence we can
have equality if and only if n=ni. and this implies that k= k1. This
shows that for a given mthere is one and only one ordered pair (k + 1,
1.6
COUNTABILITY J 41
n(n
<1>
n1(n - I)
+k1 +I
= (k1+1, n1-k1),
then k = ki. n
1.6.3
Definition.
<I>
with domain
(1, n)
= { k: k E N
1.6.4
Theorem.
Proof.
(1,
domain
(1, m)
(n
, 'TT
1) }
'TTn
with domain
(1, n)
j <
element of
C.
If
P(n)
is true, define
1Tn+i (n+ 1)
<
n+ 1
and
leave to the
reader the easy task of verifying that all the conditions of the statement
are satisfied. Hence, by the principle of induction,
P(n+ 1)
(n) (P(n) ) .
Define
1T(n)
1Tn(n) .
n E N, then
k n, then
1T(j)
of
1Tj(j) = 1Tk(j)
1Tk
<
this restriction of
1Tn
must be
1Tk(k) = 1T(k)'
and if l
k n
so that/=
1.6.5
Theorem.
Proof.
Suppose
Q+ is
r=
denumerab/,e.
Q(m,
n)
E Q+ and let
P(r)
w(r)
This defines
=
=
of
P(r) .
<l>(j ) = w(q)
thenj =
k;
this implies j E
P(r) ,
w(r)
<l>(k),
which means
q = r.
1.6
COUNTABILITY j 43
N is denumerable
function w-
0 7T
7T
that takes
onto(w). The
D Exercises
I.
2.
P( n)
3.
where
4.
(a)
{k : 7r(k)
(b)
(c)
7r ( m + 1) n.
<
n}
7T
1.6.4 is unique in that it is the only function from N onto C that has its
properties.
5.
(a)
(b)
Show that if B C
is infinite.
These two facts will completely justify the proof of Theorem 1.6.5.
6.
Show that the rationals Qare denumerable and hence the integers
Z are denumerable.
7.
If
n elements
and
elements, where
"'"
n)
and range
n!
one-to
A.
9.
Let {Ai.
, An} be a collection of n sets. Define A,
An "inductively" by setting
Ai x A2 x A3= (Ai x A2) x A3,
Ai X
X Ak= (Ai X
X Ak_1) x A k,
Ai x ... x An= (A, x . . . x An-1) x An.
an )
The elements of A1 X
X An are written (ai.
m.
A2
(We shall
Ak
Ai
An
is countable.
{0
:::)
k
k
1 :::)
E
E
A,
A.
44 J THE
Show that <I> is a one-to-one map with range an infinite subset of Q+.
The results of Exercise 5 may be helpful.
From this result it might be natural to guess that the collection of all
subsets of N is denumerable. Disturbingly enough, this is not true, as
we shall show in Section 3.3.
1.7
THE REALS
FIGURE 1.7 .1
2ab+ (a - b)2=a2+b2
I. 7 THE REALS I 45
1.7.1
Definition.
nonnegative integers N0
A
=
(r + s)(n)
(r s) (n)
r(n) + s(n),
r(n)s(n).
prove that if
as
and
1.7.4 Definition. The maximum of two rationals, r,s EQ, is the value
at (r,s) of that function with domain Q X Q defined as follows:
max
(r, s)
rsr,
sr s.
Definition.
Vr EA, lrl
set A
m.
In the symbolism of the predicate calculus the last part of this state
ment is written as follows:
(3y)(x)(x EA==> lxl
y).
Note, at this stage, unless stated to the contrary, our variables are opera
tive on the universe of the rationals.
1.7.6
Lemma.
REMARK:
1.7.7
Proof.
Proposition.
1 +
M,
lr(M)j.
1.7
{r (n): n
The set
THE REALS I 47
(n )(lr(n)I
m).
Proof.
For every
lr(n)-r (m)I
Hence, if
n,m
Q.+, 3M such
<
e/2
that
and
n,m
implies
<
e/2.
M,
l(r+s )(n)-(r+s)(m)I
lr(n)-r(m)I + ls(n)-s(m)I
<
E.
Since r and s are Cauchy sequences they are bounded and hence
(3k )(n)(lr(n) I < k & ls(n)I < k). There exists an M such that n, m M
implies
lr(n)-r (m)I
Hence, if
n,m
<
e/2k
and
ls(n)- s(m)I
<
e/2k.
M,
<
sequencer by
R x R and
range
R obeys
the commutative
p(s) = R(r8),
where
r,(n) = s for
all
n E N0
It is clear that
1.5.6). In mos t ins tances no confusion will result if we label the elements
in
p by the same
Q. C R.
names used in
Q..
In fact,
3S
In the formalism of the predicate calculus the last part of the above
s tatemen t would read as follows:
1.7.12
Theorem
0 ft_ R+ U R-.
R+ n R-=0.
R+ U R- U {O}= R.
I.7
Proof.
-R(r)
THE REALS I 49
R(r)
Suppose
- R(r) =
is a contradiction.
-(e)(e
E Q+
(3M)(n)(n
E N0 &
n;;;,: M lr(n)I
<
e))
is a true statement. Using our rule for negating statements we find that
this statement becomes
(3e)(e
Since
E Q+ &
(M)(3n)(n
E N0 &
n;;;,: M
> N
&
lr(n)I;;;,: e)).
lr(n) - r(m)I
<
e/2;
Choose
r(n0) # 0,
we have r(n0) > 0 V r(n0) < 0. In the first case, since r(n0) - r(m) :,;;;
lr(n0) - r(m)I < e/2 for m;;;,: N, it follows that r(m) > r(n0) - e/2;;;,: e/2.
In the second case, since r(m) - r(n0) :,;;; lr(n0) - r(m) I < e/2 it follows
that for m ;;;,: N, -r(m) > -r(n0) - e/2 = lr(n0)I - e/2 ;;;,: e/2. Hence r
or -r is a positive sequence.
Now, we know that trichotomy holds in Qand hence since
1.7.13
Definition
(x)(y)(x<y<=>y-x ER+).
(x)(y)(x:,;;;y<=>.x<y V x=y).
(x)(y)(x >y<=>y <x).
(x)(y)(x;;;,:y<=>y:,;;;x).
1.7.14
Lemma.
Ifx <y,
then
3r
<y.
Proof.
such that
1.7 .15
(x)(y)(x,y ER+=>(3n)(n
EN
Proof. By the previous lemma 3 r,s E Q.+ , so that 0 < r < y and
x < s < x + 1. Since Q.+ is Archimedian-ordered (Exercise 13 of Section
1.5), 3n E N, so thatx < s:;;; nr < ny.
1.7 .16
range R+
Definition.
U
x :::) x 0,
lxl = -x :::) x 0.
:;;;
1. 7.17
Theorem.
x:;;; lxl,
-x:;;; lxl,
lxl =I-xi,
llxl - IYll:;;; Ix+ YI:;;; lxl + IYI
Proof.
real
for rational Cauchy sequences. Theorem l.7.20 below shows that we get
nothing new.
Definition.
is a real
:::)
(e)(e >0=> (3N) (n) (m)(n,m E N0 & n,mN=> lx(n) - x(m) I <e)).
Our variables, of course, are now assumed to take values inR.
a ER is
x :::)
1.7
THE
REALS I 51
1.7.20
Theorem.
Proof.
R, let us write
s=p(s),
p is the isomorphism taking Qinto R.
r be a rational Cauchy sequence with range in R. For every n
wheres E Q and
Let
E N0
we may write
R. Then
lr(n) -r(P)I
3N so that
Vp
E N0,
n,p N:::}
< E/2,
r(n) -r(p)
()
<
r(n) -r(p)
and
< E2
/ .
r"{n) - f'(p)
and
or what is equivalent,
-(E2
/ ) < rn\P)
,..-:-...
- r(p)
and
for all
n,p
e+
N.
r;{jj)
r.-+ [-T]
are positive, where
r is
r. ..- [....
rn .. -r']
....
and
and
If we now set
a= R(T),
then from the facts that
and
r(n) = R (r,;-),
Vn ;;;.: N,
lr(n)-al < E.
x is a real Cauchy sequence. Using the Archimedian
Un={m: m E Z & x(n) .;;; m/n} is nonvoid and
hence, by the well ordering of N (see Exercise 17 of Section 1.5), Un
has a minimal element mn. If we set r(n)= mn/n, then from the fact that
(mn-I)/n < x(n) we get 0 .;;; r(n) - x(n) < I/n. Since x is a Cauchy
sequence, the sequence r defined by the numbers r(n) is Cauchy.
Indeed, Ve> 0, 3e' E Q+ with e' < E and 3M so that n,m ;;;.: M
==> lx(n) -x(m)I < e'/2. Hence, if n,m;;;.: max {M, 4/e}, we have
Suppose now that
3a
Consequently, for
E R and
n;;;,:
max(L,
;;;.:
a is unique,
suppose
3b,
e> 0, 3N such
implies
lx(n)-bl < E.
Then
a= b;
get a contradiction.
This concludes the proof of the theorem.
REMARKS:
V2 <t,
(b) The uniqueness part of the proof of the last theorem shows that
we are justified in calling a limit of a sequence, the limit of the sequence.
As is usual, if
x,
we shall write
x(n)=a,
x(n) -+a
as
n--+oo.
converges to
a.
I. 7
NOTE:
as
THE REALS I 53
1. 7 .20'
Proof.
Theorem.
Suppose
is
a Cauchy sequence.
x(n) =a.
n- co
0, 3N so that n
N implies
n,m
N,
lx(n) - x(m) I
lx(n) - al
la - x(m) I <
E.
The sum and product of real sequences are defined in the same way
as the sum and product of rational sequences; see Definition 1.7.2.
The analogues of Propositions 1.7.7 and 1.7.8 are valid for real se
quences and we leave it to the reader to satisfy himself of these things.
There is an important concept that we have not noted yet and that
is the concept of a subsequence.
1. 7.2 1 Definition.
A sequence y is said to be a subsequence of a sequence
x <=:? there is a function <I> with domain N 0 = N U { 0} and range in N0 so
that j < k => <l>(j) < <l>(k) and
y=xo<I>.
In other words, y(n)
= (x(<l>(n))).
The condition
<l>(n))
j < k => <l>(j) < <l>(k) means that the range of <I> is
+ l; then
1)2
4n2
4n
+ 3.
D Exercises
I.
;;,.
&
N0 =>
n(
(1+h) n ;;,. 1+nh+
0
(b)
(I - h) n
h
(c)
N0 =>
n (n
1 - nh+
h2
0, n. E N, n
;;,.
(I+h) n
2.
&
n; I)
;;,. 2 =>
;;,. 1+nh +
; I) h2
h2
and a unique m in imum element. Use this fact to give another proof
of Lemma 1.7.7.
3.
between any two real numbers. Recall that there exists an irrational
number:
\/2.
If
lxl <
4.
1, show that
xn - 0
lxl
5.
If
6.
For every
as
n-oo.
E R, show that
xn
,-o
n.
as
n-oo.
If
8.
If
(x(n))
show that
subsequence of
9.
10.
Suppose
x(n) -a
as
la - bl
<
c.
that
n;;,. N =>
1.8
If
11.
as
x(n)- a
show that
n- oo,
lx(n) I - lal
as
n- oo.
Give an
example that shows that the converse is not always true. For what
value(s) of
12.
If
a,
if any, is it
(x(n))
always
true that
lx(n) I -l al
x(n) -a?
is a sequence and
as
1) -a
n-
oo,
show that
x(n)-a
13.
Let
(s(n)) be
as
n-oo.
<T(n)
s(O)+ s(l) +
n+l
s(n).
If
14.
<l> (j)
15.
<
For every
is countable.
N,
1.8
<
U {An : n
N}
defined by
'l'(k,n)
TT
0 TT
Let
(1, n)
<l>(k).
'I'
sequence
otherwise.
N and range N
0 TT ) . l
N. Then
(2) taking
(1)
as a
below as an axiom system for the real numbers. What is a theorem under
reading (l) may be an axiom under reading
(2)
56 I THE
(a)
x+y=y+x,
x . y = y x.
(commutative laws)
(b)
(c)
(associative laws)
x
(d)
I E R, 0 ER,
I -
(y+ z)= x y+ x z.
x+O=x,
x I= x.
(distributive law)
1.8
(e)
For every x in
R:
x+(-x)=O,
x # 0 =>x x-1
(f )
(g)
x
(h)
<
in
<
<
1.
y, x=y y
,
x.
<
(trichotomylaw)
R:
<
z.
y =>x+ z < y + z,
y & 0 < z) =>x z
<
<
R,
&
<
z=>x
R:
z.
1.8.1
A set A C
R is
said to be inductive
(a)
1 E A.
(b)
xEA=>x+l EA.
1.8.2
R,
Note that N is not the null set 0, since R itself is an inductive set and
1 belongs to every inductive set. Note also it is a simple matter to prove
1.8.3
For everyMC N, if
1 EM
M=N.
Indeed, M is an inductive set and hence NC M. Since MC N we
must have M=N.
Now that we have N we can state the next property of the real number
system.
(i)
> 0
and y>
0,
there is an n in N so
To state our last and rather crucial property for the real number
system, it is necessary to consider the concepts of a sequence, a Cauchy
sequence, and a limit of a sequence. For this purpose we have the follow
ing definitions:
1.8.6
x<==>x 0,
-x. <==> x < 0.
lxl =I-xi,
x
l l+ IYI
to mean
a < b
or
a=b.
(j )
Let us note that if a sequence has a limit, then the limit must be
unique.
3N so
that
have VE > 0,
la- bl
If
S =la - bl > 0,
E,
nN.
S= la- bl< S.
Hence, if a sequence
of
the
limit
a.
(x(n))
has a limit
We usually write
lim
n-oo
a,
x(n) =a,
or
x(n)-+ a
as
n-+
oo.
1.8
1.8.9
(x(n))
is Cauchy.
x(n) -a;
then 3N so that
1.8.11
defined
as
The sum and product of two sequences (x(n)) and (y(n)) are
follows:
(x+y)(n) = x(n)+y(n),
(xy) (n) = x(n)y(n).
''
for
multiplication.
1.8.12
set
Q=
{m/n: m,n
1.8.13
Vx
E R,
E Z,
O}.
> 0 and
or,
equivalently,
In the previous proof we have made use of some facts about the
relation < without explicitly mentioning them. For example, we said
lxl < n is equivalent to the fact that -n < x and x < n. Indeed,
x ,,;:;; lxl we get from (g) that x < n, and from -x ,,;:;; lxl we get
-x < n, and from (e) and (h), n + x > 0. Now, using (d), (e), and (h)
we get x = -n + n + x > -n + 0 = -n, which is what we set out to prove.
that
from
y= x
1.8.14
N and range Q.
The proof of this statement can be found in Section 1.6. More gen
erally we can make the following definition:
1.8.15 A set
tion with domain
A
N
is said to be denumerable
and range A .
::> there
Another way of phrasing 1.8.14 is to say that the rationals are denum
erable. We can also talk about finite sets.
1.8.16 A set A is said to be finite ::>A is the null set, in which case we say
that A has zero elements, or else there is a one-to-one function with domain the
{k: k E N & 1,,;:;; k,,;:;; n} and range A, in which case we say
set (I, n)
A has n elements. A set that is either finite or denumerable is called countable.
=
D Exercises
Use only statements (a) through U> as axioms in proving the following:
1.8
1.
2.
3.
4.
m E N.
7.
8.
m E Z.
Vn EA, m:;:;; n.
9.
a
If x(n) .- a and b,
- b :;:;;
c.
- b <c or la - bl <c.
10.
If Vn E N0, x(n)
11.
Set
O!
= 1 and Vn E N, n! = 1
._
0.
n. If k E N and
_(k!)R
,
n!
12.
If x(n)
._
oo.
and y(n)
._
b - 0, show that
x(n)
a
-----
y(n)
b
13.
Let
that
p(n)
q(n)
14.
._
_g_ .
3
(a)
15.
XnYn>2:;:;; (X12+
+Xn2HY12 +
+Y n 2)
Prove
(X12 +
xk2
+
+ Xn2)
(Y12
Yk2
+
+ Yn2 )
Then add both sides of all the inequalities as k varies from I to n.]
1.9
Suppose
(e) (E
>
(3e)(e
Take L
>
;,, E.
The absolute
value is not needed, since by monotonicity x(ni) ;,, x(m1). Next take
n1 + 1 and n2
> >
E.
every k ;,, 2 there exists a finite set of pairs { (n;, m;): j E (2, k>} so
that m1 < n1 < m 2 <n 2 <
;,,
E.
1.9
x(nk) - x(m1)=
+
- x(m2)
- x(m1).
Now x(n;) - x(m;) E
and
x(m2)
x(m;+1) - x(n;)
0.
Hence
n-oo
Definition.
A C R is a number that is
(a) an upper [lower] bound for A, and
(b)
ri[ YJ
].
Proof.
In= m: m
E Z &
;E
u},
64
If we set
r(n)
Uk+i
11
m,./2 , then the above inequality shows that (r(n)) is
We claim that
g
l.u.b. A.
The first thing to show is that g is an upper bound for A. For every
EA we have
x m11/2"
n, since the number on the right is always in U. For every E > 0,
3N so that n N implies
for all
m,.
0 2n-g < E.
This follows from the fact that
that
m11/2" g. Therefore,
VE >
0, 3N so
N=>
X
This means
g-x
m71/211 < g +
E.
- x < 0, choose E
(x - g)/2 and we
(g - x)/2 > 0. We have consequently shown
O; for if g
71
of the form
and let
{k: k
EZ
& 71 km/2"}.
1.9
T/
Take
p= k0m
m
ko n<
2
T/
m
<
2n
+8
T/
I 65
g.
p/2" < g.
TJ
m11/2n
p/2n.
Hence we must
have g TJ
If A is bounded below, then
A.
(a)
n =}x(n)
(x(n)) is bounded
(y(n)), where
(x(n))
with
x(m).
a sequence
y(k)= g.l
Since
y(k + 1)
{x(n): n;;;;. k
.
b .{ x(n):
n;;;;. k }
l} C {x(n): n;;;;. k}
(y(n)) is bounded
The sequence
it
follows
that
y(k)
a=
We claim that
a=
l.u.b { y(n) :
.
n;;;;. O}.
a -y(n);;;;. e.
3e> 0
66 I THE
NUMBER SYSTEM
REAL
y(n)
a. Now if
N < n, y(m)
y(n),
and therefore
a - y(m);;;., a - y(n);;;., e.
N = 1, 2,
every m,
If we successively choose
proved, we see that for
y(m)
a-
e < a.
y.
(x(n)). To do this we must show that Ve> 0, 3x(n) such that 0 <
lx(n) - al < e. Now, Ve> 0, 3N such that n;;;., N ly(n) - al < e/2.
Also, Vn;;;., N, 3n1;;;., n so that lx(n1) - y(n) I < e/2 and Vm> n1,
3n2;;;., m so that lx(n2) - y(m) I < e/2. Hence 3n1 and 3n2> n1 so that
Jx(n1) - al
lx(n2) -al
Since
(b)
x(n1) #- x(n2),
m =infA,
M=supA.
These numbers exist by virtue of the Theorem 1.9.5. Next, let us put
P = {x:
m<x<
E Q &
M}.
Vn
(r(n))
E N0 set
An= {x:
EA &
r(n)
x}.
x(n) =infAn.
We claim that the range of the sequence
(x(n)) is infinite,
and there
y1 ,
, Yk.
Y1 < Y2 <
<
Yk I f we put Yo=
B; = {x:
m,
EA &
1.9
must be void for 0 ,,;;;; j ,,;;;; k. But this means that A can have at most k + 2
elements, which is a contradiction.
If 3n so that x(n) A, then x(n) is an accumulation point of A. On
the other hand, if Vn E N0, x(n) E A, then A contains a denumerable
set and we get the existence of an accumulation point from the first
part of the proof.
If A is the conjunction of the statements (a) through (i) given in the
description for R in Section 1.8, then an examination of the proofs of
this section show that we have the following chain of implications:
A
[Incidentally, the reason we did not use Theorem 1.9.3 directly in the
proof of Theorem l.9.7(a) in taking a as the limit of the monotone
sequence (y(n)) is that we wanted to establish the above chain of impli
cation.] If we can show that A & Theorem I. 9.7 => A & (j), then all
these statements are equivalent and any one of the statements of
Theorems 1.9.3, 1.9.5, or 1.9.7 can be used in place of U).
1.9.8
Theorem.
lx(n)
a l ,,;;;;
E.
If
2.
If
3.
If
If a set
of
5.
If A is a denumerable set in R and a is an accumulation point
A, show that there is a sequence in A which converges to a. If you
use the axiom of induction, be sure you use it carefully and correctly.
6.
(a)
7.
3
a(n) =
2
8.
(2n+ 3)
2(n+ l)'
EN0
a(n) = I +
I
"+1
,
n+1
EN0
The reader may recognize that the limit of this sequence is designated
by 'e'.
10.
n =(I+
a11
) n [n(n-1)/2!]
2
an ,
an
and hence
for n 2.]
11.
12.
0.
n,p,q
EN?
CHAPTER
21 LIMITS
respectively.
2.1
If
2.1.1
a
x
lim J(x ) =
x-a
or
3M so that
;;;,: M and
E oecn implies
70 I LIMITS
when a = oo or a=-oo.
In dealing with the limit of a function f at a it is very natural and
often very convenient to want to conclude that lim.r-af(x) = La
is an accumulation point of ,E;(f) and for every sequence (xn) for which
Xn - a, f(x) - L. One implication is very easy to prove: If f(x) - Las
x - a and Xn - a, then f(xn) -1. Indeed, VE> 0, 3 S> 0 so that
Ix - al<S and x E .e(J)'\_{a} IJ(x) - ll<E. On the other hand,
3N so that n N lxn - al<S. Hence n N lf(xn) - LI<E. The
last statement is precisely the fact thatf(xn) - L.
However, it is not always so clear how to prove the converse implica
tion. We have the hypothesis that a is an accumulation point of ,E;(f )
and for every (xn) so that Xn - a, f(xn) - L. We want to conclude that
f(x) - Las x - a. Let us see how we could proceed. Suppose it is not
true that f(x) - l as x - a. Then we must negate the statement:
E).
L
I
(Note that, for the sake of brevity, we have shortened the symbolism
of the predicate calculus by putting restrictions on the quantified varia
bles together with the quantification symbols ). Let Eo be a number for
which the last statement is true. For n E N0, let Sn= l/(n + 1) and let
(2.1.1)
2.1
7% I LIMITS
2.1.3
Definition.
2.1.4
of (J )
(a)
(b)
(c)
Proposition.
x-a
x-a
x-a
x-a
x-a
0, then
x-a
x-a
x-a
and
(b) Take l and m as in part (a). Then 361 > 0 so that I x - al < 81
and x E (J )\{a}
{a}
and
al< 8 and x
(J )
(g)\{a}
(c) Again, take mas in part (a). Then 361 > 0 so that x
0 < I x - al < 81
(g) and
2.1
lml - lg(x) I
Thus lg(x) I > lml/2, and hence ]a - Si. a+ s.[ n J0(g) c 1>(1/g).
Now, V > 0, 3S so that 0 <S <S1 and x E J0(g) with 0 < Ix - al
< s =:::}
1 g1 (x) - ;;;
1
lmg(x) I
lg(x) - ml
2
lg(x) - ml < .
-2
m
x-a
l/m.
The proof of part (c) is completed by applying part (b) to the product
functionf(l/g).
It is by no means always obvious whether or not a function has a limit
at a point; and even if we know that it has a limit at a point, the value
of the limit may not be too easy to establish. A standard example that
shows this is the function defined by
s(x) =
smx
'
x-o
0.
= 1.
limf(x) =f(a).
x-a
74 I LIMITS
--+f(a). (AC) Conversely, if a E (J), and for every sequence (xn) with
range n
i (J) if Xn --+ a impl iesf(xn) --+f(a), then f is continuous at a.
,
so that
e0,
we get a contradiction.
and
In case
g,
tion 2.1.4 and the remark made prior to Proposition 2.1.6. In case
Proof.
and
o and
of continuity off0
g at a.
provided
(J0 g).
]O, I]
and
f(x)=
Letf
I
x
a.
]O, I],
g(x)= x
is continuous
2.1
Let us take
on
a,
and
>
and
so that
Therefore,
Ix - al
x 1.
since
E,
is fixed and
E goes
to zero,
EXa
<
Ea,
<
D Exercises
I.
,RJ(g)
2.
of l(x) at
3.
a;
Let
l(x)
4.
2.1.4
when
,RJ(l)
and
oo.
s be
s(x)
Show that
a=
s does
=sin
(1/x) .
x =0.
R given by
5.
Suppose
l is
l(x + y)
Iffis continuous at
then
6.
{ 0}
=fl(, (J) .
Suppose
is continuous at
a show
7.
3m
8.
=l(x)l(y) .
fJC,(f),
E R so that f
E R so thatl(x) =mx .
If f is a function, then
=,RJ(f) and
Theorem
E R,
l(x + y)
then
Vx, y
76 J LIMITS
f(x)
:atinal,
{ 1O(:::xx:) iss irrational
.
(::::)
max(J1(x).f2(x)).
12.
Assume the following properties for the sine and cosine func
tions:
lsin(x)I
Jcos(x) I
lxl,
sin(x)- sin(y)
2 sin
1,
Vx ER;
(x ;y ) cos(x ;y ) ,
Vx,y ER.
J(x)
{ l/q
2.2
and not on
the point chosen in the interval. In 1895 E. Borel distilled out the
essence of Heine's proof and stated a certain property about closed
bounded intervals from which Heine's theorem followed as an imme
diate corollary. The purpose of this section is to prove the Heine-Borel
theorem and use it to prove Heine's theorem on continuous functions.
We begin with a number of definitions.
2.2.1
Definition. A set in R is said to be closed:::} it contains all its
accumulation points. The closure A of any set A is A together with all its accumu
lation points.
A set in R is said to be compact :::} it is closed and bounded.
2.2.2 Definition. A set UC R is said to be open :::} VxE U there is
an open interval I(x) CU.
Recall thatx E I(x) so that if U is open we have U =
U {/(x): xE U}.
is continuous at
a:::} aE JE>(J)
a so thatf(V
U
JE>(J))
CU.
2.2.3 Proposition. The complement of a closed set is open and the
complement of an open set is closed.
Proof.
Suppose
3/(x) CAc.
belong to A.
is closed. Then
Otherwise
VxEAc,
the complement of A,
is an accumulation point of
and would
2.2.4
Definition.
c R :::}
A C U{U:UE1"'}.
The collection 1-" is said to be an open covering for A :::} every UE 1-" is open,
and 1-" is a covering for A.
2.2.5 Theorem (Heine-Borel). A set A C R is compact :::} in every
open covering for A there exists a finite number of sets that cover A .
Proof.
1-"
A is a
A and
78 j LIMITS
so that
..
A.
If the range of
(xn)
a which
to
a.
Conversely, suppose that every sequence with range in A contains
is an accumulation
a.
Indeed, forn
n A. The sets An are nonvoid and by (AC) we get a sequence (xn) with
Xn EAn; clearly Xn-+ a. By hypothesis there is a subsequence (yn)
that converges to an element of A. But any subsequence of a conver
gent sequence converges to the same limit as the original sequence.
To show
An=A n [n,n + l [ .
An which are nonvoid. Other
A is bounded above. Hence the collection of nonvoid An is de
numerable and there is a one-to-one function <I> that maps N0 into itself
so that the set A41<n> is nonvoid. Clearly it is possible to choose <I> so that
m < n => <l>(m) < <l>(n) (proof?). Let us put Xn=inf A41<n> From the
definition of Ak and the fact that <I> is increasing, we get
There are an infinite number of the sets
wise
<l>{n) ,;;:;; Xn
<
<l>(n) + 1
,;;:;;
<l>(n
1).
Then we have
>
n,
Ym - Y n 1.
Thus no subsequence of
can converge. This contradicts the original hypothesis, and shows that
2.2. 7
Theorem (Cantor).
Proof.
and since
1.2]
covering for
80 I LIMITS
that covers A0 Let
it follows that Ame covers A0. But this is impossible, since Am C A0,
2.2.8
>
Definition.
0, 38> 0
(E) (E>
J(x) = l /x
for
is given by
Another example
function is even
bounded.
0 so that if
For every x E
IY-xi
(J)
x E (J)}
so that
lx-yl <8.
Now,3k E
Hence
IJ(y)-f(x)I,,;;; IJ(y)-f(xk)I
which proves that
REMARK:
IJ(xk)-f(x)I <E,
f is uniformly continuous.
necessary to use the axiom of choice in order to pick a 8(E, x) for every
x E ( f). However,
2.2
and
x,
of all S's less than one for which the first statement in the proof holds,
and then take the supremum of this set. There are also a number of
other ways to construct a suitable covering for
(J)
and we are
sure that the reader can think of several alternate ways. There are
a number of places throughout the text where a point like this will
arise, but we shall not comment about it in the future.
There are several other important results about continuous functions
with compact domains that follow from the Heine-Borel theorem. We
shall give these below.
2.2.10
Theorem.
compact range.
Proof.
If
show that its range is closed and bounded. We shall consider only the
case where
if
fl(,(j)
o/:-
is an accumulation point of
An= {x: x
Each set
An
fl(, (f), Vn
N.
N let
Thus
closed.
f(a)
b,
Vn
N, An+i
An.
Thus by the
which shows
fl(,(j),
and hence
fl(,(j)
is
To show that
tinuous,
Vn
if(x) I - IJ(a) I
:;;;_;
IJ(x) - f(a) I
<
n.
Thus
lection
82 I LIMITS
If
last theorem that f/C,(f) is compact. Since f/C,(f) is bounded, the numbers
m =inf f/C,(f),
M =sup f/C,(f),
exist, and since f/(, (f) is closed, they belong to f/(, (f). This establishes
the theorem.
The next theorem gives more specific information about functions
that are defined on compact intervals. We first prove a lemma.
argument suppose
Proof.
- y, then
f/(, (f) C
D Exercises
I.
is an open set.
2.
2.2
A' is closed.
ACBA'CB'.
(AUB)'=A'UB'.
A=AUA' is closed.
(A)'=A'.
A is closed A=A.
For
5.
Show that the union of any number of open sets is open and the
6.
number of closed sets is closed and the union of any finite number of
closed sets is closed.
7.
If
C R and
x E R,
and
as
d(x,A)=inf{lx-yl:y EA}.
If
is closed and
x g A,
3y EA
show that
(Hint:
so that
8.
and
d(A,B) =inf{lx-yl: x EA
If A is compact and B is
d(A, B) = Ix-YI. Is this
9.
and
are subsets of
as
&
y EB}.
3x EA and 3y EB so that
B are merely closed?
and
10.
If
ACRand x E R,
define
x+A={x+y: y E.A}.
Show the following:
(a)
(b)
(c)
11.
A
A
A
x+A is open.
x+A is closed.
compact x+A is compact.
is open
is closed
is
84 ILIMITS
is nonvoid.
n{Ax:xES}
13. Assuming the Cantor intersection theorem, 2.2.7, and the axiom
of induction, prove the Bolzano-Weierstrass theorem. [Hint: If A is an
infinite bounded set, use the axiom of induction to establish the exist
ence of a sequence (In) of compact intervals, each of which contains a
point of A so that Vk E N0, h+i Ch, and the length of In goes to zero
asnoo.J
14. Show that the Heine-Borel theorem implies the Bolzano
Weierstrass theorem.
15. If f and g are uniformly continuous functions, show thatf+ g,
g,
J and f g are also uniformly continuous. Under what condition(s)
is l/g uniformly continuous?
0
MONOTONE FUNCTIONS
2.3
2.3.2
a setA
MONOTONE FUNCTIONS I 85
Definition.
Ve> 0,
0].
n >(!)
will be designated by
n >(!)]
f(a+)[ f(a-)],
and we
shall write
f(a+) =
lim
f(x),
x'a
f(a-) =
lim
X-"a
f(x).
We leave as an exercise the simple fact that if the right and left limits of
exist at
a,
then
has a limit at
a if and
are equal.
f(a+) =
f(a-) =
If
g.l.b.
l.u.b.
{ f(x): x
{ f(x): x
E >(!) &
E >(!) &
x > a},
x <a}.
f( a-) = f( a),
and if
-f
is monotone non
E N, let us set
Jn=
{x: x
E >(!) &
lf(x) I
n},
86JUMITS
We claim that Dn hs at most 2n2 points. Indeed, let us first note that
Vx,y E n we have
ll(x) -l(y)I
2n.
Next, suppose that {a;:} E (I,k)} is any finite set of points in Dn.
Without loss of generality we may suppose that these points are labeled
so that a 1 < a2 <
< ak. Let us adopt the convention that i f
a E (J) is not a right accumulation point of (J), we set l( a + )
=l( a ), and if a is not a left accumulation point of (J), we setl(a-)
=l( a ) Then, clearly, we may writel(ak+) -l(a1- ) as a "telescoping
sum" in the following way:
l( ak+ )
l( a 1 )
i=I
J=2
Since l is monotone, the terms that appear under the summation signs
are always nonnegative or always nonpositive. Thus
n
l( a; ) I k/n.
D = U {Dn: n E N}.
It follows from Exercise 15 of Section 1. 7 that D is countable.
The next theorem tells us that if the domain of a strictly monotone
function satisfies certain conditions, in particular if it is an interval,
then the inverse of the function is always continuous. This is true
regardless of whether the given function is continuous or not.
2.3.6 Theorem. If f is a monotone increasing or monotone decreasing
function, then1-1 is a monotone increasing or decreasing function, respectively.
If (J) has the property that Va, b E (J) with a b, [a, b] n (J)
is closed (hence compact), then 1-1 is continuous.
Proof. If we suppose that l is monotone increasing, it is clearly a
one-to-one function and hence 1-1 is a function. For every y1 and y2
in (J-1) letx1 andx2 in (J) be those elements for whichy1=l(x1)
andy2=l(x1). Ify1 < y2, then we must have l-1(y2) -l-1(y1) X2 -xi
> 0. For otherwise, ifx2 -x1 0, we would gety2=l(x2) l(x1) =Yi.
which is a contradiction. Thus 1-1 is increasing. If f is monotone de
creasing, a similar proof shows thatl-1 is monotone decreasing.
Suppose now that b E (J-1) and b =l( a ). If a is a right accumula
tion point of (J), then VE > 0, 3x, E (J) so that 0 < x. - a < E.
=
2.S
MONOTONE FUNCTIONS I 87
f is monotone increasing
y ,,;;; y., then since 1-1 is mono
y.=f(x,). If y
J(f-1) and b
,,;;;
E.
If 3e>0 so that
J(f-1)
[a+
E,
X]
1
EXPONENTIAL FUNCTIONS
generalized
E N, then
an can be
fa with
fa(n+ 1) =fa(n}fa(l) .
fa(l)=a and V n
E N,
E N there exists a
uni,que solution in
R+ to the equation
.
a>0,
which is labeled 'a1tn. If
If
E N 0, define (Exercise
9 of Section 2.3)
88 I LIMITS
Using these facts, and the uniqueness of the solutions of any equation
r,s E Q.,
ar-as=a(ar--
I).
Since a > 1, a simpie inductive proof shows that aP > 1. Now, if ar-s 1,
then (ar-)q 1, which would contradict the above equality.
Since ea is monotone on Q. it is bounded on every bounded set in Q..
. Further, it is continuous at r
0. To prove this let us first suppose that
a;;;,: 1. For any a;;;,: 1, it is always true that Vn E N,
=
1 a< (I +a/n)n.
For we can either expand the right side by the binomial theorem (which
we suppose known) or use the result of Exercise 1 of Section 1.7 to get
(I+
a/n)n> 1 +a > a.
2.5
MONOTONE FUNCTIONS j 89
(2.3.2)
ea(x+) <ea(s),
If now we use the fact that ea, defined on Q, is continuous at zero, and
is bounded on any bounded set, we see that we must have
ea(x-) =ea(x+).
3.
such that
90 I LIMITS
a>
E R
so thatf(x) = ea(x) .
a.
Also, if
bn= a.
9.
If m E N0 and n E N , we defined
10.
2.4
There is a concept which is more general than the concept of the limit
of a function at a point and is very useful in many instances. In this sec
tion we shall introduce this concept and obtain several facts about it.
Let us suppose that f is a bounded function with J>(f) - 0 and a is
an accumulation point of J>(f). For every r E R+ let us define
E
E
J>(f)
J>(f)
& 0 <
& 0 <
Ix
Ix
al
al
<
<
r},
r}.
2.4
2.4.1
Definition. If f
point of Je(f), we shall set
is
x-a
r-o
These numbers are called the limit superior and the limit inferior of f at a,
respectively.
The limit superior and the limit inferior of f at a are sometimes
designated by
Jim sup f(x)
x-a
and
x-a
Je(f) & x
Je(f) & x
{x : x
>
>
r},
r }.
x-oo
r-ao
r-oo
92 I LIMITS
-1
7T
-1
-2
Figure 2.4.1
x-o
-1.
'
Proposition.
0.
is an
accumu
lim f(x).
!e1(0+)= l= ;1(0+).
Conversely, suppose this last equation is satisfied. Then Ve > 0,
38> 0 so that 11(8) - l l < e and l<P1(8) - l l < e. However, Vx E
(f) for which 0 < Ix-al < 8 we hive,r1(8) os;f(x) ..-;;1(8). Hence
-e < :1(8)
l os;f(x)- l
,,,-;;
q;-1(8)
l < e.
Theorem.
is an
accumu
2.4
(a)
x-a
x-a
x-a
(b)
x-a
x-a
x-a
x-a
x-a
x-a
x-a
x-a
Proof.
lb
(a)
al
<
is
x-a
l/lim g(x).
x-a
0 <
l/lim g(x),
x-a
lim (l/g)(x)
""0
x-a
r and
r+u(r)
<
(f+g)(b)+e .:;;1h(r)+0(r)+e.
Letting r--+ 0 and taking account of the fact that Eis arbitrary, we get
the first inequality in (a). The second one follows by similar reasoning.
(b) For every E> 0 and Vr > 0, 3 b E .e(fg) so that 0 < lb - al < r
and
ru(r )
<
(fg)(b)+E.:;; 1(r)u(r)+ E.
The last inequality makes use of the facts that 0.:;;j(b).:;; r(r) and
0 .:;; g(b) .:;; ?u(r). Letting r--+ 0 we get the first inequality in (b). The
second one follows by similar reasoning.
(c) For every E> 0 and Vr > 0, 3b E .,e)(g) so that 0 < lb - a l < r
and
1/g(r)
<
(l/g)(b)+E.
Since g does not change its sign and l/g is bounded, 3m > 0 so that
Vx , g(x) m or g(x) .:;; -m. Consequently, 'Pu(r) m or ip0(r) .:;; -m,
respectively. Since Vx E .e(g) for which 0 < Ix - al < r we have
g(x) fu(r), we have in either case,
'P11u(r)
<
(l/g)(b)+E.:;; l/ipu(r) + E.
x-a
x-a
0 <
94 I LIMITS
,,
fa(r)
,b)
<
_f0(r)
E.
_!a(r)
<
1
g(b)
'P11a (r) .
x-a
This inequality when taken together with the previous one gives equality.
The second equality follows by similar reasoning.
NOTE:
5 at the end of the chapter. Also, it is clear that the above theorem
1imf1(x) = limf(x) ,
.x-a
x-a
lim f1(x)
x-a
lim f(x).
x-a
g(x) = cos(l/x),
f(x) = sin(l/x),
;t. 0.
These functions change their signs infinitely many times in any neigh
borhood of the origin. Now, by a well-known trigonometric identity
we have
limf(x)
x-o
=-
Jim g(x)
x-o
-1,
x:::;o
and hence
Jim f(x) Jim g(x) > lim (Jg) (x).
x:::;o
x:::;o
x=o
2.4
g(x) = cos(I/x),
0.
k = 0,
1, , we
(2k+ 1)7T,
get
Hence
(Jg)(x)
lim
-1.
x-o
x-o
= 0,
lim
x-o
g(x) =-1 ,
x-o
x-o
and therefore
lim
x-o
lim
x-o
(Jg)(x) 2.
x-o
x-o
x-o
Proof.
Clearly
may set
l= limf(x).
x-a
We claim that
l=
96 I LIMITS
lf(x) - ll
IJ(x) - f(y)I
lf(y) - ll < E.
D Exercises
1.
that
If f is bounded and
x-a
x-a
4.
x-a
x-a
x-a
x-a
x-a
x-a
Using this in conjunction with Theorem 2.4.3(a) show that if limx- af(x)
exists, then
Jim (f + g)(x) = Jim J(x) + Jim g(x) ,
x-a
x-a
x-a
5.
x-a
x-a
x-a
lim (fg)(x),
x-a
x-a
x-a
x-a
x-a
x-a
x-a
Ji
m (Jg)(x) = Jim f(x) lim g(x).
x-a
x-a
x-a
If x E [O, I] and _is rational of the form p/q (in lowest terms),
define f(x) = I/q. Also set f(O) = I and if x is irrationaf set f(x) = 0.
6.
7.
A= nlim an,
-oo
show that V E > 0 there are only a finite number of n E N0 with an > A
E and an infinite number of n so that A - E< an. State and prove an
analogous statement for limn_., an.
8. Compute the limit inferior and the limit superior for the
sequences whose terms are the following:
(a) sin (mr/2).
n
(b) (l + (-1) ) COS ( n1T) .
)
(c) (l + l/n (l +sin (mr/8))1'n.
9. Let ( rn) be a sequence, whos range consists of all rationals in
JO, 1 [. If rn= Pnfqn. Pn.qn E N' set Sn= (Pnfqn)11qn . Show that
Jim S = lim S = 1.
n-oo n n-oo n
10. A function f is said to be upper semicontinuous at a <=>a E
>(!) and Ve > 0, 3S> 0 so that Ix-al<Sand x E >(!)f (x)
<J(a) + E. If a E >(!)and is an accumulation point of>(!), show
that f is upper semicontinuous at a if and only if limx-a J(x) os; f(a).
Give an analogous definition for lower semicontinuity and prove a
similar result .
11.
{ 1 <=> x is rational,
0 <=>x is irrational.
CHAPTER
3j INFINITE
SERIES
There exists a unique function er with domain and range the collection of
all real sequences so that if a= (an) is a sequence, then
cr(a)o= ao,
Vn E
N0.
Although we shall not bother to carry out the proof, the general idea
how to use induction in this context is like that used in the proof of
Theorem l.6.4. It is not hard to prove, using the commutative, asso
ciative, and distributive laws for R, and the axiom of induction, that
Va,{3 E
a and b,
{3cr(b).
so that
Vn E
N0,
(aa)n=aa n .
We now set
n
k=O
ak= cr(a)n=ao + ai
a n.
{ ak: k E ( 0, n)},
( 0, n)
it is of
ak and domain
98
3.1
(O, n)
so that Vk E
then CT(a) n
= CT(b)n
Very often we shall use the symbol
If n < m, this is to be given the value zero. If n ;?.!: m and a= (an) is any
sequence that is an extension of the function with domain
( m, n)
and
k=m
ak= CT(b)n-m
( 0, n)
a'=
a EA
k=O
a<P(k)
3.1
3.1.1
Definition.
This is in keeping with standard practice and the notations are very
convenient. Very often we shall use a notation such as
Let
bk= ak+n;
3.1.2
Proposition.
lim a = 0.
n-co n
Proof.
Since
<T(a)
an ---"' 0
as n
< E.
---"' oo.
k=O
k
(<-I)k _
I+k
. .
!1.?!
k
= I'
I+ k
an ---"' 0
k=l
Clearly, 1/k
-0
as k
oo,
G)
sn=
Lk
k=l
n +k
2n. Hence
Vn E
N,
S2n - Sn=
L n+k
k=t
I
:;;;:,: 2
This means that the sequence of partial sums is not Cauchy and hence
cannot be convergent.
Let us now record a very simple but very useful fact: The sum of two
convergent series is again convergent.
3.1.3
a, {3 E
Theorem.
R,
00
00
00
k=O
k=O
k=O
Proof.
n-oo
[au(a)n + ,Bu(b)n] =a
lim
n-=
u(a) + ,B
n
lim
n-ao
u(b) .
n
V k E (O, n),
n .
k=O
( (-I)k )
k+I
3.1.6 Lemma. If V k
finite subset of N0, then
ak
E N0,
0,
co
L ak:,,;; L ak.
kEA
k=O
Proof.
Let
n) \A.
Then
<T(a)
<T(a)n:,,;; L an ,
k=O
which proves the lemma.
be a rearrangement and A=
n
L a
k=O
Hence
<T(a
ct>)
co
it converges and
co
L a
k=O
Actually, since
(a,<T(a))
co
is a rearrangement of
(a
ct>, <T(a
get the reverse inequality, so that the two numbers are equal.
For a general absolutely convergent sequence
+ _
an a
,,
lanl+an
2
lanl - an
2
(an)
let us set
ct>)),
we
3.1
Thus 0,,;;;;
and sirice
<T(a+)
and
k=O
<T(a-)
Hence
k=O
they are convergent. From the previous paragraph and Theorem 3.1.3
we get
"'
L a
k=O
"'
<l>k = L a+
"'
<l> k - L a- <l>k
k=O
k=O
Note that the first equality and the last equality come from Theorem
<T(a)
is convergent. Also,
<T(la <l>I)
0
is
la
<l>kl=a+
<l>k +a-
<l>k
bn =
<l>(11+1l-l
ak.
k=<l>(n)
k=O
<l>(k)
and consequently
((- I) k)
2k, then
u(b)
is convergent.
If a series
Theorem (Riemann).
convergent, then for any two real numbers a 13, there i,s a rearrangement
(a
<I>,
<T(a
<I>)) so
that
lim <T(a
nco
Proof.
<l>)n =a,
0 and a-n
0 as n
oo.
Let r0 be the smallest integer so that <T(a+)r0 > 13. This is possible
since <T(a+) is unbounded. Hence the difference between <T(a+)r0 and
13 is at most a+ro Next let s0 be the smallest integer so that <T(a+)r0
- <T(a-) < a. Again this is possible, since <T(a-) is unbounded. The
difference between a and <T(a+)r. - <T(a-)80 is at most a-. Now let r1
be the smallest integer so that <T(a+)ri - <T(a-)80 > 13; that is,
ro
k=O
so
k
k=O
a-k +
ri
k=ro+l
The first step is to get the subsequences (a+n) and (a-n) For this
A-={n: an< O} .
5.1
ak
Clearly
0 and hence
creasing.
Actually, both of these sequences are unbounded. For if, for exam
ple,
the set
{k: <1>+(k)
(o,<1>-(n))}
mn. Clearly
mn oo as n oo, since otherwise "(<1>+) is finite. Now, (<1>+j(O,mn))
=A+ n (O,<1>-(n)), and (<1>-j (I, n)) =A- n (O,<1>-(n) ). These sets,
which we shall label Bn+ and Bn-, respectively, are disjoint and together
give (O, <1>-(n)). Thus
is not void, and hence has a maximum which we label
But since
it follows that
(b)
quences
N0 and
qr(O)
qr(l)
qr(2)
= min
=min
=min
Tn =qr(2n),
Sn= qr(2n + 1).
E N0,
(3.1.1)
a(c)
E,
(3.1.2)
proper rearrangement of the series (a, u (a)). Let us put, for j E N0,
n_1 ro,
n2i = S; + Tj,
n2i l = Sj + T;+1
+
=
Since
(s;)
(3.1.3)
m) is mono
N0= U {Bk: k
N0 U {- 1} } .
<l>(k)=
E B2;-1,
B2i,
(3.1.4)
(3.1.4')
suppose p E B21_1, q E B2; 1 and P < q. We haven2<i l> < P < q .;:; n2; 1>
3.1
i,,;; j.
(rk) is monotone increasing we have p =s1_1 ,,;; r1 ,,;; r;-1 <q -s1-1 if
i <j, andp-s1_1 <q-s1_1 if i = j Thus
.
In the same way we show that <l>IB- is monotone. These facts, together
<I> is
<I>
is one to
and range N0
(d)
Let us now bring all these things together and show that
lim O'(a <l>)n =a,
0
n-co
n-.,
We shall prove only the second one, since the first one will follow by
similar reasoning.
Suppose we take
odd,
O'(a <l>)nm
o
O' (a <l>) ro +
o
L [O'(a
k=O
j
L [O'(a
k=O
Now,
sk+rk
a <l>(i)
k-1+1
i) =a <1>-(i) =-a-;
0
Thus we get
O'(a
forsk-I
< i,,;;sk.
.L
a-i
k-1+1
=:' -
L [O'(a
k=O
that
=-
L [O'(a-).k-
k=O
O'(a ) k-1]
=-O"(a-).i'
0.)
I 08 j INFINITE SERIES
and hence
j
+
+
L [<T(a 0 <l>)2k+i - <T(a <l>)2k] = <T(a )r;+1 - <T(a )r0
0
k=O
Consequently, we get
<T(a o <l>)n2J+i
If nm-1 <
n nm+1
<T(a+)r;+1 - <T(a-)sr
(3.1.5)
we have
nm-1
n
L a 0 <l>k= L a 0 <l>k + L a
n
k=O
n m-1+1
k=O
<l>k.
Now, nm-1 = n2J. and for n2J < k n2;+1 we have a <l>k= a 0 cJ>+(k - SJ )
= a+k-J 0. Also, for n2J+1 < k n2J+2 we have a 0 cl> k a 0 ct>-(k - r;+1 )
= -a-k-rJ+1 0. Hence, in either case, n < nm or nm n, we get
0
La
k=O
<l>k
nm
L a <l>k= <T(a
o
k=O
<l>) n2J+1
(3.1.6)
n n2N =:::::}
n
L a 0 <l>k
k=O
{3 + E.
N0, {3
<
<T(a
REMARK:
<T(a 0 <l>)n
{3 .
take
so
D Exercises
1.
a,
show that
2.
:L rk=
k=o
1 - rn+l
.
I
-r
3.1
3.
Let
If k=o
convergent.
5.
If
6.
Show that
n(n+ 1) = 1.
00
[Hint:
n(n+ 1)
7.
1
:;;
Show that
00
- n+ J
1
I n(n+ l}(n+2) = ;r
8.
is convergent or divergent.
9.
If limn-oo
n
L ak
n+ l k=O
converges to the same limit. This is called Cesaro summability.
bn
If
terms and
I I.
taneously, provided
Vk
00
N0, ak 0.
If
00
( 2k )
o ; a1
is divergent.
3.2
CONVERGENCE TESTS
3.2.1 Comparison Test. Suppose (an) and (bn) are sequences for
which there is an N so that n N::::} lanl .,;; lbnl. Then ifer(lbl) is convergent,
so is er(lal), and consequently if er(lal) is divergent, so is er(lbl).
Fix
Proof.
n0
N and for
>
n0 we have
o.;;;er<laDn-er(lal)no=
n
L lakl
k=no+l
.,;;
L lbkl
k=no+l
er(lbl)n-er(lbl)no
#-
l,
n
1 - rn+I .
rk=
L,,,
l-r
k=O
Hence, if lrl < 1 ,
If
rn
lrl
co
Consequently,
1
L n1
n=l
co
3.2.2
lim lanl11n.
n< 1 and diverges if p
p
co
>
1.
superior, 3N so that
t If
N::::}
3.2
lanl < (p + E) n.
By the comparison test (with the geometric series)
u( lai)
converges.
lanl11n
p- E,
>
lanl
(p - E)n
>
> 1.
N and n >
k,
we have
n!
>
(l/n!)
kn-k+1.
( n ! )l/n
This shows that
.
hm
n,
kk<l-k>Jn
>
( )l/n
1
-1
n-"' n.
>
k/2.
=O,
3.2.3
3N so
Suppose (an)
that n
is
l aann l
1 aan+n1 1
+l ,
R =Im
.
1
-n-
QO
r=I.1m
n:::"OO
If R < 1, u(jai)
Proof.
is
convergent and if r
-- .
> 1,
u{iai)
is
divergent.
chain of inequalities,
llm
n-co
1 aan+n 1 1
.;;;;
1
lffi
n-oo
I an 111n
n!/ (n + I) != l/n
10.
The ratio test, when it works, is usually very easy to apply and hence
is very useful. When it fails it may be possible to apply a slightly sharper
variant of it.
3.2.4
N => a
n- ( -laann+i I) ,
n n(1-1 aann+il)
= lim n
oo
{3= lim
-
If
oo
Proof.
Thus
If
kjak+il
jak+il
> 1,
la:l l
< k
(
Hencek
Vp so that
<
K=>
-x
<
(p - I) lakl
< (k
- I) lakl - klak+il
(p - I) L lakl
Therefore,
K.
k K=>
lakak+1 1
> l
f!..
Therefore,
3.%
CONVERGENCE TESTS I I 13
Both the root test and the ratio test fail for the series
k=l
{: )
2
Thus
I
(
-
k +I
I
> I - I+ 2/k
=k
2
+ 2'
and hence
Jim
k-oo
(k +
2 ) [l
(k/k + 1)2]
=Jim
k-oo
k [I
(k/k + 1)2 ]
;:.;,: 2.
Proof.
unbounded.
Using the monotone nonincreasing character of (an) we get
2k+l-1
J=2k
2k-1
i=2k-1
k =0
to
n we
k = 0.
Summing from
get
u(a)2 n+l-i,,;;;
ao + L
k=O
This set of inequalities when taken together with the comments in the
first paragraph constitute the proof.
As an example of the use of the condensation test, let us consider
the convergence properties of the
p series,
diverged. Hence the comparison test shows that the p series diverges
for p < 1. We can apply the condensation test only for p 0, since
otherwise the sequence ( l/nP) is increasing. For p 0 we must examine
the convergence of the series
00
'L
k=O
k
( l/2<p-l) ) .
Lemma (Abel).
3.2.6
k=O
akbk
bn+1 u(a)n -
k=O
(bk+1 - bk ) u(ah.
Proof.
(3.2.l)
0,
ak = u(ah- u(ah 1
Therefore,
n
k=O
akbk=
k=O
bku(ah-
k=I
bk u(ah-1
(3.2.2)
The last sum begins at k= 1, since u(a)_1= 0. This last sum can also
be written
n
k=O
3.2.7
Abel's Test.
is
is convergent.
Proof.
3.2
(bn)
is monotone,
k=O
Since (bn)
n
M L lbk+l - bk l
k=O
M [bn+l
- ho]
to
zero,
Dirichlet's Test.
then
is convergent.
Proof.
L (bk+1 - bn)cr(ah
k=O
er
(a),
then
lcr(a)nbn+il
and since
bn
- 0, cr(a)nbn+i - 0.
Mlbn+1I.
gent only if an
- 0.
(a, er(a))
is conver
3.2.9
Leibnitz' Test.
is convergent.
Proof.
(an)
L <- t)k
k=O
is bounded by 1.
k=I
kx ) ,
si
x E ] -oo,
oo
[.
cos
2 sin (x/2)
Thus, if
k=l
sin kx
cos (x/2)
cos
(n
1/2)x.
we may apply Dirichlet's test and we see the original series converges.
If x
O Exercises
1.
ratio test,
2.
(b)
3.
3.2.3.
o (i : )
2 (
)
k2
k3
1 + k! .
00
k=O
(a)
(b)
(c)
4.
k =l
(akka).
i( )
i C10 )
k( lo k)a
k)a
(a)
(b)
(c)
k=I
( (-l)k21tk).
0 (
(
).
)
211<k+i>J .
3.l!
5.
summation formula.
fG
k =I
6.
log
(I+ l/k )
If
(wk ) so that
is absolutely convergent.
7.
8.
12 of Section 3.1.
b k = (-l )k a k and
IO.
(
:i (
k =O
(Vk+i - vk) .
(a)
L
k =l
<3-11k [211k
2 11< k + l ) J) .
Prove the
co
co
n
lanl11 = p
<
1, there is an N so that
Is - <T(a)nl
<
n
r +l/(l - r).
13.
that
Suppose limn-
Vr for which
p <
co
lan+ifanl = p
<
1,
Is - <T(a)nl
14.
Suppose limn-co
Show that
Vr for which
<
lanl
Is - <T(a)nl
15.
<
3N so that
<
n
r-
--
lan+1I
Dirichlet's
test.
Is - <T(ab)nl .
16.
Suppose
If
(-l)kak
(-l)kak an+1.
k=l
(ak/mk) ,
is convergent to a number in
ak
(0, m - 1),
k=l
ak /mk (m -
1) L l/mk
co
k=l
1.
co
k=l
ak/mk.
3.3
so that
(3.3.1)
In this way we can prove by induction that there exists a unique sequence
(ak) so that Vn EN, (3.3.1) is satisfied and Vj E (1, n) if c E (O, m- I)
& c >a;, then
kEAj
oo
ak/mk L ak/mk = b a.
k=l
k=l
;;;,: p
w e have l/mn -1 a
n-1
k=l
b and consequently
Thus, because of the way (ak) was defined, we must have an= m-1.
Let q be the smallest integer so that n ;;;,: q :::} an m-1. Clearly q > l,
for otherwise, using the formula for the sum of a geometric series, we
get
=
b=
(m-1)/mk=
1;;;,:
a,
k=l
k=q
it follows that
b
q 1
=
k=l
q-2
(m - l)/mk
l/m"-1,
ak /mk + l/m"-1
duimal expansion
of a number in
(ak)
[O, l]
in suc
cession to get
must be specified.
a=
n-1
k=l
k=n+l
(m - l)/mk.
ak= 0
for all
terminating decimals.
These rationals are the only reals that do not have unique series repre
sentations, and as a matter of fact they have exactly two representations.
We shall now prove these statements.
Proof.
Suppose we have
a=
If we set
ck= ak - bk ,
00
k=l
then
akfmk=
ickl .:;;; m
00
k=l
Let
00
k=l
bk/mk.
1 and
ck/mk = 0.
{k: ck
7':-
O},
icnl/m n=
lk+i ck/mk l
00
.:;;;
k=n+l
lck l /mk
00
.:;;; (m
Thus we must have
lcnl =
m.
Since
ak
and
bk
1)
k=n+l
l/mk= l/mn.
=1
are in
3.5
ak = m
and bk =
0,
or V k > n, ak =
0,
DECIMAL EXPANSIONS I Ul
and bk = m
1.
These cases
{k: ck
=fa
O}
We can use the series or decimal representation to show that the set
of numbers in
[O, I]
its range in
if the set
respectively.
Let us suppose that the set of sequences each having range in
( 0, m
1)
ak=
a\
m
1 ::::} a k
::::}
=fa
0,
k
a k = 0.
This sequence exists since m 2. Now the sequence (ak) has range in
( 0, m
1)
[O, I]
[O, l]
is an example of
[O, l]
by which this was shown to be true is, for obvious reasons, called Cantor's
Theorem (Cantor).
CANTOR'S SET
[O, l].
[O, l]
that can be
2/3k
k=;+l
we see that
lf3i'
>
j so that
(3.3.2)
b= :L bkf3k + 2w.
k=l
b is in the complement of C. Also,
j, so that bk 0; otherwise we see from (3.3.2) that
"'
J-1
b= :L bkf3k + :L 2t3k,
k=l
k=;+l
3k
>
J-1
q/3i-l = :L bk/3k.
k=l
Since
<
(3.3.3)
(3.3.3) show that every bEcc is in an open interval
(p + 1)/3i [, where p is odd and p < 3;. We claim that
every open interval of this form is contained in cc. Indeed, let us write
The inequalities
p131= :L Pkf3k ,
k=I
Since p is odd,
<
(p + 1)/31 and
3 kE {I, j)
PkE{0,2).
so that
<
"'
a= :Lak/3k;
k=l
VkE{l,j), ak=Pk
{k: kE{l,j) & ak Pk}.
then
L Pk/3
k=l
T
Pr+ 1, then
k + l/3
r
"'
:L Pkf3k + :L 2/3k
k=r+l
k=l
j
"'
.;;;
k=l
T
.;;; L
k=l
Pr - 1, then
r
Pk/3k - l/3 +
oo
k=r+l
2/3k
k
Pk/3 .;;; PW'
Pk
C is closed.
Actually, the last two paragraphs prove more than we have stated.
Namely, the inequalities
cc
where
q/3n-1
n-1
=
:L
k=l
qkf3k,
qkE{0,2} .
Further, the proof of the last paragraph shows that these intervals are
pairwise disjoint. For suppose a Elp,k n
(q,n).
n-1
=
q;/31 + l/3n +
P1 /3
1=1
k-1
1=1
co
J=n+I
l /3k +
00
j=k+I
a1f3i
a1/31.
qk
1,
q1 for j E ( 1, n - 1), which
If k < n, the proof in the last paragraph leads to the fact that
which is a contradiction. If k
n,
then p1
n=I
Every point of the Cantor set is an accumulation point of C. A closed set
with this property is called a perfect set. The proof is very easy. Suppose
a EC and
akE{O,2},
where
ak
0 for
an infinite number of
a,
a.
If, on
ak
then for
>
{O, 2},
j, the numbers
a, and converge
The Cantor set is uncountable. The proof
to
a.
ak=<l>(k)
and define
ak=
0::::}a\=2,
2=}akk=O .
(ak)
in its ternary
] 1/3, 2/3[,
1
9
2
9
2
3
1
3
7
9
1].
8
9
The interval
FIGURE 3.3. 1
/0,2 is
] 1/9, 2/9[,
)7/9, 8/9[.
These intervals
3.3
O Exercises
1.
If
m,p, n E N, m
write
p /mn
2.
If
>
p,m E N, m
1,
k
Pk/m ,
>
1,
k=l
p < mn,
and
pk E ( 0, m
1 ).
in the form
Pk E (O,m-1), Pn- 0.
(Hint:
Exercise
may be helpful.)
ak E (O,m-1).
Show that
4.
is rational{:::::>
Suppose that
(ak)
is eventually periodic.
k=I
if and only if A=
5.
k
ak/m <
00
L bk/mk
k=l
{k: ak-bk}- 0
and
an< b n,
where
=min A.
is also uncountable.
6.
7.
If x
EC
[O, l] is uncountable.
and
f(x) =
Show that (J) =
[O, l]
k=l
and that
k
xk/2 +i.
junction with Exercise 5 this provides another proof of the fact that C
is uncountable.
8.
[O, l].
f
C
9.
[O, I],
function.
Cantor's
let us set
n
and
n=
oo
x E C.
if
n(x)
Then
F(x)
3.4
n-1
=
k=l
min { k:
xk
xk /2k+i
l},
l/2n.
3.4.1 Definition. Let .t;t be the collection of all functions each having
domain and range in R. A function sequence is a function with domain N 0
and range in .f;i .
As in the situation for a real sequence, we shall designate a function
sequence by
(Jn).
IJn(x) - g(x) I
<
E.
3.4
Un(x))
is independent of
x.
The
3.4.4
fn
Suppose
Definition.
lfn(x)-fm(x)I
< e.
3.4.5
Theorem.
uniformly Cauchy.
Suppose
Proof.
Un)
(g) =.so
fn
having the
that Ve> 0, 3N
lfn(x) -g(x)I
< e/2.
lfn(x)-fm(x)I
lfn(x)- g(x)I
lg(x)-fm(x)I
< e.
Conversely, suppose
real sequence
which we
'.
Vn,m;;.:N,
lfn(x) - fm(x)I
For each
x, fn(x) g(x),
< e/2.
3.4.6
to
g and
Theorem.
g uniformly.
If
Un)
Proof.
Since
(Jn)
e/2 < e.
g is
continuous at a.
lfn(x) -g(x)I
< e/3.
of
fno
at
Hence, if
al < 8
and
x E />,
at
a.
(Jn)
is uniformly conver
Vn E N0,fn
has domain
[O, I]
and is defined by
fn(x) = x n .
Then for
x E [O, I [,
x
0,
n- oo fn( )
lim fn(l) = 1.
n-oo
lim
I.
[O, I [
f = (Jn)
Then
n
u(J) n(x) = L fn(x )
k=O
is well defined on It>.
and if the limit exists at every point of the common domain of all the
fk,
,Lfk
k=O
3.4.8
a and
is
er
Corollary.
(i)
is
Un) is continuous at
continuous at a.
There is a very simple but very useful test for uniform convergence
of function series.
Proof.
Since for
>
m,
n
L lik(x )I
k=m+l
<T(M)
that
er(Iii)
and
erU) are
-1
lim
n-"'
xn
+ X2n
1 1/n
for which
< 1,
x2n, we have
-.
hm
n-"'
x2"
I --x"
1 +
1 1,.. 1
lxl'
lxl
UO I INFINITE SERIES
1 + x2n
1 + l/x2n
lxl
I1 nx2nl
:s;;
:s;;
p< 1
lxl
<
1.
we see that
n'
l/xn
1 + l/x2n
1 + x2n'
----=
so that
lxl
=Jc.
1.
If
xn
--o ,
+
x211
x.
0 Exercises
I.
2.
uniformly on R \ { 1,
-1} .
(a ) fn(x) = 1
[Hint:
(b )
fn(x)
(c)
fn(x)
nx
' x
JO, l[.
, x E [O, oo[.
xn
nx(I - x)n, x E [O, l].
1 +
n-1
(1-; ) n
3.
3 ! n2
( a)
(b )
4.
(n- l)(n-2)
( ! x2 ) , x ]-00,oo[.
(0 :x2)k ) x ]-oo,oo[.
k2
J-oo,oo[ .
3.5
[Hint:
5.
Vn
15
1.8).]
of Section
6.
Vn
vergent and
E N0,
Jn
(Jn)
is uniformly con
7.
and
gn
is bounded. Then
8.
9.
10.
(<Pn)
<Pn-1 (x)=
<=>
{O
.I
Let
(rk)
Vx
ER,
EN
co
Jn(x)=
Show that
Vn
E N 0,
Jn
L <Pn(X
k=O
rk)/2k.
is continuous, and
(Jn) does
Vx
ER,
Un(x))
is con
any interval in R.
3.5
INFINITE PRODUCTS
There exists a unique function '1T with domain the collection of all real se
quences and range in the collection of real sequences so that if a= (an) is a
sequence, then
7T(a0) = ao ,
7T(a) n+i = 7T(a)nan+t
Vn
N0
We shall not carry out the proof here, but shall note only that the use
of induction for this result is very much like that used in the proof of
Theorem 1.6.4. Actually, the existence of the
function discussed at
are all special cases of a very general logical theorem on the possibility
of
or
recursive
inductive definition.
n ak
k=O
7T(a)n
{ak: k
for
m.
If
k=m
ak
7T(b)n-m
TI
qEA
(O, n)
n+
aa
Il a<f>(k)
k=O
The last definition, of course, raises the question whether or not we have
given a definition that is independent of the function <I>. We think that
the reader can easily establish by use of the commutativity property
of the product of two real numbers and the axiom of induction that
for every one-to-one function <I> with domain and range
( 0, n), we have
S.5
INFINITE PRODUCTS
I 13!
The reason for excluding zero factors and a zero limit when speak
ing about the convergence of products is that it leads to neater criteria
for deciding when a product converges. This will be borne out by fol
lowing results. An infinite product will often be designated by
Thus we have
But
7T(a)n
7T(a)m-1
l 1
7T(a)n - 7T(a)m-1
l =
7T(a) n
7T(a)m-1
--
7T(a)m-1
IJn
k=m
ak
'
<
E.
(3.5.2)
Conversely, suppose
If we fix m
;.:;.
N, we see that
Pi = lim 7T ( a ) n ,
n-oo
From
(3.5.2)
-E
<
( i
- 1
7T a ) m-1
j= 1, 2.
E,
<
<
1,
2.4.3(c)
Pt
m=oo 7T ( a ) m-1
lim
m-oo
we get
I in
P1
= P1
7T(a ) m-1 p/
and thus
-E
Since this is true, VE > 0.
Pt
- 1
P2
<
p1 = p2
=;i=
<
E.
From the last theorem it follows that a necessary condition that the
nk;;.o (ak)
infinite product
converges is that
as
ak - 0
as k
ak = 1 + ak
k-
oo.
oo.
In {I+ f3d- l1 n {l
:o;;;
({3k)
N0,
lf3kl}-l.
n = j,
and V n E
n = 0.
If it is true for
then since
i+t
k=O
k=O
k=O
{l +
f3k}'
3.5
Consequently, we have
0, 3N so that n
hand, if Vk E
N0, ak
TI
0,
I
let us write
{I+ ak}
k=O
fI { 1 + -2.J_
}
I+ ak
k=o
ak 0, 3K
so that k
ft{l
K => 0
<
lakl} - I
it {I+ l:klJ- 1 .
K,
m and n are
Ih,.0(1 + lakl)
ak
Ilk,.0 {I + ak}
lish that Vk E
N0, 3f3k
with
log(l +
Thus, if
k;.o (lakl)
1/(1 + Jakl)
lakl)
f3k
1, so that
lak l.
f3k lakl
!i..?! log TI
k=O
{1 + lakl}
(3.5.3)
il
{l
+ lakl}
!i exp
1og
( I!
{l +
lakl}
Vk
;<!: K =>
that
lakl l. Thus Vk
;<!: K,
f3k
;<!:
;:;i!=K.
k=O
(log(l +
lakj)
D Exercises
I.
If
a2k
is conditionally convergent. Show that this product has the same value
as the absolutely convergent product
fl
k=2
( ;2)
-
kTI
=O
is uniformly convergent for
{l
+ ak(x)}
x in a bounded set in
x2
TI I - k2
k=I
}.
3.5
3.
N0,
infinite product
nk,,,O (I
ak)'
I +
ak
> 0,
is conditionally convergent.
6.
Suppose
the infinite
product
nk,,,O (I
ak)
is conditionally
CHAPTER
41
DIFFERENTIATION
4.1
4.1.1
E
x-a
exists. In case this limit exists, it is called the derivative of f at a and is denoted
by any one of the symbols 'f'(a)', 'df(a)/dx', or 'D f(a)'. If f is differentiable at
every point of its domain it is called differentiable.
The derivative of a function f is that function f' (or dj/dx or DJ) with domain
J(J') = {x:f'(x) exists} (possibly void) whose value at the point a E J(f')
is the derivative of f at a.
From a logical point of view the notation
DJ,
D as a function with
all real-valued functions each having domain
in R (including that function whose domain is the null set), and the
range of
D2 = D oD, D3 = D2 oD,
F with
F(l) = D ,
If we set
F(n+I) =F(n) oD .
of
f.
4.1.2
at a a
lll8
Definition.
E
4.1
A
a
E N0,
E N 0,
4.1.3
Theorem.
Proof.
&
By the definition of
LJ(J)
I J(x)x-- af(a)
J' (a)
<Ix
- al <S'
< 1.
IJ(x)-f(a) I
Now, for a given
e > 0,
take
< (I
+ I f' (a)I)Ix-al.
<min
(S', e/ (I
lf'(a)I)).
Then the
IJ(x)-f(a)I
provided
Ix-al < S
&
<e,
LJ(J) .
The converse of this theorem is not true. Namely, it is not true that
if a function is continuous at a point, then it is differentiable at the
point. The simplest example that illustrates this is given by the function
f (x)
This is continuous at
0,
lxl.
but
lim
x-o
E R+. We
shall suppose that all the standard additive and multiplicative proper
ties have been proved, as, for example,
140 I DIFFERENTIATION
loga xy
loga
loga
+ loga y,
y Ioga x =
loga x-1,
loga
X11
ea
of
2.3).
- loga x
=
Let
v =
xh
/ ; then ash \i 0,
loga
( l + ;)l/h.
(4.1.l)
lim
v-.,
(1 + .! )
(4.1.2)
'e.'
v--oo
( 1 .! )
+
= e.
Assuming, for the moment, that we have proved these facts, we have
from (
4.1. l)
loga
(4.1.2)
1
e1 x =
- loga
e.
l,
a0 =
n EN,
is monotone increasing and bounded above (see Exercise 8 of Section
1) .. n(n-1) l _!_
(l +.!.n)n l +!".n + n(2n!n2 . +
n!
nn
1 + 1 + _!_2! (1 .!.n)
+ 3\ (1-)(l-) +
! (1-) ... (1-n:l).
+
If we use
n+ l
instead of
n (4.1.3),
in
(4.1.3)
k
n,
,,;:;
4.1
(
l+
n+l
)n+1
has one more positive term than the corresponding expansion for
Hence
n.
+
(
(
1+
< 1+
n!1r I.
r
(4.1.3) that
(
1) n
I
I
l+<l+t+-++-
n
n!
2!
Since
k!
2 3
'
k ;??:
(4.1.4) we get
(4.1.4)
2k-I,
(
I )n
1
1
l+<l+l+-++
n
2
2n-1
1-(1/2)"
l+l-(1/2) 3 .
(an) is monotone and bounded and its limit exists. We denote the
n-oo
lim
I "
(
1+- .
n
n v n+ I ; then
(Why?)
Now,
n-oo
lim
lim
11-""
1)n + 1
1+ n
I+
)"
n+l
--
n-oo
lim
n-oo
lim
n-oo
lim
(
(
I+
I+
1)"
n
-
n-oo
lim
)"+1
n+l
--
(
1 + ! ".
n
1)
n
I +-
n-oo
lim
n-oo
lim
1 )"
I +-
1 )-1
1+ -n+l
142 J DIFFERENTIATION
Therefore,
v
lim
V-+00
If
( + !)
1
lim
n-oo
1+
e.
0, then we have
h /'
lim
v- - oo
(+ )
1 v
lim
v-oo
lim
u-oo
(-)
(
( !u ) ( + !u )
1
and loge
1+
is called the
-v
function
( !)n
n
ex
lim
v-oo
1+
-1
--
e.
natural logarithm
of
exponential
and usually is
x'.
of Real Functions
Primer
(4.1.5)
To prove this we know that
with 0<
Ix - aJ < 8
Ve
> 0,
we have
e/2.
f(x) - f(a)
x-a
4.1
where
l11(x) I
e/2.
<
Thus, if t1
a,
we have
Hence we get
l[(t2) - f(t1)
- f' ( a)
I t2 - ti
<
e/2 .
x E (-1/2, 1/2],
g(x) = lxl,
f0(x) = g(x - k)
<===> x E
[k - 1/2, k + 1/2],
kE
Z.
l;
that is,
Vx E
R,
fo(x + 1)
Now, for
Vk E N0
and
Vx E
fo(x).
R let us set
fk(x) =fo(2kx).
Each function
VxE
R,
fk
l /2k;
Figure 4. 1.1
that is,
144 I DIFFERENTIATION
fk (x) = 2kx-p,
fk(x) =-2kx + p + 1.
We now set
00
f(x) = L fk(x)/2k .
k=O
By the Weierstrass M Test this is a uniformly convergent series of con
tinuous functions and hence f is continuous. Let a E R, S > 0, l/2n < S,
and m be that integer so that m/2n a < (m + 1) /2n. Set b1 m/2n,
b2 = (m + l )/2n, and b = (b1 + b2)/2. If k > n, then sincefk has period
l/2k, we have f (b1) =f (b) =fk(b2). If k < n, let p be that integer so
k
k
that p/2k b1 < (p + 1) /2k. Then, of course, p/2k < b< (p + 1) /2k and
p/2k < b2 (p + l)/2k. If pis even we have, for j E (1, 2),
=
or k <
n,
we have
0.
Hence we have
f(b2)
b2
f(b1)
b1
f(b;) - f(b)
b; -b
=
b2 -b1
b; - b
4.2
2.
(a)
(b)
if
g(x)
3.
Vx
(bn)
Suppose
4.
bn = 1+1
Show that
bn
- e.
1
+ f"
= 1 + I + ...!._
2!
1m
[Hi nt:
For
+ ...!._
n!
( ; r=bnm
1+
Then bnm
5.
- bn
(O!=l) .
n.
E N0 is given by
and
Tnm - Tn;;;. 0
as
;;;. n,
m -
- 1m
that is
set
),
nm.
oo. ]
6.
Let us set
f(x)
-1/x
oF-
0,
0.
0,
Show thatj<n>(o) exists and i s zero
Vn
= g(O)
4.2
DIFFERENTIATION RULES
146 I DIFFERENTIATION
4.2.1
Theorem. If f and g are both differentiable at a, and a is an
accumulation point of J:>(f ) n J:>(g), then f+ g, Jg, and, if g(a) "" 0, l/g
are differentiable at a. Moreover,
(f+g)'(a)=J '(a)+g'(a).
(Jg) '(a)=f(a)g'(a)+f '(a)g(a).
(I/g)'(a) =-g'(a)/g(a)2
(a)
(b)
(c)
Proof.
a+ h
J:>(f )
J:>(g).
h is
(Jg)(a+h)- (Jg)(a)
h
=f(a+h)
the fact that
g(a+
g(a)
+g(a)
f(a+h -f(a)
'
product is the sum of the limits and the product of the limits, respect
ively.
Formula (c) is a consequence of the equation
(I/g)(a+h)- (I/g)(a)
g(a+h)- g(a)
-1
- g(a h)g(a)
h
h
+
_
4.2.2
C
(fog)'(a)=f '(g(a))g'(a).
Proof.
(f
g)(a+h)- (f g)(a)
h
0
4.2
above equality we should get the formula stated in the theorem. The
g(a + h) - g(a)
could
in every neighborhood of
g(a + h)-g(a).
ferent way.
f is differentiable
81 & y E .B(J ) ==>
Since
<
at
>
so that
IY - g(a)I
IJ(y)-f(g(a))-j'(g(a))(y-g(a))I:;;; e,ly-g(a)I.
Also, since g is differentiable at
Hence, if
so that
(4.2.1)
Ix-al
lg(x)-g(a)-g'(a)(x-a)I:;;; e,lx-al.
Ix-al
<
8&x
.B(g),
<
(4.2.2)
IJ(g(x))-f(g(a))-f'(g(a))g'(a)(x-a)I
:;;; IJ(g(x))-f(g(a))-f'(g(a))(g(x)-g(a))I
(4.2.3)
lg(x)-g(a)I
and a
corresponding
(4.2.4)
x =;6 a, we can divide both sides of (4.2.4) by Ix - al, which shows that
f 0 g has a derivative at a and indeed the derivative at that point is given
If
(j0g)'(a)/f'(g(a)).
>
0 so that Ix - al
<
81
(4.2.5)
148 I DIFFERENTIATION
gx
E1IY - g(a)I .
(4.2.6)
Ix
IJ(g(x))-f(g(a))-f'(g(a))(g(x) - g(a))I
eiJg(x)- g(a)I.
,
(4.2_6 )
Let us put 8 =min (8i. 83) and m = I/If'(g(a))I If we use (4.2.5) and
(4.2.6'), then Vx E Je(g) with Ix-al < 8, we get
lg(x) -g(a)-Cfoc:l)
I
E1 m{lg(x)-g(a) I+ Ix-a}
l .
(x-a)
(4.2.7)
Now set M= I (f0g)' (a)I and use the triangle inequality on (4.27) to
get
(I - me1)
lg(x) -g(a)I
m(E1+ M) Ix.:._ al .
lg(x)-g(a)I
(I+ 2Mm) Ix - al .
(fog)'(a)
lg(x)-g(a)- f'
(x-a)I
(g(a))
e1 A Ix - al .
(4.2.8)
(f g)'(a)
< E
f'(g(a))
0
(4.2.9)
REMARKS.
Ix
D Exercises
4.3
p(x)
k=O
akxk,
is differentiable.
Use Theorem 4.2.3 to solve Exercise 1 of Section 4.1.
2.
g'(a) = (f g)'(a)/J'(g(a)).
0
h<n>(x) =
n
) pn-k>(x)g/-l<>(x)'
(
k=O
where
4.3
All the mean value theorems of the differential calculus are based on
two principles: (a) a continuous function on a compact set assumes a
maximum and a minimum, and (b) if a differentiable function is defined
in an open interval about a point where it has a local maximum or mini
mum, then the derivative must be zero at this local maximum or
minimum.
150 I DIFFERENTIATION
4.3.1
f(c + h - f(c)
{;;;:.:
if h > 0 ,
.;;; 0
if h< 0 .
c.
Then
h - 0,
must be zero.
4.3.2
Rolle's Theorem.
Since
[a, b],
it has a
=0
by Theorem 4.3.1,
f'(c) = 0.
f(b) - f(a)
= f'(c) .
b-a
Proof.
says that there is a point on the graph off where the tangent line to
(a, f(a))
to
f(b)-f(a)
y=f(a) +
(x-a).
b-a
y
Figure 4.3. 1
4.!
F(x)
f(b) - f(a) x a
f(a) + b-a
( - )- f(x).
F'(c)
f(b)- (a)
c
f
b-a - f'( )
differentiable on
0.
4.3.4
<
g'(c)[J(b)- f(a)]
Proof.
F(x)
[a, b], is
f'(c)[g(b)- g(a)].
Set
through by the latter factor. Now Fis continuous on [a, b], differentiable
on
F'(c)
0,
and
(x,f(x))
x,
f (x) )
(a, f (a))
Figure 4.3.2
(b, f (b))
152 I DIFFERENTIATION
F(x)
f(x)
J(a)
J(b)
Now, if f is continuous on
[a, b]
Clearly,
theorem
3c
]a, b[
F.
so that
x
a
b
I
I
and differentiable on
F' (c)
F(a)
F(b)
]a, b[,
the same
0, so that by Rolle's
0. The derivative
F' (x) is
obtained
and set the resulting determinant to zero, we get the Mean Value
Theorem.
This procedure can be generalized in the following way. Suppose
f, g,
and
are continuous on
F(x)
Then
F
F(b)
F' (x) is
and differentiable on
g(x)
g(a)
g(b)
f(x)
f(a)
J(b)
is continuous on
[a, b]
h(x)
h(a)
h(b)
3c
]a, b[
Set
differentiable on
[a, b],
0. By Rolle's theorem,
]a, b[.
so that
Thus we get
F'(c)
Note that if
h(x) =
h' (c)
h(a)
h(b)
g' (c)
g(a)
g(b)
f'(c)
f(a)
f(b)
0.
[a, b]
and for
and
x+ h
in
[a, b]
61,f(x)
If x+2h is
6h2f(x) and
also in
[a,b],
is
is a function defined on
J(x+h) - f(x).
then
we have
6h2f(x)
If n
we define
6h(6,,J(x))
x+ nh
[a, b],
6h"f(x)
we define
6hnf(x)
6h(6hn-1J(x)).
Proof.
Let
P(n) should
P(n)
4.3
[a, b]
and
7'=
The statement
P(l)
P(n- l)
g(x)
n> l
is true for
=
and set
6hf(x) .
h
[a, b- h] and is n- l
[a, b], then x + (n- l)h
P(n- l) to g and find a
[a, b - h].
81
so that
]a, b- h[.
If
x + nh
0 < 81 < 1.
Now,
1
g<n- >(x + (n - l) 81h)
=
Apply the Mean Value Theorem to the right side and we find a
82
so
that
0 < 82 < l.
If we set 8
since
6hn-l(6_hf(x)/h)
8 < 1.
L,_hnf(x)/h ,
(4.3.l)
Further,
(4.3.2)
6hnf(x)
If
h < 0,
pn>(x + n8h)hn.
(4.3.3)
x-a
(//g)(x)
Jim /( x ) / lim
x-a
x-a
g(x),
lim,,._ag(x)
7'=
154 I DIFFERENTIATION
oo
and it may
oo
4.3.6
X?b f
lim
(a)
or if
lim g(x) =
x
?b
(b)
oo,
and if
lim
.r?b
{f'/g')(x) = l,
then
lim {f/g)(x)
.r ?b
Proof.
l.
(a) Let us first give the proofs under the supposition that
-/ <
/ =
.r? b
E.
limx?bg(x) =
oo.
Let us set
(h'/g')(x) = 0 .
Now, since
>.
x < g < b.
<
2'
c < g < x.
x < b =>g(x)
4.3
Also, since
then 0 <
]d, b[
we have
lh(x)l
g(x)
Since
lh(c)/g(x)I
<
e/2
ford<
x < b,
<
lh(x) -g{c)I
g(x) -g(c)
for these
{x) I
lhg(x)
Ig(x) l
= M- z <
<
x we get
e.
reader.
g(x)
-oo
as
x b.
h(x)
cos x
.
Sill
# 0.
D sin x =cos x,
and
x = 1, limx-o sin x = 0.
All the conditions of L'Hospital's
D cos x =-sin x.
1.Im
x-o
h (x )
sm
l"
=Im--=
0
x-o cos x
rule more than once in a given situation. For example, consider the
function given by
f(x)
g(x)
smx-x
x2
# 0.
Then we have
.Jil. cos x - I
g' (x) - 2x
_
f' /g'
to get
0 as x
l(x) 1. -sin
.
IIm = Im
x-o g(x)
x-o
2
--
= 0.
are differentiable at a point, then the sum of the two functions is also
differentiable at that point and the derivative of the sum is the sum of
the derivatives. By the principle of induction this result can be stated
for any finite number of functions. Under suitable hypotheses the result
156 I DIFFERENTIATION
will extend for an infinite sum of functions. This is what we now wish
to establish.
Proof.
we get
is between x and
c.
E N0
Hence we get
and
x.
If
= (z - x)[f',. (0
z -x of:- 0,
z-x
- f 'm (')] ,
f:,.(,) j.
3N so
that
Vz
If we let
oo
z-x
J(z) -f(x )
z-x
m > N so
fm(z) - fm(x)
z-x
that
< .
3
in this last inequality and use the fact that the absolute
Choose a fixed
m, 38 > 0
so that 0 <
l fm(Z:-=-!m(x) _ j:,.(x) I
<
]a, b[ =>
4.3
From the last three inequalities and the use of the triangle inequality
lz - xi
-f(x)
lf(z)zx g(x) I
f' (x)
f'(x) g(x).
If V n
fn is defined and differentiable on ]a, b[,
if k:o U'n)
uniformly convergent on ]a, b[ and 3c
]a, b[ so that
k:o Un(c)) is convergent, then k:o Un)
uniformly convergent and
moreover
(Jn)' 0f'n
we find that if
]a, b[ &
< S, then
0 <
<
E.
exists and
=
4.3.8
Corollary.
E N0,
is
is
D Exercises
f
Vx,y .e(J), IJ(x) -J(y) I Mix - YI f
f'
f
[a, b]
Vx
A function
1.
If
If
is differentiable on
and
> 0 so that
has domain
2.
[a, b],
[a, b]
and
is Lipschitz.
f'(x)
0, show
3.
4.
Assume that
domain
.e(J).
]a, b[, g
f
(xn)
limn-oof(xn)
5.
Suppose
that
exists.
Xn
c E ]a, b[
be f'(c).
and for
7.
(g)
Suppose that
(x)
f'
is a differentiable function on
must
and
g.
bounded. If
6.
]c, d[,
Use the Mean Value Theorem to derive the chain rule for differ
]a, b[
[a, b]
and that
F(a)
lim
x'>a
f(x) - f(a) ,
x-a
F(b)
lim
X/'b
f(x) -J(b)
x- b
158 I DIFFERENTIATION
8.
9.
10.
.
ex - I x-o
x2
hm
lim
x-o
log(l - x) +x.
x2
cos x
.
1im
I + x2/2
x4
x-o
4.4
f(x) =ao+a1(x-a)
+an_1(x-a)11-1
that
J<k>(a)
ak=,
kE (O,n-1).
n-1 j<k>(a)
k=O
k-!
(x - a)k + R,.(x, a) ,
-
Rn
is
due to Taylor himself. The method for obtaining expressions for the
remainder is an application of Rolle's theorem.
Theorem. t
is
has
<I>
<
<
r
...t.
(4.4.l )
Proof.
on
If
is continuous on
and differentiable
0.
1-
Clearly
(4.4.2)
is continuous on
is given by
(4.4.3)
Further,
0.
(4.4.4)
Using (4.4.3) and (4.4.4) in (4.4.2) and solving the resulting equation
for
to
160 I DIFFERENTIATION
Rn(x,a)=
(b)
<l>(x)-<l>(a) p n>(c)
(x-c)n-1,
<l>'(c)
(n-1) !
(c)
pn>(c)
1 ! (x-a)P(x-c)n-v,
)
p{n-
a< c< x.
(d)
a< c< x.
a< c< x.
Proof.
a< c< x.
p
set = n
p
and
n, in part
expand around the right end point b. In the future we shall use this fact
without comment even though we refer to Theorem 4.4.1 or Corol
lary 4.4.2.
ex=
n-1 xk
ec
L-+x n'
!
n!
kO
If lxl b, then
4.4
Since
bn/n! 0
161
[-b,b].
Thus
.,
ex=:Lxk/k!,
k=O
R.
The proof we have given above to show that the exponential func
tion is the sum of an infinite series leads immediately to a more general
result: If a function f is defined and infinitely differentiable in an open
interval
I(a)
compact subinterval of
I(a)
a,
and if
(J'kl)
f(x) =
.,
k=O
r<k r a \
(x-a)k,
k
>
I(a). Indeed,
M= sup{IJ'k>(x)I: x E j
suppose
lpn>(c)
let
IRn(x,a)I= -1
- (x-a)n
n.
This estimate on
Rn
bn
.;;M-,.
n.
-m
,,;_:;
pn>(x) .
remainder,
Vx E [a,b[, we
f (b)
where
<">
(x),,;;;
<
<
n! M
(b- x)n
m= 0.
get
J<k>(x)
J<n+ll(c)
_ (b-x)k+
(b - x)n+l ,
k!
(n + I)!
n
b.
__
16l! I DIFFERENTIATION
must have
pn> x)
(b -x)n f(b).
n.
In case
g(x)
f(x) +mer-a,
[a,b].
Then we have
g<n>(x)
Thus
Vx
pn>(x) +mer-a -m + m=
0.
[a,b[,
pn>(x) n!g(b)/(b-x)n.
4.4.4
is
infi
I(c)
-m pn>(x).
If Ve E [a,b[, we set I(c)= {x: Ix-cl< b-c}, then Vx
n [a,b[,
f(x)=
j<k>(c)
k=O n.
co
--1 (x-c)k.
Ind e ed, the infinite seri es on the right converges uniformly in every compact
set in I(c) .
Proof.
By hypothesis
n!/(b - a)n
3M
pn>(x)
oo
as
oo,
and
Vx
n! M .
1pn>(x)I (b -x)n
Let us use the Cauchy form of the remainder in Taylor's formula:
pn>(O
R (x,c)=
(x-c)(x-On-1
n
(n-I)!
Using the above estimate for
IJ<n>({)I.
I Rn(x, c) I nM
If
<
<
b, then c
<
<
<
Jc,x[.
x-c
{
Ib
l 1 x-{ 1 n-1
b_{
we get
with values
(x-{)/(b - {) cannot
{, when restricted
to
[c, x],
must. take its minimum and maximum value at the end points
x and consequently
c. We get the same result if x < < c;
Ix - Wlb-I is taken at= c. If c d < b,
-d-c
b-c
p=
then for
Ix-cl d -c we
< l;
have
Thus for
Ix-cl d-c.
--'>
0 as n
c) I ,,,:::.
--'> oo,
and
Vn EN,
d -c n 1
n - .
b d p
_
c)
--'>
0 uniformly for
f(x)
"'
<k>( a
k=O
k.
)
:L J
-,- - (x-a)k,
the neighborhood of a point it does not necessarily mean that the func
f(x)
e-1/x '
0,
=I=
0,
0,
Vn E N0, pn>(O)
0 (see Exercise 6 of
164 I DIFFERENTIATION
0 Exercises
1.
Do Exercise 10 of Section 4.3 by using an appropriate form of
Taylor's remainder formula and assuming thatj<2> is continuous .
2.
f(x) =1
1
-
x.
Show that f is analytic at zero and that its Taylor expansion at zero
converges uniformly on every compact subset of ]-1, 1[.
3. Generalizing the considerations of Exercise 2, letfa be the func
tion with domain ]-1, 1 [ defined by
a
ER.
Show that fa is analytic at zero and its Taylor expansion at zero con
verges uniformly on every compact subset of ]-1, 1[.
4.
e= L llk'
k=O
l/k! + Rn+1
k=O
Show that Rn+i < I/n!n. This estimate for Rn+i shows that e is irrational.
Indeed , if e is rational it can be written e = p/q, p, q E N and
q
L
k=O
11k1 + lfq!q.
k=O
4.5
]
[ f(x) - n-1
f'(kl f(n\)
(x- a)k gin>(c)
k
[ g(x) - k=O
'f gk< >a)(x - a)k ] pn>(c).
.
By choosing g to be a suitable function, obtain the Taylor formula with
Lagrange remainder.
Give an example of a function
8.
f(
x) =
J<kl ( 0) k
x
k!
'
9.
<
[O, a[,
0.
]a, b [.
f is analytic at
0, 3/(c), and 3M so that Vk E N0 and Vx
]a, b [
I(c),
is that
3p
>
jj<k>(x)j
:s;;
k!M/pk.
IO.
at a. Show that
J is
4.5
a and is analytic
a. (Hint: Use
POWER SERIES
We have seen in the previous section that some (but not all) infinitely
differentiable functions may be analytic at a given point; that is, they
may be represented in a neighborhood of the given point by their Taylor
a =0
about
k=O
( c k (x- a)k ) .
converges at
a,
analytic function?
a,
of an
series converges is given by the Cauchy root test. For every sequence
(4.5.1)
166 J DIFFERENTIATION
4.5.1
oo
Theorem (Cauchy-Hadamard).
00
L (ck(x - a)k)
k=O
is uniformly absolutely convergent on any compact subset of the interval
]a - r, a+ r[ and diverges on the complement of [a - r, a+ r], where r
is given by (4.5.1).
Proof.
From the Cauchy root test the power series will converge
for which
lim
k-oo
and will diverge for every
for which
lim
k-
icki1'klx - al < 1
00
lckl1'klx - al > 1.
Indeed, the proof of the Cauchy root test shows that the absolute con
vergence is uniform on every compact subinterval of
]a - r, a+ r[.
k..olc kl Ix - al k
s < r,
then
[a - s, a+ s].
The number
]a - r, a+ r[ is
called the
4.5.2 Theorem. If the power series k..o (ck(x - a)k) has a nonzero
radius of convergence r , then the function f defined on ]a - r, a+ r[ by
00
f' (x)
L kck(x - a)k-l,
k=l
where the latter series also has the radius of convergence r.
Proof.
tion 2.4)
4.5
it follows that both of the series above have the same radius of con
vergence.
If we set
fk
k=O
(fk')
Corollary.
00
f (x)
then Vn
k=O
ck (x-a)k,
]a - r, a+ r[,
E N0,
Cn
Proof.
pn>(x)
x
pn>(a)/n!
Vn
Setting
E N0,
00
L k(k - 1)
k=n
(k- n
+ I)c k(x-a)k-n.
Cn .
n=O
(anxn )
x,
we get
00
00
( bnxn) = L (cnxn),
n=O
n=O
where
Cn
2 akbn-k
k=O
Cauchy product
of the series
on the left. The natural question to ask concerns the value of the radius
of convergence of the Cauchy product in relation to the radii of con
vergence of the series that make up the product. We shall prove a
theorem of this nature for series of constant terms that will immediately
answer this question for power series.
168 I DIFFERENTIATION
n
L a kbn-k
k=O
(4.5.2)
Proof.
domain of
Let us set
J(n,k)
akbn-k
fork E
(O, n)
and
N0 The
Cn
is given by (4.5.2),
m [n
o f(n,k)
Cn=
m 1
1 -
D=ID
-l+tL
i ++- 1
1--c. ---+-
, =t=f+t
Figure 4.5.1
What we are doing is adding together all the values that f takes on the
triangle of lattice points shown in the figure. Note we are first adding
the terms along the columns and then adding the resulting numbers.
We get the same result if we first add the terms along the rows and
then add the resulting numbers. Thus we get
4.5
n=k
n=k
and thus
m
Cn =
n =O
Let us set A= li m n - oo
L ak<T(b)m-k
k=O
<T
(a) n
and
B= lim n - oo <T ( b) n .
Then we may
write
m
Cn
-AB=
n=O
oo
k=O
k=O
Let
k=O
oo
ak
[ <T ( b) m-k - B]
<T
( lal)
B L
k=m+l
and
<T
(b) ,
ak.
(4.5.3)
which exists by
Ve> 0, 3m0
E N
<
e/2M,
00
k=mo+l
If we take
m 2m0
lakl
<
lo
:s;;
e /6M.
ak i
+l
k=O
Note that
:s;;
k=mo+l
oo
m+l
vergence and if that open interval is not the null set, the power series
converges to a function that is analytic. Now, the reader can easily
show that nothing can be said about the convergence of a power series
at the end points of its interval of convergence. In other words, exam-
170 I DIFFERENTIATION
pies of power series can be given which converge at both end points
of the interval of convergence, at neither end point, or at only one end
point.
There is another question about the end points of the interval of
convergence of a power series which is well illustrated by the following
example. It is very easy to show that for
lxl
<
1,
xn
(-l)n+1 -,
n
n=J
ao
log(l
+ x) =
1 the
]-1, 1[ .
on the left has the value log 2. The question is whether these two
quantities are equal. The answer is given by the following theorem due
to N. Abel.
4.5.7
Theorem (Abel).
ao
f(x) =
k=O
akxk,
lxl
<
1,
J( l-)
:L
k=O
a k.
Proof.
n
akxk= u(a) nX"+i - L u(ah[xk+i - xk]
k=O
k=O
n
= u(a)nx n+I+ (1-x) L u(a)kxk.
k=O
Noting that
jxj
<
f(x)
Let
A be
the limit of
u(a).
(1- x)
k=O
ao
:L
k=O
we get
u(ahxk.
oo
lxl
<
1,
xk,
we get
<
00
[u(ah- A]x k.
k=O
ju(ah-AI
1,
00
(1- x)
k=n
ju(ah-Ajxk
:s:::
e/2.
4.5
;;,, N,
(I - x)
is less than
n-1
L [cr(ah-A]xk
k=O
of Abel's theorem.
The converse of Abel's theorem is, in general, not true. That is to
say, suppose
f(x)
L akxk,
00
lxl
k=O
<
1,
L (-l)kxk,
1
1+X
00
-- =
k=O
1/2 as x -
l, but
lxl
1.
<
k..o (-l )k
is not convergent.
Tauberian theorems.
4.5.8
Lemma (Cesaro).
Cn
Proof.
of Sec
If
=
n
1
s l
L
n + k=O k S.
We may write
"
1
L [s
n+ l k=O k -S] .
that k;;,, K =::::} lsk -SI < e/2.
Cn
Now
13
1. 7).
Ve> 0, 3K
n;;,, m,
so
get for
h-SI
-S
l
n+ 1
m-1
4.5.9
e/2
e/2
Theorem (Tauber).
n.
<
1,
If
L akxk,
00
lxl
k=O
and ifkak - 0 ask - oo and J( 1-) exists, then
J(x)
0 lsk-SI + n+ 1 k l sk-SI.
k..o ak.
172 I DIFFERENTIATION
Proof.
lxl
If
1,
<
we can write
n
n
L ak-f(l-)=J(x)-J(l-)+ Lak(l-x k)
k=O
k=O
(4.5.4)
Now, for
>
1,
1 -xk
and .since
x
J l
<
1 we
k-1
( 1 -x) Lxi,
j=O
get
1-xk k (I -x).
Using this in (4.5.4) we get, for 0 x <
I ak- f(l-) 1
1,
Let us set
Xn
(I
x) L klak l + L J akJxk .
k=n+l
k=O
(4.5.5)
1 - 1/ ( n + 1) ; then, by hypothesis,
klakl
as
n - oo.
0, we get
(4.5.6)
(4.5.7)
Also, since
kak -
0,
Ve> 0, 3N so that
,.,;:: _
.c:::
;=: N
L.J
+ 1 k=o
Xn
l akl
e
1
--.c:::
e
+ 1 1-xn
(4.5.8)
--
<
Xn
replacing x, we
4.5
(4.5.9)
If there exist such functions, it is clear from these equations that they
must be infinitely differentiable and moreover V k E N 0,
c<2k>(x) =(-l)kc(x) ,
Since
(4.5.10)
c<2k+ll(x) =(-l)k+is(x).
(4.5.10), for every compact set in R all the derivatives are uniformly
c(x) =c(O)
oo
k=O
oo
k=O
l)
(-l)k
x2k
oo
+ c(O)
(2k) !
x2k
(2k) !
s(O)
k=O
oo
k=O
x2k+1
( - l) k
(
(2k + 1) ! '
x2k+ 1
k
l)
(2k + 1) !
(4.5.11)
Hence, if there exist functions that satisfy (4.5.9), they must be of the
form (4.5.11). From Theorem 4.5.2 we may differentiate the series on
the right termwise, and it is a simple exercise to establish that these
functions actually satisfy the differential equations (4.5.9).
The equations (4.5.11) show that the solutions to (4.5.9) are not
uniquely determined. However, once s(O) and c(O) are specified, they
are uniquely determined. By taking s(O) = 0 and c(O) =1, we obtain
the trigonometric functions sine and cosine:
sm x =
cos x =
oo
k=O
oo
k=O
(-1) k
2k+I
X
(2k + 1) !
'
(4.5.12)
x2k .
( -O k
2k'
Cy(x)=
Cy
ferential equations (4.5.9) and moreover su(O) =siny, and cu(O) =cosy.
Hence from (4.5.11) and (4.5.12) we get
sin (x + y) = sin x cos y + cos x sin y ,
cos(x + y) =cos x cos y - sin x sin y .
(4.5.13)
174 I DIFFERENTIATION
(4.5.12)
From
-x,
x+
x=1 ,
sin2
(4.5.13) we
xi
Vx
Vx
E R,
!sin
xi
1,
(4.5.14)
ER.
1.
(4.5.14')
sin
cos
(4.5.15)
x =cos x
and indeed it is increasing in that interval around the origin for which
cos
x > 0.
Now,
22
cos 2=1 2!
24
4!
26+4k
""
[ 6 +14k)
(
4
(8 + 4k) !
and since
4
(8 + 4k) !
(6 + 4k) !
>
k;;;. 0,
is
an
even
function,
(4.5.14')
is [-1, l].
From the first formula of
(4.5.15)
ond formula of the same number we get cos 1T =-sin2( 7T/2) =-1.
Repeating this process with 27T in place of 1T, we find that cos 27T = 1,
sin 27T=0. If we use these facts in conjunction with the addition
formulas
(4.5.13),
cos
we find that
Vx
(x + 27T) =cos x,
ER.
x.
(4.5.16)
4.5
176 I DIFFERENTIATION
Theorem 4.2.3 tells us that D arc sinkx exists wherever cos (arc sinkx)
#- 0 and moreover
D(arc sinkx)
1
.
cos (arc sinkx)
Now,
cos2 (arc sinkx)
so that
cos2 (arc sinkx)= 1 -x2
For k odd, cosx is non positive forx E [(k - 1/2) 1T, (k + 1/2) 1T] , and for
k even, cosx is nonnegative on this interval. Hence
k
.
(-l)
D arc sinkx = ;,-o ,
lxl < 1 .
v 1-x2
k
(-l) +l
D arc coskx= . ;,-o .
v 1 -x2
lxl
<
1.
Once the sine and cosine have been defined, the other trigonometric
functions can be defined in terms of these. The most important other
trigonometric function is the tangent defined by the equation
tanx=
Sill X
cosx
--
#-
(2k + 1) 7T/2.
x,
and D tanx is never zero, it follows from Theorem 4.2.3 that arc tankx
has a derivative
D arc tankx=
1
D tan (arc tankx)
--------
y #-
(2k + l)TT/2.
4.5
thus
= I
Thus
D arc tank
x= 1 + 2
x
D Exercises
1.
Suppose that Vx in [a
00
k=O
Show that Vk E N0,
2.
ck=
B,
a+
BJ, B
> 0,
ck(x-a)k=O.
0.
f (i ! k x
k=O
(b)
k .
oo
k=O
00
(c)
3.
k=l
x2k+1
2k+ I
(kkxk).
Suppose that
3M so that Vk
)
)
E N0,
Jk(a) =ck.
4;
Suppose that
00
f(x) = L ckxk,
k=O
g(x)
k=O
k3ckxk?
178 I DIFFERENTIATION
5.
Suppose
]a - r, a+ r[, r
power
series
has
the
interval
of
convergence
6.
k;.o(ak)
k;.o(ck) is
and
con
k;.o(bk)
vergent, then
7.
Suppose that
f(x)
and VkE N0,
ak
L akxk,
00
k=O
lxl < l,
bounded.
8.
4.6
same
power series.
4.6
4.6.1
domain
Theorem.
N0 X
Bn(J,x) =
then Bn(J,x)
Proof.
L f(k/n)An(k,x).
k=O
f(x) uniformly in x as n
oo.
f(x)
L f(x)A,.(k,x).
k=O
Bn(J,x)- f(x) =
If we use the fact that
L [f(k/n)- f(x)]A,.(k,x).
k=O
An(k,x)
IBn(J,x) - f(x)I
we find that
L lf(k/n)- f(x)IAn(k,x)
k=O
IBn(J,x)- f(x)I
1x-k/nl<6
lx
nl ;;.6
k/
L
lx-k/nl<6
An(k,x)
2 L.J An(k,x) = 2
k=O
(4.6.1)
180 I DIFFERENTIATION
2M.
L An(k, x).
lx-k/ni"'ll
that n;;:.: N implies
Ve, 3N so
e/2, uniformly in x. Thus
Ve> 0, 3N so that n;;:.: N
By condition (b)
(4.6. l)
than
IBn(f, x) - J(x) I
<
e.
(An)
previous theorem arise again and again. Such sequences fit under the
generic name "approximate identity." The reason for this name will
become clearer to those readers who investigate the theory of Banach
algebras.
The Bernstein proof of the Weierstrass theorem is obtained by choos
ing the sequence
(An)
4.6.2
Lemma.
For every n
E N0
defined by
A n(k,
N0 X
[O, l]
x) = () xk(I-x)n-k ,
n
(n _
<=> k
! kl
:s.;;
n
,
O<=>k>n.
(x+ l
-x ) n=
k=O
(n)xk(l-x)11-k.
k
It therefore remains to prove the crucial condition (b). This will require
some computations.
From the binomial theorem,
(x+ y)n
If we fix
Vn
=
E N0 and
()
Vx,y
E R, we have
n kyn-k
.
x
k=O k
y n-1
n(x + )
f k (kn)xk-lyn-k.
k=l
x we get
(4.6.2)
nx
I -x to get
i k ( nk ) xk(I-x)n-k .
(4.6.3)
k=O
n(n-I) (x + y)n-2
i k(k-I) ( nk ) xk-2yn-k.
k =2
n(n- I)x2=
k(k- I)
k2
k=O
=
I -x to get
k=O
From the fact that (k-nx)2
()
n
k
k2
()
n k
x (I-x)n-k
k
(4.6.4)
and (4.6.4)
()
( nx-k)2 n xk(I-x)n-k
k
k=O
nx(I -x).
(4.6.5)
Using (4.6.5) we get
.s2
l.r-k/nl;;.ll
()
xk (1
x)n-k
.::;
l.r-k/n l;;.ll
( )(
n
k
2
x - ! xk(I - x)n-k
n
x(l - x)
I
n
4n
This shows that V.S > 0, the sum on the left goes to zero uniformly in
x as n
-+ oo,
Proof.
b]
[O, I].
Bn (f, x)
converges uniformly to f.
f(k/n)
( )
xk(l-x)n-k
182 I DIFFERENTIATION
In the general case, set g{y) f( (b - a)y +a) for y E [O, I]. As y
ranges over [O, I], x = (b - a)y +a ranges over [a, b] Now,
.
g(ktn) (;) G:::::)\ :::::r-k.
=
Bn(g,y)
CHAPTER
5.1
51 INTEGRATION
RIEMANN-DARBOUX INTEGRALS
[a,b]
If Ik
{Ik: k E (l, n) } .
ak, and
A,
A* is
A*.
a refinement of
We also think it
184 I INTEGRATION
The function R1 is called the Riemann sum function for f and any number in
its range is called a Riemann sum for f.
The function R1 is said to have the limit R (J) ::::? VE > 0, 36., so that
a>-a. IR1(fi, {x k}) - R(f) I < E. In case R1 has a limit, we say that
f is Riemann integrable and the limit R(J) is the Riemann integral of f.
Note that in case R1 has a limit R (J), we are justified in calling R (J)
the limit of R1 since it is unique. Indeed suppose R1 (J) is also a limit of
R1. Then Ve> 0, 3fi, and 3fi1, so that if a>-a. and a>-d1,, then
DJ(D,,)
k=l
n
QA6.)
'L
k=l
Mkllkl,
mk11k1,
Mk=
mk
sup
{J(x): x Eh} ,
inf {f(x):
x E Ik} .
The functions D1 and !J.1 are called the upper and lower Darboux sum functions
for f, respectively. The numbers DJ(D,,) and !J.J(D,,) are called upper Darboux
and lower Darboux sums for f, respectively.
Set
and call these numbers the upper and lower Darboux integrals of f, respectively.
case D(f) !J.(f) D(f), we say that f is Darboux integrable and call
D(f) the Darboux integral off.
In
R(f)
and
D(f),
J:!(x)
dx,
5.1
By definition
n
!l1(A)=L mklikl,
k=l
For each Ik EA, let
Ak
II*I.
/*EAk
L mk IIkl
k=l
L 2, m*(I*)I
I *!.
k=l JE A k
Since A*= {/*: I* EA k & k E (l, n) }, it follows that the right side of
the above inequality is precisely !l1(A*) . This proves the left-hand in
equality of Lemma 5.1.6. The right-hand inequality follows by similar
reasoning, and the middle inequality is obvious.
,,;_;;
(5.1.l)
We shall suppose that a < b, since otherwise the fact that D( f ) exists
and is equal to R(f ) is trivial. If we choose xk EI k so that Mk -f(xk)
< e/2(b - a), then
-
I kl < e/2.
2, (Mk - J (xk))I
(5.1.2)
k=l
On the other hand, if we choosexk E Ik so thatf(xk) - mk < e/2(b- a),
we get
(5.1.3)
=
186
\INTEGRATION
(5.1.4)
(5.1.5)
e'
(5.1.6)
This shows that the Riemann integral of f exists and R(J) = D(f) ,
which completes the proof of the theorem.
Theorem. The Riemann-Darboux integral of the function f exists
the function R1 is Cauchy in the sense that Ve> 0 ,3 .6., so that .6.> A,
and .6.' >.6., ==::}
5. 1. 7
A' >A},
5.1
<
e/2,
<
e/2.
(5.1.8)
e/2.
<
(5.1.9)
<
e.
(5.1.10)
-;;;(A)- limR,
<
e,
-;;;(A)-R1(A , {xd)
<
e.
< E.
so
lS g.
< E.
Let A1,, A2,, and A3, be the subsets of A, which are decompositions of
[a,c], [c,d], and [d,b], respectively. If A2 is a decomposition of
[c,d] and A2 >-A2. , then A= A1, U A2 U A3, is a decomposition of
[a b],which is a refinement ofA,. IfA '2>-A2, and A' = A1, U A'2 U A3.,
then
188 I INTEGRATION
Proof.
For every
>
0 , 3d, so that d
>-
d, ==?
I 1 af(xk)IIkl - J:f(x) I
I J3g(xk)IIkl - J3 J: g(x) l
<
dx
e/2,
dx < e/2.
Thus
I [af(xk)
J3g(xk)] II kl - a
J:f(x)
dx - J3
J: g(x) I
dx
<
E.
<
E.
f2 - 2mf + m 2 ,
Du(A) -Qu(A)
< e2
5.1
kEA
E
kEB
Hence
n
k=l
D Exercises
l.
If
a*
of intervals in
2.
If
a*.
is a decompositiOn of
a is a union
show that
b-a= L II kl
k=l
3.
[a, b],
that
4.
5.
following:
J: x dx= 1/2.
r x2 dx 2/3.
f dx= -
(a)
(b)
-1
(c)
6.
If
1.
is a decomposition of
llll
=max
[a, b]'
we have defined
{III: IEa}.
VE> 0, 36> 0
so that
lal<6 ==>
O:s;;Dr(a)-ll.r(a)<E.
[a, b],
is Darboux in
190 j INTEGRATION
7.
of
IR,(A,
8.
If f is Riemann-Darboux integrable on
E.
[O, 1]
and if
show that
Sn J:J(x) dx.
(Hint:
9.
1
( k )2
n-oo -n kL=I -n .
1
k
n-oon- k=LI e ln.
(a)
lim
Jim
(b)
(Hint:
10.
is defined on
[a, b]
same in
tegral as f.
5.2
(b)
on
a and f3,
J: [af(x) + f3g(x)] dx=a J: f(x) dx + f3 J: g(x) dx,
J 0 implies J: f(x) dx 0,
5.l!
(c)
(d)
b implies
Proof.
The proof of part (a) is the same as the proof of part (a)
of Theorem 5.1.9.
Part (b) follows from the fact that all the Riemann sums are non
negative, and hence the limit must also be nonnegative.
To prove part (c) we note that it is always true that
Iii - f;;:;.
IJI + f;;:;.
0 and
of/to
[a,c]
and
positions a .. and
e/2,
e/2,
If a.= a,. u
of
inequality
Ll,
e.
Ll1
Ll2,
we are
done.
J: f(x) dx
Proof.
lk
Let
[ak> bk]
Ll = {Ik: k E ( 1, n)}
ak <bk. Then
and
g( b )
- g(a) .
be any decomposition of
[a,b]
with
192 I INTEGRATION
n
g(b) - g(a) = L [g(bk) - g(ak)]
k=l
n
L g'(xk)[bk - ak]
k=l
=
n
L f(xk) IIkl ,
k=l
We have, of course, used the Mean Value Theorem, and this is where
>
0, if we choose a so that
<
{xd
E,
<
e.
Proof.
f(x)
Clearly,
Rn(t,t)
n-1 <k>(t
j
)
L k !- (x - t)k + Rn(X,t) ,
k=O
=
R'n(x,t)=-
[a,b].
fixed, we get
Hence, if
Rn(X, c)
b, we
Rn(X, c) - Rn(x,x)
-
tE]a,b[.
J: R'n(x,t ) dt
(x - t) n-1 n>
p (t) dt.
(n - I)!
5.2
If {3 ;;;.:
and
g is
integrable on
[a,{3], define
l) ! J:
Rn(X,c) =
(n
(x - t)n-ipn>(t) dt.
b.
(5.2.1)
fg(b) f(x)
d:x=
g(a)
fb f0g(x)g' (x)
d:x.
If d
{ [ak,bk]: k E (1, n)} is a decomposition of [a,b],
{ [a'k,b'k]: k E (1, n)}, where a'k = g(ak), b'k g(bk) is a de
composition of [g(a),g(b)]. Conversely, since g is continuous and
increasing, it is one to one and every decomposition of [g(a),g(b)]
comes from a decomposition of [a,b J in this way. Further, xk E [ak,bk]
<=> x'k = g(xk) E [a'k,b'k] and d1 >-d <=> d'1 >-d'.
Since f is integrable, for every E > 0 there exists a decomposition
d', of [g(a),g(b)] so that if a' >-d'., then
Proof.
d'
then
d:x <
(5.2.2)
e.
g(a)
Ik
a,
[a,b].
If I'k E
d',
to get
(5.2.3)
If we take
x'k = g(xk)
in
(5.2.2),
then from
(5.2.2)
and
(5.2.3)
we get
va >-a.,
d:x <
Since
f0g(x)g' (x)
E.
10
of Section
5.1).
REMARK:
va >-a.'
(f0g)g'
is integrable
{xk}
>-a.
194 I INTEGRATION
J:
In this case
f(x) = -Let
dX .
f g(x) =
0
cos
us take
x,
0,,;.;;
x,,;.;; 7T/2.
1. Thus the
g'(x) =cos x,
and thus
J: f(t) dt,
F(x) =
F(x+h -F(x)
- f(x)=
h
[a, bJ which
is
a point of continuity
we have
If
[a, bJ,
> 0, we have
Hence
f is continuous at x, Ve >
IJ(t) - f(x) I < E. If we take h
Since
=>
h\"}
0, 38 so that
It - xi
< 8 &
fe(f)
F (x+h)-F (x)
=J(x) .
h
5.2
If
lim
h)'O
F(x + h) - F(x)
= J(x) ,
h
5.2.6 First Mean Value Theorem. If f and g are defined on [a, b],
g 0, Jg and g are integrable, and J is bounded, then there exists a number c
such that inf J ::% c ::% sup f and
J: J(x)g(x)
Proof.
Let
dx= c
m =inf f, M =sup
J: g(x)
f. Since
dx.
mg(x)
::%
J(x)g(x)
::%
Mg(x),
it follows that
m
Now, if
J: g(x)
J: g(x)
f J(x)g(x)
dx ::%
dx ::% M
J: g(x)
dx.
J: g(x)
dx
> 0, set
c=
dx
::%
::%
dx,
M.
5.2. 7 Second Mean Value Theorem. If f and g are defined on [a, b],
g 0, Jg and g are integrable, J is bounded, and m ::% inf J ::% sup J ::% M ,
then 3c E [a, b] such that
.
f J(x)g(x)
Proof.
dx= m
G(x)= m
G
J: g(x)
on
dx + M
[a, b]
J: g(t) dt
f g(x)
dx.
by the equation
f g(t) dt.
min
::%
G(b) = m
J: g(t) dt J: J(t)g(t) dt
M J: g(t) dt= G(a)
::%
::%
Since
::% max
and max
G,
G.
196 I INTEGRATION
J: f(t)g(t) dt
G(c)
[a, b]
such that
J: J(t)g(t) dt
J: fn(x)
dx -
J: f(x)
dx.
lsup{fn(x): x
Iinf Un(x): x
Fix
N and let
E
E
A} - sup{f(x): x
A} - inf {f(x): x
E
E
<
e/3
a.
>-
=>
Hence
Thus
is integrable. Further,
I J:fn(x)
dx-
ll21m(.:1) - l2r(.:1) I
< E
N =>
dx e/3,
,,,/
Corollary.
We shall leave the obvious proof of this corollary for the reader. We
should remark that it is possible to relax the hypothesis about the uni-
5.2
Suppose
fn(x)
n+ l
{::}
x E ]0,1/(n+ I)],
0,
otherwise.
fn(x)
0. But Vn E N0,
Let
a= {lk:
bn
b, and ak
.,;;
respectively. Hence
+ f(an)(b - an).
Thus
D,(a) - Q,(a)
Now, for
>
"
_L
k=l
D,(a) - [},(a)
Since
D(j)
.,;;
15,(a)
and
[},(a)
E'
b.
and we get
.,;; E [f(b)
.,;;
an+l
- f(a)] .
198 I INTEGRATION
it follows that
r f(x)
Lb f(x)
dx-
E> 0,
Proof.
For every
:L llkl,,;;; 2m8,.
kEBo
L I Ijl
}EB
L lfkl
kEBo
Hence we have
JEB
ing by a set of open intervals the sum of whose lengths is less than
It turns out that a bounded function
is Riemann-Darboux integrable
5.2
Proof.
Since
b. Setting
J: f(t) dt,
such that
F'
f.
collection of primitives of
The
that
D Exercises
I.
If
La J(x)
If a,b, and care
5.2.l(d) is valid.
2.
dx
[a,]
-J: f(x)
define
dx.
and g are
3.
is integrable by
[a,b]
If
F(x)
show that
5.
[a,b] , and F
is defined on
by
F is
J: J(t) dt ,
continuous.
200 f INTEGRATION
6. Suppose g and g' are defined and continuous on [a, b], and J is
defined and continuous on (g). If Fis a function defined on (g)
by the equation
F(y)
[Note: If
y <
g(a),
g(a)
=.
o<a>
J(t) dt
[a, b] ,
g(x)
J: Jo g(t)g'(t) dt.
b,
then we get the formula of Theorem 5.2.4. Hence for a certain dass
of functions this result is a generalization of Theorem 5.2.4.]
7.
J: J(x)g(x)
8.
dx = J (c)
3c
[a, b] so that
J: g(x)
dx.
9.
x is irrational,
if
x is rational .
10.
g(x)
11.
If g is continuous on
[O, 1] as follows:
[a, b], g
J exist?
;;;;,; 0 and
J: g(x)
dx
0, show that
0 for every x.
J: Jn(x)
dx
J: J(x)
dx.
1 2.
on
5.3
13.
in Theorem
5.2.8 using
14.
00
--= "" xk
1-x
,.,,
k=O
15.
Expand
l/
'
lxl < 1,
(1- x)
around
0.
16.
Rn (x ' c)
(x- c) n-1
=
(n - 1) !
J<n+O
Rn(x,c)
But
Rn(b,c)
f(b), and
5.3
Rn(x,c)
c) ds
(=r-1 Rn(b,c).
this implies
0 Rn(X, c)
This implies
G=r-l f(b).
0.
IMPROPER INTEGRALS
[a,oo[.
One natural
3.
or
I(J)(x)
J: f(t) dt.
The ordered pair (J, J(J)) is called an improper integral of the first kind
b= oo , and it is called an improper integral of the second kind b E R.
202 I INTEGRATION
The improper integral (f, I(f)) is called convergent<::) limx-b I(f) (x)
exists. An improper integral is called divergent <::) it is not convergent. The
improper integral (f, I(f)) is called absolutely convergent<::) (ifl, J(ifl))
is convergent. In case (f, I(f)) is convergent, then limx-b I(f) (x) is called
the limit of the improper integral and it is denoted by
J: f(t) dt .
As in the case of infinite series, it is often convenient to denote an
J: (f(t) dt).
As an example of a convergent improper integral of the first kind
.r- 00
.r
e-" dt
e-1
Then
I,
L'" e-1 dt
I.
we have
lim
.r-1
ix
0
dt
7T
- -
\!T=t2 -
Clearly Definition 5.3.1 does not exhaust all the possibilities for defining
]a , b] (a
E R or
a = -oo )
and V x E
I(f) =
J f(t) dt.
The type of theorems that one can prove about improper integrals
of the first and second kinds resemble very closely analogous theorems
for infinite series. As examples we prove the following results.
5.3
Vx,y
Proof.
If lim.r-b I(f)
(x)
<
e.
vious. On the other hand, if J(f) is Cauchy, the theorem follows from
Proposition 2.4.4.
Proof.
Suppose
and x,y E
[a, b[
with
y > x.
Then
If f(t) dt l
f lf(t)l dt
f lg (t)I dt
II(lgl) (y) -J(lgl) (x)I.
Since
I (lgl)
(lgl, I (lgl))
I(IfI)
and
hence J(f) is Cauchy. Thus, using Theorem 5.3.2 again, we have com
[k
f(t)
for
(5.3.1)
However,
J(f) (n)
Hence we see that if lim.r- .,,/ (J) (x) exists, the monotone nondecreasing
sequence defined by the left side of (5.3.1) is bounded and hence con
vergent. On the other hand, if the series k,.,0
(J(k))
is convergent, the
204 J INTEGRATION
f(t) = (t + I)P'
0.
J( k)
(k + I)P
(I/kP)
converge or
x -dt
1
=
tp
{ _l_
1- p
log
[x1-p
l]
'
for p =I= 1,
for p=
x,
1.
Thus we see that the given integral of the first kind converges{:=:} P > 1.
Thus the p series converges {:=:} p > 1 .
Let us give another example that shows how the techniques used in
the proof of the integral test can be used to obtain rather refined esti
mates for certain finite sums. Let us show that
n
k}:
=2
k log k =
-
0 < a<
0 <
bn
1
2 log 2
<
log log 2
1
n logn
1/ (t
log
(Fig. 5.3.1). Let ek be the area as shown in the figure; that is,
1
k
e = k log k
k+l dt
J t t'
k
log
2.
t), t > 1
5.3
1
t
log
k+l
FIGURE 5.3.1
n-1
n
n-1 k+l dt
1
1
2: k log k = 2: k t log t+ 2: ek+n logn
k=2
k=2
k=2
--
(5.3.2)
--
n-1 k+l dt
n dt
= = log logn - log log 2 .
L k 1
t
og
t
t 1og-t
k=2
Further,
I
< ek< k log k -
O
and thus
<
n-1
n-1
2 ek<
I
I
+ l)log(k+I)'
(k
1
k log k - (k
+I)
l)log{k
2 log 2
(5.3.3)
(5.3.4)
n logn
l::k,,.2 (ek)
< Lek< n
k=n
I
1ogn
(5.3.5)
Let us set
co
(5.3.6)
(5.3.6')
bn .
n
I
= log logn+ a+ bn.
L
k=2 k log k
-
and from
l!06 I INTEGRATION
(5.3.7)
is not absolutely convergent; that is, the function with values ! sin
xl/x
does not have a convergent integral of the first kind. To see this we
simply note that
Vn
E N,
lmr j
sin
Now,
Vt
ti
k
< +ll1r j sin tj
n -1
dt = 2: f
dt.
k=O k1T
Hence
Jo
n"
!sin ti
t
E N0,
1
7Tk=ok+l
n-I
dt-2:
Since the sequence defined by the sum on the right is divergent, we see
that the integral in
(5.3. 7)
On the other hand, the integral in (5.3. 7) is convergent and the proof
is very much like the proof for Abel's test,
lx
0
sin
--
dt =
f"'2
sin
dt +
--
Ix
1T/2
sin t
--dt.
t
t/t,
continuously
to t= 0 by giving it the value 1 there. Hence the first integral on the right
exists as an ordinary Riemann-Darboux integral. Let us use integration
by parts on the second integral. We have
fx -- d
sin t
7T/2
Since
Vt, ! cos ti
t--
fx
cos x
cos t
-- dt.
x
7T/2 -2t
100
0
f"'
sin t
1"'2 sin t
--dt=
-- dtt
t
7T/2
0
5.3.3
cos t
-2
dt.
- loge=
that the function with domain
I.I dt
]O, l]
5.3
>
0,
. [f- -+
dt f dt
t
t ]
1
hm
E-0
-1
=0.
has a
Ve
I(j) (e) =
c- f( t )
l>
f
dt +
c+E
f(t) dt.
Then the ordered pair (f, I (J)) is called a Cauchy principal value integral.
Similarly, if f has domain ]-oo,oo[ and Vx 0, Jl[-x,x] is integrable,
and I(j)
I(f )(x) =
J:/(t) dt,
then the ordered pair (f, I (f)) is also called a Cauchy principal value inte
gral. The Cauchy principal value integral (f, I (f)) is said to be convergent
=>Jim,_0I(J)(e) or limx-ool(J)(x) exists, and the latter numbers are
called Cauchy principal values.
We shall now give some examples which show that the symmetry used
in the definition of the Cauchy principal value may be very important.
Since
t2
sin
[-x,x]
x 1
+
-x
t2
[-x,x]
is 2 Arc tan
sin
1 +t2
dt
is zero. Also
Thus
x.
7T.
-x
+ t2
dt = 2
Arc tan
x+2
cos
x+
x+1l
I - sin t
dt,
1 + t2
x - oo.
hm
x-oo
Jim
2x
-xI+t
x -co
I+t
dt = 7T.
t2
-x 1 +
fx
-x
x-co
2x
..!....i_ dt.
I+t2
208 j INTEGRATION
Now,
2X df
J
x-00 x I+t2
. 2x -1-t - dt I .
x-oo Jx +t2 x-oo
hm
hm
Thus
hm log
(1+4x2 2)
1+x
. zx,I+t
+t2 dt +
x-oo J-x 1
hm
7T
log 2 .
log 2.
The definitions we have written down do not exhaust all the possi
bilities for defining improper integrals and the reader can undoubtedly
think of cases we have not discussed. However, in most instances a
suitable definition of an improper integral will be either a variant or a
combination of the definitions we have discussed. Some of the follow
ing exercises are designed to exhibit the various possibilities.
D Exercises
I.
2.
(b)
(c)
3.
(b)
(c)
4.
!100 (/ 1).
L"" C2 )
J:""C3 )
L1 (t t dt).
J: c- t dt) .
in/2 ( )
0
log
in
cost
5.3
(b)
(c)
5.
J: (;!).
Loo (;!).
L (k
)
J:oo C( (2 )
J-oooo ( t2 t I tltl1/2 ) dt
00
(c)
6.
1112
7.
00
f C t)a )
J: ( J
L"' Ca g t)
1
( lo
r I g
sin
8.
9.
integrals.
IO.
(a )
(b)
(c)
I I.
L"' ( t dt) .
"'
L ( I t ti dt ) .
oo C! t dt .
)
L(
i
5
si
""
k=I
(k3e-k).
210 I INTEGRATION
00
(b)
(c)
k=2
((log k)P/k).
( k (log og k)")
12.
an=
pn
L
k=qn+l
1 and let
Use the ideas involved in the proof of the integral test to establish that
()
an log
(Hint:
"1
L-
k=l k
0 <
as
oo.
n
dt
- + a + bn,
t
1
In the equation
n
l og k =
L kk=2
log log n
a + bn,
show that
log log 2
log 2 -
<
a <
and
0 <
log log 2 ,
2 log 2 -
bn < 2
n l ogn
[Hint: Extend the techniques used in the text and show that V k 2,
I
I
2 k log k - (k + I) log (k + 1)
5.4
<
ek <
1)
k log k - (k + I) log (k +
RIEMANN-STIELTJES INTEGRALS
5.4
S1,0(d , {xd)
k=I
The function S1,0 is called the Riemann-Stieltjes sum function for f with respect
to g and any element in its range is called a Riemann-Stieltjes sum.
The Junction s l.o is said to converge to the limit s (f,g ) v E > 0' 3 a.
so that a >- a.
IS1.o(d, {xk})-S(f,g )I
<
E.
In case the limit of S1,0 exists, the number S( f, g ) is called the Riemann-Stieltjes
integral of f with respect to g and it is denoted by
S(f, g )
J: f(x) dg(x).
L mk[g(bk)-g(ak) ],
[l(f,g)
su p {!l1.0(d):
d E >([21,0)}
J:
J: f(x) dg(x) ,
arJ,d call these numbers the upper and lower Darboux-Stieltjes integrals of f
with respect to g, respectively. In case D(f,g) [l(f,g) D(f,g), we say
that f is Darboux-Stieltjes integrable with respect to g and call D(f,g) the
Darboux-Stieltjes integral of f with respect to g.
=
212 I INTEGRATION
ISr.o(a,{xd)-Sr.o(a',{x'k}I
<
e.
5.4.5 Corollary. Suppose [a1, b1] C [a, b], f and g are defined on
[a, b], and f1 and g1 are the restrictions of f and g to [a1, b1], respectively.
If f is Riemann-Stieltjes integrable with respect to g, then fr is Riemann
Stielt.Jes integrable with respect to g1
The proofs of Theorem 5.4.4 and Corollary 5.4.5 follow
mutandis
mutatis
J: f(x) dg(x)
?!! 0.
5.4
The proofs of (a), (c), (d), and (e) follow the proofs of Theorems
and
5.2. l
5.1.9
ple and we also leave it as an exercise. The proof of (b) also follows from
part (a) and Theorem
5.4.8.
Proof.
with
Ik
dx.
s,.u(a, {xd>
Since the limits on the left side and on the right side exist, they must
be equal. This establishes the theorem.
An immediate corollary of Theorem
if we take
f(x) =
5.4.7
is Theorem
5.2.2. Indeed,
J: dg(x)
g(b) - g(a) ,
5.2.2.
5.2.3.
f(b)g(b) - f(a)g(a)
J: df(x)g(x).
214 I INTEGRATION
Proof.
.1 {h: k E ( l, n)}
[ak,bk]. We may write
Let
where/k =
be any decomposition of
[a,b],
=L
k=l
Now
[ak,xk]
.1'
f(bk)[g(bk) - g(xk)]
L f(ak)[g(xk) - g(ak)].
k=l
and hence
{[ak,xk]: k E (l,n)}
{ [xk,bk]: k E (l,n)}
S0,,(a, {xk})
x'k = ak for
[xk, bk]. Since, by
where
= f(b)g(b)
the interval
hypothesis,
[ak, xk]
S(f,g)
and
x'k =bk
Proof.
g(x) dh
g(x) .
is a decomposition of
versely. Further,
versely. Hence
Since
'1 1
>
> .1'2, and since the limit on the left exists, the
h(x) = x.
5.4
dx
for any continuous function f. Assuming that the integral on the left
exists (which we shall prove in Section 5.5), set g(x)
Arc sin x. Since g
is monotone increasing and g(O) 0, g(l)
7T/2, if we use the formula
of Theorem 5.4.9 we get
=
But since g is the inverse of the sine function, we have sin g(x) x.
Of course, in this case Theorem 5.2.4 in conjunction with Theorem 5.4.7
will also justify this change of variable. (See also Exercise 6 of Section
5.2.)
The next two theorems are generalizations of the mean value the
orems, 5.2.6 and 5.2.7.
=
c[g(b) - g(a)J.
J: f(x) dg(x)
Proof.
J(a)
f dg(x)
+ J(b)
J: dg(x).
J: f(x) dg(x)
f(b)g(b) - f(a)g(a)
- J: g(x) df(x).
J: g(x) df(x)
g(c)
J: df(x)
g(c)[J(b) - J(a)].
216 \INTEGRATION
Hence
J:
J:
dg(x) + f(b)
dg(x).
NOTE:
5.5.2
J,
D Exercises
If f is defined and bounded on
I.
[a, b]
g,
J:
F(x) =
f(t) dg(t)
2.
[a, b],
r
r
1:
(a )
(b)
(c)
is nondecreasing on
f(x) dh(x) =
f(x) dh(x)
r
J:
f(x) =
f:
f(x) df(x)
f(x) dh(x)
f(x) dh(x) +
f(x) dh(x) +
3.
Does
[a, b]
show that
r
1:
[O, l] in
g(x) dh(x).
g(x) dh(x).
{I'
for
x E [O, 1/2],
0,
for
x E ]1/2, l].
exist?
with
a1
Suppose
=
5.5
uous on
[a, b],
J:
5.
Suppose
Iff(x)
6.
x for x
If f,
and J and
g) exists and
J(x) dg(x)
is defined on
g(x)
[O, l]
x,
x + 3/4,
x + 9/8,
[O, I],
f(ak)8k.
x
x
for x
compute
for
for
[O, 1/3],
]1/3, 2/3[,
[2/3, I].
f(x) dg(x).
7.
Prove Theorem
8.
5. 4 6(d).
.
Vn E
[a, b].
J:
9.
fn(x) dg(x)
J:
f(x) dg(x) .
3.2.5,
5.4.8.
IO.
Supposef,
J: [J:
(J:
=
) (f
f(x)2 dh(x)
) (f
g(x)2 dh(x) -
r.
f(x)g(x) dh(x)
(f
5.5
r (J:
f(x)g(x) dh(x)
) (f
f(x)2 dh(x)
g(x)2 dh(x) .
To get an idea of the kinds of functions that will guarantee the existence
of the Riemann-Stiehjes integral, we shall start with a continuous func
tion
a function
218 j INTEGRATION
Let
a1 and a2 be
decompositions of an interval
and a=
2,
of
{I k :
I :,;;; k
[a, b]
Suppose
a;= {l;k:
(1, n;)}, j = l,
:,;;; n}. Now, suppose we have labeled the elements
a2
k E
so that
l1k = {/;: lk
<
j:,;;; lk+1}.
g(b1k) - g(a1k) = L
i=lk +I
x1
and hence if
11
and
x1k
Ilk
[g(b;) - g(a;)] ,
we have
lk+I
'k+l
j=l k+I
n1
1k+I
k=I
i=lk+I
(5.5.1)
is continuous on
E.
and
L lg(b;) - g(a;) I.
J=I
j=I
then we have
(5.5.2)
That is,
S1,g
5.4.4
with respect to
and
(5.5.2)
that
g exists.
5.5
L Jg(bk)-g(ak) I
k=l
:;;;; M.
g(x) - g(a)
g+(x)-g-(x),
(5.5.3)
and so that for every pair of monotone nondecreasing functions J+ and 1- for
which
g(x) - g(a)
we have
j+(x)-J-(x),
[a, x]
[a, b],
let
sg+(a(x))
2 L {Jg(bk)-g(ak)I
k=l
1
sg-<a<x)) = 2
[g(bk)-g(ak)]},
L {Jg(bk)-g(ak)I- [g(bk)-g(ak)]}.
k=l
%20
I INTEGRATION
Sg+(A(x)) -s0-(A{x))
g(x) - g(a).
(5.5.5)
sup{sg+(A{x)): A{x)
.#(x)},
Now, if x < y, then VA(x), 3A{y) such that A{x) C A{y). Indeed, if
A(x,y) is any decomposition of [x,y], then A(x) C A(x) U A(x,y)
=A{y). Consequently, it follows from this fact and (5.5.6) that
g+(x)
sup{S0+(A(x)): A{x)
.#(x)} g+(y) ,
S0(A{x)) =2
k=l
'
Sg+(A(x)) + sg-(A(x))
k=l
E
lg(bk) -g(ak) 1.
.# (x)}
I
2 [g(x) - g(a)].
If we subtract g -(x) from g+(x) and use the above equality, we get
(5.5.3).
;;;,:
0 andB ;;;,: 0,
we always have
l+(x) - 1-(x),
2 {lg(bk) -g(ak)I
Thus
sg+<<x))
1l
.;;;
k=I
It follows that
l+(a)
1-(a),
and
1-(x) -g-(x)
l+(x) -g+(x).
;;;.:
Thus
0,
decomposition of
g -g(a)
into functions
canonical decomposition
ofg -g(a).
g(a)
andg(b+)
g(b).
g(x+) -g(x)
g(x) -g(x-)
{lg(x+) -g(x)I
=4 {lg(x) -g(x-)1
[g(x+) -g(x)]},
(5.5.7)
[g (x) -g (x-)]}.
222 I INTEGRATION
Proof.
SO that
0 g+(b) - so+(a)
[g+(x) - so+(a(x))]
+
(5.5.8)
E.
Now, by the method used in the last paragraph of the proof of Theorem
5.5.3, we can easily establish that all the terms in the sum on the right in
(5.5.8) are nonnegative. Thus we get
0 g+(y) -g+(x) - S0+(d(x,y)) <
E,
E.
we get
+
If g is of bounded variation and g and g form the
canonical decomposition of g - g(a), then
5.5.5
Definition.
(5.5.9)
is
called the variation of g and v(g) = v0(b) is called the total variation of g.
The next proposition gives another way of computing the variation
of a function.
5.5.6
Proposition
(5.5.10)
where
Sg(A(x))
n
L lg(bk) -g(ak)I.
k=l
Moreover,
Vg(x+)-vg(x)
Vg(x) - Vg(x-)
Proof.
lg(x+) -g(x)I,
(5.5.11)
l (x) - g(x-)1g
4 sup{Sg(A(x)): A(x)
tt (x)}
1
2 [g(x) -g(a)].
Thus, adding g + and g-, we get the formula (5.5.10). The formulas in
(5.5.11) follow immediately from the formulas (5.5. 7).
As we have already noted, a function g of bounded variation can
have only jump discontinuities. From Theorem 5.5.4 there are only a
countable number of jump discontinuities and moreover from Proposi
tion 5.5.6,
J (x) -g(x-)I= Vg(x+) - Vg(x-).
Jg(x+) -g(x)I+ g
Let J be the set of jump discontinuities of g and {xk: k E (1, n)} any
finite set in]. Suppose these are labeled so that x1 < x2 <
< Xn
Then
n
L [ Jg(xk+) -g(xk)J + Jg(xk) -g(xk-)J ]
k=l
n
L [vg(xk+) -vg(xk-)]
k=l
n-1
:;;:;; L [v9(xk+) - Vg(xk-) + Vg(Xk+i-) - Vg(xk+)] + Vg(xn+)
k=l
- vg(xn-)
The last three inequalities follow from the fact that v9 is a monotone
nondecreasing function. Now, if] is denumerable, let (xk) be a sequence
with range]. The last inequality shows that
00
L g
J (xk+) -g(xk-)I
k=O
:;;:;; v(g),
and thus the series on the left is absolutely convergent and consequently
independent of the order of summation of the terms. Therefore, if
J is finite or denumerable, the number
%24 I INTEGRATION
( +)-gx
( -)]
L [gx
xE}
[a ,x [ , and Vx E [a,b ],
g,(x )=
[g(y+)-g(y-)] +gx
( )-g(x -).
g8is
called the
facts about the saltus function are easily established and we have
left the proofs for an exercise at the end.
g8(x +)-g8x
( )= gx
( +)- gx
( ),
(a)
(b)
( -)= gx
g8(x )- g8x
( )-gx
( -),
Vx E [a,b].
and
(c)
Vx E [a,b ].
If we set
gc = g-gs
it is clear from condition (a) that
gc is
gc is
of bounded variation.
g8 We
know that
this integral exists by virtue of Theorem 5.5.2. By the proof of the latter
theorem
Ve>
0, 311
> 0
<
E/2.
ak and
definition of g. that
where
case
the right.
g+b
( )-g+ (b-)to
the sum on
5.5
lf(x) - f(y)J
<
inequality
L Jg(x+) - g(x-)J
<
xE}
Thus if bk#
b, we get
f(k)[g,(bk) - g,(ad]
and, if
where
M.
LJekJ
k=l
<
e/2.
k, we see that
< E.
g(x) - g,(x) ,
It is clear that
Vk
N,
g(O+) - g(O)=a0,
g. Since the series is absolutely
g is of bounded variation. The function g is a saltus function
and indeed g,
g. Further,
and these are the only discontinuities of
convergent,
Lak =
dg(x).
Ol
k"'=O
226 I INTEGRATION
D Exercises
If f and g are functions of bounded variation on [a, b], show
f + g and Jg are functions of bounded variation. If lgl m > 0,
I.
that
g1
gl [a, c], g2
tion and
v(g)
v(g1) + v(g2) .
4.
g(x)
[O, l] as follows:
x sin (7T/x),
for
#- 0,
0,
for
0.
A function
[a, b],
Suppose f is defined on
IJ(x)-f(y) I
12
- Yl 1
[a, b] ,
Suppose f is defined on
IJ(x)-f(y)I
M Ix-
YI
5.5
IO.
Suppose f is continuous and of bounded variation on [a, b].
Show that
6j HIGHER
CHAPTER
DIMENSIONAL SPACE
Stokes' theorem, but the names of Gauss and Green may also be associated
with it.
In this chapter we shall lay the basic foundation for the subsequent
chapters. We shall discuss real vector spaces with a certain distance
function acting on pairs of points and shall also discuss general proper
ties of continuous functions as well as special continuous functions called
linear transformations.
6.1
(x1,
"
,x
),
where
ER fork E (l,n).
(1, n)
(1, n)
domain Rn X R" and range R" and the second one having domain
228
6.1
,xn)= (ax1,
axn).
x= (x1
'
'xn).
k=l
akxk=O::::::} ak=O,
Vk E (l,m).
L
XEA
a(x)x
0 ::::::}
a(x) =
0,
Vx E A.
2:
a(x)x
.xEA
2:
k=l
a(<l>(k) )<l>(k) =
2:
k=l
akxk .
6.1.2
space of
Definition.
vn
EL.
6.1.3 Definition. A vector x E vn is said to be a linear combination of
the vectors in the set {xk : k E (1, m)} C vn ::::> there exist numbers a1 ,
am E R so that
so that
6.1
(6.1.1)
where
eki =0
ifj -
k
ek =1.
and
V3
is given by the
three vectors
X1=(1,0,0},
X3 = (1,1,l) .
X2=(1,l,O),
6.1.5
Theorem.
Since y1 -
0, we
may write
r
Y1 = L <X1kXk,
k=l
where not all the
xk1
a1k 1
0.
Then
is a linear combination of
Y2=/3Y1
k.. k,
xk
we find
<X2kXk.
Hence the latter set generates the same linear subspace generated by
{xk: k
(l,r)},
and thus
r
Yr+t = L f3kY k
k=l
This contradicts the linear independence.
The formal way to do this is as follows. Forj E
statement: There exist distinct integers
vector in the set
{xk;: i- E ( l ,j)}
k1,
ki
in
in the set
{y;: i
For j >
s,
P (j) be any
P(l) is true.
let
have shown
( l,j ) }
{xk: k
(l,r) & k
k;, i
( l,j ) } .
1 =1. Now, we
P(j) is true.
The set
{yi: i
E (l,j)}, which is impossible. Now, the process we have carried out
for y2 in the previous paragraph can be carried out for YJ+i, and we se
that P(j + l) is true. If j ;;;,, s, P (j) :::::} P (j + 1) automatically. Hence
we have the statement (j) (P(j)), and taking j
s we see thats.;;; r.
wise YJ+i is a linear combination of the elements in the set
6.1.6 Corollary. If L is a linear subspace of vn, then any two bases for
have the same number of elements.
6.1. 7
Theorem.
independent set in
Proof.
If L is a linear subspace of
can be extended to a basis for L.
Suppose
{xk: k E ( 1, r)} C
V",
n elements, it follows
6.1.5 that any linearly independent set contained in L
{xk: k E (I, r)} has at most n elements. Hence, among
m.
If this set is
L. Otherwise,
{xk: k E ( 1, m + 1)}
{xk: k E (I, r)}. But this is a
L - L1, so that
6.1.8
Corollary.
If L is a linear subspace of
V", L #-
{O}, then
has
a basis.
Proof.
Since L #-
{O},
formed from this one element is linearly independent and hence can
be extended to a basis for L .
Corollaries
6. J .6
and
6.1.8
sense.
6.1
( )
We shall usually abuse the language and designate a real vector space
by 'V' instead of the triple ' (V, +, ) .' Also we shall follow the usual
custom and drop the dot when we "multiply" an element of R by an
element of V; that is, we shall write 'ax' instead of 'a x.' We shall also
define -x to be the vector Ix.
All the definitions, theorems, and corollaries that we have given for
the vector spaceV" will carry over mutatis mutandis to a finite-dimensional
vector space V. We shall leave the verification of this as an exercise.
A simple example of a finite-dimensional vector space is the space
R,. [x] consisting of all polynomial functions of degree at most n, that
is, all functions given by
-
p (x) = L
k=O
akxk,
We shall soon see other examples of a real vector space, and indeed
we shall even see an important example of one that is not finite-dimen
sional.
D Exercises
I.
Let V be a real vector space of dimension n. Show that V is
isomorphic to vn; that is, there exists a one-to-one function <P with
domain V and range V11 so that Vx ,yEV and Va , f3 ER ,
3.
Decide which of the following sets of vectors form a basis for V4:
(a) { (I,l, 0, O), (O,0, 1, 1), (I, 0,0,4), (O, 0,0,2)}.
(b) { (I,0, 3,1), (1, 1, 0, 2), (0,1, 2,1), (2,2,5, 4)}.
(c) { (l, 0,1,0), (0,1,0,0), (0,0, 1, 0), (2, 1, 3, 0)}.
(d) { (a, b, 0, O), (I, 0, c, O), (0, 1, I, a ) , (I, 0, 1, b)},
where ab< 0.
4.
{ k=l
xk=O}
(b) { x: (xk)2= }
k=I
(c) { x: i kxk 1} .
k=I
( a)
x:
6.
Is the vector (I, -3,2,-2) in the linear subspace generated
by the vectors in the set {2,3,-1, 4), (3, 0,1,2), (1, 6,-3,6)}?
For
( a)
(b)
(c)
6.2
(2, 3,
1,
L+M and L n
10.
dimension by
12.
6.2
M?
EUCLIDEAN SPACES
use the symbol 'lxl' to denote the absolute value of a number and we
can intuitively think of this as being the distance from x to 0. We shall
use the same notation for elements of vn.
6.2.1
vn
to
Definition.
defined by
lxl
vn
is a Junction from
{ (xk)2 } 1'2.
The pair (V", I I), consisting of V" and the length function I I,
is a prototype example of what is called a Euclidean space of dimen
sion n and we shall designate it 'En.' The number Ix - YI will often be
called the distance between x and y. By an abuse of language we shall
say
length function.
The reader will perhaps recall from the elementary theory of analytic
geometry that the cosine of the angle between two vectors x and y in
the plane (Fig. 6.2.1) is given by
cos
(J =
x1y1 + x2y2
.
lxl IYI
FIGURE 6.2.1
The number
of the vector
6.2.2
Definition.
defined by
Proposition.
conditions:
(a)
(b)
(c)
(d)
(e)
xy =y x,
Vx, y E vn.
VaER & Vx,yEV".
a(xy) = (ax)y=x (ay),
Vx,y,zEV".
(x+y)z=xz+yz,
xxO, VxEV",andxx=0:::}x=O.
x x= lxl2,
VxEV".
6.2.4
Theorem (Cauchy-Bunjakovsky-Schwarz).
I x YI
lxl IYI ,
6.2
Note that in
V2
! cos Bl
:;;;; 1.
Proof.
a>..2+b>..+c=a
_
>..+l._
2a
a ;C 0, it must be positive;
VA.,
Since
that
)2
_.!_ (b2-4ac)
4a
V>..
E R,
(6.2.1)
0.
2
4a2 >..+:J :;;;; (b2- 4ac),
b2- 4ac=0,
r,
>..+
:a
b2 - 4ac,,;;;; 0.
real root,
and hence
V>..,
r =-b/2a.
b2- 4ac=0.
For every>.. E R ,
(y+ >..x)
r, then
0.
(6.2.2)
(x
If
y =ax
or x
y /lxl2,
it follows from
6.2.6
y=ax.
For every x, y
I lxl - IYI I
,,;;;;
I x+ YI
,,;;;;
lxl + IYI,
E En,
a 0
and equality
Proof.
:y+
IYl2
Thus
the proof of the inequalities. The statements about equality are immedi
ate consequences of Theorem 6.2.4 and the above computations.
The reason that the previous theorem is called the triangle inequality
is that the length of any side of a triangle is less than or equal to the
sum of the lengths of the other two sides (Fig. 6.2.2).
x +y
length lxl
FIGURE 6.2.2
6.2. 7
Definition.
Two
vectors
normal Vj, k
= 1.
{ek:
E (I, m)}
6.2.8
{ 0} ,
Theorem (Gram-Schmidt).
Proof.
Let
{xk:
E (l, m)}
Y1
X1f lx1 I,
FIGURE 6.2.3
Yi is the vector
Y2 = z2/lz2I
j=l
The vector zk+i - 0, for otherwise zk+t is in the linear space generated
by {x1:j E (l,k)}. If we set
Yk+1 = zk+d lzk+1 I.
it follows that {y; : j E ( 1, k + 1)} is an orthonormal set. It is an easy
exercise to show that this set generates the same linear space as the set
{xi: j E ( 1,k + 1)}. The principle of induction shows that there is an
orthonormal set, having the same number of elements as the dimension
of L, which generates L. This proves the theorem.
6.2.9 Definition. If L is a linear subspace of a linear space M C En,
then the orthogonal complement L.1. of L in M is the set of all elements in M
which are orthogonal to every element of L; that is,
U = {x:
x E M & (y) (y E L x
y = 0)}.
6.2.10
Proposition.
where
LEBL1-,
LEBL1-= {z: (3x)(3y)(x E L,y E L1-, & z=x+y)}.
M
Further,
LH
L.
Proof.
x,x1 E
Landy, y1
Ll. and x
+ y=x1 + y1,
then x
- x1 = y - y1 The
x=x1 and y = y1
6.2.3, then
6.1.
q=
J:p(x)q(x)dx.
We put an inner
(6.2.3)
We shall leave as an exercise the fact that this function satisfies condi
tions (a) through (d) of Proposition
6.2.3.
D Exercises
1.
2.
(b)
(c)
6.3
3.
E"
is linearly independent.
4.
En
can be extended to an
E".
5.
Show that
Vx,y E E",
6.
If
y.
x, y
En
= IYI
lxl
and
E2
show that
7.
Suppose that
the form
x + ay
and
x-y
is orthogonal to
E2
varies over
R.
8.
{I,x,x2,x3,x4}
9.
Let
for
R4[x].
vector space
x ;=
Compute the numbers
ak.
x
10.
x E E"
akxk.
k=l
n
y = L (x
k=l
show that
xk)(y xd.
Suppose that
For a given
If
"
set
E".
Ix
f .akxk l =minimum.
k=l
TOPOLOGY IN E"
6.3
E1
R,
functions are discussed in terms of the absolute value, which is the same
as the length function. An alternative, but equivalent, way of discussing
these matters is through the use of open intervals or more generally
open sets. In the same way, we can discuss limit and continuity questions
in
En
open sets.
In
E1
the same thing in En, replacing the open intervals by open cubes or more
generally by higher-dimensional rectangles. Another way is to replace
B(a,p)
The set
B (a, p)
E" defined
by
is called an
6.3.1
Definition.
'
The concept of compactness for E" is the same as the concept of com
pactness for 1
relevant definitions.
6.3.3
Definition.
VxEA,Jxl,,;;;M.
6.3.4
Definition.
bounded.
To discuss the Heine-Borel theorem in
E"
6.3.5
Definition.
ACE"<=>
AC U {U: UEU}.
The collection is said to be an open covering <=> each UE U is open.
6.3.6 Theorem (Heine-Borel). A set ACE" is compact <=> in every
open covering for A there are a finite number of sets which is a covering for A.
6.3
Proof.
ment: An open covering for every closed and bounded set in En reduces
to a finite subcovering. We have shown in Theorem 2.2.5 that
P (1)
A=BX{t},
t E E1 Suppose U is
U E U we set U1 = {x: x EEk
& (x, t) E U}. It is almost immediate that U1 is an open set in Ek and
U1 = {U1: U E U} is an open covering for B. Hence, by P( k) , this re
duces to a finite subcovering for B which in turn implies that U reduces
to a finite subcovering for A.
where
A=B XI,
[a, b] C E1 Let U be an open covering for
[a, b] so that BX [a, t] is covered by
a finite number of sets in U. J is nonvoid, since, by the first paragraph,
a E]. Let us set t0 = sup]. By the first paragraph of the proof, BX{t0}
is covered by a finite number of sets {Ui: j E (1, n)} CU. At each
point of BX {t0} place an open cube with center at the point and whose
closure is in one of the U1 This is an open covering for BX{t0} and
where B is as before and I=
B X [a, t0 - l] is
- l, t0 + l] is
covered by { U1: j E ( l, n)} it follows that B X [a, t0 + l] is covered
by a finite number of sets in U. Hence we must have t0 = b.
Finally, if A is closed and bounded, it is contained in a large enough
covered by a finite number of sets in U and since BX [t0
cube,
A C Ik
where I=
U
U
[a, b] C E'
and
{Ac}
I,
and
Ac
is the
I taken k times. If
complement of A, then
sequently, U reduces
P(k) P(k + l).
The sufficiency part of the theorem follows exactly the same reason
ing as the sufficiency part of the proof of Theorem 2.2.5, and we shall
leave it for the reader.
The Bolzano-Weierstrass theorem is also true in En, and as we shall
see its proof is an immediate consequence of the Heine-Borel theorem.
6.3. 1
Theorem
(Balzano-Weierstrass).
Every bounded
infinite set
Proof.
accumulation points,
The collection
reduces to a
CONNECTED SETS
relatively
the concept of
open and
relatively
closed sets.
Using the concepts of relatively open sets and relatively closed sets,
it is possible to describe the concept of a connected set.
6.3.9
written
as
the union of two disjoint nonvoid sets that are relatively open in A.
of two or more disjoint pieces. For example, the set that is the union of
the open balls
B( (2, 2), 1)
and
B( (-2, O), 1)
in E2 is a disconnected set.
On the other hand, connected sets in E1 are easy to describe, and this
is often very useful in deciding whether or not a set in higher dimen
sions is connected.
6.3.10
Theorem.
Proof.
A set IC E1 is an interval<=::?
A set in
E1
is connected<=::?it is an interval.
Vx,y E /, {z: x ,s; z ,s; y} C I.
3x,y EA and 3z EAc
J-oo,z[ and ]z,oo[ are open in E1 and
<
<
y.
The intervals
if we set
B=J-oo,z[nA,
then
and
C=]z,oo[nA ,
and
C= 0.
6.3
Conversely, suppose
A n B
xEA, yEB
and suppose x < y. Since I is an interval [x, y] C I. Since A is relatively
open in /, there is an open U so that A= U n I. Now 3 so that
x < < y and [x,U CU. Thus [x,[ CA. Let E= { : [x,U CA};
the set E is nonvoid and bounded above and hence z = I.u.b. E exists
and zE [x, y]. Now z fj. 4,; otherwise; since A is relatively open in/,
there is a > z so that [z, U c A and thus [x,U c A, which means
thatE E and z l.u.b. E. This means that zE B. But since Bis rela
tively open 3 < z so that ]' z] C B, which again contradicts the fact
that z is the least upper bound of E.
Using Theorem 6.3.10 it is possible to prove the connectedness of a
= 0, A
is an interval and
I=A
U B, where
6.3.11
Definition.
straight-line segment
L(x, y)={z: (3t)(tE [O, 1) & z= (I- t)x+ ty}
is contained in
C.
6.3.12
Corollary.
Proof.
Let us set
[O,1) =A1
[O, 1).
[O, 1)
A and B are
A 1 and B1 are
rela
rela
6.4
we shall be able
Theorem.
ponents.
Proof.
ting
If A C En,
y x and y
by set
every
6.3.15 Corollary. IfA C En is open, then the components ofA are open
and there are a countable number of them.
Proof.
B (x,p) CA.
Now,
B (x,p)
x EE.
Let
> 0 so that
B (x,p)
is equivalent to
and thus
is in E. Hence E is open.
Suppose
Q" is
QA
times, and
Q" n A;
that is,
QA
is open,
Q with
itself
QA
is denumerable. (Proof?)
QA
If E is a
component of A, let
NE
E E}.
nE.
= nE.
QA,
6.3
6.3.16
Corollary.
TOPOLOGY
IN E"
I 247
open intervals.
Proof.
A.
The
is
It is clear that
it is clear that
=t= 0. The
A C B0
It is quite possible that a set may be rather "thin" and yet its boundary
may be rather "thick." For example, the rationals in
[O, I]
are in some
sense rather "thin" but the boundary of this set consists of the whole
interval
[O, l].
D Exercises
In the following exercises all sets are to be taken in En, unless otherwise
specified.
I.
open sets is relatively open and the union of any number of relatively
open sets is relatively open.
2.
Show that the union of any finite number of relatively closed sets
3.
Suppose
tively closed. If
that
that
A\U is
A\C is
rela
rela
tively open.
4.
Let
C be relatively compact in A.
d
then
5.
3c E C,
so that d
inf {Ix
le
If
E C},
al?
6.
al :
If a
are
n B 0, show that
U Bis
connected.
7.
Let
A,
8.
If
X B is open in 1+m
{:::A
::}
{:::::}A
If
10.
are compact.
11.
if it is a Cauchy sequence.
12.
If
X Bis connected
{:::A
::}
and
Bare connected.
13.
14.
[A0 U (Ac)oy.
(AoV\ (Ac)o.
6.4
{3A
{3A
A is closed{:::::} {3A
A is open {:::::} {3A
C
C
A.
Ac.
CONTINUOUS FUNCTIONS
6.4
6.4.1
Definition.
If f is a function with J0
(
f) C P andf
( ) C Em
E [B
( a ,8)\ { a }] n J0
(
f) l
x
( )EB
(l ,
E)
.
x-a
6.4.2
Definition.
or f
(
x) - l
asx- a.
If f is a function with J0
(
f) C En and(
f) C Em,
The function I
ZS
6.4.3
Theorem.
A function f with J0
(
f) C P and f
( ) C Em
is
(
is relatively open in J0
J
( ).
continuous :::f:} or every openU C Em, 11
- U)
6.4.4
Definition.
6.4.5
Corollary.
6A.6
Tl(j)
Theorem.
CEm
C E"
and
CE"
is
Proof.
6.4.10
Theorem.
uniformly continuous.
Proof.
6.4.11
Tl(j)
Theorem.
C Em
CE"
and
6.4
l(t)
for
[O, l].
The function f is continuous and since its domain is connected its range
is connected. Butf'/C,(J)
[f'/C, (J) n A] U [f'/C, (J) n BJ, where clearly,
f'/C,(J) n A and f'/C,(J) n B are disjoint, nonvoid, relatively open sets in
(f). This is a contradiction.
We think it is clear that the procedure we have just given can be
generalized in a very simple way.
=
V x, y E
6.4.13
Proposition.
The details of the proof are essentially the same as the proof we have
just given above for convex sets, and we shall leave it as an exercise.
As an example of how Proposition 6.4.13 can be used, let us consider
the ring in E2, which is the set
Let x,y
x1
y1
lxJ cos 8,
JyJ cos <(J,
x2
y2
Jx J sin
Jy I sin
(J,
<P
f1(t)
f2(t)
l(t)
2(1- t) J
G) + (2t- l)y,
'
FIGURE 6.4.1
Proposition.
0 <
x I}.
In other words,A1 is a line segment along they axis and A2 is the graph
of the function given by sin (I/x) for 0 < x 1. The set A is not arcwise
1 2
connected. For suppose g= (g ,g ) is a continuous function defined
on [O,I] and whose range is inA,g(O)= (O,y) and g(l) = (x, sin (l/x)).
Let
B= {t: g(t)
A1},
t0 =sup B.
6.4
PEANO CURVES
[O, l]
[O, l]
where V k E N,
xk
0 or
xk
2. Set
f1(x)
j2(x)
f xi2'
k=I
- 2
f X2k /
'
2
k=I
(J'(x), f2(x) ).
We think it is clear that &>,(J') = &>,(f2 )
[O, l] (see Theorem 3.3.2)
and that &',(J )
[O, l] X [O, l]. To show that f is continuous, it is
enough to show that f1 and f2 are continuous.
l/3 2<n+1l. Suppose that x and a are in Cantor's
Let En= l/2n and 811
J(x)
set and
xk
ak for k E ( 1, 2(n
1))
L,,
Similarly, we get
- 2n
< - 2n+1
00
k=n+2
2k
(x2k-a2d/2
2k
k=I
00
,,,;;;;
Ix- al < 8n
and
x, a EC
and
Ix-al < 8n
==>
Ve> 0,
3n so that
1/2" <
E,
[O, l]
[O, I]
is connected,
[O, l]
(Corollary 6.3.16). If
Consequently, on
joining
g(x) =
Of course,
[(x-a)f(b)
b-a
(b-x)f(a)].
a EC ,
and is a left
x,a
If
= g(a) .
< () ==>
VE >
0, 3 8 so
b,c EC, 0
Vx E Jb, c[,
Suppose
g(x)-g(a) =
b-a < 8, 0
c-b
E.
c-a < 8,
and ]b,c[
E cc.
Then
and consequently
lg(x)-g(a)I
c -b
[(x-b) + (c-x)]e=e.
x,a
g(x) = g(a).
6.4
lim
x .. a
g(x)=g(a).
O Exercises
I. Suppose f is a continuous function with domain the cube
C= {x: jxkl < 1, k E (l,n)} and range in Em. Let a EC and define
a function fk on ] -1, 1 [ by means of the equality
fk(t) J(a+(t ak)ek).
Show that fk is continuous.
2. Let J be a real-valued function defined on 2 as follows:
=
J(x,y)
Let
xy
x 2+y 2'
0,
(x,y) = (O,O).
(Hint:
(x,y)#(O,O),
is not continuous at
x = y.)
(0,0).
3.
(a)
(b)
(c)
(d)
{
{
r
{
f(x,y)=
f(x,y)=
f(x,y) =
f(x,y)
x2 + y
4'
0,
x'+y'
lxl+IYI'
0,
-y
lxl + IYI'
0,
lxl
IYI'
0,
(x,y)#(O,O),
(x,y) = (O,O).
(x,y)#(O,O),
(x,y)=(O,O).
(x,y) #(O,O),
(x,y)=(O,O).
(x,y) # (0,0),
(x,y)= (0,0).
4.
If
(f)
is compact,
5.
If g is continuous at
(a)
and
is continuous at
(b)
If
g(a),
show
a.
Ill
is continuous. Is the
converse true?
6.
If
7.
!7C,(f)
(f),
may be extended to
the closure of
(f)
(f).
is bounded. Give examples which show that this may not remain
pk
8.
Suppose
'fa 0, Vk
9.
open
connected set in
E"
is arcwise connected.
or disconnected?
1 1.
Show the existence of a Peano curve with range the unit cube
inE".
Show that no Peano curve (range [O, l] X [O, l]) can be one-to
(Hint: Consider how two intersecting perpendicular line segments
[O, l] X [O, l] would map under 1-1.)
12.
one.
in
13.
E Em. Define
y)
and
(x,O).
14.
15.
6.5
LINEAR TRANSFORMATIONS
linear transformations.
6.5
6.5.1
Definition. A function T with domain a linear subspace L c V"
and range in vm is said to be a linear transformation \fa,{3 E R and
Vx,y EL,
T(ax
a ER, T(O)
0.
6.5.2
:::::>
Proof.
is one to one we can show that this latter set is linearly independent, we
will have proved the necessity.
If T is one to one it follows that
T(uk)
- 0
Vk E (1,r) , since
0 is
akT(uk)=r( t1 ak uk )
0.
and since
then
T(x)
0 =::::}
r
k=I
ak T(uk)
0.
noted,
is
range in M. If for
(aT)(x)
(S + T) (x)
=
=
aT(x),
S( x) + T(x),
then these two operations will make the set of all linear transformations
with domain L and range in M into a vector space. Indeed, this vector
space is finite-dimensional, and if
pq.
sider a basis of a vector space as a function rather than as a set (see the
discussion in Section 6.1 ). We shall use the terminology
ordered basis
A(uk)
2:
j=l
ajkVj,
kE(l,p).
a21
a12
a22
a,.
a2P
aql
aq2
aqp
[""
qXp
q Xp
Suppose
N. Let
( w1,
X (I,p)
and range in R.
, wr
A (uk)
2:
j=l
a;kB(v;)
kE(l,p).
6.5
It follows that if c1k is in the lth row and kth column of the matrix
representation of B
L
i=I
b1;a;k
bu
b21
b12
b22
b,.
b2q
au
a21
a12
a22
alp
a2P
Cu
C21
C12
C22
Ct p
c 2P
br i
br2
b rq
a q,
aq2
aqp
Crt
cr2
C rp
Thus we see that we get c1k by "multiplying" the lth row of the matrix
of B with the kth column of the matrix of A; that is, bu is multiplied by
a;k and then the result is summed over j to get c1k.
Suppose now that A is a linear transformation with domain a linear
subspace L C vn and range in L. Let 'U
(u 1,
up) be an ordered
basis for L and let us designate the matrix representation of A with
respect to the pair ('U,'U) by [a;k]. We want to investigate the ques
=
ordered basis 'U' = (u'i. , u'v) for Land get a matrix representation
[a' ;k] with respect to the pair ( 'U', 'U'). From this point on,for the sake of
simplicity, let us agree that if [au] is a matrix representation of A with respect
to a pair of ordered bases ( 'U,'U) we shall say that A has the matrix represen
tation [au] with respect to the basis 'U.
Let Q be the linear transformation with domain and range L that is
defined by
u'k = Q(ud =
j=l
Q;ku;.
'U
is
'U'.
We have
A(u'k) = A Q(uk) =
0
L
j=l
s;ku;.
where
s ik
L a i,qlk
l=l
Now,
Q-1 (u i)
L ruui.
l=l
u' 1, we get
Hence, if we write
[q;k]-1
[ rlisik ] u'1
[rik],
we have
(6.5.1)
In Section
of the entries of
[q;k].
vn x vn.
6.5.4
linear subspace L
Vx EL,
IT(x) I
lxl.
(6.5.2)
x EL,
Let
we may write
lxl2
If we apply the transform
T we
T(x)
L 1gk12
k=l
get
p
L gk T(uk),
k=I
L.
If
6.5
and taking the norm of both sides and using the triangle inequality we
get
p
IT(x)i :!S
x by x - y and
6.5.5
3m>
0,
Since
theorem that
Vx
JV(T) we
JV(T),
(6.5.3)
have
Vx
EL we have
llaTll
la l llTll.
Va
E R,
and that
llTll=sup{IT(x)I: lxl=l}.
The proof of this is very simple. Indeed, if
(6.5.3')
and
hence the right side of (6.5.3 ') is dominated by II Tll. On the other hand,
let M0 be the right side of (6.5.3'). Then,
IT(x)I .;;
Vx
oF- 0,
M o.
llTll.;;
M0 This
E (T)}.
(6.5.3")
llTll
=sup {IT(x )
Vz
is a linear subspace of
En,
then
EL,
lzl
=sup { lz
YI: IYI
1 &
y E
(6.5.4)
L}.
lzl=z z/lzl
lz l .
oF- 0,
=
0,
.;; llTll,
for
(6.5.4) we
l}.;;
M o.
Definition.
linear functional.
Mo.
is called a
6.5. 7
Theorem.
EL
A(x)=x
y.
6.5
A(x)
k=l
kA(uk) .
If we set
n
y= L A(uk ) uk>
k=l
A(x) =x y.
z E L, so that Vx E L, A(x) =x z. Then Vx EL, x (y - z)
=0. In particular, take x
y - z and we get jy - zj =0, which implies
y=z.
it is clear that
Suppose
(6.5.5)
Fix
A(x)y=xy1
Since
Aay+13Ax) =aAy(x)
(6.5.5')
A1(y)
N(A)l. =(A1).
Proof.
For every
Thus
suppose that
&2.(A1).J..
Thus we have
Vy
E M,
A(z) y=zA1(y)=O,
from which it follows, upon setting y
E N(A) n N(A).J.. It follows that
Hence
N(A).J.=&C.(A1).
6.5.10
Corollary.
Proof.
Since
The range of
AJN(A).J.
and hence
z= 0, which is a contradiction.
A=
rank
A1
A=dim N(A).J.
=dim
A1
A1
with
A(ud = L aikvi,
i=I
and thus
A1(vJ = L a1ki uk
k=I
and hence
Thus
The matrix
The rank of a linear transformation can be comp uted from its matrix
representation with respect to any ordered p air of ordered bases. The
p recise facts are given in the following proposition.
row
vectors
Proof.
(v1,
,Vq)
(u1,
,Up
C Rand
Since
q
A(uk1) = L aik; vi
J=l
A(ud
L /3;kA(uk;)
i=l
Hence
r
jE(l,q).
ak= L /3;kak;,
i=l
ep)
{ak: k E {l,p)}.
{bk: k E (1, q)}, let ( e1,
for P and (f 1, ,fq)
subspace of
l
y + y ..
transformation
P(x) =y.
The linear transformation
x E L:::P
:> (x) =x.
p2 = p p=p.
(c)
P=P1
{uk:
P(x) = L akuk.
k =l
If we take the dot product of both sides with respect to
ak=P(x)
uk=x
uk, we get
uk.
P1
Hence
r
P(x) = L (x
k=l
uk)uk.
spherical representation
P1 be the projection of En onto the linear subspace
1
O}. P1 may also be described as the projection of
{x: x E En & x
En onto the space generated by the vectors {ek: k E (2, n)}. Here we
i
are taking ek
(e/,
ekn ) where ek = 0 ::::> j # k, el= 1. Let P2
2
1
be the projection of En onto the linear subspace {x: x E En & x = x
O}. The projection of P2 may also be described as the projection of
En onto the space generated by {ek: k E (3,n)}. Note that (P2)
=(P2j(P1)), so that P2 restricted to (P1) is the projection of
the latter subspace onto the subspace (P2). In general, let Pi, j
E (I, n I), be the projection of En onto the subspace {x: x E En
1
& x =
x; =O}. Clearly, the last subspace is the space generated
by {ek: k E (j + l,n)}, and (Pi)=(Pij(Pi_1)),j E (2,n-1).
1
The vector (t , 0;
, 0) is the projection of t onto the subspace
generated by e1, and we have
of a vector in En. Let
=
6.5
COS
e 1)/ltl,
(JI,
e 2)/
FIGURE 6.5.1
If P n ( t) =F
-2
Pn_
P,._3
and moreover
It" I = Iti
(O ,
If
t = 0,
0 , t") =
Pn
-1
Pn
-2
P1 (t) = P n (t).
-1
(JI,
,en -I
0,
=F
but
follows that 61
tn-I
tn
= It I
= It I
sin 61
(Jk E [O,'TT]
set of n-tuples
E [O,27T [.
t2
tk
tn-I
tn
for k
defined by
ltl cos 61
Iti
sin 61 cos 62
It I
sin 61 sin 62
= It I
= It I
sin
(Jk-l
cos
(Jk
(6.5.6)
'
has range all of E". If this function is restricted to S then it is one to one and
its range is all of E" with the exception of the subspace generated by the set
{e1:j E(l,n-2)} U {O}.
0,
6.5
Note that if
= 2,
the formulas
Symmetric Transformations.
and
A is
linear transformation
If
( u1,
ur
A is
A with
A=A1,
we get that
[a;;]
is
A1
C L. The
said to be symmetric
a1ii =a;;=aii.
that
a1;;=a;;.
symmetric.
Vx E
En,
IA(x) xi M lxl2
sup { IA(x)
l} ,
It is clear that
A(x) y = [A(x+y)
"4
(x+y)-A(x-y) (x-y)].
(x-y)]I M[lx+yl2+lx-yl2]
IA(x) xi
xi: lxl = l}
llAll.
(6.5.7)
sphere in L,S ={ x:
lxl = l},
3x0 E S,
Let us set
,0=A(x0) x0
A direct computation shows that
so that
Since
IA(xo ) I
llAII= 11.to l.
:!f::
it follows that
IA(xo)-,o xol2=0.
This means that
(6.5.8)
Any number
,0
x0 E(A)
for which
called an
eigenvector
or
proper vector
A.
for
(6.5.8).
A(x) y = x
,0
A(y)
A.(x
y)=0.
A(x) EMA.l'
This means
Let
A1
be the restriction of
to
M.,,1-;
that is,
A,= AIM..1-.
A1 is a symmetric linear trans
M./
x1 EM./,
so that
lx1I =I
,1
and an
and
{ A.i.
A. r}
so that
(v1,
v r)
is
A.,
(v1,
0
0
vr)
6.5
where the entries off of the main diagonal are zero. This means that
[a;i],
the matrix
( u1,
Ur)
with respect
[bii]
so that
>..
>--1
>..r.
For
E L let us write
Hence we get
r
A(x) = L xk>..kvk>
k=l
r
A(x) . x L >..k(xk)2.
k=l
=
Vx
E L,
>..1 =sup
>..r =inf
Orthogonal Transformations.
and
linear transformation
=Ix!.
l(A)
(A) C L. The
Vx E L, IA (x) I
L and
orthogonal is that
Vx, y
E L,
A(x)
A(y) =x y.
(6.5.9)
A(x)
A(y) =
A(x-y)l2J
A x y)
4 [I { + l2-I
1
=4 [Ix+Yl2-Ix-Yl2J
x Y
At oA
(6.5.10)
E L, I
(x)= x. Indeed,
D Exercises
I.
If T is a linear transformation, show that its range is a linear
space. If T is one to one, show that T-1 is also a linear transformation.
N= {x: x E V
&
T (x)=O},
A0B=IB0A=I,
where I is the identity transformation; that is, Vx
V, I (x) =x.
6.5
ER with
a;;.:
0,
Mlxl.
Vx, y
E En
J (x + y)= J(x)
Show that if f is continuous at
J(y).
x= 0,
point of En.
9.
Let
Ay be
Ay(x) =x y.
Show that
10.
Let
[aii]
( u1,
ur
is any ordered
with
L aiiakJ
if
i # k.
j=I
11.
Let
[aiJ]
[biJ]
( u1,
,Ur
).
Show
is a diagonal matrix.
12.
If L and Mare linear subspaces of E" that have the same dimen
13.
Vx
Show that if
E L,
IA (x) I= lxl.
Vx
E L,
14.
that P2
Give an example that shows that P2= P does not imply that P is a
symmetric linear transformation.
15.
of
Let
(A),
and
P the projection
A= A P.
into itself::::? P
16.
If
P is
(A)
vector in En and
of
of
takes M
11P11= 1 .
If y is a nonzero
onto the linear space generated by y and the result of the first sen
17.
6.6
DETERMINANTS
permutations,
selves.
6.6.1
as
For example, if
(1, n),
then a permutation is
simply a function that takes these integers into each other. It is not
difficult to show, although we shall not do it, that each permutation is
a composite of permutations that permute or interchange only two
elements in the original set and leave the remaining elements fixed.
This leads to the important concept of an
even
or
odd
permutation:
(I, n)
Sn
S n(<T)=
IJ (<r(k)-<r(j)).
l:Ej<kn
(6.6.1)
6.6
DETERMINANTS I 275
The definition of the product on the right has already been indicated
in Section 3.5. To make this precise we first remark that the set
elements.
An
Let
An.
(I , n(n-I) /2)
is defined as
Sn(<r)=
n<n-1)/2
TI Jrr0<l>(k),
k=l
An as
1 <r(k) - <rU)
U'jk = -1
<r(k) - <r(j)
follows:
> 0,
< 0.
We claim that
TI <T1k[<r(k)-<r(j)J = TI < k- 1)
1j<kn
li<kn
where these products are defined in a manner indicated in the last
paragraph. Indeed, for each permutation
having domain
A"
Fcr(j, k)
If
<T
let
Fcr
be that function
<Tjd<r(k)- <r(j)].
qr
F, (j,k)
k - j= f, (j, k).
An by
n<n-1)/2
TI
k=l
F<T 0<l>(k) =
n(n-ll/2
TI
k=l
F<T 0 qr 0<l>(k)=
n(n-1)/2
TI
k=l
F, 0<l>(k)'
<T =
TI <Tjk,
l:!Si<kn
(6.6.2)
Sn( <r)
where
<rand
sgn <rSn ( L )
(6.6.3)
T,
sgn
T 0
(6.6.4)
<r.
{l,n).We set
Sn(f)
TI (f(k) - J(j))
1E.i<kn
We claim that
S,,(f
0 a-
) = sgn
<r
Sn(f).
Indeed, the proof of this carries over mutatis mutandis from the proof
of (6.6.3) given just previous to the statement of this theorem, where
in that case f was taken as the identity permutation L. If in the last
formula we replace f by T we get (6.6.4).
Suppose now that [ a ii] is a square n X n matrix. One way of defining
a determinant of a matrix, which the reader has probably already seen
in his previous studies, is as follows:
(6.6.5)
<r
6.6
DETERMINANTS I 277
n
A multilinear functional D defined on the m-fold Cartesian product of v
<T
D(a1,a 2,
,am)
,en) = I ,
6.6.4
Theorem.
(6.6.5) is a
determinant function.
Proof.
Let us set
D(a1, ,am )
anun.
Now, Va,{3 ER
D(a1, ,aa k + f3bk>,a,.)
=
f3
(aakuk +
f3bkud
a11Un
bkuk anun
1
T- (k). Thus, since Tis a permutation and
TI
j=l
aTjuj =
L
u
(sgn <r)
TI
i=l
TI
k=l
1
T- )
a Tiui = sgn T
akUT -lk.
(sgn <r
'T-1
TI
k=l
aku ,-1k .
Now, as <T goes over the set of permutations of ( 1, n) , <T 0 T-1 also goes
over this set. Thus the function <l>A<r)= <T 0 T-1 is a permutation of the
collection of permutations of ( 1, n). Because a finite sum is independent
of the order of summation, we have
2:
(sgn
<T 0
1
T- )
<T
IJ ak<N-1k
k=I
2:
(sgn
<T
akrrk.
TI
k
=I
a m )= sgn
TD ( a 1 , ,an),
so that D is alternating.
Finally,
D ( e 1,
,en)=
2:
(sgn
<T)e1 rr
enrr".
<T
Since ekrrk=0, unless <Tk k, we see that the only nonzero term on
the right appears when <T is the identity permutation. Thus
=
D ( e 1, - ,
e 11 )=
l,
Theorem.
satisfies the
following:
(a)
(b)
xJ=xk=>D(x1,-,xi,,xk>,xm)=O.
(c)
Proof.
Since
(c)
X;
6.6
DETERMINANTS I 279
By part (b) the second term on the right is zero, which completes the
proof.
D(e;., , e;m)
Proof.
a(ii, .im)
.im). If we set
F=D
, e;m) =a (j1,
E,
im <S: n.
...'Xm be vectors in vn and set
xk = L xkie;.
j=i
Using the multilinearity of F we find that
F(xi,
",Xm) =
L
im=l
L
i1=l
If ir = i, for some rands, it follows from part (b) of the last theorem that
F(e;,, , e;m) = 0. If the ik are all different, let ii , , im be a re
numbering of the ik so that ii < i2 < < im Let CT be the permuta
tion given by i.,.k
ik. Then
=
F(e;1,
e;m)
,
, e;m )
6.6. 7 Corollary. There exists one and only one determinant function on
the n-fold Cartesian product of vn with itself.
Proof.
6.6.8
Theorem (Binet-Cauchy).
and [bjj] an n
<let
Let
m :o:;;
n, [a;;] an m
n matrix,
m matrix. Then
[a;j][b;;]
Let
of the matrix
(aiki
aikm),
the vectors
ek1[b;i]
Thus we see that
, bk1m)
6.6
DETERMINANTS I 281
e ktkt
ek/
"'
=l.
ekmk1
ekmkm
-,km),
jt
then
jm
{j;: i
< <
3j
and
k1
k1
<
D(ekp ,ekm)
Corollary.
6.6.9
< km :;;:; n,
<
If [a;;] and
det[a;;] [bu]
km
:;;:;
we have
E(ekp , ekm).
D
[b;;]
<
E,
are n
n matri.ces, then
det[aii]det[b;;] .
From the formula (6.6.5) it is very easy to see that the determinant
of a square matrix may also be considered as an alternating multilinear
functional of the column vectors of the matrix. Indeed, as the permu
tation
<T
aa-lkk
a1a1a2a2 .
Since sgn
<T
(j )
and
<T =
sgn
t
<T- ,
. llnan= lla-t11llu-l22 .
it follows that
. lla-tnn.
<T
6.6.10
Theorem.
satisfy a1u
a;;, then
det[au]1
det [ au J .
(6.6.7)
1)
(n
1)
usual set
"
ai
If we set D(a1,
(a;i. ,ai,,)
k=l
aikek.
D we have
det[a;;]
k=l
,a,,),
(6.6.8)
,a n). Set
(a;2,
,a;,,).
The number on the right and hence the number on the left side depends
only on a'2,
6.6
DETERMINANTS I 283
e)=D(e1, e2,
1) fold
,en)= 1,
(6.6.9)
:s;;
n and
<r
be
i
1,
iE(2,j),
iE(j+l,n).
=
sgn
Thus, from
<r =
(6.6.9),
= ( l)i-1
-
<r
a u-1>n
au+ on
,an, then
rr(i) =
Then
au-u2
au+u2
,an), where
Let
I)i-1
=>i=l ,
1 =>iE(2, k)
i =>iE(k+l,n).
Finally, let us take the transpose of the matrix having rows ( ba1 ,
The new matrix has
e1
brr,.).
we get
Finally, if
ek is
a2ck-o
a2ck+o
aack-o
aack+o
anck-1>
anck+O
e1
find that
D ai.
au
(- I ) i+ k
ek,
.. 'a,.
)
aHk-o
aHk+i>
aln
au-m
au-1xk-o
ac;-O<k+O
au-on
au+o1
au+ock-o
au+i><k+o
a< i+l)n
ant
an<k-0
anck+O
a,.n
ai2
an2
(6.6.10)
'
6.6.11
Theorem.
If [aij] is any n
n matrix, then Vj
det [aij]
k=l
aik
Co (aid =
aki
Co (aki ) .
( 1, n} ,
(6.6.11)
k=l
6.6
[au]
[bu] be
i j,
[au] by
is an
with
and let
of
DETERMINANTS I 285
obtained from
[au]
and
[bu] has
of
[bu]
Vi,j
(6.6.11)
b;k= aik ! )
2,
k=l
aik Co(a;k)= 0.
(1, n)
we may write
2,
k=I
where
B;i= 0
adjoint of [a;1],
{)ii= 1.
Co(a1;). This
j &
(6.6.12)
(i,j)
(6.6.12)
shows that
(6.6.13)
By working with the columns of
(6.6.13')
(6.6.13')
{x1
{y1,
y.. }
x.. }
(6.6.14)
is satisfied. If we perform the matrix multiplication and equate corre
sponding entries on each side, we see that this is the same as a system of
n
(6.6.13')
on the right
(adj[a; ]) [Ytn]
Y
(det[a.,])
r:l ]
(6.6.15)
x,.
xk d et[ai;]= 2,
j=l
Y;
Co(a1k).
(6.6.15')
[bii]k which
[aii]
consisting of the
Y;.
Now, if det
[aii] = 0,
be said at this point. However, if det [aii] =/= 0, from (6.6.15') it follows
that
xk-
det[b0] k
det [a ;;]
(6.6.16)
This shows that if det[a;;] =/= 0 and the equation (6.6.14) has a solution,
the solution must be of the form (6.6.16) which implies the solution
is unique.
Conversely, if we suppose det
[a;;]
=/=
by (6.6.16) satisfy the equation (6.6.15') or, what is the same thing, the
equation (6.6.15). Multiply both sides of equation (6.6.15) on the left
by the matrix
[aii]
every ordered
8iJ = 0 if i =/= j, 8;; 1. If A is a linear transformation
inverse, [a;;] a matrix representation of A with respect to an
basis, and [aiJJ-1 denotes the matrix representation of A-1
[ 8;;],
with an
ordered
where
[a;;]-1
(det
[a;;))-1
(q;;]
6.6
DETERMINANTS I 287
Hence
det
[ti;]
det
[t;;].
6.6.13
Theorem.
linear transformation
is nonsingular<==> det T
- -0.
Proof.
I and hence
1.
0. Suppose
( u1,
,Un
T(uk)
j=l
t;ku;,
Then
"
T(x)
Since the
u;
k=1
k
x T(uk)
equations
n
k=1
k
t;kx
o,
( l , n) .
k
(1, n), x
0. Hence
t11
t21
/:;.k=
t12
tz2
t1k
t2k
(6.6.17)
tk l
tk2
tkk
VkE(l,n).
(6.6.19)
(u1 , , un). Then the matrix [t;;] itself is symmetric; that is, tiJ
t;;.
T
are bounded below by m. If {i1.k: kE ( 1, n)} are the eigenvalues of T,
then since det T is independent of any matrix representation, we have
=
n
/:;.n=det T= I1 A.k
k=I
Now,
mn > 0.
Tk(u;)= L tuui.
jE (l,k).
i=l
The symmetry of
get that
n=1, it
n=p
1 < q. Let
T be a
VkE (l,p). By
6.6
lip
l11
DETERMINANTS I 289
l11
S22
S2p
Sp2
Spp
Lip =
lpp
lpl
where,sincelrs=lsr.
Vr,s E(l,p),
VkE (2,p) ,
rk-1 =
it follows that
VkE(2,p).
Thus
(6.6 2 0)
VkE(l,p-1).
Sui =
L s u ui
i=2
j E(2,p).
>
Vx E .:B(S),
o/:- 0, we have
S(x)
> 0.
kE(2,p).
It is an easy matter to check that ( u1, u2,
and we leave this for the reader. Now,
T(u;)
Vj,kE (2,p),
t it k
ufc=tki- i t =S(ui)
t11
T(u 1)
Suppose now that
u;,
u1 T(u;,) = 0.
uk.
Then, since
u1
T(uk)
T(x) x
1
x u1
k =2
0, we have, if
xkuk,
1
t11(x )2
t11(x1)2
1
t11(x )2
+ S(x')
#- 0,
f f xixkT(uj)
p
L L
k=2 j=2
xixkS(u;)
x '>
ufc
uk
0.
k=2 j=2
x > 0.
T (x)
If x
m lxl2
T(x) x
-m lxl2
(6.6.21)
are
VkE(l,n).
(6.6.22)
In case <let T
an #- 0, but the conditions (6.6.19) do not hold and the
conditions (6.6.22) do not hold, then 3x so that T(x) x > 0 and 3y so that
T(y) y < 0.
=
Proof.
If we set S =
transformation S, then
the conditions (6.6.22) do not hold, then T must have at least one posi
tive and one negative eigenvalue. Taking x a nonzero eigenvector for
the positive eigenvalue and y a nonzero eigenvector for the negative
eigenvalue completes the proof.
6.6
DETERMINANTS I 291
D Exercises
1.
(c)
-1
I 1
-2
-2
-1
-1
1
1
3
(b)
( d)
-3
-1
a
a
1
c
2
c
(a)
x1 - 2x2+ x3=0
3x1+ x2+3x3=0.
(b)
2x1+ 4 x2 - 3xa = 3
3x1 - 8x2+ 6x3=
a =
, n)
det [a ii ] D (e i .
,en ) .
4.
Let <r be a permutation of (1, n) that interchanges only two
elements; that is, u(j ) = k, u(k) = j, and Vi, i j, k, u(i) =i. Assuming
j k, give all the details of the proof that sgn <r = 1
-
5.
Let [a u ] be an n
n matrix with
det[a;;] =
a;;= 0
if i
> j.
Prove that
n akk
kI
1.
-3
1
-2
-n
1
A-
-A.
is odd, prove
0.
(r + s)
(r + s)
[ i ].
where
is an
rXr
matrix,
is an
J
(Hint: Fix
r X s matrix, C is
s X r zero matrix.
(<let
an
X s matrix,
Show that
A)(<let C).
D (A, B, C)
<let
[i l
vn.
Hence by Exercise 3
D(A,B,C)
where I is the
( det C)D(A B I)
,
identify matrix
[Sii].
A.)
I 0.
an
a2n
ann
,
has a nonzero solution
11.
(x1,
Xn)
x'
l f ] fl
<let
0
.
.
0
O
X2
.
.
.
Xn
[aii]
0.
out rows and/or columns from the original matrix. Let T be a linear
transformation and
[tii]
p there exists a
p X p submatrix of [tiJ] with nonvanishing determinant, and any sub
ordered pair of ordered bases. Show that rank T
6.7
6. 7
FUNCTION SPACES
Vx EK,
Vx EK.
11111=0 <=> f= 0.
llafll = la l llJll,
and Vf, g E Cm(K) the triangle inequality
II!+ gll
:s;;
11!11 + llgll
is valid.' The distance between J and g is taken as the number llJ- gll.
Using this definition of distance it is clear how to define a Cauchy
sequence in Cm(K). In Theorem 3.4.5 we proved that a uniformly
Cauchy function sequence is uniformly convergent. This was done for
function sequences whose elements were real-valued functions with a
common domain in R. However, the proof carries over mutatis mutandis
to Cauchy sequences in Cm(K). In Theorem 3.4.6 we proved that a
uniformly convergent sequence of continuous functions converges to a
continuous function. This also carries over to Cm(K). Thus we have the
following fact.
<
p}.
is said to be the open ball in Cm(K) with center g and radius p. Using
the concept of open ball, an open set in Cm(K) is a set for which every
Theorem.
Cm(K)
space
Cm(K)
Cm(K)
Cm(K)
I covers A. Hence there must be one of these balls, say B0, so that no
B0 Now, B0 (being a subset of A) is totally
bounded. Indeed, V > 0 we may cover A by a finite number of balls
of radius e/2 and hence a finite number of these balls covers B0 The
only question is whether these balls have their centers in B0 However,
if a ball of radius e/2 has a nonvoid intersection with B0 we can put a
finite subset of 'U covers
'U covers
complete, it follow.s
that
covering for A,
N ==> gn E U
3U
and
The part of the proof that requires the axiom of choice is the rigorous
function
G n( O ) =B(go. 1),
G,,(k+ 1) =F(Gn(k)) ,
kE(O,n-1), nl.
Theorem.
A.
'Pn(k)
=min {Ak\{q;n(l),
",'{)n(k-1)}}.
g EA.
If the set A is not totally bounded, then 3E: > 0 so that no finite set of
balls of radius E covers A. Pick g0 EA, and since B(g0, E ) does not
cover A we may pick g1 E A n B(g0, E ) c. Since B(g0, E ) and B(g1, E )
6.7
g2 in A and
outside of B(g0,
(gk)
E)
with range in
Hence no subsequence of
(gn)
B(g1, e ) .
A so that
is Cauchy
(gk)
of the last
X
X which satisfies the
defined on XX
following:
(a)
(b)
(c)
(d)
Vx,yEX.
d(x,y)O.
d(x,y)=0 {::::} x = y.
d(x, y) = d(y, x),
Vx,y EX.
d(x,y) d(x,z) +d(z,y),
Vx,y,zEX.
C m(K ).
Cm(K)
En we get equivalence
6. 7.6
Ve
Definition.
> 0, 36 > 0
IJ(x) - f(y)I
<
E.
&
II/II l}.
The set
fn(x) = x"
are in
A,
A.
lgdx) - gk(y)I
<
e/3.
E.
Hence A is equicontinuous.
To prove the converse statement we shall prove that A is totally
bounded. Since A is closed it will follow by Theorem 6.7.4 that A is
compact. Note that the axiom of choice is needed, since it is needed
in part of the proof of Theorem 6.7.4.
Since A is equicontinuous, VE > 0, 3 8 > 0 so that V x, y E K with/
Ix - Y I < 8 and VJ EA we have IJ(x) - J(y)I < e/4 . Since Kis co n:f
pact, it is totally bounded and hence there is a finite set of balls {B(xk, 8}':
kE ( 1,p)} which covers K. Since A is bounded, each set
kE(l,p) ,
is bounded, and thus totally bounded. Hence there is a finite set of
balls{B(f;(xd, e/4): i E (l,pk)}which coversAk.
Suppose, for the moment, that we are able to find a set { hk: k
E ( 1, p)} of real-valued nonnegative continuous functions on K so
that Vx E B(xk, 8)c n K, hk(x)= 0, and Vx EK,
k=l
Letj = (j1,
g i (x)
k=l
J(x)
L hk(x)J(x),
k=l
6.7
we get
jgi(x)-f(x)I
For every
lgi(x)-f(x)I
<
(E/2) L hk(x)
kEN(Xl
E/2.
E/2.
From this we conclude that A is totally bounded. The only small point
to be clarified is that the centers of the balls may not lie in A. However,
as in the proof of Theorem 6.7.4, for each of these balls which has a
nonvoid intersection with A we choose a point in this intersection and
take the ball with the chosen point as center and radius
a finite set of balls with centers in A and radius
E which
E.
This gives
cover A.
( 1, p)}.
Let
cp
'P(x)
The function
cp
{hk(x):
{ 1- lxl
<:::::}
0 <:::::}
lxl
l xl
1,
;;.;,: 1.
'Pk
with domain
E'', by setting
Vx E B(xk,o)c, cpk(x)= 0.
If we set
B= u {B(xk,o): k E (l,p)},
then
Vx EB,
p
Let us
<l>(x)= L 'Pk(x) # 0.
k=l
define hk on K by putting Vx EK
hk(x)
cpk(x)/<J>(x).
We see immediately that this set of functions has the required properties.
This completes the proof.
REMARK:
{hk: k E (1,p)}
C1(K).
(a)
[0, I].
If
P (x, y) =
ai,!cX'iyk'
k=l j=l
(x, y) E K .
algebra in
point of
vanish at a fixed
]-a, oo[
and values (t
) 112
a>
0, the
is infinitely differ
entiable. By using, for example, the Lagrange form of the remainder, the
reader may easily convince himself that the Taylor expansion of this
function around t = 1/2 converges uniformly in the interval
If we replace t by y2 and
by
2
E , E
[O, I J .
6.7
nomial
p so
that
jp(O)I< 2e and Vy
[O, l] ,
e[ (y2 +E2)1/2
(y2) 112)
,,,,;; [ (y2 +e2) 112 + (y2)112][ (y2 +e2) 112
Thus, since
we have
If
Vx
E K,
Since A is an algebra,
lgl
can be
lgl
EA. If
l lgll
> 1, then
g/llgll
h=
max
EA=>
lgl
EA.
(g,h) and g
/\
h=
min
(g,h)
g (\ h
and the facts that
(c)
If
and Va, b
Vx
g+ h
E K,
ER,
3f
By hypothesis
3g
h= 2 [g +h +lg- IL
1
E A and
jg- hi
EA.
EA
EA
3g,h
EA so that
3p
EK
so that x
= y
EA so that
let us set
371
> 0 so that if
iol <
11 and
lei<
11 then
ip(o, e)
E,
IEI
< YJ,
so that if we set
f= aq+ f3q2,
where
a= aq(x)+ f3q2(x),
b
aq(y)+ f3q2(y).
>
Pxu(x)
f(y)
Fix
y and
Pxu
contains
<
Pxu(z)}
{ Uxk :
qy
then by (b) we know that
Px1Y
qy EA.
f(z)
Pxmy;
Moreover,
- E
<
Vz E K,
qy(z).
Since
qy
and
f are
Vu is
- E
<
h (z ) < f(z) +
E,
/\
6.7
E.
A is closed, f EA.
If 3a EK
(e)
& g(a) = O} .
so
that VJ E A,f(a) =
0,
which has the value e(x) = 1 for every x EK. Then Ai is a closed,
separating algebra which satisfies the additional hypotheses of (c).
Thus Ai= C(K) which means that Vf E C(K) so that f(a) = 0 and
V E>
0,
3g EA and c E R so that
llf-gll<
E,
and this shows that f EA. This completes the proof of the theorem.
D Exercises
1.
Show that Cm(K) is finite-dimensional K has a finite number
of points.
2.
3.
6.7.4.
Let A be a bounded equicontinuous set of functions in C(K).
4.
Show that
Show
6.
C
E.
7.
Let K = [l,
oo[,
Vx E K and Vn E N set
and
f,.(x)=x-1111
and A= {!11:
9.
[O, 27T[
x E
and defined by
g(8)=(cos 8,
sin
8)
E A
n
g(8)= L ak
k=O
cos
k8
+ L bk
k=l
sin
k8.
Show that the closure of A is C(T). This result is a slightly less refined
version of a result called
10.
If I=
[O, l]
Fejer's theorem.
gk
E C(I).
form
f(x)= p(x)e-lxl'
where p is a polynomial. Show that the closure of A is the set
C(R) & limx-oo
{f: f
CHAPTER
71 HIGHER-DIMEN
SIONAL DIFFERENTIATION
7.1
MOTIVATION
can be defined in
is in Em,
{ (x, f(x)): x
defined
trace of a path,
F with
values
F(x)
(x, J(x)).
305
The function
vector
to F at
F has
c E 1
tangent
(interior of I) is
F'(c)
The tangent line to F at
(l,f'(c)).
;eF<cJ by a
affine space.
constant vector
x2
FIGURE 7 .1.1
{ (x, g(x)): x E
'(g)}, considered as a subset of E3 Such a graph is often called the
trace of a suface or trace of a surface element, the surface or surface element
E2 The graph of
has domain
'(g)
at a point
interior to
'(g).
g,
x3
with all
lines the collection of tangent lines if they form an affine space, could
possibly be called the
tangent plane
to
G at c
(Fig. 7 .1.2).
7.1
MOTIVATION\ 307
x3
Al
FIGURE 7 .1.2
Let us make a little more precise the things we have said in the
previous paragraph. A two-dimensional linear space which contains
the vector
(u,O) =(u1, u2,O) and the set of all vectors (0, O,x3) is the set
.Pu={t(u,0) +(O,0, x3): t,x3 E R}.
A p lane through
c E .B(g)
and parallel to
Pu(c)=Pu+C.
If we intersect this space with the graph of
{(c+tu,g(c+tu)): c+tu
(g)}.
Gu(t)
where
(c+tu,g(c+tu)) ,
If we assume that
G'u(O)
u.
0 E /0
and
c+tu E (g).
Dug(C) - r
direction
Gu defined by
g(c+hu) - g(c)
h
directional derivative
(7. l. l)
of
g at c in the
We shall set
directional derivative
h
of
at
in the direction
(7.1.l')
u.
08 I HIGHER-DIMENSIONAL DIFFERENTIATION
Vu
E2, DuG(c)
de c<c>
{DuG(c): u
E2}
(7.1.2)
is a linear subspace of E3, then there is some temptation to say that the
surface
c.
dec<c>
DuG(c)
range in 3 Indeed,
Vu, v
and
Va, {3
E R,
E2
and
3w E 2 so that
DwG(c).
Dau+nv G(c)
au + {3v
=
and hence
aDuG(c) + f3DvG(c).
Lc<c>
(7.1.3)
with domain
E2
and
defined by
Lc<c>(u) = D,,G(c)
is a linear transformation with range in
is the linear space
dec<c> defined
E3
(7.1.4)
Indeed, the range of
Lc<c>
by (7.1.2).
In the case where the domain of g is in R, the fact that g' (c) exists
before that
which
R for
(7.1.5)
However, if we can say that VE> 0, 31) > 0 so that for all
lul
c.
is continuous
Vv
E2 with
we have
IG(c+v)-G(c)-Lc<c>(v)I,;;; E lvl.
Lc<c> is a linear
ILc<c>(v)I,;;;; M lvl. If
Since
E2 with
transformation, 3M so that
Vv
(7 .1.6)
E
E2
we have
7.2
Vv
lvl+ IL c<ci(v)I
(M + 1) lvl.
G at c.
affine space
T G<c>=Lace>+ G(c).
There are any number of examples which show that
DuG(c) exists
G is not continuous at c. From our previous remarks
it follows that such a G does not have a tangent plane at c. The example
Vu
E E2 but that
,
we shall give is a very standard one. We feel sure that the reader will
be able to construct many more without difficulty.
Let us write the points in
g(x,y )_
{ xy2/[x2+y4],
0
if
Vx
x
0,
0.
g is not
(O,O). However, let us show that Dcu.vig(O,O) exists
2. Since g(O,O) = 0 we have
continuous at
V(u, v)
Hence, if
=O.
7.2
0, we get Dcu.v>g(O,O)
7 .2.1
Definition. Suppose f is a function with domain in En and range
in Em. The function f is said to have a directional derivative at a in the direction
u ::::) a is an interior point of If) (J ), u E En, and for h E R the limit
Duf(a) =Jim
h-0
J(a+hu)- J(a)
h
in the direction
only if
lul
af(a)
oxk
Dk f(a) .
an
have in mind, for example, the formula for the chain rule which we
shall develop in Section
fxk
and
7.2.1 has
become more popular with the advent of the modern theory of partial
differential equations since
D k can
and V f E .P,
Duf
is well defined;
more likely than not it will be the function whose domain is the null
set. However, regardless of this, the composite function
Dv Du
0
can
be defined and its domain is also .P. This way of looking at things
will be convenient when we discuss higher-order derivatives and we
shall consider the matter again in Section
7.4.
Em,
IJ(x) - J(a) - L (x - a) I
,,;;;
Ix - al.
(7.2.l)
df with domain
FJ(df) is df(x).
Some authors prefer to use the word 'derivative' for df and reserve
the word 'differential' for the function defined on FJ(df) X En with
values df(x) (u). Since from a strictly logical point of view df(a) is not
the derivative of f at a when n = 1, we shall not use this terminology.
The differential of
Vx
L the
differental of
7.2.2.
at a.
38> 0
so that
IL(u)-L1(u)I
Vu
IJ(a+u)-J(a)-L(u)I +IJ(a+u)-f(a)-L1(u)I
2elul < 2e8.
:s:;
:s:;
Now,
Vv
E En
E En, if
0,
then
L1(av)
aL1(v)
Vv
8v/2lvl
E E" and
lul < 8.
aL(v) and
Va
E R,
L(av)
we get
(8/2lvl) IL(v)-L1(v)I
Hence
L1(v).
7.1
and we shall
not repeat it here. However, we should call attention to the fact that
the proof of Proposition
7.2.3
linearity of the differential. On the other hand, the proof given above
of the unicity of the function L which satisfies the conditions of Defi
nition
that is,
L(au)
aL(u).
7 .2.4
Em
Duf(a)
Proof.
Suppose
Ve> 0, 38> 0
so
E E"
that
Vh
and
df(a) (u) .
lul
E R with
E -B{f) and
If we divide by
e lhl.
v E
If
E" and
Thus
1
Dv11v1f(a) =
df(a)(v).
But
Va E R, a
0,
so that if
Thus
Duf(a)
Dauf(a) = aDuf(a) .
As we saw at the end of Section 7.1, the converse of the last proposi
tion is not true. That is to say, if
necessarily true that
Vu E
has a differential at
E",
a.
Duf(a)
exists, it is not
lau+ f3vl.
(7.2.2)
lf3vl
<
8i.
Ve
> 0,
381
> 0,
then
lf3vl .
laul + lf3vl
D1F(a, {3)
Since
f/2,(F )
l
lim _
h-0 h
lim
h-0
Duf(a+au+{3v).
C Em we may write
F(a, {3)
where
< p. We have
f/2,(Fk)
L Fk(a, {3)ek>
k=l
we get
thus
If we now write
f(a+au+{3v)-J(a)
and
take
we have
J(a+au+{3v)-f(a+{3v)
+ f(a+{3v)-J(a),
83 = min (81 , 8 ),
2
then
Va, {3
E R with
IJ(a+au+{3v) - f(a)-aDuf(a)-f3Dvf(a)I
:;;; e
laul +lf3vl
[laul +lf3vlJ.
<
83
(7.2.2')
that
lau+f3vl2
laul2+2au {3v+lf3vl2
;:::,7/2 [laul2+lf3vl2 ]
;::: 7/2 [laul+lf3vl ]2/2 .
=
If we take o
<
from (7.2.2') and the above inequality we get (7.2.2), up to the unimpor
tant factor
\12/71.
(b)
D,,f(a)
Further, VE
> 0,
38
> 0
"
2: ukD.kf(a).
so that lul
<
o =::::}
Dau+fJvf(a)
Dau+fJvf(a) exists
(7.2.3)
k=l
lul.
(7.2.4)
and moreover
aD,,J(a) + ,BDvf(a).
Let us now prove the first statement in (b) by induction. For every
k E (O, n - 1)
let
P(k)
and
u which
{,j: j E ( 1, n) \{i}},
is a linear combination of
(7.2.3')
ji
Lj,,.; ui,i. For every k E N and k n, let P(k) be the state
1 1. Clearly, Vk E N0 and k n, P(k) =::::} P(k + 1).
Now, P(O) is certainly true, since a zero-dimensional subspace consists
only of {O}, and thus (7.2.3') holds. If n
1, there is nothing more to
prove. Hence, suppose n > 1, k < n - 1 and P(k) is true. Suppose u is
a nonzero linear combination of (k + 1) vectors in A. Then 3l E
(1, n) \{i}, so that u1 u - u1,1 is a linear combination of k vectors in
A. From the hypothesis P (k), Va E R, Du1+a./(a) exists and
where
ment:
ji
Now, by part (a) of the proof,
-.
(7.2.3")
since by hypothesis
D,,1.J
is defined on
1
u ,1 and u1 +a,; are linearly independent, the fact that D,,1.J
is defined on B(a, p) and continuous at a, formula (7.2.3"), and the
fact that
inequality (7.2.2).
(c) . The function f has a differential at a. From part (b) we know that
Vu E E",D,,f(a) exists. If we set L(u)
D,,f(a), then (7.2.3) shows
=
7.2
REMARK:
{ei: j
( l, n)}
a was not done for reasons of sophistry. This was done to include the
n 1, where a function has a differential at a point if it has a
case
a,
then
df(a) (u)
Vu
E R,
uf' (a) .
Em
and
df(a)
En
and
( (e1,
n,
ek
df(a) (ek)
m
Dd(a) = L DdJ(a)ei.
df(a)
(7.2.5)
Jacobian matrix
of f at
a,
and, if
m,
the
and is
denoted by ]J(a) . Of course, the reader should be aware of the fact that
the Jacobian matrix of a function can exist at a point without the func
tion having a differential at the point in question.
If f has a differential at
a,
df(a) (u) in
J and the components of u with
(e., ,en) We already noted such a
then it is easy to compute
and apply the linear transformation df(a) to both sides. Noting that
df(a)(e )
k
D f(a), we get
k
df(a)(u)
k=l
ukD f(a).
k
(7.2.6)
The last formula can be put into a form that we feel sure the reader has
seen in elementary calculus, even though the meaning may not have
been clear at that time.
For every k E
(1, n),
range E1 defined by
(7.2.7)
Note that we have also used the same symbol 'xk as a variable. We think
it will always be clear from the context in which way we are using this
symbol, and when we use it as a function it will be clear on which space
it is acting. The function xk is clearly a projection and an almost trivial
calculation shows that Vx, u E En
Duxk(x)= uk.
Since Vu E En, this is a continuous function of x, <fxk(x) exists and
(7.2.8)
Note that <fxk(a) is independent of a so that we will usually write simply
'dxk ' instead of 'dxk(a).' Also note that <fxk(a)(u)
xk(u).
k=l
D f(a) <fxk(a).
k
(7.2.9)
right side of (7.2.9) can be interpreted in a way that will make the
Vu E En we interpret D f(a) dxk(a)(u) as ukD f(a),
k
k
and thus both sides of (7.2.9) are the same when evaluated at every
equality valid:
u E En. The reader may have seen (7.2.9) written in the form
df=
i dxk.
k=l
From the above discussion he should be able to supply the interpretation
of this formula.
D Exercises
I.
follows:
f(x' x2)
f(x1,x2)
(a)
(b)
Show that
Dif(O)
Vu= (u1,u2),
(xx2) t/3.
lx1x21112.
and
so that
2.
D2f(O)
u1u2
E3
'/:-
0.
Duf(O)
and is defined by
df(a).
(a)
(b)
( c)
follows:
f (x )
lxl2
sin
, 12
<=>x
if
'/:-
E2
defined as
0,
x = 0.
5.
Compute
4.
E2
df(O)
E2
but are
exists.
ential at
En.
Show that
Jg
has a differential at
and
g(a)
'/:-
0,
show that
tives exist at
En.
Define
Vf(a)
L Dkf(a)ek>
k=I
a.
If
df(a)
df(a)
Vu
En
u.
En
and that
En, {u:
u E U &
iul
I } by
,
df(a)
when
u= Vf(a)/IVJ(a) I
8.
at
Va E En,
df(a) = f.
9.
Suppose
Em.
Writef componentwise as
J= (J1'
'
Jm) '
where Vk E
Show that
Vu E En,
df(a)(u)= (df1(a)(u),
10.
dfm (a)(u)).
and Vt E R+ so that tx E
En and range in Em
JFJ(f) we have
J(tx) = tPf(x).
df(a) exists,
show that
df(a)(a)
This is known as
Pf(a).
E uler's relation.
11.
E n,
(a)
and set
(b)
If
[bt;] is any
g(x) =
matrix, then
j=l
i=l
L L b;;xixi.
12.
Vx E En set
E"
and both
have range in
Em.
If
J(a) df(a)(u) = 0.
7.3
If we think of
geometric point of view this says that the tangent plane to the unit
sphere at a point is perpendicular to the line from the origin to the
point.
7 .3
DIFFERENTIATION RULES
ence her is that the rules will be stated in the language of differentials.
7 .3.1 Theorem. (a) If f and g each have their domain in E" and range
in Em, and if df(a) and dg(a) exist, then d(j + g)(a) exists and
d(J
(b)
En
d(Jg)(a)
d .!
g (a)
E",
- 0,
dg(a).
g(a)2
(a)
Since
interior point of
Ix - aj/2,
Ix - al/2.
Ix - al.
The point
interpretation of
a is an interior point
g(a) df(a), it follows
of
that
so that
that M
VE>
lg(a)I
Vu
:E; Mand
E En,
==>
lf(x) - f(a)I
<
e/3M,
:E;
(e/3M) Ix - al,
:E;
(e/3M) Ix - al.
Thus we have
:E;
:E;
E Ix - al.
Since
dg(a)
a. Thus,
Ix - al <
x E JV(g) and
g(a) of; 0, 3M>
TJ==>g(x) #;O and ll/g(x)g(a)I :E;Af. We may suppose M is large
enough so that Vu E R, ldg(a)(u) I .s; Mlul. Further, Ve> 0, 38>
0 so that 8 .s; TJ,_ and x E DE9(g) and Ix - al < 8 ==>
0 ano 3"f'/ > 0 so that
since
<
e/2M2
dg(a)(x - a)
g(a)2
Thus
:E;
.s;
E Ix - al.
7.3
df
g(a)
df(g(a))
dg(a).
(7.3.1)
l df(b)(g(x)-g(a))-df(b) 0dg(a)(x-a)I
N lg(x)-g(a)-dg(a)(x-a) I.
Next, Ve> 0, 38' > 0 so that Vx
lg(x)-g(a)-dg(a)(x-a)I
<
(7.3.2)
8' we have
(7.3.3)
elx-al/2N.
If we apply the triangle inequality to (7.3.3) and use the fact that
E F, ldg(a)(u)I Mlul, we get that Vx E cB(g)
3M > 0 so that Vu
with Ix-al < 8',
lg(x)-g(a) I
eIx-al/2N+ ldg(a)(x-a)I
L Ix"'.""' al,
IJ0g(x) -f(b)-df(b)(g(x)-b)I
Let us set 8
min(8', 8"/L); then Vx
have from (7.3.2) through (7.3.5),
=
(7.3.4)
cB(f 0g)
e lg(x) -bl/2L .
(7.3.5)
<
8 we
IJ0g(x)-J0g(a)-df(g(a)) 0dg(a)(x-a)I
IJ0g(x)-f(b)-df(b)(g(x) - b)I
+ ldf(b)(g(x)-b)-df(b) 0dg(a)(x-a) I
elg(x)-g(a)l/2L+N lg(x)-g(a)-dg(a)(x-a)I
eIx-al .
This shows that df 0g(a) exists and moreover has the value given by
(7.3.1).
WARNING:
we get
Dk(J g)i(a)
0
L D;Ji(g(a))Dk gi(a).
i=l
(7.3.6)
i, j, and k. How
agi(a)
axk
Dk gi(a) '
a(J0 g)j(a)
aji(g(a)) agi(a)
=
..,
axk
agi
axk .
i=l
(7.3.6')
dg(a)
df(g(a))-1
df
g(a).
Theorem 4.2.3 and we shall not reproduce it here. However, the reader
may find it rewarding to trace through the proof. Note that in Theorem
of
in the proof.
F or
[O,I] set
F(t)
Either by direct computation or the use of the chain rule we find that
7.3
]O, I[
so that
F(I)-F(O) =F'(y).
But
F(l)
J(b)
and
F(O)
J(a),
J(b)-f(a)
Proof.
L(b
a) .
Write
k=l
Now, Vk E
(I, n),
3 ck E A so that
Jk(b)
For every
u E
fk(a)
dfk(c ) (b - a).
k
En define
L(u)
It is clear that
dfk(c )(u)e .
k
k
k=l
O Exercises
1.
Suppose
f(x)
Let
(x1 )2 - x2x3 -
x1 x3
g1 (x) x1
g2(x) x1
g3(x) = x1
=
cos
sin
sin
x2,
x2 cos x3,
x2 sin x3
2.
sin
Let
of
f 0 g.
kth component of
g,
gk is
the
suppose that
gk(x)
a function depending only on
xk.
g (xk)'
k
Suppose that
is a function with
Jr.g(x) =Jr(g(x))
IJ
k=l
g' (xk) .
k
Suppose
3.
a(x)
Show that
Va E
aa(a)
axu
If
det [xij] .
E9,
=
Co(au) .
D(a g)
0
g;l
g21
g31
g3
g23
g33
g;2
g22
g32
g13
g3
g33
gll
gl
g31
4.
5.
6.
Suppose
3 to
gll
g21
gl
determinants.
df-1(f(a) ).
'
7.
Lobtained in Corollary
bases
8.
Vu E
Em,
3c E A so that
[J(b) - J(a)]
9.
sis that
df(c) (b - a)
u.
V c E A, 11df(c)11
llLll :,;;;; M , where Lis any linear transformation satisfying the conclusion
of Corollary 7.3.5. Hence, deduce that
If(b) - f(a) I
[For definition of the symbol
7.4
II
II
:,;;;; M
see
I b - aI .
(6.5.3).]
Du
on the collection P of all functions each of which has its domain in some
7.4
D uf(x)
happen that
form the
I. I
::::} Vk
N0, tFJ(Fk)=M,
and
5(,(Fk)
(Fk)
tFJ(f) =, 5(,(f)
C so that
VF=(Fd
f(F)(O)=Fo,
f(F) ( n +I) = Fn+ i
E,
f(F) (n).
always
be that one whose domain is the null set. This, of course, allows us the
possibility of establishing the existence off. If
F= (Fk)
E we shall
write
n
TI
k=O
Fk=f(F)(n).
II=m
Fk
(j1,
Fk,
and so on. In
Di=
TI
k=I
D;k=D;n Di n- I
o
D}].
Em,
Definition.
fare
the
Di Dk=Dk Di . How
f it may often happen that Di Dkf=Dk DJ , and
a E J8(f ) it is more likely to happen that Di Dd(a)
=Dk DJ(a).
0
(7.4.1)
Proof.
is real
D;f(x) exists
[D;f(a + hek)
D1 Dkf(a).
Now,
and, of course,
D;f(a)]
Vx E B(a, p)
(7.4.2)
we know that
x E B(a, p)
and we set
Dkdtf(x)
thus
exists.
we can write
Further,
(7.4.3)
t-o
where
depends on
a, h, t, j,
Dkdtf(a + Ohek) in
depends on
(7.4.4)
7.4
O'tei)
<
8 =>
Di Dkf(a) I
0
<
Dkf
e,
\Dkd(a + Ohek)
Di Dd( a ) \
<
e.
If we use the fact that the absolute value is a continuous function, then
by allowing
-7
I [Dd(a
hek)
Dif(a)] - Di Dkf(a)
0
Dk Dd(a)
0
e.
Dkf(a),
DJ=DCTJ.
P(q)
P(l) is true and P(2) is
true by the previous theorem. Suppose q ;;;:-: 2 and assume P ( q) is true;
we shall prove that P(q + 1) is true. Suppose cr(l)=r. If r
1 we may
Proof.
write
(g
follows from
P(q).
If
D;k (D;J) =
q, we
11 D;kf=(ft
(g
D;"K (D;J)
have from
P(q),
D; k (D;J)
kr
P(q)
D;q+I
we get
to both sides.
DIf
(D
= (Ii
=
lq+l
k=2
q 0D- (D f)
fl
I,
'k
k=l
k,,.r
D;uk
1) ,
q+
1. Hence
Definition.
Ck, then from Corollary 7.4.3 it follows that any partial deriva
f of order k is independent of the manner in which the partial
If f E
tive of
derivatives are composed. Hence the operator D'>, which we shall define
in an informal way below, becomes rather useful. Suppose that
C En and
se(f)
C Em and a=
(a1,
set
J(f)
N . We shall
0
(7.4.5)
If
Ck and lal
k, we shall set
D<>J= (J]
where
0Dk<>k (f),
(7.4.6)
f.
The notation
D"f
has become popular with the advent of the modern theory of partial
differential equations.
dkJ(a) (u1,
, ud =fl 0D u J(a) ,
i=l
(7.4.7)
dkj(a) (u)k
dkJ(a) (u,
u),
(7.4.8)
first partials at a point, then the function has a differential at the point.
The same type of result holds for kth-order differentials, although for
the sake of simplicity we shall state it in a slightly less general form than
is possible.
7 .4.6 Theorem. Suppose f has domain in En and range in Em, and there
is a ball B(a, p) C JFJ (J), so that all the partials of f of order k, (k 1)
have B(a,p) in their domains and are continuous there. Then f has a kth
order differential at every x E B(a,p) and
dkJ(x) (u1 ,
uk )
L
n
i1=1
Proof.
U1;,
uk 1 k
IJ
j=l
D1if(x).
(7.4.9)
P(l) is true by Theorem 7.2.5. Assume P(k) is true and we shall try
to prove P(k
+ 1).
+ 1)
1),
7.2.5 has a differential at each point of this ball. Further, Vuk+l E E11
and Vx E B( a,p)
(fI
Duk+1
J=l
D;J (x)
'k+I =1
uk+1ik+1 D;k+1
( fJ
D;J (x).
J=l
Hence, if we apply Duk+1 to both sides of (7.4.9) and use the last equality
and we could have stated the previous theorem in terms of the direc
tional derivatives of any basis for 11 Note that the right side of (7.4.9)
shows that
dkf(x) is multilinear.
d 3f(x) (u)3 in
d3f(x) (u)3
L L .L ii i2uia
u u
.
1 =1 11=1
n
11
a3J(x)
axiaaxt.ax;"
t3=l 2
i
where of course u i is the ii component of the vector u.
f(x)=
Proof.
For
1
m-1 1
L kl. dkf(a)(x - a)k +m.1 dmf(c)(x - a)m.
k
-
VtE [O, l]
(7.4.10)
let us set
F(t)
f(tx +(I - t) a) .
F'(t)
E[O, l]
VkE (1, m)
and
Vt
we have
F (I)=
4.4 .2(c),
we get
i 1i p<k>(O) +
p<m> (y)'
m.
k
k=O .
If we substitute in for
F(l), p<k>(O),
and
yE]O,l[.
p<m>(y)
we have completed
>
1,
Em,
[a, b]
we define
7 .4. 7
Theorem. Suppose the hypotheses of theorem 7.4.7 are satisfied
and in addition VxEL, dmf(tx +(I - t)a)(x - a)m is a continuous func
tion of t. Then VxEL,
m-1 1
f(x)= L I dkf(a)(x - a)k
k.
k=O
7.4
Proof.
F (t)
f(tx + (I - t) a) .
F(l)
I - t)m1 pcm>(t)dt.
i F<k>(O) + Jo{ 1 ((m-1)
.
k=Ok.
ii dkf(x)(u)k
where
"
L u"caDaf(x),
1al=k
p(x) = L Pa (X - a)",
lai"'m
where (x - a)a =Ilj= (xi - ai )ai. From Theorem 7.4.7 we can write
1
p(x) = L (x - a)acaDap(a),
lal.:;m
to
where a!= nil aj ! . If we choose pso that Pa - 0, we see that Ca= I/a!
1
1
f(x) = L -, naf(a)(x - a)a + L I naf(c)(x - a)o:.
a
a.
lal=m .
lal.:;m-1
D Exercises
1.
point a:
(a)
(b)
(c)
2.
3.
Suppose
p(x1' x2)
x1 (x2)2 + 3(x1)2 x2
x1 + 2.
Suppose
convex set in
IJ(b) - f(a)I
,,s; Mlb
(x1 - 1)
and
(x2
1).
0. If B is a compact
- a l2
Suppose
open set in E
6.
n
.
Suppose B C
7.5
n > 1.
Since
that this is essentially the case and our first object in this section is to
prove this, and indeed somewhat more.
7.5.1
Proposition. Suppose f is of class C1 with an open domain in E"
and range in Em. For every compact set K C JFJ(f) and Ve > 0, 3 f> > 0 so
that Vx, y E K with Ix - YI < f> and Vu E E" we have
ldf(x)(u) -df(y)(u)I
,,s;
elul,
Let
{u: u
E P &
lul
l}
(7.5.1)
,,s;
elx - YI.
(7.5.2)
df(x)(u)
L uiDJ!(x),
j=l
ldf(x)(u/lul) - df(y)(u/lul)I
lul
we get
e.
<
last in
7.5
[f(x)-f(y)-df(x)(x-y) ]
u= [df(c)(x-y)-df(x)(x-y)]
u.
lf(x)-f(y)-df(x)(x-y)I
ldf(c)(x-y)-df(x)(x-y)I.
B(a,8(a)).
ldf(x)(u)I
m lul.
(7.5.3)
m lul.
Thus
jdf(x)(u) I
ldf(a)(u)I-m lul
m lul.
3M I
HIGHER-DIMENSIONAL DIFFERENTIATION
Proposition.
(J)
and
ldf(x) (u) I
2m l ul.
(7.5.3')
7.5.l
Thus
that
38,0
Vx,y
<
B(a, 8),
we get from
IJ(x) - f(y)I
<
28
we have
(7.5.4)
ldf(x) (x - y)I-m Ix - YI
m lx-yl.
If B
(a,p)
C f
( ) , we shall show that
B(a, p)
FIGURE 7 5 1
.
We claim that B (f(a), m/2) C f(B (a, p)). To see this lety E B (f(a),
m/2) and let hu be that function with domain B(a, p) and defined by
S , then
m - m/2 =m/2 .
n
h/ (x) = L [Ji (x)
yi]2,
j=l
gk(t) =h/(b+tek),
then gk is defined in some open interval around t =0, and gk(O) is a
local minimum for gk Thus Vk E (I, n),
dgk(O)
= Dkh y 2(b) =0 '
dt
and hence Vu E E",
"
dhi/(b)(u) = L uiDih/(b) =0.
j=l
dhu2(b)(u) =2df(b)(u)
[f(b) -y ] = 0.
Since df(b) is nonsingular, its range is all of E", and thus we must have
y = f(b).
7.5.5 Inverse Function Theorem. Suppose f is of class C1 with (an
open) domain in E" and range in E". If df(a) is nonsingular, then there exist
open sets U and V in En so that a E U, f(a) E V, JIU is a one-to-one June"
tion with range v' and the inverse function is also of class c.
Thus the hypotheses of Theorem 7.3.3 are satisfied, and it follows that
Vy E
V,
dg(y)
exists and
dg(y)
df(g(y))-1
0 l =
df(g(y))-1
(7.5.5)
Vu
dg(y)(u) is continuous.
df(g(y)). Since g is con
L(y)
1
f E C , it follows that Vu E En, the function on V with
values L(y)(u) is continuous. Let b E V and b = f(a). By Corollary
7.5.2 371 and 3M> 0, so that Vx E B (a, 71) and Vu E En
For the sake of simplicity let us set
tinuous and
idf(x)(u)I
This is the same as saying that
JuJ/M.
Vy E f(B(a,71)),
JL(y)(u)J
and
Vu
E En,
JuJ/M.
J[L(y)-1-L(b)-1](u)J
Now,
dg(y)
1
[L(b)-L(y)] L(b)- (u).
IL(y)(u) I
we get
M J[L(b)-L(y)] L (b)-1(u)J.
L(y)-1
3()> 0 so
that
last inequality
Proof.
Corollary.
which is
df(g(y)) dg(y)
0
e;,
l.
we get
df(g(y))(D;g(y))
e;.
(g(y))
DJgk(y) = k
]J(g(y))'
(7.5.6)
7.5
where
by
(7.5.7)
where
0
we get the proof of the corollary. Of course, the precise way to do this
is by induction, but we shall not bother to write this out in a formal
way.
In most instances the easiest way to decide whether or not
REMARKS:
df( a)
{::::} ]1 ( a)
df(a) is
nonsingular
df(x) is
example
] 1, 2[
and
JO,47T[
f1 ( r, 8) = r
2
f (r, 8) = r
and, of course
cos
sin
8,
] 1 (r, 8)
For r
let us set
cos
sin
sin
r cos
-r
Then
r.
df(O, O) = 0,
df(O, O)
is singular.
On the other hand, there are any number of examples which show
that if
df(a)
a.
A specific
J i (x,y) =bJlx1+
+ b3711X711 + aily1+
+ ll1nYn
and range in En. Let us identify E111+" with E111 X En and designate the
points in the latter space by
duf(a, b)
(x,y).
is nonsingular, where by
f(a,y).
Suppose that
dyf(a, b)
f(a, b) =
0 and that
linear function we discussed in the last paragraph and the fact that
J(x,y) =x2+y2-1.
Now, for
of
x.
lxl
<
f(x,y) =
g1(x)=,
g2(x)=-,
so that
f(x, g1 (x)) =0
for j E
(1, 2)
and
lxl
<
1.
already illustrates several points. First, we can only "solve" the equation
7.5
in
in an open neighborhood of
paragraph.
As a second example we shall consider a function with domain in
and range in
R,
f1(x,y, z) = x2 + y2 + z2 - 1,
f2(x,y,z) = xy - z ,
and f(x,y,z) = (f1(x,y,z), f2(x,y,z)). If we "solve" the equation
f(x,y,z) = 0 by elementary techniques, for y and z in terms of x, we
get the following two C00 "solutions" for lxl < 1:
{l,x),
g {x) =- {l,x).
2
g1 (x) =
V(a, b, c)
3 so that lal
one
<
2 and
2z
-1
=-2(y+az).
y + az = 0, and the only points
(a,b,c) where both the Jacobian and f(a,b,c) vanish are (1,0,0).
d<y,zJ
is singular.
Proof.
Let us identify Em+n with E"' XE" in the obvious way, and
Vx E >(!)
we set
F(x)=(A0P(x),f(x)) ,
then
Further,
Vu EEm+n
>(/)
we have
dF(a)(u)=(dA0P(a)(u),df(a)(u))
=(A0P(u),df(a)(u)).
dF(a) is Em+n. Indeed, let (v1,v2) EEm+n and u1 E Af1.
A0P(u1)=v1,u2EM so that df(a)(u2)=v2-df(a)(u1),
and u=u1 +u2 Then A 0P(u)=v1, df(a)(u)=v2, and we see that
dF(a)(u)=(v1,v2). Consequently dF(a) has rank m + n, which means
The range of
so
that
it is nonsingular.
If we apply the Inverse Function Theorem to F, we find that there is
an open set V C
>(/)
containing
ing F(a) so that FIV is a one-to-one function with range Wand having
an inverse function
W1= { T:
T E W1
A P (a) E W1 .
0
let u s set
h(T)=G(T,0).
Then
h E Cq
and since
d G (T,0)
dG(T,O)JE"'
has rank
m.
But
Vu EE"'
we have
7.5
d G(r,O) (u,O) =
Hence rank dh(T)
i=I
m.
Now,
h(A0P(a))=G(A0P(a),O)=G(A0P(a),f(a))
(7.5.8)
=GF (a)=a.
Further,
Vr
E W1
AoP0h(r)=r ,
(7.5.9)
f 0h(r)=O.
x
Note also, if
E V and
(7.5.10)
E Wand
Finally,let us setU=W1-A
&
E W1}
Then
Vt
g (t)=h(r),
Clearly
E Em &
(7.5.11)
t=r-A0P(a)
t=r-A0P (a).
itself. We can get any number of other functions that satisfy the con
Proof.
Em X En by (x,y).
E". From the formula
u= (0, u 2)
E Em X
11
see that
F(x,y) = (x,f(x,y)).
If we apply the proof of the last theorem, we find that there is an open
neighborhood U C
C (J)
containing
(7.5.8) we have
Further, from
(7.5.9)
we have
h1(x) =P0h(x) =x ,
and thus from
(7.5.10)
w e get
(x,y)
Dm+d1(a,b)
1
Dm+nf (a,b)
Dm+nf"(a,b)
Jacobian, that is, the determinant of the above matrix, does not vanish.
The reader may find it instructive to go back and review the examples
given before Theorem 7 .5. 7 in the light of that theorem and its corollary.
O Exercises
1.
Define a function f on
f'(x,y)=x2-y2,
f2(x,y)= 2xy.
Show that f has a nonsingular differential at every point except the
origin and thus at every point of E2\ { (0,O)} is
locally a
one-to-one func
2.
f'(x,y)=
{x
E2 defined by
oF-
0,
if x = 0,
f2(x,y)=y.
Show that
(a)
E2 by
f(x,y)=x-y2
Does there exist a real-valued function g defined in a neighborhood of
f(x,g(x))
0 and g(x)
>
0.
E2 defined by
f(x,y)=x2 - y2.
How many continuous functions g do there exist, defined on a neighbor
hood of x= 0 so that
5.
f'(x,y)= e cosy,
x
Show that
f/2,(J)
6.
f2 (x, y)
ez,
= x + iy , f1 (x, y)
tinuous function with domain U which satisfies (a') and (b'), show that
h=g.
7.
Suppose
J0(J)
C E2 and satis
8.
where the domain and range off are in higher dimensions. In fact,
show that
Vu
Note that
dxf(x, g (x)).
E Em and Vv E En
Suppose
C1
=JIB(a, p),
then
Va
E R so that
a= 1, 3m
lg(x)-g(y) I
lx-yl"
Suppose
lim
lg( x) - g(y)I
Ix - YI
C1
0,
>
Vu
Vx E J0(f ) ,
[Hint: Use the
[J(x)-J(y)] when u = x - y.]
E E",
=ft 0, and
7 .6
df(a)
g
so that if
u df(x)(u)
J0(J)
> 0
lx-yl-O
10.
1 we have
Jim
lx-yl-O
and if
a<
p)
1, it is, of course,
possible to talk about the maximum and minimum of the function. The
purpose of this section is to develop some of the criteria that will tell
when a multivariate function has a (local) maximum or minimum. We
are sure that the reader knows the definition of the maximum and mini
mum of a function, but for the sake of completeness we shall repeat it.
7.6
Let us set
gk (t)
This defines a function
f(a + tek).
gk with >(gk)
f has
a differential at
f(x,y)
Clearly
df(O)
x3 + y3.
saddle point.
Em,
then a is
(7.6.2)
ak(x) =
[dn (x) is called the Hessian offat x]. IfVk E (I, n), dk(a) > 0,
then fhas a local minimum at a, and ifVk E (l,n), (-I) kdk(a) > 0,
then fhas a local maximum at a. If the Hessian dn(a) 0, but neither of
the previous conditions hold, then fhas a saddle point at a, that is, has neither
a local maximum nor minimum at a.
Since
Proof.
df(a) =0,
(f)
so that
I
f(x)-J(a) =2 d2f(c)(x - a)2.
For every
En
into
x E B(a, p),
let
T(x)
(7.6.3)
E"
T(x)
Theorem
7.4.6,
n n
i - a i)(x k - ak) Di Dkf(c)
d2f(c)(x - a)2 = L L (x
k=I j=l
=T(c)(x-a) (x-a).
o
(7.6.4)
3cr
To prove the last statement of the theorem we use the last statement in
T(c)(Oiv)
so that
f(x) - f(a)
LAGRANGE MULTIPLIERS
If
le(J)
that we are not interested in the local extrema offbut rather in the local
le(f )
restricted to a subset of
le(f ).
{x:
h(x)=O} n le(J), where his a function with domain in E" and range
m ,,;;:; n. This is a standard type of problem that arises, for example,
in Em,
h(x)=0.
that a point
h(x)=0.
on that theme.
7.6.5
in
E".
(7.6.5)
F(x,y)=f(x) + h(x) : y
has a critical point at (a, A.); that is,
k=I
A_k dhk(a)= 0.
(7.6.6)
Vt
E U, rank
dg(t)=n - m, h
and
g(t)=0,
g{O)
=a.
Since
7.6.2 to f
g, and
dh
g(0) = dh (a)
df
g(O)= df(a)
dg ( 0)
O,
(7.6.7)
dg(O)= 0.
Let N be the null space of dh(a) and N1- its orthogonal complement in
En. Now, (dh(a))= (dh(a) IN1-), and dh(a) IN1- is a one-to-one func
tion. Hence, since rank dh(a)= m, we must have dim N1-= m, and since
dim N + dim N 1n, we must have dim N=n - m. Since rank dg(0)
=n - m, it follows from the first equality of (7.6.7) that N (dg(O)).
Since df(a) is a linear functional on E", it follows from Theorem
6.5.7 that 3b EE" so that Vu EEn,
=
df(a)(u)
From the second equality of
df(a)
u b.
(7.6.8)
dg(O)(u)
dg(O)(u)
b= 0.
b= -dh(a)1(A).
If we use this in
Now,
(7.6.9)
dy(v),
(7.6.10)
k=l
_Ak dhk(a)
0.
Dk f(a)+
L
i=l
A1Dkhi (a)
0,
VkE(l,n).
(7.6.11)
7.6
VjE(l,m).
(7.6.12)
its uses are given in the exercises at the end of the chapter.
We shall prove the following statement:
IJVk
E (l,n), ak;;,: 0, th en
(7.6.13)
f(x)
(il x1 )2.
xi
>
0,
Vj E (1, n) ,
n
h(x)= L (xi)2 -1=0.
J=l
n+ 1 equations
kE(l,n),
n
(7.6.14)
L (xi)2=1.
i=l
Sppose
equation in
(7.6.15)
If we sum up over k and use the last equation of
(7.6.14)
we get
nf(b)+A.=0.
Putting this value of
(7.6.16)
and thus
A= -n-n.
(7.6.16')
f is defined. To see
f, let us extend J, in
the obvious way, to a continuous function F defined on the set D
{x:
x E E" & Vj E (l, n), xi ;;;.: O}. If Sis the unit sphere in En, then since
F is continuous, FI (D n S) must take on a maximum and minimum.
Clearly, t_he minimum is taken on when 3j E (l, n) so that xi= 0,
and the minimum is 0. Thus the maximum of FI (D n S) is taken on
when Vj E (l, n), xi> 0, and by Theorem 7.6.5 the point where the
tem has a unique solution in the domain where
(l,n)}
Let {a : k E
and x = (x1,
viously we have
But
(l, n)
so that
E"
2.
0.
f(x,y) = ax2
If
-,!:. 0
and b2 - 4ac =
0,
Let
bxy
J has
= (O, 0).
show that
3.
0,
show 3 a E A0
E2
by the equation
cy2
either a relative maximum
1
x2 + xy + 2 y3.
1
equation
7.6
Find all the relative maxima and minima for f restricted to the triangle
and its interior which has vertices at the points
( -1, 6) .
and
this
Let
f be
J(x,y) =2+2+2xy,
x
y
x=F-0, y=F-0.
Find all the critical points of the function and decide whether they are
at a relative minima, a relative maxima, or at a saddle point.
5.
3x+ y- 2z = 5.
(12, 1, 5)
is a mini
mum.
6.
7.
Let
xy - z=0.
equation
0.
However, if
8.
& Vj
Suppose
E
with
.B{f) = {x: x
E En
1 n
n i=t
f(x) =- L xi.
Use the method of Lagrange multipliers to find the minimum of this
function under the constraint
n
(a)
(7.6.13).
h(x,y)=xy-1=0
is
(l/p) + (l/q).
(b)
b 0,
(l/p) + (l/q)
1 , and
0 and
> 1 and
> 1,
then
b a2 b2
b]
10.
[a,
to show that if
(l/p) + (l/q)
1 , then
1 and
oo,
inequality.
define
=sup g.
11.
and
= 1,
and
ak
12.
p 1
k 0 bk 0,
[ ak rp [ b kqrq.
(l/p) + (l/q)
Vk
10
to show that if
(l,n), a
for 1 p <
and
then
inequality
and
oo.
10
to obtain
Minkowski's
b
[ f)f(x)
g(x)IP dh(x) ]l/p [ fablf(x)IP dh(x) ]l/p
[ J:ig(x)Ip dh(x) rp.
[Hint:
81 HIGHER-DIMEN
CHAPTER
SIONAL INTEGRATION
8.1
RIEMANN-DARBOUX INTEGRALS
n we shall often
identify E"' with that subspace of En consisting of all vectors of the form
(x,O)
(or
(O,x)),
where
8.1.1
Definition.
n nonvoid intervals in
/ =/1X X/ll,
Jk
C E1,
Vk E (l,n).
I I I = II I/kl'
k=I
[ 11k12 r2.
Jk
n,
I = J1 Xj2 X
Xj",
8.1.3
Proposition.
sition of I, then
Proof.
II' I
Now,
and, by
2: If' I
J'Eli'
P(I),
Hence
where the notation on the right means we are summing over all ordered
pairs in
A' X A"+',
A,
and if
II' I II 11+' I= II I .
is complete.
corresponds to
P ( n + I)
I E A,
A11+1
then
8.1
The function Rf is called the Riemann sum function for f, and any number
in its range is called a Riemann sum for f
The function Rf is said to have the limit R (f) <==> VE> 0, 3A, so that
A >A, IRf(A,{xd) R(f)I < E. In case Rf has a limit, we say that
f is Riemann integrable and the limit R (f) is called the Riemann integral
off.
-
In case
Rf has
R (f) the
limit since
R.
IRf(A,{xd)-Rf(A',{xD)I
<
E.
The proof of this is exactly the same as the proof of Theorem 5.1. 7
and we shall not repeat it. We go on to define upper and lower Darboux
sums and integrals.
M(])=
sup {J(x):
m (])=
inf {f(x):
x E ]},
Jea
x E ]} .
Jea
The functions i5f and Qfare called the upper and lower Darboux sum functions
for f, respectively. The numbers Df(A) and !2f(A) are called upper and lower
D(J)
=inf {D1(Li):
Q(J)
=sup
i f(x)dx,
i f(x)dx,
Li
10(D1)}
{Q1(d): Li
.f0(Q1)}
and call these numbers the upper and lower Darboux integrals off, respectively.
In case D (J) =Q (J) =D (J), we say that fis Darboux integrable and call
D (J) the Darboux integral off.
As in Chapter 5, to show the connection between Riemann integrals
and Darboux integrals it is necessary to prove the following lemma.
and denote it by
GENERALIZED LIMITS
It is possible that by this time the reader may have begun to wonder
whether or not the various concepts of limit we have used, for example,
the limit of a sequence, the limit of a function at a point in En, or the
limit of the function
(a)
( b)
(c)
>-
m) .
>- on
8.1
A real net is a real-valued function with domain a directed set G/V. The
real net S is said to have a limit s <=> VE> 0, 3N E <IV so that n >N
==} IS(n) -sl < E. If a net has a limit, the limit is unique. Indeed, sup
pose tis another limit. Then 3M E <IV so that n >M ==} IS(n) -ti < E.
Now, by condition (c) for a directed set, 3P E <IV so that P >M and
P > N. Hence if n >P we have
0 (n1) -s < E.
(8.1.1)
Also, 3n2 >M so that m, n >n2 ==} IS{m) -S(n) I < E/2, and 3n' >n2
so that
(8.1.2)
Since n' >n2, it follows that Vn >n2 we get
(8.1.3)
From the inequalities (8.1.2) and (8.1.3) we get that n >n2 ==}
(8.1.4)
0 (,O(n) -s < E,
0 (n) -S(n) < E.
From these two inequalities it is immediately clear that n >N ==}
IS(n)-sl < E .
Let us see how we can apply this concept to the various definitions of
limit that we have given. In case <IV = N 0, a net is just a sequence and
the relation > is taken as the relation - Suppose f is a real-valued
function with rB(J) C En and a is an accumulation point of rB(J).
If x, y E rB(J) \{a} set x >y<=>Ix -al IY-al. It is easily checked
(J) \{a}
(J) \{a}
a<==>
a.
(A*, {xk*}) >- (A, {xd) <==>A*
that
is a directed set under this relation. The function J
restricted to the directed set
is a net and the function f has
the limit l at
the net f has the limit l at
In the case of the function Rf we take ,;Vas the domain of Rf and
define
is a refinement
Note that with
this definition our discussion of Cauchy nets provides a general proof
for Theorems 5.1.7 and 8.1.6. For the functions Df and[]! we can take
,;V as the set of all decompositions of the given interval I and take
as a refinement of
Then Df becomes a monotone non
increasing net and !2.f a monotone nondecreasing net. The numbers
are the limits of these nets, respectively.
and
Of course, everything we have said for real nets will work as well for
vector-valued nets with ranges in E", n 1.
A* >-A<==> A*
D(J) Q(J)
A.
A.
D Exercises
I.
Suppose that f and g are real-valued bounded functions each
having as domain the closed interval I C E". Show that
f,,,:; g
4.
Suppose that I and j are closed intervals in E" so that I U J
is an interval and I n j is at most an (n
!)-dimensional interval. If
J is a real bounded function with domain I U J, show that
-
5.
>-
n
is nonvoid.
8.2
{K,.: n E <IV}
JORDAN CONTENT
8.2.1
Definition. Let A be any bounded set in E" and I any closed interval
in E" so that A C I. If f is a real-valued, bounded function with J:>(J) =A
ex tend f to I by taking
Then define
and
In case these numbers are equal, we define the common value
of f over A and denote it by
as
the integral
In order that the previous definition make sense, it is clear that the
numbers we have defined should be independent of the interval I that
contains A. Although this seems almost obvious, we shall state and prove
it formally.
8.2.2 Proposition. If A C
with A C I, then the numbers
E" is
and
are independent of I.
Proof.
We shall make the proof only for the upper integral, the proof
C ] C I,
I= /1 X
XI",]=J1 X
X]". Since ] C I, it
follows that Vk E (I, n),Jk C Ik, and hence there is a decomposition
!!,.k Jk
l2\l3k,
l2k =lk.
!!..
{t:..k:
n)}
(8.2.1)
n)
(!!,.k\{Jk})U t:..1k C
!!.'. \l!..i.
=
DrA(t:..') = L M(/')JI'J+ L M(/')JI'I =DrA(t:..1).
Jk;
(8.2.2)
/'El>.'\L>.1
!A(x) dx-
IfA(x)
dx.:;;
DrA
(!!..1) -
IfA(x)
dx < e.
(8.2.3)
< e.
f
D1A
l!..1* = {/*: /* !!..* I* C j};
C !!..*\l!..1*,
*
M(/ )
DrA(!!.. *) = DrA *).
(!!..*) -
0.:;;
Now,take
that
!A(x) dx
(8.2.4')
E
&
then if/*
it follows
0. Thus we again have a formula like (8.2.2), that is,
(!!..1
(8.2.2')
IfA
(x) dx-
IfA(x)
dx.:;;
i51A(l!..1*) - IfA(x)
dx < e.
(8.2.5)
The inequalities (8.2.3) and (8.2.5) show that the upper integral of
over j is the same as its upper integral over /.
In case whereA
/,but it is not true thatA
1, let us proceed
as follows. If
and e> 0, set I/=
e], and
- e,
I,= 1.1 X
XI/. In the same way we construct the interval]. We
leave for the reader the very easy, but slightly tedious,task of verifying
that 3M> 0, depending on and/, so that
fA
Cl C
lk = [ak, bk]
C
[ak bk+
II.fA(x)
I I/
dx-
A(x) dx-
I
l <Me,
IfA(x) l <Me.
fA(x) dx
dx
The analysis is very similar to that carried out in the first part of the
proof.
Since
Ve>
A Cj Cj, CI.,
0,
Using this fact and the two inequalities above, we see that the upper
integral of
and
A C K,
8.2.3
fA
then
A is
Definition.
I.
x(A)=
f-
dx
and
En,
Finally, if
A CI
n K and
let
x(A)=
dx,
A
and call these the outer and inner Jordan content of A, respectively.
In the case where X'(A)= x(A), we say that A is Jordan measurable and
designate this common value by IAI. calling the latter number the Jordan
content of A.
The next theorem, although of a rather simple nature, is nevertheless
relatively important in the development of the theory of Jordan content.
In loose language it says that the outer content is a subadditive mono
tone function.
8.2.4
(a )
Theorem.
inf{lx
- YI: x EA &
EB}, is
Proof.
I on
=I
[
x(A u B)
XAuB (x) dx
xA(x) dx+
the value
called the
Since
Vx
ILll
tion of I so that
<
LlB = {]:I E Ll
E Ll \ (LlA U Ll8)
&
it
follows
Thus
DxA uB(Ll)
2: M(j) II I+ 2: M( j ) II I
JEAA
=DxA
JEAB
and consequently
[ XA
( x)
dx +
[ XB
(x)
dx DxA (Ll)
Dx8 (Ll)
DxAuB (Ll)
[ XAuB ( ) dx +
ILll
=DxAuB (Ll) .
<
d/2 and
E.
Consequently,
B).
x(A) = XA (x) dx
REMARKS:
[ XB(x) dx
x(B).
x(A) =g.I.b.
{ J2: II I:
LP E r
E<P
},
C E",
(8.2.6)
tion we have
x(A) x (
CP E f,
u {]:] ECP} )
'2: 111.
JE<P
A.
It is not hard to give examples of sets that are not Jordan measurable.
The easiest example is obtained by considering the set A as all the
rationals in
check that x
[O, I];
(A)
that is,
1, while
A= Q
K (A)
[O, I].
A
A
A0
0, then x(A)
0;
hence, if
interval
Q)(A (a)
I,
x(A) > 0,
into a closed
then
L II I,
JEt;.
JcA
A0
8.2.5
Theorem.
A bounded set A
C E"
is Jordan
measurable<=>
I.BA I
=O.
Proof.
371>0
Since
so that
Lipschitz,
we have
3M >0 and
lg{x)-g{y)I
x(Ufk) :L 11k1
k=l
(MV;;)n
L lhl
k=l
<
<
E.
f3g(A)
g(A) \g(A)
g(A) \g(A0)
g(A \A0)
g({3A).
I. Show that an (n
1)-dimensional interval in En has zero n
dimensional Jordan content.
4.
A Jordan arc or path in E" is a one-to-one continuous function
J with domain [O, l] and range in En. It is piecewise differentiable
the domain off' is all of [O, l] except for a finite number of points.
If J is a piecewise differentiable Jordan arc in E2 and f' is bounded,
show that the two-dimensional Jordan content of f,1(,(j) is zero.
k=l
lhl,,,;;;; 1/2.
x(f3A)
x(A) - x(A).
10. Let g
(x, y) be the polar coordinate function with domain and
range E2 defined by
=
x(r, 8)
r cos 8,
y(r, 8)
= r
sin 8.
n
n
c(J)}],
c(J)}].
(8.3.1)
<
f(y)
<
8.3
Thus Vy , z EB (x , 8(x) )
(f) we have
+ e/2
(f) ,
+ e/2 <
e.
(8.3.2)
and V'Y/>
Let a' = {]: } Ea & } n f!(f, e) oF- 0}; if} Ea', it follows that
M(}) - m ( } ) e. Also, since f!(f, E) c u {]: J Ea'} it follows
from Theorem 8.2.4 that
,
Consequently,
ex(f!(f, e) ) L [M(})
JEA'
we have
x(f!(f, e) )
<
T/,
E.
0.
clearly
IJI,
M(L)
m(L)
:s;;
2e.
Consequently, if
we have
0 :s;;
i5,(Li1) - Q,(6.1)
[M(]) - m(]) ] IJ I
JE!J.1*
[M(])
m(]) ] Ill
JE!J.1/!J.1*
:s;;
2e II I+2MDxn<f,l (Li)
:s;;
2e[III+M].
e)
we have
Lim
t-a
f(t) - T}/2
<
f(x)
< Lim
t-a
J(t) +T}/2.
f(t)
I,,p. f(t)
Hence,
Vx
B(a,p)
w(f, x)
If we fix TJ < E -
w(f, x)
T}/2
- TJ/2
:s;; Lim
t-x
:s;;
f(t)
1tlr;i J(t)
:s;; Lim
t-a
:s;;
J(t) +T}/2,
T J(t) +TJ/2.
n /,we have
t-x
w(f, a) ,
< E. Consequently,
f(t)
:s;;
w(f, a) +TJ.
J\!l(f, e)
Vx
B(a, p)
I,
{3A. For
n (XA' E)
us that
if and only if
XA (x) dx =
J!l(xA, e) I = lf3A I
XA (x) dx
= 0.
An immediate corollary of Theorems
is the following.
8.3
Proof.
{3A.
Embed
Thus V e >
and Theorem
8.3.4
Definition.
as
If A
1(A)
C En,
g.Lb.
{ 111: <U E r }.
(8.3.4)
IE'U
0.
to a set R U
to R U
{oo}
{oo},
where
by taking x <
oo if
oo
!JI (A) = 0, we shall usually say that A has zero Lebesgue measure rather
than zero outer Lebesgue measure. Of course, as we already have noted,
the null set always has zero Lebesgue measure. Also note that every
countable number of points in En has zero Lebesgue measure. Indeed,
since each point can be covered by an open interval of arbitrarily small
8.3.5
and B
1(B).;;;
L l(A).
(8.3.5)
AE-t
IfA
(b)
Proof.
m elements,
k o 1(<l>(k)) < oo, since
..
then
suppose that
'Uk
of
<l>(k)
VE>
0 and
Vk
N0,
so that
'U
1(B).;;;
VE>
A.
Proposition.
8.3.6
(8.3.6)
E>
{Ik: k
For every
of closed intervals
(1, m)}
so that
k=I
Clearly,
Vk
( 1, m)
_
>..<A>.;;; :L lhl
k=I
<
m
E
:L 11k1 +2
k=I
<
x<A > + E.
8.3
VE>
0 we have
(8.3.6).
E>
0 there is a covering
Since
A.
x<A)
Since this is true,
VE>
x< u 1k)
k-1
0 we get equality in
(8.3.6).
Proof.
where
VE>
0, since
f!(f, E) C
f!(f, E) is compact, it follows from Proposition 8.3.6
measure. Since
(b)
(c)
(d)
0.
Jg is integrable.
En,
fAuB f(x) dx+ fAnB f(x) dx= fA f(x) dx+ JB f(x) dx.
(8.3.7)
Since
Further, since
!A U B+ fA n B =!A + fB
Since the boundaries of all the sets in question have zero Lebesgue
measure, it follows that all the above functions are continuous except
on sets of zero Lebesgue measure. Hence by Theorem 8.3. 7 the integrals
of all these functions exist. The formula (8.3.7) is then a consequence
of Theorem 8.3.8(a).
8.3.10
Corollary.
A U B and A
IA u BI+ IA
B I= IA I + IB 1.
The first mean value theorem, 5.2.6, has a natural analogue in higher
dimensions and we leave it for the reader to formulate. The analogues
of some of the other theorems in Section 5.2 become very much more
complicated in the higher-dimensional situation. For example, Theorem
5.2.4 must be interpreted in a somewhat more complicated way and has
a considerably more complicated proof. We shall carry this out in Section
8.5. Theorem 5.2.2 and its corollary on integration by parts becomes
Stoke's theorem in higher dimensions and we shall delay this until
Chapter 9.
8.3
D Exercises
I. The set of rational numbers in [O, l] has zero Lebesgue measure.
Since the characteristic function of this set of rationals is not Riemann
Darboux integrable, why does this not contradict Theorem 8.3. 7?
2.
C En
(A)
show that
"X(A).
Give an example of a set for which equality does not hold. At any rate,
if A is Jordan measurable, this inequality shows that IA I = X(A).
3.
5.
C En
L f(x)
6.
dx=
L g(x)
dx.
g= max (J, g) ,
/\
g= min(J, g)
C En.
h defined by
h(x, y) = f(x)g(y)
is integrable on
I X }.
8.4
ITERATED INTEGRATION
1(x, y)dy
and
1(x, y)dy
IxJ
f(z) dz
Proof.
J [JJ
I [[
]
]
f(x, y) dy dx
f(x,y) dy dx
LxJJ(z)
dz.
(8.4.1)
Let us set
L(f) =
L [L
f(x,y) dy dx.
8.1):
f gL(f) L(g).
L(f) + L(g) L(f + g).
(c) L(af) = aL(f), Va ER.
(d) If K C I X] is an interval, then
(a)
(b)
L("X.Ko) = L(X.K)
IKI
ft.= L m(K)X.K*,
KEl1.
then
f t.
L(ft.) L(f).
From the properties (b), (c), and (d) we get
8.4
!}r(A)
L m(K) IKI
Ket;.
Thus
!2t(A)
KE!;.
,,;; L(f),
on the left we
fx(y)=f(x,y)
and h given by
g(x) = Jf(x,y)dy ,
h(x) Lf(x,y)dy,
=
Proof.
Clearly g
l we get
(8.4.2)
This shows that the upper and lower integrals of g are equal, and thus
g is integrable on /. In a similar way, we show that
is integrable.
L [h(x) - g(x)] dx
Now, the function F
0.
every point
suppose
f F(x) dx
J1
F(a)
IMI
2
>
0,
which is a contradiction.
We have proved that
E2 let I= [O, l]
[O, I]
form
(P/2n, q/2n), where p and q are odd integers. It is, we think, clear
that A is dense in /. Also, every line parallel to the x axis and every line
[O, I]
Further, we have
J: f(x,y)dy =I
J: f(x,y)
and
dx= l,
and thus
Let us now work out a simple example that makes use of iterated
integrals. This is the type of exercise that is usually done in the ele
mentary calculus. However, the reader may possibly find it instructive
to look at it from the slightly more sophisticated point of view that we
have developed in this section. The problem is to find the volume
enclosed by the ellipsoid whose equation is
a,b,c ER+.
In precise terms this means that we wish to find the integral of the
constant function, with value 1, whose domain is the set
A=
{ (x, y, z) : x2
a2
y2
2
z2
2
+b + c
,,,;;;
I .
Let us embed A
[-b, b],
where I = [-a,a],
(y,z)
r
(x, y,z) d(x,y,z)
J1xJxK XA
so that
XA(x,y,z) _
(y/b)2 + (z/c)2.::;
0 otherwise
XA(x,y,z) = 0
and, of course,
if (y/b)2 +
(z/c)2
<I>
Then we get
1, we have
XA(x,y,z) dx
(y/b)2 + (z/c)2.::;
1,
otherwise.
Thus
v
r
JK
[J <l>(y,z) dy ] dz.
J
z,
fJ <l>(y,z) dy J'i'<z>
=
1,
<l>(y,z) dy.
-'i'(Z)
Hence
V
<z>
J
lK [ 'i'
-'i'(Z)
DIFFERENTIATION OF INTEGRALS
g(y) =
E2,
continuous on [a, b]
Dd
g(y)= J:!(x,y)dx,
then g' is defined and continuous on [c, d], and moreover
(8.4.3)
Proof.
Now,
continuous,
Ve>
Hence
and
Since
0, 35 > 0 so that
< 5, and thus
is uniformly
and
<
<
and
<
<
E.
is complete.
8.4.4 Corollary. Under the hypotheses of Theorem 8.4.3 and the addi
tional hypotheses that a <P
<I>
b, where <P and <I> are defi,ned and
differentiable on [c, d], and V E [c, d] the function fv on [a, b] with
values v
is continuous, then
(y)
f (x) J(x,y)
=
(y)
8.4
g(y)=
J<l>(y) f(x,y)dx
<p(y )
is
differentiable, and
g'(y)=
Proof.
Set
G(y,u,v) =
f J(x,y)dx.
and the other terms are obtained by the use of Theorem 5.2.5.
D Exercises
I.
X
[a, b]
a 'P(Y) <l>(y) b, where 'P and <I> are
real-valued functions defined on [c, d]. Show that
[c, d].
continuous
g(y)=
J<l>(y) J(x,y)dx
<p(y)
is continuous.
2.
g(x)
in the interval
3.
J:
cos[ (y
- x)2]dy
]O, oo[.
interval
(x,y)
E J,
E J, define
together with
or its
5.
Suppose
f, D1 D2f,
on a rectangle in
E2
and
D2 D1f are
that
D1 D2f= D2 D1f
6.
Suppose g is defined on
[O, l]
J: [{ g(t) dt ]
7.
En.
E [O,f(x)]} is
8.
J: tg(t) dt.
domain A C
dx
If
L f(x)
V n(r) is
V2dr)
V2k-1 (r)
En show
that
r2k7Tk/k!
r 2k-l7Tk-l4kk!/(2k) !
Vn+i(r)
8.5
2Vn(l)
5.2.4
to
C1
formula for integrals can be obtained in this way. This is actually the
way it works and we shall develop this in the subsequent pages.
We shall begin by considering how the volume of an interval changes
under three very simple types of linear transformations of En into
E n.
g(x)=x+(A.-l)xkek, A.ER.
g(x)=(x"\
xrrn), <Fa permutation of (1, n).
g(x) = x + x2e 1
I.
II.
III.
Proposition.
1.
II, III.
Proof.
I =1 1 x
... x P, then
/".
It is clear that
g(J)=]
x K.
L
FIGURE 8.5.1
is an open
g(J)
is Jordan
measurable.
Let us apply Corollary 8.4.2 on iterated integration. Let us first note
that if we designate the elements of
En-2
by
(x, y),
then
Xum(x,y)= xJ(x)xK(y).
Now, since the transformation that takes
into
x + x2e1
is non
XJ is
X K is
\g(J) I=
xg(f) (x,
JrK
[I
LX/2
J(
LX/2
Xow
is integrable.) Thus
y) dx dy
XJ(x) dx
dy
Xk(y) dy.
Now, clearly
n ,Ii,_
J(
LX/2
For fixed
x2
/2
X J(x) dx=
we have
XJ(x1, x2)=
J [f
I ::::>
/2
ai
xJ(x) dx1
x2
xi :;;; bi
dx2
x2'
0 otherwise.
Thus we get
Lxi
and thus
lg(J) I
III.
L II;I
<
IAI + e/l>..1.
j=l
j=l
j=l
IAI = lg-I
g(A)I
TiT lg{A) 1.
(8.5.1)
This clearly defines a function VA with domain the n-fold Cartesian
product of En, and range in R. In Exercise 8 at the end of the chapter
we have asked the reader to show that h(A) is Jordan measurable.
8.5.3 Proposition. The function VA defined by (8.5.1) satisfies the fol
lowing properties:
(a) VA(h,, ,A.kb,hn)= l>..I VA(h,, ,hn).
(b) VA(h1, ,hn)=VA(h<I1,,h<In), for every permutation <T of
(1, n).
(c) VA(h,,,hn)=VA(h,,,hv+Ahq,,hq,,hn), Vp,
q E (1, n)so that p - q, and VA. E R.
(d) VA(e,,,en)=IAI.
h with
sentation of
= A.ek.
Thus, since
basis
(e1 , ,en)
are
to the
using
VA(h1,
A.hk
Thus
h1 and
and leaves the other rows fixed, use the result (b) and the
equality we have just obtained for a transformation of type III, and then
use the result (b) again, we have arrived at the fact that
hn).
From this, if
A. =
0 we get (c). If
A.=0,
8.5.4 Proposition. There exists only one function defined on the njold
Cartesian product of En that satisfies the conditions (a) through (d) of the
preceding proposition.
hk +
#k
Ajhj
W(h., , hk +
j#k
Ajhj,
hn)
0.
hk,
n
=
i=l
A.iki eJ,
Also,
hk,= A.1k,hk, +
j= 2
A.;k,e;.
Using the previous equality and a repeated application of (c) and (a)
we get
Now let <r be that permutation of (1, n) so that <r(j) = ki. Then, applying
(b) we get
lg(A)I
ldet
En
gl IAI.
(8.5.2)
Proof.
to
8)
g(A)
8.5.3,
and thus
is a Jordan-measurable set
and thus
En
lg(I)I
(8.5.3)
Proof.
follows that
formula
7.5.1,
Theorem
ble sets.
Vx, a
From Proposition
E K with
Ix - a l
<
a) I
0,
38' >
e' Ix - al.
21,
where
21 < 81/Vn.
(8.5.4)
we get,
(8.5.4)
a and of side
d(I) < 8',
0, so that
Vk
Thus, since
and
we get
(8.5.5)
e'
> 0 we choose
have established
dg(a) = I.
(8.5.3)
so that (1 +
Let
M be a positive
dg(a)-1
we get
and the
(8.5.6)
Let us set
h = dg(a)-1 g.
0
(g)
(8.5.6)
dh(a)
inequality
(8.5.5)
for
Me'. Consequently,
<l + e/2. Applying
for a given
> 0, choose
e'
'
and further
leads to the
(I + Me Vn)"
'
we find that
8.5.5
we get
I <let dg(a)-1I
Ifn(a) 1-1,
we get
(8.5.7)
Let us now finish the proof. From the compactness of K, and the
Vx E K we have
lfn(x)I m. Further, from the uniform continuity offn(x) IK, Vri > 0,
3 8 with 0 < 8 < 8' so that Vx, y E K with Ix - YI < 8 we have
-71 < lfn(Y)l- IJg(x)I <
T/
(8.5.8)
Take
11
so that
Now, if
from Theorem
set
d(I) < 8,
V, > 0 there
and
Let
a k be
the center of
I k.
Then using
(8.5. 7)
it follows
is a finite
we get
re
JIJ
Proof.
and AC
Jg(x)-g(y)-dg(x)(x-y)J
:s;
Jx-yJ.
Let
(8.5.9)
that
U {I k: k
L II kl :s; 2nx(B).
k=I
The factor
2"
I k in A.
(8.5.9) we get
Let] be any one of these cubes and a its center. Then from
j11(a)
(8.5.9')
r.
For every
g(x)-g(a)
x Ej
h(x)+k(x),
let us write
8.5
lh(x)I
,,;:; ElVn,
(8.5.10)
,,;:; ElVn.
ldg(x) (u ) I
,,;:;
M lul.
(8.5.10')
IUh(x)I
,,;:; ElVn,
IUk(x)I.;;; (M+E)Vn
l
.
Thus Uh(x) is contained in a cubel1 C En-r of side length 2e!Vn and
Uk(x) is contained in a cubel2 C Er of side length 2(M+E)lVn. Hence
Ug(x) - Ug(a) is contained in 11 Xl2 and Ug(x) is contained in the
cube 11 Xl2 + Ug(a), whose content is, of course, the same as the
content ofl1 x12 Thus
Set
x(Ug(j)),,;:; EN Il l .
From this we get
n
n
.;:; EN :L IIkl
k=l
.;:;
EN2nx(B).
lg(A)I
we get
E"
(8.5.. 11)
}g(x) =I' 0
Jg (x)
E A0, it follows from the Inverse Function Theorem that g is
map. It follows from Theorem 8.2.8 that g(A ) is Jordan mea
Proof.
for every x E
=I' 0, Vx
an open
A.
Since g
E C1,
A= B;
that is,
glA0
takes sets
g) IJgl
is continuous on
ii
A0
A0
A0
be a compact
so that
to vc, and
K C V
VE> 0
let 8 be a number so that 0 < 8 < TJ, and which is_small enough so that
the conclusions of Proposition 8.5.6 hold with V the compact set of
that proposition. Embed
sition of I so that
0
By '( J0
lal
< 8 and
Du,g>IJylK(a)
J0g(x)
IJy(x)I dx
<
E.
(J0 g)IJgl on K
Ea&] n K =I' 0}. Clearly
V,and thus fromProposition 8.5.6it follows thatVx Ej,
g)IJglK'
VJ Ea*,} c
M(})=sup{f0g(x):x E]n K}
and
M'(})=sup{f0g(x) IJg(x)I: x
Jn K}'
then we get
8.5
f!l
Clearly,
g(K)
2 M(] ho(J)
JEil *
lg(J)I
fg(K)f(x)dx,;;; fg(f) fll(x)dx=l:MU)
JEA*
< (1 + E) D(fog)fJgfK (6.)
,;;; (l+E)
Since this is true
VE > 0,
[ LJ0g(x)IJ0(x)I dx+E J .
we have
(8.5.12)
set
Now, of course,f0
g 0g-1
L= g(K).
Then
(8.5.12')
Now argue exactly the same as in the previous paragraph and we see
For an f that takes on both positive and negative values, let us set
f -(x)=2 [if(x)l-f(x)].
392 j HIGHER-DIMENSIONAL
INTEGRATION
Then
J.
A
AC (g) it follows that A\B is bounded and its
closure is in (g). Sincejg(x) =0 on A\B, it follows from Proposition
8.5.7 that /g(A\B) / =0. But since g(A) \g(B) C g(A\B), we see that
/g(A)\g(B) / =0. Now, since g/B0 is certainly an open map, it follows
from Theorem 8.2.8 that g(B) is Jordan measurable. Hence it follows
from Corollary 8.3.10 that g(A) = g(B) U (g(A)\g(B)) is Jordan mea
(b)
surable. From the hypothesis on fit follows that the left side of (8.5.11)
(f g) /Jg /
B except for a set of Lebesgue measure zero.
Further, Vx E A\B, I g(x) /Jg(x) / =0 and thus, since B is measur
able, (f g) /Jg / is integrable over A regardless of whether or not A
certainly exists. On the other hand, as we noted in part (a),
is
continuous
on
l(x)dx=
g(A)
l(x)dx+
g(B)
J.
l(x)dx= {
Y<Al \g(B)
l(x)dx,
J g(B)
and also
J.
g<B>
Jn
If we combine this with the previous two equalities we get (8.5.11) and
the proof is complete.
REMARK:
AC (g)
B.
8.5
6.5)
g1 (r,(},rp)= r cos (},
g2(r,(},rp) = r sin (} cos rp ,
g3(r,0, rp )= r sin (} sin rp.
r
If
{3 cp {3
0 (} 1T,
0,
{3 arbitrary.
27T'
properties and hence we may apply the transformation theorem and get
g(A)
J(x,y,z) dx dy dz=
A= {(r, 0, cp): r
then
J f 0 g(r,(},cp) r2
(} dr d(} drp.
and
[O, l], (}
sin
[O, 7r], cp
[O, 2 7T]},
g(A) is the closure of B (O, 1) and we can thus compute the volume
8.5.8
following:
0,
a,
aOa+1T,
{3 arbitrary.
d(} d'(J is an
r this element of surface area is magnified (or shrunk) to r2 sin (} d(} dcp
and hence r2 sin (} d(} d'(J dr is an element of volume in the (r,(},cp)
space.
There is a rather amusing way to remember the transformation
formula
jy(x)
(8.5.11)
of Theorem
8.5.8.
Vx
o(g),
a(gl'...'gn)
Jg(x)=a(x1,,xn) (x).
g(A)
f(x)dx=
' ,
dx n.
(8.5.11 )
'
dx1
,g") as dg1
over into
g(A)
A,
g(A).
x variables vary
(8. 5. 11').
O Exercises
1.
Suppose
J(x,y)
Compute the integral off over the interior of the square having vertices
at
v=x-y.)
u=x+ y,
2.
Suppose J is a real-valued
g(x,y)
(See Exercise 9 of Section
3.
(x,y-f(x))
8. 3.)
'
Let g be the function on
g(x,y)
2\{0} defined
by
x'
JI
dxdy
(x2+y2)2
{(x,y) : (x - 1) 2
l}, {(x,y): (x-1/2)2+y2 > 1/4},
function of class
C2 with
3.
Suppose f is a real-valued
V f(x)=,L Dkf(x)ek>
k=l
8 .5
0,
Vx
JE>(f).
Show that
CHAPTER 9 j THE
INTEGRATION OF
DIFFERENTIAL FORMS
I. LINE INTEGRALS
9.1
y(t) (y1(t),
y2(t), y3(t)) defines a function of the time t and represents the motion
under the action of a force field w(x)
(w (x), w (x), w3(x)), which
2
1
depends only on the position vector x
(x1, x2, x3). Let Ll be a decom
position of the time interval [a, b]. If 1 E Ll and T E 1, then
The concept of a line integral arises quite naturally in physics in con
nection with the motion of a particle. Suppose that
dyk (T)
1I
dt 1
dyk(7)
wkoy(-r)---11 1
dt
k=I
3
the
x k direction
ILll
Ll,
wk
fb 3
0
y(t)
d (t)
7
dt.
(9.1.1)
By definition, this is the work done on the particle in the given time
interval
w is defined as a function
En to E1 If x
w(x)(u)
If we set
396
wk(x)
1!
k=I
.B(w) and u
u kw(x)(ek).
En, then
9.1
dxk (x)(u)
uk , then we have
(9.1.2)
k =l
range in
y(t)
k =l
wk
y(t) dxk
y(t).
(9.1.3)
d xk
y(t)
dxk (y(t))
dy(t)
dyk (t).
y(t)
k =l
wk
(9.1.3')
We define the
JY
J:
wk
y(t) d yk (t),
(9.1.4)
provided the integrals on right exist in some sense. They will certainly
y k is a continuous function
wk is continuous. Note that we may have
such a situation even though dyk (t) does not exist for every t E [a, b].
In case dyk (t)/dt exists V t E [a, b], is continuous, and each w is
k
continuous, then of course dyk (t) may be replaced by (dyk (t)/dt) dt
in (9.1.4). For n
3, the right side of (9.1.4) is the same as (9.1.1).
exist as Riemann-Stieltjes integrals if each
curve
5.4.9 we have
and thus
(9.1.5)
5.4.9)
(9.1.6)
The functions
a, {3, and y all have the same range, but the line
w with respect to these various paths may not be the same.
In other words, the line integral of w depends on more than just the
integral of
J: f(t)dt = f J(t)dt.
(9.1.7)
J13 w = J: f(a+b - T) dT
-
I w.
w over {3.
On the basis of all the previous motivation we are now ready to give
some formal definitions.
9.1
9.1. l
NOTE:
(9.1.2).
in the future, we shall not comment about it and assume that the
rea<,:ler is aware of this convention.
{
where
w=
J:
y(t) dyk(t),
(9.1.4)
dyk(t).
jdyk(t)I
in
(9.1.4) instead of
is piecewise
'Y
W =
fb
11
y(t)
d k(t)
Ti
dt.
(9.1.4')
and
( x2
-y
) dx- 2xy dy ,
y(t) =
cos
t , sin t ) ,
tE[O, l].
'Y
w=
(1
Jo
'!!..
_
cos2
{'
Jo
'!!..t 2
[(
cos2
sin2
'!!..t2
sin2
cos
'!!..
sin
'!!..t dy2(t)
2
'!!..
'!!..
'!!..
'!!..
i sin i + 2 cos2 i sin i
] dt
1
3
On the other hand, let us take
a(t) =
(1- t,O),
(0, t -1),
tE[O,l],
tE[l,2].
The range of this curve consists of portions of two straight lines, one
proceeding along the x axis from
9.1
(0, O)
to
dadt t) 1l
da2(t)
dt !
[O, 1 [,
w=
tE
tE]l,2]
tE
tE]l,2].
t)2 dt
[O, 1 [,
Hence we get
{1 (1Jo
_!,
3
We see from the two computations that we have just made that
This fact is not an accident and indeed we would get the same value
of the integral of
proceeds from
is so.
If
{'Yk: k
(1,0)
(1, m)}
to
(0, 1).
where Vk
(1, m), ak
k=l
R. If
(9.1.8)
ak'Yb
w=
aY
k =l
ak
(9.1.9)
w.
Yk
(9.1.8) may
(1, m),
zero at every other curve. Such functions, defined on curves, are called
chains, and the integral of a first-order differential form over a chain is
defined by
(9.1. 9).
tE [O,l].
D Exercises.
1. Compute the value of the line integral of the first-order differ
ential form with domain E2,
w(x,y) =xdx-ydy,
over the following curves:
(a) y(t) = (cos 7Tl, sin 7Tl), tE [O,l].
(1- t,O), tE [O,2].
(b) y(t)
(c) y(t)
(1- t,1- jl - tj), t E [O, 2].
=
3.
[0,3].
y
x
w(x,y)= - 2dx+ 2dy,
x +y
x +y
over the following curves:
(a)
Note that these line integrals are not the same, even though the two
curves have the same initial and final points.
4.
(x,y) in E2
so that
5.
y,
9.1.3).
monotone function
9.2
Let
L,,(a)
L ly(bk) - y(ak) 1.
k=l
is a function with
be a decomposition of
[a,b] ,
(9.2.1)
of a curve as follows.
En,
then the
(9.2.2)
to say that
J: g(t) dl(y, t)
exists for every real-valued contin.uous function g. In particular, if
l(y, t) defines a coritinuous function and f is a real-valued continuous
function whos domain contains the range of y, then
J: f :y(t) dl(y, t)
(9.2.3)
exists. This last integral may be viewed as another type of line integral
where the function f is being integrated along 'Y with respect to the
length along
is
l( y ) ..
way.
9.2.2
Theorem.
If y is of class
t(y)
then
J: Jy' (t) I d t.
Suppose .:i
{[ak, bk]: k E (1, m)} 1s any decomposition
[a, b]. By the mean value theorem we get
Proof.
of
C1,
Now, since
E Ik we have
v .:i
9.2
=
where
l7Jkl
<
e;
405
(9.2.4)
moreover,
DIY'I (!J.)
If
J: IY, (t)I dt + E.
then using
(9.2.5)
(9:2.4) we get
).
f21Y'I (!J.)
k=I
ly(bk)-y(ak)I+e.
J: ly'(t)J dt l(y)+e.
In a similar way, and using
(9.2.5), we get
Thus
l(y)
Since
(9.2.3).
by a force field
w x
w1
, w2
be given by
yW
(y1 (t),
y2(t), y3(t)).
Since we
have a physical system we may assume that the particle has an accelera
tion at every time
t and
y'(t)
is continuous. Let us
t.
y'(t)
[a, b]
the
The vector
O(t) =y'(t)/ly'(t)I
is a tangent vector of unit length and is often ca,Iled the
the oriented curve 'Y.
(9.2.6)
orientation
of
O(t), the
w 0 y(t)
[w 0 y(t) O(t)]O(t).
This is the compom:nt of the force that lies in the space generated by
be a decomposition of
[a ,b] and
integral
acting on
[a, b].
(9.2.7)
To show that this actually coincides with the integral (9.1.1), we note that
w
0 y(t)
__l
O(t) - '
IY (t) I
wk
11
'
y(t)
dyk (t)
,
dt
and that
/(y,t)
{ ly'(T)ldT.
Thus
From Theorem 5.4.7 it follows that the integral (9.2.7) is the same as
(9.1.1).
Let us close this section by showing how a continuous oriented curve
[a, b]
s(t)
Since
y is
y is
J d/(y,
t1
<
/(y, t).
t2 ==>s(t1)
<
s(t2).
For,
Thus, if
y is
9.3
inverse function by
s(t)
y'
and if
never vanishes,
parameterize
J ly'(r)I dr;
D Exercises
Justify the remarks made in the last paragraph of this section;
I.
YI [t1, t2] is
If
2.
and
3.
is a curve of class
C1
that is regular
[a,b], and Vt
on
l(y, t)
Suppose
4.
2.
[a,b],
If
is a function of
'
t-a.
9.3
(y'
I.
C1 defined
I
I dyd(t)
t
then
s(t2)
I dys)I
class
a constant function.
l(a)= l({3)?
Let I=
[O,I]
[O,I]
and let
given as follows:
Vt
[O, I],
(I, t-1), Vt
[l,2],
(3-t,l), Vt
[2,3],
(0,4-t), Vt
[3,4].
(t,O),
y(t)
(9.3.1)
Jy W f W1
'}'(t)
1
dy (t) + W2
f40 k=l
:L
y(t)
'}' (t)
2
dy (t)
dyk(t)
- dt.
dt
-
Note that since yk is piecewise smooth the last integral exists. If we note
-1, 0, or 1, we get
1 w = f [w2(1,T)-w2(0,T)] dT
-i1 [w1(t, 1) -w1(t,O)J dt.
(9.3.2)
(9.3.3)
(9.3.4)
2
/ 0 (t)
1
/0 (t)
(t, O),
(O, t),
2
11 (t)
1
11 (t)
(t, 1) '
(1, t).
(9.3.1 )
'
iJI=LL (-l)HkJ/.
j=O k=l
9.3
(-1) i+k J
i=O k=t
If
J w.
(9.3.5)
(9.3.4) over more general regions than the unit cube /. Although this
can be carried out for rather general situations, we shall confine our
selves at present to showing that the formula holds for regions that are
smooth mappings of/.
Further, let us suppose that Vx E !0,j<fJ(x) > 0 and rpj/0 is one to one.
Then we may apply the transformation theorem 8.5.8 to get
(9.3.6)
On the other hand, ,
rp
f,
140 k=12
2
wk
,(t)
d (t)
7dt,
t
(9.3.7)
d, k(t)
dt
so that
2 wk
,(t)
d (t)
T,
DJ'P k ( 'Y(t))
L..
j=t
2 [ 2 wk
dyi(t)
dt '
,(t)Dirpk(y(t))
] ?,
d i(t)
Let us set
wj(x)
w*(x)
2 wk
k=t
2
i=I
wj(x) dxi.
(9.3 8)
.
(9.3.8')
{ w=
J.poy
f w*.
y
(9.3.9)
(9.3.4)]
that
r [D1w!(x)-D2wf(x)] dx= { w* .
(9.3.10)
To connect the left side of (9.3. l 0) with the right side of (9.3.6), we must
compute the quantity
w.
form
Since
ip is
of class
(9.3.11) in (9.3.10),
we get
(9.3.12)
Parenthetically, let us introduce another notation for
w*
which is
more or less standard and will arise again in higher dimensions. From
we get
w*(x) =
Since
wk
ip(x)and
[ J DJipk(x) dxi J.
2
2
w*(x) = L wj(x) dxi = L w
k
j=I
k=I
w*
is [see
we have
(9.1.3)]
w*(x) = ip*w(x) = w
ip(x).
Theorem.
'!'(/)
where
y is
w,
(9.3.12)
<p,y
given by (9.3.1).
We can also write the formula (9.3.12) in terms of a chain. Indeed, set
aip(I)
= LL <-oi+k q;o1/,
J=O k=l
9.3
where
I/
is given by
(9.3.1 ').
1
2 <- o i+k
L L
o IP(/)
j=O k=l
IPll
f,
w.
IPY
ip
9.3.1
can be used.
(9.3.12)
(9.3.12)
9.3.1,
ip,
which
in any
ip(r, 8) = ( r
cos
E2:
8, r sin 8).
"
FIGURE 9.3.1
[r1, r2]
[O, 27T],
where
r1 ;;;.
0. The function
'P is of class C2, the Jacobian of 'P is positive in the interior of J and
iplf0
(9.3.12)
is replaced
9.3.1.
ip
Note that the two vertical edges of J map onto the circular
boundaries of the annulus, while the horizontal edges map onto the
portion of the positive
aries. However, note that the two different horizontal boundaries map
ip y this portion
9.3.1 takes the form
J f [D1w2-D2w1] dx dy
R
is the outer
direction."
As a second example, let us show that Stokes' theorem is valid for a
triangle and its interior. Let us set
given by
I
= 1-x.
j.,,(x,y)=
-y
(x,y)
rpjI0 is
(1-x)
I,
then
J.,,(x,y)
Hence, if
Further,
> 0.
9.3.2.
I
I
!
FIGURE 9.3.2
u1
and
u2
and set
Then extend
2 Suppose that
Jl/J(x,y)=
> 0.
chosen so that
u1
and
u2
are
9.3
Then
!fl
FIGURE 9.3.3
we may move the triangle on the right away from the origin so that
Stokes' theorem is valid for any triangle.
As a third example we shall show how to obtain a somewhat more
exotic region for which Stokes' theorem will hold. Let us set
cp(x, y)
(x,y(x5 sin
;: + 1)),
(0, y)'
0,
x= 0.
'1 t is not difficult to check that cp is of class C2, and cpj/0 is one to one and
(9.3.12). The
9.3.4.
FIGURE 9.3.4
9.3. l and
the form
w
(x, y)
d 2 (s)
ds
d1 (s)
'
ds
[
[-
a u(x,y)
Thus
ax
n 1+
au(x,y)
n2 ds
.
ay
.
Dnu(,(s))ds=
(s).
w 0
.
U >au((s))
ds
.
an
o
au
ds.
.an
The operator
a2
a 2
6.=
+
ax2 ay2
II
R
6.u(x,y) dx dy =
aR
au
ds,
(Jn
(9.3.13)
D Exercises
1. Use the Stokes-Green-Gauss theorem to evaluate the following
line integrals:
(a) f. x2y dx x dy, where proceed "clockwise" around the
boundary of]= { (x, y): 3 x 5, 1 y 3}.
7.
- -y
x
w(x,y)-+ dx++ 2dy.
x
y2
x
y
IfJ is any interval containing the origin in its interior , show that
0,
'J
but that
dJ
27T,
where a] is the suitably oriented boundary of]. Why does this not
contradict Theorem 9.3. l?
Under suitable conditions on u,v, and R
8.
rf [u
where
av + au av+ au av
ax ax
ay ay
] dxd =I
Y
E 2, show that
a
u v ds.
an
an
u,v,
(uL\v-vL\u) dx dy= iR(u:-v:)ds,
ff
R
9.
where
IO.
then
where
9.4
u,
Au=
0,
dy= u ds.
(
(
ff
R [ r r J dx LR :
+
Suppose
En and moreover
Vx ..(w),w(x)= df(x),
E
where
is a real-valued
(w)
..
k
w 0 y(t)= Dk f( y(t)) dydt(t) dt df 0/t (t) dt.
k=I
[a,b],
a= y(a),/3 y(b),
(9.4.1)
J w=J d J=fbdjodty(t) dt=f(/3)-J(a).
w
a {3,
(9.4.1)
w
=
and we set
then
to
domain is zero.
9.4. 1 Definition.
first-order difef rential form is called exact there
exists a real-valued function f defined on ..(w) so that Vx ..(w),w(x)
= df(x).
A
"Y
Proof.
df
'Y
Let
so
w=
'Y
df=
'Y
Since
over
is closed,
is zero.
.L
fak+l dj
y(t)
dt
dt
k=l ak
m
L [J y(ak+i) - f y(ak)]
k=l
0
y(a1)= y(am+1),
.B(w)
component of
.B(w).
x0
Fix a point
.B(w) and Vx
.B(w) with x0
yx
.B(w)
let
'Yx
Let us set
yx We claim
of 'Yx Indeed,
x0
and
it is
ax is another piece
x0 to x. If yx has
that its domain is [b, c].
suppose
[a, b]
suppose
Define
{3(t)=
ax
is parameterized so
{ 'Yx(l),
Vt
Vt
ax(b + c - t) ,
[a, b],
[b. c].
E
E
.B(w).
Hence
x0
we may set
f(x)= F(yx) ,
and this defines a real-valued function on
w= df
u E En
'Yx+hu(t) -
Since
and
.B(w)
E R so that
{ 'Yx{t)' -b)u,
x+h(t
.B(w).
is open
Vt
Vt
E
E
[a, b],
[b, b + I).
Hence we get
{f w- w }
J
b+t
n
d k
(t)
f
"' Wk
(t)
dt
=dt
h
J(x+hu)-f(x) !
=
h
h
'Yx
'Yx+hu
'Y
'Y x+hu
x+hu
,,,c.,
b k=I
(b+I n
wk(x +h(t-b)u)uk dt.
=J
b
o
Ash 0 we get
n
Duf(x)= L wdx)uk = w(x)(u).
k=I
Vu E En the right side is a continuous function of x, it follows
that df(x) exists, and hence w(x)=df(x).
1
In case w is of class C it is possible to give conditions on the partials
of wk so that w is "locally exact." For the moment we shall restrict our
2
selves to the case where (w) is an open set in E Later on we shall
n
consider the case where (w) C E . Let B be an open ball in (w)
Since
If
Now, if
V(x,y)
[D1w2-D2w1] dxdy =
iJ
w.
EB,
D1w2(x,y)=D2w1(x,y) ,
w
in Bis zero .
Now, if
(a,b) is
(x,y)
EB, set
f1(X, y) =
(x,b)
to
(x,y).
(a,y)
(x,y).
V(x,y)
Dz/1(x,y)=w2(x,y),
Dif2(x,y)=w1(x,y).
Since w1 and
w2 are continuous,
if we setf
f1
wlB
is exact.
DJwk(x)
Dkwi(x).
- ......:::.1_
w(x, y) 2 +
2 dx
x
y
is defined on
2
\{0}
+ z--- 2
+y
x
dy
w#-0.
aJ
locally exact.
From the previous example it would seem that for a closed form to
be exact, there would need to be additional conditions on its domain.
This is actually the case, and for the purpose of obtaining these addi
tional conditions we introduce the following definition. To make the
notation easier we shall suppose, for the remainder of this section, that
we shall only work with representatives from a given curve that have
domain
[O, l].
9.4.4 Definition. Two closed, piecewise smooth, oriented curves 'Yo and
y 1 in a set E C En are said to be homotopic in E there exists a continuous
function r with domain [O, l] X [O, l] that is piecewise smooth in each
variable, has range in E, VT E [O, l], f(T, O]
f(T, 1), and Vt E [O, l],
,
t
t
y
y
O,
t
and
f(
0(t),
1( ).
f( )
l )
A piecewise smooth oriented curve y in E is said to be homotopic to zero in
E y is homotopic in E to a constant curve; that is, a curve 'Yo so that Vt
E [O, l], y0(t)
y0(0).
=
connected. From the point of view of the homotopy relation, the last
statement says that a region is simply connected if all the closed,
oriented, piecewise smooth curves belong to the same homotopy class.
E3
set is simply connected. To be more precise, let us say that the set
every convex set is star-shaped with respect to every point in the set.
Suppose S is open and is star-shaped with respect to
a E S. If y is
f ( T, t) = (1 - T) a + TY (t) ,
Clearly,
variable,
VT E [O, l].
homotopic to zero.
E2,
9.4
Proof.
Let
w be
B(a, r)
[O,I]
B(a, r)
.e9(w).
by
s(x,t)
Clearly, for fixed
x,
( I - t)a + tx.
the range of
and
a.
Let us set
Di to
8.4.3
to move this operator from the outside to the inside of the integral. Now,
Di wk
s(x,t)
ask(x,t)
ask(x,t)
= [Di wk0s(x,t)]
at
at
ask(x, t) .
+Wk0s(x,t) Dj
at
s,
we get
Diwk0s(x,t) = tDiwk(s(x,t)).
Also,
ask (x,t)
D)
at
Now use the fact that
- =
8)k
l
0
<=>
<=>
j = k.
k,
is closed so that
Thus we have
- t aw;0s(x,t) + Wj
at
o S X,
t).
Consequently,
(1 aw-0
) s(x , t) dt+ (1 Wj0s(x,t) dt.
DJ(x) =Jot
Jo
at
If we integrate the first integral hy parts we get
DJ(x) = w;(x).
Since
equal to
w(x).
df(x)
The theorem we have just proved is one of the crucial steps in showing
that every closed first-order differential form in a simply connected
region is exact. A second crucal step is the following lemma. For the
sup{ja(t) - y(t) I: t
Suppose w
domain an open set in En. For
.,(w) , 3 8 > 0 so that for every
with a ( 0) = 1'(0) , a (I) =1' (I)
9.4. 7
Lemma.
Proof.
Let
compact and
and
Vx
28
B(y(T), 28)
y,
by
to
>
set
'Y,.
(0, T]
5C,(y)
where
f,
w,
s-r.x
y ,(t) = y(t ) ,
and
Vx
B(a(T), 8)
let us set
g,(x) =
w+
a,
where
a,
w,
TT,X
=(I - t)a(T) +
Since
B(a(T), 8)
Vx
B(y(T), 28)
it follows
d[f,(x) - g,(x)] =0 ,
from which it follows that there is a number
Vx
c(T)
so that
B(a(r) ;8),
is a continuous function of
T.
Let us put
To< I.
Take
To< T1 I
so
To= I.
that t
To
[T0,T1] =}y(t)
B(y(T0),28)
9.4
and
of
Vt
Vt
Vt
The function
f3x
oriented curve in
E
E
E
[T0, T i],
[Ti.Ti+ l]'
[T1 + l, Ti+ 2].
B(y(T0), 26)
FIGURE 9.4.1
{ w=
J f3x
This shows that
Vx
w-
'Yn
w+
'YTO
w-
8n,x
8To,x
w=O.
B(y(T0), 26) ,
J,0(x) =f,,(x).
Vx
B{a(T0), 6)
Vx
B(a(r0), 8)
we get
f,0(x) - g,0(x)
c(T0) = 0,
y( 1) = a ( 1),
we get
fi(y(l)) =g1(a(l)).
But this says nothing more than
Proof.
[O,l]
Let r be a function on
I.
w=
where
f7(t)= f(T,t).
w,
fT
To= sup E
Lemma 9.4. 7, 3 e > 0 so
EE. Further,
To= 1. Indeed, by
that if y is in an
neighbor
I.fTO
w=
J')'
3o > 0
Vt
so that
lro - Tl < o
[O, I],
lf(T,t)-f(To,t)j <
( w= (
JrTO
JrT
To = 0, then r 70 = a and we are
3a < To so that jT0
al < o, and
If
w=
e.
led to a contradiction. If
'
To > 0,
then
hence
( w= ( w= r w
JrO"
Jr TO
JrT
To=
shows that
(b)
w=
( w=
Jr,
w.
fJ
We shall now reduce the general case to the previous case. Let
be a function on
and
/3
with
f(O,t)
9.4
A(t)
and
VT
[O, l],
{ r(
f(3t,O),
{3(3t- 1),
f(3 - 3t,1)'
Vt
Vt
Vt
A(t),
A(7,t)
t- 7/3
,
T,
1 - 2T/3
A(t),
E
E
E
[O, 1/3],
[1/3,2/3],
[2/3,I],
Vt
[O,7/3],
Vt
[7/3,I - 7/3]'
vt E
[I - T/3' 1] .
FIGURE 9.4.2
Now,
with
a
proof is complete.
class C2 so that
- -y
x
w(x,y)-
2dy,
2dx+
x +
y
x +
y
which we have considered before. Let 1/1 be any closed differential form
0
defined on Bo and let] be any closed interval in B(O, 1) with 0 E ] .
Leta be that real number so that
27Ta'
r
JaJ
[. - aw]= 0.
g(x,y)
J: [l/11(t,b)-aw1(t,b)]dt+f [l/12(x,t)-aw2(x,t)]dt.
This is a line integral along a horizontal line from (a, b) to (x,b), and
then along a vertical line from (x,b) to (x,y). It is always well defined
and clearly D2g(x,y) i/J2(x,y) - aw2(x,y). Also, V(x,y) E B0, which
is not of the form (x,O) withx > 0, set
=
This is a line integral first along a vertical line from (a, b) to (a, y) and
then along a horizontal line from (a, y) to (x,y). It is also well defined,
1/11(x,y)-aw1(x,y).
andD1 h(x,y)
Since the integral of I/I-aw is zero around every closed rectangle in
B0, it follows that on the common domain of g and h we have g(x,y)
=
h(x,y). Now,
D2h(O,y)
lim
+t) - h(O,y) .
h(O,y
t-o
If we use the fact that the integral of I/I-aw around any closed rec
tangle in B0 is zero, we get
h(O,y
+t) - h(O,y)
+ J:+t
[1/1 (0,
2
T)-aw2(0,T)] dT.
Thus
D2 h(O,y)
1/12(0,y)-aw2(0,y) .
df= I/I-aw.
The result we have just proved can be interpreted in the following
way. Suppose we say that two closed forms on B0 are equivalent they
differ by an exact form. It is not hard to verify that this is really an
equivalence relation. The result we have just obtained tells us that the
disjoint equivalence classes are determined by all the forms {aw:
a E R}. Another way of putting it is that the real vector space of
equivalence classes is one-dimensional.
D Exercises
1.
Determine which of the following forms are exact, and if exact
find a function f for which w = df
(a)
2.
(h)
w(x,y)
w(x, y)
(c )
w(x,y,z)
(d )
w(x,y,z)
Let
xydx
(2xyz3
+2" x2 dy
+z) dx
+z3 dz.
+x2z3dy
+(3x2yz2
y
x
w(x,y)- dx+ dy ,
+y2
+y2
x
x
+x)
dz.
3.
w be defined on [a, b]
Let
X R by
w(x,y) =p(x)ydx+dy ,
where pis a continuous function on
[a,b]. If we set
qw is exact.
w(x,y) =xdx+y3dy
along the oriented curve defined by
a(t)
5.
( e1 log cos
Show that
w(x,y)is exact on
6.
t E [O, l].
x2- y2
2xy
dx+ 2
dy
(x2 +y2)2
(x +y2)2
2\{0}.
Show that if
If
8.
a and {3.
intervals in
in 2\{0}.
be a simply
,an)
=\{ak: k E (l,n)}. Show that there exists a set {wk: k E (l,n)}
of linearly independent closed forms in (ai. ... 'an) so that for
every closed form /I in (ai. ,a,.) there is a set {ak: k E (l,n)}
10.
C R so that
9.5
1/1 -
k=l
akwk
k=l
11.
akwk =O=>ak=O.
B (O, I) C E2;
t:m
a2u
ax2 + ay2
=0
1)
so
av
ax
ay'
av
au
-=--
ay
ax
12.
Extend Exercise
Our purpose in the next few sections will be to extend to higher dimen
sions some of the facts we have proved for line integrals. In this con
nection it will be necessary to decide the meaning of an integral taken
over a surface. This section will be devoted to making definitions
together with the motivation for making these definitions.
The first thing we want to do is define a surface and the area of a
surface. We think it will be best if in the beginning we give a rather
discursive motivational discussion. Let
T be a
n;;;.. m.
is a
us
linear transformation
n m, and A C
J ordan measurable set, then the considerations of Section 8.5
that T(A) is J ordan measurable and
flC,(T)
If
Em
tell
(9.5.1)
is in an m-dimensional subspace of
T(A)
define
by
(9.5.2)
IV
T(A)I = IV1T(A)I=0.
Let us now give an effective way of computing the right side of (9.5.2)
so that the operator V does not intervene. Let us first note that T1 T
is a nonnegative symmetric linear transformation from E"' into itself,
that is, Vu E Em, T1 T(u) u;:,, 0. Thus this linear transformation
has a matrix representation consisting of nonnegative eigenvalues
down the main diagonal (see Section 6.5). Suppose we arrange them
in nondecreasing order d<?wn the main diagonal. By taking the non
negative square roots of these nonnegative eigenvalues and arranging
these square roots in nondecreasing order down the main diagonal we
get the matrix representation of a nonnegative symmetric linear
transformation B so that B2 = T1 T. We leave to the reader the easy
task of showing that there is only one nonnegative symmetric linear
transformation B with this property (Exercise 1 of Section 9.5). Since
Bis symmetric, the null space of Bis orthogonal to Tt(B)and moreover
BlTt(B) is one to one and takes Tt(B) onto Tt(B), For simplicity sake,
designate the inverse of BlTt(B) by B-1. Let V be that linear trans
formation with domain Tt(B) and range Tt(T) and defined by
0
V= T B-1
0
Now, Vu
Tt(B) we have
jV(u) 12 = V1
since
l l2,
V(u) u = u
\/fl:T,
(9.5.3)
9.5
IT(A)I=IV
Vo
(A)I
(9.5.4)
D1cp1(c)
:
D1cpn(c)
dcp(c)=
l:EiJ<<imn
where
a(cpji,... 'cpjm)
(c)
a(x1,. ,x"')
,m).
is the Jacobian at c of the function (1,
Suppose {Ak: k E (1, q)} is a decomposition of the Jordan measur
able set A into Jordan measurable sets that intersect at most on their
boundaries. If ak E Ak0, then it is not unreasonable to consider the sum
k=l
[det dcp(ak)1
dcp(ak)]'121A kl
L [det dcp(x)1
dcp(x)]112 dx.
(9.5.5)
A little later we shall write this again as a formal definition and show
that this definition is independent of the parameterization of the sur-
cp
x(y).
di/I (y)1
di/I (y)1
di/I (y)]112
dcp(x(y))1
det
[det
dx (y)
dcp(x(y))
dx(y).
we get
dcp(x(y) )1
dcp(x(y))]112 IJx(Y) J.
[det
dtfJ(y) 1
dtfJ(y) ]112 dy
[det
dcp(x)1
dcp(x)]112 dx.
En
t For this definition we shall say a function is C' if the restriction of the function to the
interior of its domain is of class c.
J
- [
-J
( )=
J)(, cp
.IJ('{>)
.EJ('/))
[det dcp(x)1
dcp(x)]1'2 dx
l:O::::::l,<<imn
(a(cp)ii,
a( 1
)2]112
i )
. . . ,cpm
(x)
)
x
, x
,
m
dx.
(9.5.6)
[ We are supposing ta
h
t cp
( ) C E m, m:;;;; n.]
As an example of the use of formula (9.5.7) let us compute the area
E3
r E3
L1 (x,y) r
L2(x, r
L3(x,y) r
in
cosx,
y)
where
(x,y)
are given by
sinxcosy,
sinxsiny,
a(L1,L2)( ) 2.
a(x,y) .
a(L1 L3)
a(x,y)
a(L2 L3)(x,y) r2
a(x,y)
xy
,
=
'
sin2 xsiny,
(x,y)=-r2 sin2xcosy,
--' -
sinx cosx .
Squaring and adding, taking the square root, and integrating we get
J)(,(L)=r2 Jro
r
Jo
2 1T
1T
sinxdxdy=47Tr2
in
E3
E3
and domain in
E2
vis a function with domain and range in E3 We shall suppose that vis
continuous and its domain contains the trace ofcp. We suppose that the
trace ofcpis of such a nature that the fluid can flow through the surface
without altering its velocity.
If we suppose that dcp(a) is nonsingular, then we can think of the
range of the affine transformation dcp(a)+<P(a) as being the tangent
plane to cp at cpa
( ). Suppose n(a) is a vector emanating from rp(a)
which is orthogonal to this tangent plane. Then
dcp(a)(u)
n(a)
Vu E
d<P(a)1(n(a))= 0.
2 we have
D cp1(a)n1(a)
1
D cp1(a)n1(a)
2
+
+
Assume that
D1cp2(a)n2(a)
D2cp2(a)n2(a)
component
D cp3(a)n3(a) = 0.
1
D2cp3(a)n3(a) = 0.
+
+
Cl(cp1,cp2)
(a) -0
a(x,y)
and take n3(a) to be this Jacobian. Then we can use Cramer's rule on
the previous two equations to solve for n1(a) and n2 (a) to get
n(a)= a
(9.5.7)
If
n(a)/n
j (a)I
(9.5.8)
Jlf,(cp)=
The component of the velocity
normal to cp at x is given by
v a
EJ (cp)
cp(x)
cp
jn(x)jdx.
(9.5.9)
:: , ] ,:=I'
n(x)
cp(x) . j
n(x)j
JI(,
(Vx)'
r,o(x)
v o
n(x) dx
.EJ(cp)
(9.5.10)
where we have taken
surface integral.
oriented
w(x)
LL
k=1 i=1
w;k(x) dx i
/\
dxk ,
where
to indicate that we are not thinking of this as the composite linear trans
formation
dxi
w(x) =
im=1
Jt=t
witim(x) dxii
/\
/\
d,xim,
(9.5.11)
where
436 I THE
<P
EJ !<P)
.L ... .L
im=I
it=l
witim
cp(x)
a (cpit, . . . 'cpim )
m ) (x) dx.
a 1
'
'X
(9.5.12)
y(x).
(9.5.12),
tent zero, then it follows from the transformation theorem for integrals
that we could use l/J. in place of cp in
(9.5.12).
D Exercises
I.
linear transformation
Show that
is
2.
sentation
wherex E
cp(x,y)
[O, 1]
y E
[O, 27T]
l/J (x,y)
where If)(l/J)
B (0, 1) C
this surface.
(x y, Yx2 +y2),
3.
4.
cp1(x,y)
cp2(x,y)
(R+rcosy) sinx ,
cp3(x,y)
2.
r siny ,
7T ,
1T]
[ -7T, 1T]
5.
of radius
same for
6.
E".
of Section
8.5.
the equation
y, z)
that satisfy
7.
[Hint:
8.
(If!) =
.f>(f{!)
JE>(l{I)
D 21{1(x))2] 112 dx .
6.2).]
Show that
JJ(,
= IrJ.e
(If!)
{!{!)
[I
tion of class
C1
10.
If
at
11.
with itself
surfaces.
9.6
E"
[O, I ]
it is clear that
(9.5.12),
J dxi
"'
/\
k=-
J cJxk
/\
dxi.
"'
'
dxi
/\
cJxk=-dxk
/\
dxi.
(I, m),
then we
rJxh
(\ . . . (\
(\
. . ./\rJxi.:rm.
We have already run into this type of behavior in our study of deter
minants in Section 6.6. Actually this should not be a very surprising
development for us in view of the fact that formula (9.5.12) defines a
surface integral in terms of determinants. If we want to demand also
that Va,{3 ER,
rJxit
(\
(\
s /\
(adxir+[3rJxi)
/\dxim
= a[rJx1/\
1 .../\rJx ir/\.../\rJxim]
+{3[dx i1
(\ . . . (\
rJxis
(\ ... (\
<fx1m],
(a)
y
z), Vx,y,z EA.
(x/\y)/\ z=x/\(/\
b
()
zf\(x+y)=(z/\x)+(z/\y)
v x ,y,z E A
z +(/\
y
z)
(x+y)/\z= (x/\)
(c )
x)/\y
y =(a
a (x/\)
=x/\(a )
y , Va ER & Vx,y EA.
'ax'
instead
x.'
9.6
(v1,
=A (v,'
v,
u1,
. 'Vp
u )
q
) , ( u,' ...
'U ) .
q
(9.6.1)
quite clear that A , E Mv+q(V). Note that it is not usually true that
covariant tensors of
V of order k.
(a)
(a.A)n = a.An .
(b)
(A+,)n=An+, n.
(c )
n
(A ,)n =
Ak ,n-k
k=O
simple facts for the exercises. The algebra M (V) is usually called the
V.
For our purposes the algebra M (V) is much too large, since there are
already too many elements in each M k (V) in the sense that they contain
A 0(V) =R .
Now, if
tion d&k with domain Mk (V) and range A k(V) by means of the equations
d&k(A) =
kl
,L
UETTk
(sgn <r)<r*(A),
k;;;.: 1,
(9.6.2)
d&o(A) =A,
where <r*(A)(v1,
v k ) =A(v"''
1Tk
is the set
E 1Tk, (sgn <r)<r*(A) =A. Hence, since 1Tk has kl elements, it follows
that A E A k (V) => d&k (A)=A.This is one reason for the normalizing
factor I/kl
and indeed is modeled on the latter formula.To show that d&k has range
E 1Tk, then
T*(d&k (A))=
I
k!
L
<Te
1Tk
(sgn <r)(T0<T)*(A).
T*(d&k(>,))= (sgn T) kl
PE
(sgn
1Tk
p)p* (.\)
= (sgn T) d&k(.\),
which proves our assertion that
d&k{.\) is alternating.
d& on M (V) bysetting
(9.6.3)
The range ofd& is, ofcourse, M (V).
Let
,\k E
(a)
(b)
(c)
E N0,
A (V) becomes a
, is called the
9.6.2
Lemma.
d&(d&(.\),)
Proof.
q
A (V) we have
Vp, q E
N0,
rr JJ
is identified with that element ofrrp+q which leaves invariant all elements
in
(1, p + q) \ (1, p)
(1, p)
is
write
d&p+q{v'&p(.\),)
(p
=
(p
1 q)
TE 1Tp+q
sgn
L
1 q) p\ UE1Tp
!
sgn
p
er
(sgn
(sgn
(]"E
1Tp
TE1Tp+Q
<T)(T
<T)* (.\ ,)
) ( ro <T)* (.\,) .
'(E1Tp+q
ihp+q(vtp(.\) ,)
'TrP,
it follows that
(p + q)
( sgn p)p*(.\ ,)
E11J>+q
ihp+q(A. ,).
a/\.\=aA.=A./\a
(9.6.4)
q
and VA. E AP(V) and V, E A (V),
A./\,= (-l)PQ,/\.\.
Proof.
(9.6.5)
The fact that A(V) satisfies all the conditions for a real
associative algebra can be very easily checked. Actually the only thing
that may cause some difficulty is the proof of associativity. However,
this is an immediate consequence of the last lemma. Indeed, using the
fact that M(V) is an associative algebra we get
,\ /\ (,/\ T/).
We shall leave as an exercise for the reader the proofs of the other facts
needed to establish that A(V) is an algebra.
The statement that Ak(V) is contained isomorphically in A (V) means
that there is a nonsingular linear transformation with domain A k(V)
and range in A(V).
manner, because the formal proofs of many of the things we say would
require an induction argument, and this would get rather tedious.
However, the induction arguments are not difficult and the reader
wishing to do so can easily fill in the formal details.
Let {ek: k E
E
j
j
k,
k.
Thus if
er
rrk we have
>._i1
/\
/\
\ uLe 'TTk
(sgn
er
) Vu/ 1
Vukik.
A.(v1,-
,vk)
k!A.(e;1,'
i1=l
Jk=l
(J)
[j)
-,e;k)
[;
-,e;k)
( sgn
)vu1i1 .
er
vukik
(TE7Tk
-,vk).
where [j] means we are summing over all ordered k-tuples (j1,
< jk n. Consequently, we have shown
jk) for which 1 j1 <
what we set oqt to prove: The set {A.ii /\ /\ >._i k: 1 j1 < < jk
k
n } generates A (V).
L,,,
[j]
.
J1 ... -.Jk
.
u.
\
I\
i1
\h
/\ . . . /\ I\
ik
n !/k ! ( n -
k) !
9.6
A(V), if
L Laj,,.ik >._i1/\
/\ >.. ik = o,
k=I [j]
then
-O
"" aii .... ,jk A.ii/\ .../\ A.ik.L.J
(j)
Vk
>
n
nl
=
(l+l) =
" k!(nk)! 2n.
>
if we permute
and
,ei k)
=O.
/\ >._ip,
(j)
. \ i1 /\
"" /3t1..tq/\.
L.J
/\
\
I\
iq
'
[ii
where recall that by the symbol [j ] we mean we are summing over all
P-tuples of integers (j1,
<
<
=LLa;,,,i,,/3iv,iq >._it
[j) [i)
(\
>._ it /\
/\
A_ip/\ >._i1/\
/\
/\ >._ip/\ >._i1/\
A.k = - A.k
/\
/\ >._iq.
Ai, we have
A_iq
. 9.6.4
E
j
j
k,
=
k.
Then the set {,\it /\ /\ ,\ik: 1 j < < jk n} is a basis for Ak( V) ,
1
k > 0, which in turn implies that the set consisting of the number 1 together
with
{A.it f\ ... f\ ,\ik:
is a basis for
j1
<
jk n, k
<
(l,n)}
A( V) .
is that vector whose jth component is 1 and all of whose other com
ponents are zero. Thus, from formula
(9.6.6), Vx
.(w)
we may
write
(9.6.7)
(j)
(j1,
,jk)
[j]
En,
j 1 <
< Jk n.
(9.6.7) changes if we choose a different
for which 1
d f1
g(x) /\ /\ dfq
= :L
[j]
g(x)
au , . . . fq)
,
. , tiq) (gCx> )dgit(x) /\ ... /\ dgiq(x)
a(tit ' ..
'
where
a( f1,... r )
k(g(x))].
; (g(x))-det[D;f
,
a (.
111,
, t q)
'
(9.6.s)
9.6
Proof.
dfk g(x)
0
dfk(g(x))
Also, we have
dg(x).
dti
Hence, since
dg(x) = dgi(x),
"
dfk(g(x))
Thus
df1 g(x)
/\
iq=l
(ih
iq)
dgi1(x)
< jq
/\
/\
dgiq(x).
/\ . . . /\
dgiq(x)= 0.
jq) is
(j1,
7T[j] be the
dr g(x)
then
1 j1 <
n q
L TI D;kfk(g(x)) dgi i(x)
i1=l k=l
L
If
we have
n,
let
q-tuples
{ ( <Tj 1 ,
dga'i1(x)
<TE 7T[j]
/\
.'
/\
we have
dga'jq(x)
sgn
<T dgii(x)
/\
/\
dgiq(x).
Thus we get
df1 g(x)
=L
[j]
/\
L
crerr[j]
/\
sgn
dr g(x)
<T
k=l
/\
/\
dgiq(x).
a( f 1,
a(tji,
.r)
(g(x))
tjq)
,
VaE En let
n
us write
n
a= L ake' = L ake
k
k
k=I
k=I
For every j E
( 1, n)
of Corollary
formula
L w'it ....,;k(x)dyit
/\
(1, n)}.
(1, n)}
Thus from
/\ dyik .
(9.6.7')
[j]
(1, n),
equation
y(a)
y1(a)e
1
j=l
Then we have
n
y(a) =
yi(a)e'; =a.
j=l
9.6.6
dx'l /\ . . . /\ dx'q
(9.6.8)
a(gh,.,giq)
.
i
> dyi1 /\ .
a(t t ' .. ' tiq
'L
[i]
..
/\ dyiq.
(9.6.9)
J acobian
a(gi1,.. .,giq)
a(tii, .. , ti q)
L
(i]
['L
[j]
Now, by Corollary
.;,;;
n}
(9.6.9)
9.6.4,
in
a
(9.6.7)
we get
(ri:: : : : ':;) J
'
'
'
dyii /\ . . . /\ dyiq.
(9.6.7')
with equation
I
we get
w it ;...,iq (x) =
L
[j]
t
a (gi ' giq)
.
a(tit ' ... ' tiq)
(9.6.10)
9.6
I 447
L
[j)
/\ dxik.
(9.6.11)
dw(x)
L dw'iJ,...,;q(x)
/\ dyi i /\
/\
dyiq.
[j]
Proof.
If
is of class
(9.6. l 0) we get
dw i1,...,;q(x)
L
[j)
'gi )
dw;l,. . .,;q(x).
a(ti1, ... 'tiq)
a(gii,
7.
L
rn
(9.6.9) we get
q
dw';1,...,;q(x) /\ dyi1 /\ . .. /\ dy i
-
-L
[j)
dwh ...,;q(x) /\
[iJ
.
.
a (gJ1,.. ,gJq)
q
dyi1 /\ ... /\ dyi
a< t i1' ' t iq)
dw;1,...,;q(x) /\ dxi1 /\
dw(x).
[j)
/\ dxiq
dw(x)= dw1(x)
dwdx) = D1wk(x) dx1
Since
w is
/\
dx1
dw2(x)
D 2wk(x) dx2,
/\
dx2
dx2
/\
We have already seen the coefficient of the right side in Section 9.3.
Suppose I/I is a function of class
interval
I= [O, l]
[O, l],
C2
and let
w.
<p = I/III.
&R,(<p)
Suppose that
is
Vx
!0,j<P(x) > 0
and if
<plI0
<P(/)
[D1w2(x)-D2w1(x)] dx.
J dw= J
IP
iJq;
<p 0
w,
'Y in formula
(9.3.12).
If
has an open
w(x)= L wk(x) dx k.
k=l
1
domain and is of class C ,
then
/\
k
dx .
From_,....the
Vx
previous
Vx
o(w)
formula
E
.(w),
dw(x)= 0.
9.6.9 Proposition. (a) If w and
class C 1, then on their common domain
11
(9.6.12)
(9.6.13)
d(w
/\ v)(x)
= dw(x)
(9.6.14)
Proof.
dw(x)
Hence
=LL L D,
d2 w(x)
Since dx8 /\
/\ dxii /\
[JJ r=l
fil r=l
dxr
s=l
-
dxr
r
D rw i1, .. .ik(x)dx /\ dx /\ dxi1
/\ dxik.
/\
/\ dxik.
= f(x)dxi1 /\
v(x) = g (x)dxit /\
w(x)
where
d(w
/\ dx;P,
/\ dx iq,
/\ v)(x)
=
d(Jg)(x)
/\ dxi1 /\
k
Dkf(x) dx /\ dxi1
+ f(x)
11
L Dkg(x) dxk
k=l
/\ dx;P
/\ dxii /\
/\
/\ dxip
/\ dxi1 /\
/\ dxiq
/\ v(x)
/\ dx;P /\ dxi1 /\
/\ dxiq.
we get
dx k /\ dxi1
/\ dxiq
(\ . . . (\ dxip (\ dx k (\ dxii (\ .
= (-})P dxi1
(\ d,xiq.
Hence
d(w
= w(x) +
v)(x) = w(x) /\
(w + v)(x)
v(x) ,
(w
v(x).
/\
cp*w(x)
w0cp(x)
/\
iq
dx 0cp(x).
(9.6.15)
[j]
We shall leave it as an exercise for the reader to show that cp*w is inde
pendent of the particular representation we use for w.
Proof.
cp*dw.
The proofs of the first three statements are rather easy and
cp*w(x)
L wi1...,iq
cp(x) dxi1
cp(x) /\ . . . /\ d,xiq
cp(x).
lj]
Thus from formulas (9.6. l 2), (9.6. l 3), and (9.6. l 4) we get
i
dcp*w(x)=L dwi1...,iq 0cp(x) /\ dx 1
cp(x) /\
/\
d,xiq
[j]
cp(x).
k
dwi1.-.iq(x)= L Dkwi1,,iq(x) dx '
. k=I
and thus
cp*dw it.,iq(x)
k
L Dkwi1,- ..,iq(cp(x)) dx 0cp(x)
k=l
n
dwit..,iq 0cp(x).
Hence from (a) and (b) and the previous equality we get
i
iq
/\ dx ]
L cp*dwit.--.iq(x) /\ cp*[dx i /\
i
l l
iq
=L dw i}.--.iq cp(x) /\ dxii cp(x) /\
/\ d,x
cp*dw(x)
[j]
=dcp*w(x).
This completes the proof.
cp (x)
D Exercises
I.
Prove that
M( V)
2. Complete the proof of Theorem 9.6.3; that is, show that A(V)
is a real algebra and that A k( V) can be isomorphically embedded into
A(V). Can we consider V as embedded in A (V)?
3. If V is a real finite-dimensional vector space and {e :k E(I,n)}
k
is a basis for V, show that there exists a linear functional >..i, acting on V,
so that
j =;t. k,
j = k.
4.
5.
7.
Suppose that
f..i =
2: au,i,
jE(l,n),
i=l
whereVjE(l,n),A.iand ,1are in A1(V),andVi,j E(I,n),a;1 ER.
Show that
A.1/\
/\ ,".
E".
df1(x)/\
/\dfk(x)
0.
9. 7
9.4.6
dw = d27J =
9.4.6
star-shaped
regions.
Definition. A set S CE" is called star-shaped 3a E S so that
the set {y: y= (1- t)a + tx, t E [O, l]} is contained in S.
9.7.2
Vx
E S,
It is clear that every convex set is star-shaped, but of course not every
star-shaped set is convex. We shall now state and prove the analogue
9.4.6.
of Theorem
9. 7.3
Theorem.
the function
'(w)
is star-shaped,
defined on
'(w)
dw
Supposing that
dw
s(x, t)
dw(s(x, t))
ds(x, t)= 0.
(9.7.1)
s(x, t)
dwu1
s(x, t)
dxi1
s(x, t)
[j]
Now,
dxi1
s(x, t)
= [xii dt + t dxii]
dxik
dxik
s(x, t).
(9.7.2)
and thus
s(x, t)
A
[xik dt + t dxik]
i=l
(9.7.3)
where the caret over
dxi;
9.7
If we use
tk
(9.7.3)
and
(9.7.4)
LL (-I)i-txi; dwc;1(tx)
fj] .i=l
tk
in
aw(j](tx)
at
(9.7.2),
/\ dxii /\
at
+ t k+i
_.......__
/\ d,xii /\
(9.7.4)
(9.7.1)
/\ dt /\ dxii /\
L awc;i(tx) dt
ui
dt+t dwui(tx) .
"d,x i k
we get
/\ dxik
L dwrn(tx) /\
[j]
dxii /\
/\ d,xik
O.
Now, the last term on the left is tk+i dw(tx), which is zero since dw(tx)
=
we get
k
LL
( -1) i-txi; dw1n(tx)
[j] i=l
/\ dxi1 /\
=
......--:-..
/\ dx3i /\
L aw(j]Ctx>
at
ui
dxi1 /\
/\ dxi k /\ dt
.. .
/\ d,xik /\ dt.
(9.7.s>
Let us set
71(x)
(-l)i-t
[j] i=l
[ J(
tk-t w1n(tx)x1i dt
/\ dxJi /\ . . . /\ dxik.
dxii /\
[J:
tk-1wrn(tx)xii dt
J
[J:
tk-1w c;1(tx) dt
dxi;
dxm
J:
m=I
[Jo{
tkx1iDmwrn(tx) dt
Consequently, we get
d71(x)
L
[j]
f
Jo
tk
[f
[2:
[j) i=I
tk-1wu1(tx) dt
dxi1 /\
(-I)i-xi;dw1n(tx) /\ dxii /\
/\
d'): /\
/\ dxi k
dt.
where the last integral must be given the obvious interpretation. Now,
integrating by parts, we get
(1 k
t -lw
Jo
[j]
(tx)dt
[j]
(1 k awui(tx)
(x) t
dt
.
at
Jo
Putting this into the expression for d11 and using (9.7.5) we get
d11(x)
w(x) .
tx. In case
0, that is,
a # 0, set
v(x)
w(x
+a),
x E .B(w)
a.
(x
- a), we get
D Exercises
1.
L w;,.12(x) dxii
[j]
/\ dxi2
D3w1.2(x) -D2w1,3(x)
2.
+D1w2,3(x)
C1 defined on E3 by
w(x,y,z)
3.
d11.
Suppose
w is exact.
9.8
Suppose that
5.
MANIFOLDS I 455
w /\ dw = 0.
9.8
MANIFOLDS
9.8.1
Definition.
E }.
(9.8.1)
If the collection <I> of all m-dimensional charts on M is an atlas forM, then the
ordered pair (M, <I>) is called an m-dimensional topological manifold. Jn the
latter case <I> is called the continuous structure for M, and Mis called the trace
of the manifold.
For the sake of simplicity we shall usually designate a manifold by its
trace M rather than by the ordered pair consisting ofM and the struc
ture for M. Generally speaking, topological manifolds are not "smooth
enough" to be able to carry out very much analysis on them. Thus it
is necessary to describe classes of differentiable manifolds.
oo
also. If we replace
we call M a regular analytic manifold. For the sake of rounding out the
terminology we can designate a topological manifold as a
If M is a
<p-1
"'
C0 manifold.
456
d<p(,(x))
Now, at every point
d,(x)
,(x), d<p(,(x))
dijJ(x).
has rank
rule we can solve for the entries of the Jacobian matrix of, in a neigh
borhood of
and the partials of 'P and ijJ. We can then proceed as in Corollary
7.5.6. We shall ask the reader to give all the precise details in an exercise.
Let us look at some simple examples. Let us take M as the unit circle
in
2,M
{x: lxl
1}.
The functions
t
t
E
E
]-7T, 1T[,
]0,27T(,
3;
that is,
S2
{ixl: x
3 & lxl
= l}. On
S2 be the
S2 we shall
V/ = {x: x
V;{x: x
=
E2
E
E
1T;
taking
V;
U;
in
as follows:
1T1(x) = (x2,x3),
1T (x) (x1,xa) ,
2
1T 3(x) (x1,x2) .
=
It is a very easy matter to check that these are one-to-one functions and
the inverse functions are of class C''' with nonsingular differentials at
each point of their respective domains. Thus
S2
is a regular, two
9.8
MANIFOLDS J 457
an
Ck
manifold is orientable. As an
band. We shall
first describe the trace of this manifold geometrically, which will enable
us to get a parametric representation for it.
Let us take a circle of radius 2 in the
(x, y)
plane as represented by
the dashed line in Fig. 9.8. 1. Take a line segment of length 2 and keep
z
FIGURE 9.8.1
its center on this circle. Starting with the line segment on the x-axis in
a vertical position, move it continuously around the circle and rotate
swept out by this moving line is the trace of the Mobius band.
[O,27T] to E3
2(cos
c(8) =
8,
sin
8, 0) .
d) / Id) I
= (-sin
c(8)
is, by definition,
8, cos 8, O).
c(8)
given by
at the point
c(8).
It is
given by
{(x,y,z): -x
8+y
sin
cos
8 = O}.
(x, y,z)
in
E3
x(r, v,)
y(r, v,)
Z( T, V,)
where
c(8)
is the collection of
given by
]-1,1(, v
[O,7T], and
[O,27T].
(9.8.2)
lf we fix, say=
8,
at
c(8).
If we fix
c(8)
and allow
c(8). If we
0 and v(27T)=
so that
v(O)
make
7T,
some con
then we have a
and
v(8)
8/2.
resulting function is
E
[0,27T]},
and the
range of this function is the trace of the Mobius band. However, just
using this one function will not justify the fact that this point set in
is the trace of a C"' manifold, since the function does not have an open
domain in
E2,
E2
to show that this set is the trace of a manifold. Let us consider the
(
(2
cp1(r,8)= 2
sin
cp2(r,8)
sin
cp3(T,8)
COS
)
U
cos
sin
(9.8.3)
9.8
ip2 be ip
]-7T/2, 7T/2 [. The functions ip1
and ip2 are C"' homeomorphisms and (ip1) U (ip2) is all the trace of
the Mobius band. Further, it is a rather easy matter to check that dip1
and dcp2 are nonsingular at every point of '(ip1) and '(ip2), respec
Let
ip1
be
ip
MANIFOLDS I 459
The function
U2 and
'/)2-l oip1('T,8)
(-T,8-27T),
(T,8) E U1
'1'
ip1
and
ip2
is an oriented struc
1
('T,8) for which 3l/J E '1' so that
1
('T,8) E '(l/J- ip1) and J.i,-1.p, ('T, 8) > 0. Also let Q be the set of all
1
1
points ('T, 8) for which 3tjJ E '1' so that ('T, 8) E '(tjJip1) and
1
]w- <P1 ('T,8) < 0. It is clear that P and Q are open. Since dip1 (x) is
1
always nonsingular and since VtjJ E '1' , dtjJ(x) is nonsingular, it follows
1
ip1 never vanishes. Hence the union of P
that the Jacobian of tjJand Qis '(ipi).
1
1
Since '11 is an oriented structure, P n Q 0. For, if tjJ, t/J1 E '1'
1
and tjJip1 has a positive [negative] Jacobian at ('T,8), it follows
1
1
tjJ1- tjJ tjJ- ip1 has a positive [negative] Jacobian at
that tjJ1-i ip1
('T,8). Finally, P and Q are both nonvoid. For suppose that V is a
connected open neighborhood of (0, 0) and ip2(V) is in the domain
1
1
of tjJ- , where tjJ E '1' Clearly, there is a neighborhood W C U1
]-1, l[ X ]37T/2,27T[ so that ip1(W) C ip2(V). Now, let ('T1,81)E
1
V n J0(ip2) with 81 > 0. Then the Jacobian of tjJip1 at ('Ti. 81) is
1
the same as the Jacobian of tjJip2 at this point, since ip1 ip2 in a
neighborhood of ('T1, 81). On the other hand, let ('T2, 82) E W. Now,
1
1
1
1
tjJ- ip1 tjJ- ip2 ip2- 'Pi. and it follows that the Jacobian of tjJ- ip1
1
i
at (T2,82) is negative the Jacobian of tjJip2 at ip2- ip1(T2,82), since
i
we have shown that the Jacobian of ip2ip1 is -1 at (T2, 82). Since Vis
1
i
connected, ip2ip1('T2, 82) E V and the Jacobian of tjJ- ip2 never
1
vanishes,- it follows that the Jacobian of tjJip2 at ('T1,81) has the
1
same sign as the Jacobian of this function at ip2ip1(T2,82). Thus
1
ip1 at ('Ti. 81) is negative the Jacobian of this func
the Jacobian of tjJtion at (T2, 82). This shows that P and Qare not void.
ture. Let P be the set of all points
0
J0(ip1)
'(ip1)
460 I THE
OPPOSITE ORIENTATIONS
If qrk is an orientation for M, then there is another orientation for M
that is canonically associated with qrk and which we label _qrk_ It can
be called the orientation for M that is opposite or negative to the
prientation qrk_ Let us describe -'l'k. Suppose that M is an n-dimen
sional manifold - so the elements of '}lk are defined on open subsets of
En. There is an atlas '11 C qrk so that every element of qr has its domain the
open unit ball. Indeed, Vr/J E '}lk and Vx E JF>(rjJ) let B(x, Px) be a
ball contained in JF>(rjJ). Let Tx be the function with domain B(O, I)
defined by
Tx(t) =pxt+ x.
It is clear that T x is a C"" homeomorphism that takes B (0, 1) onto
B(x , px ) . Let us set
rfix=r/J0Tx.
Since Tx has a positive Jacobian, it is clear that rfix E '}lk. Now take
It is clear that '11 is an atlas with the prescribed properties: All elements
in qr have domain B(O, 1).
For every rjJ E qr let us set
(x)= r/J-1
!Ji_(x)
it follows thatj(x) < 0, so that rfi-fi- '11. On the other hand, if cp E '11
and we set
D Exercises
I. Give all the details of the fact that if
Vcp, rjJ E <I>k, the transition function
cp-1 0!Ji
is a C k manifold, then
9.9
is a
Ck
function.
2.
C00
3.
4.
regular, orientable
5.
C'"'
manifold.
manifold.
C1
there exist exactly two oriented structures for M. In other words, there
exist exactly two orientations for M.
6.
Show that the results of Exercise 5 are not valid if M is not con
9.9
INTEGRATION ON MANIFOLDS
partition of unity.
We
e a(r)
The function
{el/(r2-a2>
lrl <a,
lrl a.
lflk (x)
lflk is
x E K,
The function
therefore, if
of class
C00,
p
:L
k =I
Let us setB
epk ( l x - xk l) .
vanishes only for
Ix - xk I Pk>
l/lk (x) ,e o.
1Tk (x)
and
Then
9.9.1
Definition. Let K be a bounded set in En. A Ck partition of unity
for K is a finite collection { 1T J : j E (I ,p)} of real-valued Ck functions each
having as domain an open set in En containing K and having the following
properties:
(a) 7TJ 0, Vj E (l,p).
p
( b)
J=l
1T;(x)
l, Vx EK.
the
9.9.2
K, and
Definition.
M
M, then when we speak of the boundary
of K we shall mean those points x E M so that every relatively open set in
M that contains x contains points of K as well as points in M\K. We shall
over suitable sets in oriented manifolds. Let us first remark that if
is a manifold in En and K C
9.9.4
in
En
9.9
Kin Mis compact and Vcp E <I>k the set cp-1(aK) has m-dimensional Lebesgue
measure zero.
r
JK
i
i=l
r
Jx
1T;W,
Let us make some remarks about this definition. First, the Riemann
integral used to define the integral of the differential form 7T; w
over K exists. This is one point where we make use of the fact that
{ rr;: j E (1, p)} is subordinate to 'I'1 Indeed, if the function under
the integral sign is defined to be zero outside cp;-1 (K), then the extended
function is continuous except possibly at the points in the set cp;-1(aK).
Since, by hypothesis, this set has Lebesgue measure zero, the result
follows from Theorem 8.3. 7. Note that the definition of the integral
of the differential form 1T;W coincides with the Definition 9.5.4, although
in the latter definition we did not demand that the function
rank
cp
have
m.
to '111
Proof.
Suppose
{7r;:j
(l,p)}
and
{cr;:j
(l,q)}
are partitions
has support in
7T;IM
has
(t/I;).
f;(t)='TT; 0 cp;(t) L
W[i] 0 cp;(t)
lil
rp;-l<K>
f;(t)dt
qJ
iJ
:L
i=l
=
i=l
rp;-l<K>
CT; 0 cp;(t)fj(t) dt
(9.9.1)
1T;CT;W.
The last equality follows from the first formula in Definition 9.9.6,
since we can think of
'TT;
cr;w.
Let us set
g(S)
f0g-1(t)}0-1(t)dt=
f J(t)dt,
S
(9.9.2)
i=l
l/J;
-l(
CT;
..-1(t)dt.
l/J1(t)f; 0 g;;-1( t) }gtJ
(9.9.3)
Note that we can use l/J;-1(K) in place of l/J;-1(K n K0), since the
support of the integrand is l/J;-1 (K0). It is at this point that we make essen
tial use of the fact that our partitions of unity are subordinate to 'I' 1 Now,
(9.9.4)
= 1T; 0 l/J;(l) " W[k]
fi
l/J;(t)
iJ(rp/t,
Further, since
rp/m)
(giJ-l(t))}g ..-1(t)
lJ
p fK
j=l
1T3W=
L Lq
p JK
CT;1T;W
=l i=l
q 11 r 1T;<T1W=
q r CT ;W.
JK
JK
= j
The first equality comes from the above computations. The first and
third sums are equal by formula (9.9.1). The last equality is, of course,
obtained by interchanging the roles of {7T;:j E (l,p)} and {er;:
j E ( 1, q)} in the previous proof. This completes the proof.
and we
L,uK2 w L, w L. w.
(9.9.7)
y:>(A)
J 'P*w,
(9.9.8)
By definition
y:>(AJ
JL
[j)
Wen
ip(t)
a ('Ph'
. . . ' 'Pim)
(t) dt .
l
o(t ' . . . ' em)
(9.9.9)
'{)*w(t)
L wu1
[j]
If in Proposition 9.6.6
(\ ... (\
ip(t) diph(t)
/\
/\
dipim(t).
.('Ph,
, ipim ) and g to be the
Em onto itself we get
we take f
identity transformation of
diph(t)
dipim(t)
a ( ipi
=
. . . ipim)
'
(t) dti
a(ti, . . ,tm)
i,
(\ ... (\
dt m.
9.10
Thus
REMARK:
dq;(x)
is of rank
m.
manifold.
surface,
as given in Section 9.5, we would have only had to make the assump
tion that 'P is a
C1
function.
D Exercises
En
I.
2.
3.
Suppose
(M, c:I>1)
M.
9.10
STOKES' THEOREM
En
given by
/={x:xkE[O,l],
Let I0k be the
(n
Vk E (l, n)}
/1k be
the
(n -
I 1k={x: x E I & xk = 1} .
The space
En
C'
mani
En
structure for
468 I THE
(n -
1)-dimensional interval J0
'Pk(t)
where the
e1
k-1
j=l
J=k+l
2: tiei + L
ti-lei,
(9.10.1)
(9.10.2)
If '1'1 is the oriented structure on Nk that contains t/Jk, then (Nk '1'1)
is an oriented manifold and
w(x)
L wr;1(x)
n- 1
whose
rJxi1 /\ /\ dxin-1,
[j]
then, by definition,
dw(x)
L dw1il(x)
[j)
LL
[j]
k=l
/\ dxit /\ /\ rJxin-1
n,
it follows that
1),
Hence
dw(x)
L (-I)k+IDkwk(x) dx1
k=l
/\ /\ dx",
wk(x)
If
W1.--.k-1.k+t..n(x)
Vx E E",j,(x)
l dw
I
k=I
i
where xk= (x1,
i [J:
Dkwdx)dx=
xk-1, xk+l,
Dkwk(x)dxk dxk>
E /}. Thus
From the definition of the functions 1/Jk and cpk, and the definition
of a surface integral given in section 9.9, we get
1ik
w=
L W[j]
[j]
lfJk(t)
( l , n - 1 ) }.
Now, if (ji .
'
1/lk;n-1 )
(t)= O '
1n-I)
I1k
w=
w=
{ wk
J lk
Ik
wk
1/Jdxk) dxk.
To/<
cpk(xk) d:xk.
Consequently,
and thus
dw=
k=l
(-l)k+1
[f
11k
w-
f ]
1
n
= :L :L (-l)i+k
w.
f; k
j =O k=I
Iok
(9.10.3)
1 n
a1= :L :L (-l)i+kf/,
i=O k=I
(9.10.4)
and call this chain the oriented boundary of/. We shall leave to the
reader the simple task of giving a precise definition of a chain in this
(9.10.3).
r dw
11
=.
r w,
Jar
(9.10.5)
where al is the chain given by (9.10.4) and the integral on the right is defined
as the last sum in (9.10.3).
To be able to obtain a formula like
iJip(/)
l IPW
0
(
LL
J=O k=l
(9.10.6)
-l)Hkip(J/)'
J=O k=I
(9.10.7)
JIPUJk) w.
(9.10.8)
r dip*w
Jr
r ip*w
Ja1
I n
J=O k=I
We claim that
r ip*w
J1;k
Let us show this for
1,
Jjk
ip*w.
f1PU;k> w.
(9.10.9)
(9.10.10)
=
being similar. As we
(Nk,
9.10
{ (cp
En-1;
then
cp(I1k)
I/Id *w.
cp*w = f lflk*
J cp*w = J
11k
J
o/Jk(J)
= L (cp
cp*w
lflk)*w.
Thus the two integrals in (9.10.10) are the same. Consequently, the
right side of (9.10.8) is the same as the right side of (9.10.9).
On the other hand, from Proposition 9.6. lO(d) we get
From Proposition 9.9.10 we get
f dcp*w= r cp*dw=J
J1
<p(J)
dcp*w=cp*dw.
dw.
Thus using the last equality, (9.10.8), (9.10.9), and (9.10.10) we see
that (9.10.6) is valid.
REMARK:
has rank
cp
is a homeomorphism and
dcp(x)
cp
is of class
C2
used as given in Section 9.5. See the remark after the proof of Propo
sition 9.9.10.
Hn= {x:
XE
En &xn
En-1
O}.
En
given
x E (cp).
.B(cp) n Hn=cp-1(K) .
.B(cp) n En-1=cp-1(aK).
se('P)
n aK oF- 0
and conditions (b) and (b') of Definition 9.10.3 are satisfied. Let 'I' be that
collection of all functions 1/1 with the property that 3 'PE <I> so that
1/1 = 'Pl.B('P)
n E"-1.
= (t1,
n-1
k=I
Now, dip(t,O) has rank n, and thus dip(t,O) IE"-1 must have rank n - I.
This means that dl/l(t) has rank n - I. By conditions (a) and (b') of
Definition 9.10.3,
aK = u
j= 1,2.
Let us set
.(t) = 1/12 -l
v(t,t") = i/)2 -l
0
0
'Pt (t,tn).
Vj,k E(l,n-1).
On the other hand, v"(t,0)
0, so that
D;v"(t,0) = 0,
Thus it follows that
VjE(l,n-1).
9.10
}v(t,O) =Dnv"(t,O)j,,(t).
Now, since
(M, <J>k)
Dnv"(t, O)
v"(t,t")
> 0 when
tn
'IJ'k
regular,
( aK, 'IJ'k) is a
(n - 1)-dimensional, oriented Ck manifold. For reasons that
(aK, (-1) "'IJ'k) r:ather than (aK, 'IJ'k). The reader is advised to reread
the last two paragraphs of Section 9.8 to refresh his memory on the
notation
( - I) n'IJ'k.
the structure on
'-Pa(t) = (t3 cos t1,t3 sin t1 cos t2,t3 sin t1 sin t2).
The Jacobian of
'Pa at t is
l<f!a ( t ) = (t3)2 sin t1
and thus
Va
R, 'Pa
7r(x)
E <J>k. If
> 0,
a(t)
7T <P
> 0.
Let
Then
]-a-27T,-a[
1>( I/la)
H3,
l/Ja-'(aK) = 1>(1/Ja)
l/Ja-1 (K)
{l/Ja: a
R}
(1/la)
<I>k).
'Pa./ (t)
(9.10.11)
>
0.
{cpa. p : a, ,B E R} belong to
]O, R [, let cp be defined on [O,2rr] X [O,2rr] X (0, r]
by the right side of (9.10.11), and set K=92.(cp). K is an anchor ring
in 3 and aK is a torus (Fig. 9.10.1) (see Exercise 4 of Section 9.5).
This shows that the collection of functions
FIGURE 9.10.1
1/Ja,P='Pa,P
0 T
1/la,p-1(K) = (1/la.P)
H3 ,
9.10
Since
(3' <t>k).
r dw=
JK
aK
w.
(9.10.12)
Proof.
For each
9.10.2,
Note that if j E
r d'lT;W =
JK
Since
aK
'lT;W.
ordinate to
n
w( x)= L 'lT; ( x)w( x)
j=i
for all x in some open set in EP that contains K. Thus, for the x in this
open set,
dw(x) =
drr;w(x).
j=l
dw =
'i:, l
i=l
drr;w.
Thus we see that formula (9.10.12) holds and the proof is complete.
The theorem we have just proved does not include Theorem 9.10.1
or 9.10.2 as special cases. Aside from the relatively minor fact that al
and acp(I), when considered as chains, are logically different from
manifolds, there is the more important fact that a/ and acp(/) when
considered as boundaries of I and cp(J), respectively, cannot support
structures that will make them into regular manifolds. The trouble,
of course, lies in the fact that these boundaries have sharp corners.
These things can be rectified by considering "piecewise Ck manifolds,"
but we shall not go into these matters.
D Exercises
1.
L
2.
dw=O.
n.
L
Show that
3.
w=O.
is closed.
position vectors (x, y, z). The flow is said to be irrotational if and only
if v is of class C1 and
JY
( v1 dx + v2 dy + v3 dz) = 0,
for every smooth, regular, oriented, closed curve yin E3. Show that the
flow is irrotational if and only if there exists a real-valued function cp
of class C2 so that
v=-'ilcp.
9.10
4.
Let
n-dimensional
for E" which contains the identity transformation. Show that the Jordan
content of K is given by
IKI
/\
d;J" /\
/\ dx".
iJK j=l
Suppose
Suppose
oriented, n-dimensional
7.
8.
sions.
v
(x, y, z). Suppose
also that p(x, y, z, t) is the density of the fluid at the position (x, y, z)
and at the time t. We shall also suppose that the flow has no sinks or
9.
sources; that is, the rate of increase of mass of the fluid which lies inside
any fixed, closed surface equals the rate at which the mass flows through
the surface. Assuming that
equation of continuity,
dp
at
IO.
0.
condition (b).
SYMBOLS
PAGE
2
negation
=::::}
&
conjunction, and
{=}
for every
(x)(Q(x))
for every
there exists
16, 20
element of
16, 18
intersection
17
{x: Q(x)}
AXB
A\B
ACB
An B
n {A:A E v(..}
AUB
U {A:A E vt}
Ac
17
not an element of
21
0
JFJ(R)
5ll(R)
R -
R-1(A)
null set
24
F0G
24
!IA
f(A)
16
17
17
17
17
17
17
22
22
22
22
24
24
x, Q(x) is true
Cartesian product
A and not in B
A is contained in B
elements in
inclusion,
R
A under R
equivalence relation
the functionf restricted to the set
26
31
the integers
35
the rationals
39
Q
Q+
(m, n)
45
No
47
R
R+
the reals
48
52, 69
lim
limit
53
(x(n)), (xn)
a sequence
36
m and n
479
480 I SYMBOLS
54, 235
lxl
63
g.l.b., inf
63
1.u.b., sup
69
[a,b], ]a,b[
[a,b[, ]a,b]
I(x)
half-open intervals
vector
69
69
71
(AC)
77
91
lim,lim
98
CT
98
99
132
133
133
138
138
183, 354
183, 354
183, 354
184, 355
184, 355
184
(a)n
L ak
k=O
(a, u(a))
2: <ak)
k=O
n
II ak
k=O
(a, Il(a))
II (ak)
k=O
f', df /dx, DJ
pn>, djn/dx", D nJ
A
IAI
A* A
R1(A, {xk})
i5,(A)
Q,(A)
J:
r
f
finite sum
infinite series
infinite series
finite product
infinite product
infinite product
derivative off
nth order derivative off
decomposition of an interval
norm of a decomposition
A* is a refinement of A
Riemann sum forf
J(x) dx
f(x) dx
f(x) dx
_a
185
201
( f, /(J ))
improper integral
211
Riemann-Stieltjes integral
J(x) dg(x)
236
xy
239
240
Ll.
L EBLl.
242
B(a,r)
247
247
{3A
Ao
259
[ai!]
matrix
orthogonal complement of
LJ.
a
and radius r
SYMBOLS I 481
263
transpose of T
T'
sign of
275
sgn
276
det
309
D.J
310
af
Dk,-k
310
315
316
325
325
328
353
356
356
[a;;]
determinant of
[a;;]
directional derivative
partial derivative
ax
differential of f at
df(a)
],(a)
dxk
Du Dvf
Jacobian of f at
differential of projection
akf
ax' ... axk
Ck, C""
d(I)
I dx
I f(x) dx
fA
f(x)
) <ix
higher-order partial
classes of differentiable functions
diameter of
361
'X( ),K(A)
IAI
Jordan content of
361
XA
characteristic function of
393,431
Jacobian of f at
361
399,444
400
Jy
410
440
444
>.. /\ ,
(j]
A
A
ip *w, w
xk
/C x
'P
447
dw(x )
463
462,473
aK
Jordan domain
relative boundary; also oriented
boundary of a Jordan domain
INDEX
Boundary, 247
Bounded set, 46
left, right, 85
Cantor, G., 44
closed, 300
function, 126
intersection theorem, 79
exterior, 440
set, 121
Cartesian product, 16
Grassman, 440
Cauchy, A., 45
separating, 300
tensor, 439
Alternating multilinear functional,
277,439
Buajakovsky-Schwarz inequality,
217, 236
condensation test, 113
net, 357
sequence, 50
Chain, 401,469
Atlas, 455
oriented, 457
Axioms, 2
for the natural numbers, 27
Axiom of choice, 71
Ball, 242
Component, 245
Basis, 230
Composition of functions, 24
ordered, 258
Bernstein, S., 178,180
Conjunction, 3
Connected set, 244
polynomials, 182
arcwise, 251
theorem, 162
simply, 420
Content,
of an interval, 353
482
INDEX 1483
Continuity, 73, 249
uniform, 80, 250
Convex sets, 245
Corollaries, 2
Cosine, 173
addition formula, 173
Countability, 38, 41
Covering, 77, 242
open, 77, 242
Disjunction, 3
Curves, 398
Domain
closed, 399
of a function, 23
homotopic, 419
of a relation, 22
oriented, 398
piecewise regular and smooth, 399
Eigenvalue, eigenvector, 270
Equality, 12, 16
D' Alembert's ratio test, 11 l
Equicontinuity, 297
Equivalence, 5
integrals, 184
integrable functions, 184
relation, 24
Euclidean spaces, 235
Existential quantifier, 7
Decimal expansions, l l 8
terminating, 120
Decompositions of intervals, 183
generalized, 87
Exterior product, 441
Denumerable, 38
Finite set, 41
Functionals,
Derivative, 138
higher order, 138
of the logarithm, 139
of the trigonometric and inverse
analytic, 163
176
Cantor, 126
characteristic, 36 l
484 I INDEX
Functions (cont.)
equicontinuous, 297
Geometric-arithmetic means
inequality, 349, 35 l
Geometric series, l l 0
Gradient, 316
harmonic, 429
Graph, 305
linear, 257
Lipschitz, 363
logarithm, 139
maximum and local maximum of,
81, 345
minimum and local minimum of,
81, 345
monotone, 84
theorem, 80
Hessian, 346
Homeomorphism, 250
Higher-order differences, 152
differentials, 325, 328
partial derivatives, 325
multivalued, 23
one-to-one, 23
open, 249
oscillation of, 366
periodic, 175
Implication, 3
permutations, 274
polynomial, 75
product of, 72
quotient of, 72
convergent, 202
Riemann-Darboux integrable,
divergent, 202
184, 371
saltus, 224
Inclusion of sets, 17
Index, 482
Induction, 27
Inductive
definition, 87
set, 57
Inequalities,
Cauchy-Bunjakovsky-Schwarz,
217, 236
geometric-arithmetic means, 349,
351
Holder, 352
Minkowski, 352
triangle, 36, 237
Inference, 6
151
INDEXl485
Infinite series, 98
Abel's test, 114
Intervals (cont.)
higher dimensional, 353
absolutely convergent, l 0 l
Intersection of sets, 17
conditionally convergent, l 03
relation, 22, 23
convergent, 99
Isomorphism, 32
Jacobian, 315
matrix, 315
harmonic, 100
Jump discontinuity, 85
p-series, 113
power, 165
Pringsheim's theorem, l 09
Lagrange,
ratio test, 11 l
rearrangement, 10 l
Riemann's theorem, l 04
Infimum, 63
Infinite set, 41
Integers, 31, 60
Lemmas, 2
215
Integrals,
Limit,
Darboux-Stieltjes, 211
line, 400
generalized, 356
371
Riemann-Stieltjes, 211
Integration, 183, 353
of differential forms, 396
iterated, 374
on manifolds, 461
functional, 262
by parts, 192
manifold, 230
space, 233
subspace, 230
Linear transformations, 257
486 IINDEX
Linear transformations (cont.)
eigenvalues and eigenvectors of,
270
matrix representation of, 258
nonsingular, 257
norm of, 261
orthogonal, 271
projections, 266
proper values and proper vectors
of, 270
rank of, 257
symmetric, 269
transpose of, 263
Linearly dependent and inde
pendent vectors, 229
Lipschitz functions, 363
Logarithm, 139
natural, 142
Open,
ball, 242
covering, 77, 242
function or map, 249
set, 77, 242
Order preserving, 36
Ordered,
basis, 258
n-tuple, 228
pair, 16
Oriented,
curves, 398
manifolds, 457
surfaces, 432
Orientation,
of a curve, 405
induced, 473
of a manifold, 457
Orthogonal complement, 239
Orthogonal transformations, 271
Oscillation of a function, 366
Outward normal, 414, 434
Jacobian, 315
skew-symmetric, 292
symmetric, 269
transpose of, 264
Maximum, 45, 81, 345
local, 81, 150, 345
Mean Value Theorem, 150, 332
Generalized, 151
Merten's theorem, 168
Mikolas, M., 142
Minimum, 46, 81, 345
local, 81, 150, 345
Minkowski's inequality, 352
Monotone functions, 85
INDEX 1487
Power series (cont.)
Cauchy,50,58
convergent,50, 58
Predicate calculus, 7
of functions, 126
Products, 131
monotone, 62
infinite, 133
subsequence, 53, 58
Projections, 266
Proof, 6
Sets, 16
by contradiction, 14
Propositional calculus, 7
Propositions, 2
Cantor, 122
Pythagorean theorem, 44
convex, 245
countable, 60
covering for, 77, 242
dense, 60
Ratio test, 11 l
denumerable, 38, 60
Rationals, 35, 60
derived, 242
Reals, 44, 56
directed, 356
finite, 41, 60
101, 104
infimum of, 63
Relation, 22
equivalence, 24
reflexive, 24
open, 77
symmetric, 24
perfect, 123
transitive, 24
353
existence of, 197, 371
properties of, 190, 371
Riemann-Stieltjes integrals, 210
existence of, 219
properties of, 212
supremum of, 63
totally bounded, 294
Simply connected sets, 420
Sine, 173
addition formula, 173
Spherical coordinates, 268
Statement, 3
Structure, 455
oriented, 457
Subsequence, 53, 58
Subset, 18
4881 INDEX
Support of a function, 462
Trace (cont.)
Surface, 306
of a manifold, 455
area, 432
integrals, 435
oriented, 432
integrals, 390
Symbols, 479
Truth table, 2, 3
Symmetric,
matrix, 270
transformation, 269
Uncountable, 121
Uniform continuity, 80, 250
Uniform convergence, 126
Tangent,
Union of sets, 1 7
function, 176
Universal quantifier, 7
plane, 309
vector, 306
Tauberian theorems, 171
Taylor's,
Variable, 7
expansion, 163
series, 165
Vectors,
theorem, 330
independent, 229
alternating, 440
Venn diagram, 1 7
Theorems, 2
Weierstrass, K., 47
approximation theorem, 129
M Test, 129
Well ordering, 29
Weyl, H., l