This Topic begins by introducing the gradient of a curve. This concept was in-
vented by Pierre de Fermat in the 1630s and made rigorous by Sir Issac Newton and
Gottfried Wilhelm von Leibniz in the 1670s.
The process of finding the gradient by algebra is called differentiation. It is a
powerful mathematical technique and many scientific discoveries of the past three
centuries would have been impossible without it. Newton used these ideas to discover
the Law of Gravity and to find equations describing the orbits of the planets around
the sun.
Differentiation remains a powerful technique today and has many theoretical and
practical applications.
The Topic has 2 chapters:
Chapter 1 explores the rate at which quantities change. It introduces the gradient
of a curve and the rate of change of a function. Examples include motion and
population growth.
Chapter 2 introduces derivatives and differentiation. Derivatives are initially found
from first principles using limits. They are then constructed from known re-
sults using the rules of differentiation for addition, subtraction, multiples,
products, quotients and composite functions. Implicit differentiation is also
introduced. Applications include finding tangents and normals to curves.
i
Contents
1 Gradients of Curves 1
1.1 Describing change . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Gradients of curves . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.3 The rate of change of a function . . . . . . . . . . . . . . . . . . . . . 9
2 Differentiation 11
2.1 From first principles ... . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.1.1 Terminology and Notation . . . . . . . . . . . . . . . . . . . . 14
2.2 Constructing derivatives . . . . . . . . . . . . . . . . . . . . . . . . . . 17
2.2.1 Powers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
2.2.2 Polynomials . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
2.2.3 Products and quotients . . . . . . . . . . . . . . . . . . . . . . 22
2.2.4 Composite functions and the chain rule . . . . . . . . . . . . . 24
2.2.5 Implicit Differentiation . . . . . . . . . . . . . . . . . . . . . . 28
A First Principles 32
B Answers 36
ii
Chapter 1
Gradients of Curves
1
1.1 Gradients of Curves
Describing change
How can we describe the rate at which quantities change?
1.1 Describing change
Example
How can we describe the rate at which quantities change?
constant The distance-time graph below shows the distance travelled by a car that has
Example
velocity a constant velocity of 15 m/s.1
The graph below shows the distance travelled by a car that has a constant velocity of 15 m/s.1
gradient
distance (m)
of a line
150
100
50
0 time (s)
0 5 10
We canmeasures
Velocity see that thehow
car travelled
distance15m after 1with
changes second, 30mYou
time. after can
2 seconds, .... the car
see that
travelled 15m in 1 second, 30m in 2 seconds, etc. When the velocity is
The velocity of a car describes how distance changes with time. When the velocityconstant,
is constant, it
22
it iscancalculated using
be calculated the
using theratio:
ratio:
∆ distance !change
distancein distance
change in distance
= = = 15 m/s
∆ time !change
time in timechange in time
This ratio is equal to the gradient of the line on the time-distance graph above, so the constant
This ratio of
velocity is the
thecar
gradient
can also of
be the lineofonasthe
thought graph, so
the gradient of athe constant velocity of
line.
the car can also be thought of as the gradient of a line.
Observe:
• the gradient of a straight line is the same between any two points on the line
1
meters per •second
the velocity of the car above is the same between any two points on the line
2
∆ is an uppercase Greek letter called “Delta”. It is used in mathematics to mean ‘change in’.
• the time-distance graph provides information about velocity
Here is the time-velocity graph for the1same car ...
velocity (m/s)
20
15
We can see that the car travelled 15m after 1 second, 30m after 2 seconds, ....
The velocity of a car describes how distance changes with time. When the velocity is constant, it
can be calculated using the ratio:2
! distance change in distance
2 =
CHAPTER 1. GRADIENTS OF CURVES
! time change in time
This Observe:
ratio is equal to the gradient of the line on the time-distance graph above, so the constant
velocity of the car can also be thought of as the gradient of a line.
• the distance-time graph provides information about velocity
Observe:
• the distance-time graph is a straight line with gradient 15
• • the gradient of a straight line is the same between any two points on the line
The velocity of the car is 15 m/s
• the velocity of the car above is the same between any two points on the line
• the time-distance graph provides information about velocity
Example
velocity HereHere
is theistime-velocity graph
the velocity-time for the
graph for same car ...car ...
the same
acceleration
velocity (m/s)
20
15
10
5
0 time (s)
0 5 10
Acceleration measures how velocity changes with time. When the acceleration
is constant, it can be calculated using the ratio:
1
m/s = meters per second
2
! is an uppercase Greek ∆ velocity
letter change
called “delta”.
=
in velocity
It is used in mathematics to 2mean “change in”.
= 0 m/s
∆ time change in time 1
Observe:
• the velocity-time graph provides information about acceleration
• the velocity-time graph is a horizontal line with gradient 0
• the acceleration of the car is 0 m/s2 .
2 Differentiation
Example
Example
shape This isis the
This the time-height
height-timegraph
graphforofananexperimental rocket.
experimental rocket.
of graph
height (m)
vertical 150
velocity
100
50
0 time (s)
0 5 10
Therocket
The rocket climbs
climbs up to to about120
about 120mmand
thenthen
returns to the to
returns ground again. The
the ground time-height
again. The graph is
not a straight
graph line,sosothe
is curved, the change
change in height
heightwith
withtime
timeis not constant.
is not constant.
What does the shape of the graph tell us about the vertical velocity of the
What does the graph tell us about the velocity of the rocket? Can the velocity be found from the
rocket? Can the
graph? These velocity
types be found
of questions from the
motivated thediscoveries
graph? of Fermat, Newton and Leibniz.
The graph shows that between t = 0 and t = 1 the rocket climbs 44 m, but between t = 4 and t = 5
it only climbs 5m. So the shape of the curve shows that the rocket slows down as it climbs to its
maximum height at t = 5, and that it speeds up it again as it falls to the ground.
This is the time-height graph for an experimental rocket.
height (m)
150
100
1.1. DESCRIBING
50
CHANGE 3
Questions like
0 these motivated Fermat, Newton and Leibniztimein
(s)their exploration
0 5 10
of the gradient of a curve and led to the invention of differentiation.3
TheThe
rocket climbsshows
graph up to about
that 120
themrocket
then returns
climbsto the
44 ground again. The
m between t =time-height
0 and t =graph is
1, but
not a straight line, so the change in height with time is not constant.
only 5m between t = 4 and t = 5.
What
Thedoes the graph
shape of thetellcurve
us about the velocity
shows that theof the rocket?
rocket Can the
slows velocity
down as itbeclimbs,
found from theit
until
graph? These types of questions motivated the discoveries of Fermat, Newton and
reaches its maximum height at t = 5, . . . and that it then speeds up as it falls Leibniz.
to the ground.
The graph shows that between t = 0 and t = 1 the rocket climbs 44 m, but between t = 4 and t = 5
it only climbs 5m. So the shape of the curve shows that the rocket slows down as it climbs to its
Example
maximum height at t = 5, and that it speeds up it again as it falls to the ground.
population This population-time graph models the growth of aphids on broad bean plants
growth Example
over 60 days.
This time-population graph models the growth of aphids on broad bean plants over 60 days.
population
1000
500
0 time (d)
0 20 40 60
The graph that the aphid population grows slowly for the first 20 days, grows faster from day 20
to day
The40,graph
then grows
showsmore slows
that the again
aphidand levels off. grows slowly for the first 20 days,
population
Thegrows fasterthat
graph shows from
the day 20 to isday
population 40, then
growing faster grows more
at day 30 slows
than it again
is at day andexactly
50. But finally
howlevels
fast isoff.
the population growing on each of these days?
This is another
The graphexample of the the
shows that type population
of question that
is led to the invention
growing faster atofday
differentiation.
30 than it is at
day 50. But exactly how fast is the population growing on each of these days?
This is another example of the type of question that led to the invention of
differentiation.
Exercise 1.1
1. A car starting from rest travelled the first 300 m in 10 seconds at a constant
velocity.
(a) Represent this on a distance-time graph.
(b) What is the constant velocity of the car?
(c) Draw the corresponding velocity-time graph
(d) What is the acceleration of the car?
3
Pierre de Fermat (1601 - 1665), Sir Isaac Newton (1643 - 1727), Gottfried Wilhelm von Leibniz
(1646 - 1716)
4 CHAPTER 1. GRADIENTS OF CURVES
height (m)
150
100
50
0 time (s)
0 5 10
The second
The rocket climbs up to
question canabout 120 m then returns
be approached to the ground
by estimating again. The
the velocity time-height
near t = 2 : graph is
not a straight line, so the change in height with time is not constant.
• at t = 2, the rocket’s height is 78.4 m, and at t = 3 the height is h = 102.9 m,
Whatso does
thethe graph tell
vetrical us about
velocity the velocity
between t = 2 of
andthet rocket? Can the velocity be found from the
= 3 is approximately:
graph? These types of questions motivated the discoveries of Fermat, Newton and Leibniz.
∆ height change in height 102.9 − 78.4
The graph shows ∆ thattime = t = 0 and t = 1 the =
between rocket = 24.5 m/s
change in time 3climbs
− 2 44 m, but between t = 4 and t = 5
it only climbs 5m. So the shape of the curve shows that the rocket slows down as it climbs to its
maximum
. . . theheight at t =at5,tand
velocity = 2that it speeds
is about up m/s.
24.5 it again as it falls to the ground.
Example
• at t = 2.5 the height is 91.8 m,
This time-population graph models
so the vertical velocity the growth
between of aphids
t = 2 and on is
t = 2.5 broad bean plants over 60 days.
approximately:
population
∆height change in height 91.8 − 78.4
1000
= = = 26.8 m/s
∆height change in height 2.5 − 2
∆height
0 Interval ∆time ∆height
∆time
time (d)
0 20 40 60
t = 2 (h = 78.4) → t = 3.0 (h = 102.9) 1 24.5 24.5
The graph that the aphid
→population
t = 2.5 (hgrows slowly for0.5
= 91.9) the first 2013.4
days, grows26.8
faster from day 20
to day 40, then grows more slows again and levels off.
→ t = 2.1 (h = 81.3) 0.1 2.9 29
The graph shows that the
→ population
t = 2.01 (his =
growing
78.7) faster0.01
at day 30 than
0.3 it is at day
3050. But exactly
how fast is the population growing on each of these days?
Thiscan
You is another example
see that of the type
the velocity of question
of the that
rocket at t= led2 to thebe
will invention of differentiation.
very close to 30 m/s.
1.2. GRADIENTS OF CURVES 5
4 Differentiation
These estimates can be interpreted as gradients:
!h
= 26.8
!t
•(2, 78.4)
The firstThe firstestimate
velocity estimatewasof24.5.
theThis
velocity was 24.5
is the gradient m/s.
of the line from (2, 78.4) to (3, 102.9).
. . . it is the gradient of the line from (2, 78.4) to (3, 102.9).
The second estimate was 26.8. This is the gradient of the line from (2, 78.4) to (2.5, 91.8).
The third
Thewassecond
30. Thisestimate
is the gradient
was of26.8
the line
m/s. from (2, 78.4) to (2.01, 78.7). It is very close to
the gradient of. the tangent line at (2, 78.4).
. . it is the gradient of the line from (2, 78.4) to (2.5, 91.8).
... the velocity at t =2 is the gradient of the curve at t = 2.
The third estimate was 30 m/s.
Terminology . . . it is the gradient of the line from (2, 78.4) to (2.01, 78.7), and is very close
• The tangent
to the gradient of the
line to a curve is a line that
straight linejust
that touches thethecurve
just touches curve.at (2, 78.4).
• The gradient of a curve at a point is the gradient of the tangent line that touches the
The velocity of the rocket at t = 2 is equal to the gradient of the straight line that
curve at that point
just touches the curve at t = 2.
500
0 time (d)
0 20 40 60
Terminology
• The tangent line to a curve is a straight line that just touches the curve.
• The gradient of a curve at a point is the gradient of the tangent line that touches the
curve at that point
500
0 time (d)
0 20 40 60
In the graph . . .
• the gradient of the curve increases from t = 0 to t = 30, soGradients of curves 5
. . . the growth rate increases from t = 0 to t =30
In this •graph
the ....
gradient of the curve stops increasing at t = 30, begins to decrease
• and becomes
the gradient of close to zero
the curve afterfrom
increases t = t50,
= 0 so
to t = 30, so
. . . the growth rate stops increasing at t = 30, begins to decrease and
.... the growth
becomes closerate
to of the after
zero population increases
t = 50. 4 from t = 0 to t =30
• the gradient of the curve stops increasing at t = 30, begins to decrease and becomes
close to zero after t = 50, so
Example
.... the growth rate of the population stops increasing at t = 30, begins to decrease and
changing The vertical
becomesvelocity of anafter
close to zero experimental
t = 50.4 rocket can be found from the gradient
velocity of the height-time graph. When the rocket is climbing the vertical velocity is
Example
positive, and when
5 it is returning the vertical velocity is negative.
The vertical velocity of the experimental rocket is equal to the gradient of time-height graph.
height (m)
150
100
50
0 time (s)
0 5 10
In this graph ....
In this graph . . .
• the gradient to the curve is positive but reducing from t = 0 to t = 5, so
4
The size of the population is increasing even though the growth rate is decreasing. The
.... the velocity of the rocket is positive (climbing) and reducing for 0 ! t < 5
population levels off as the growth rate falls to 0.
• the gradient of the curve is zero at t = 5, so
.... the (vertical) velocity is zero at t = 5.
• the gradient of the curve is negative and growing for 5 < t ! 10 , so
.... the velocity of the rocket is negative (falling) and growing for 0 ! t < 5
150
100
50
0 time (s)
5 10
-50
0
You can see that: 5 10
• the initial
You can see that velocity
. . . is about 50 m/s.
• the initial velocity is about 50 m/s
4
The •
sizethe
of thevelocity
population is positive
is still increasingfor ≤t<
even0though the 5 (climbing)
growth rate is decreasing. The population levels off as
the growth rate falls to 0.
5
This•example
the velocity
only looks atishowzero at t of=the5 rocket
the height (maximum
changes withheight reached)
time. When the rocket is climbing the
6• Differentiation
veloscity is positive, when it is falling the velocity is negative.
the velocity is negative for 5 < t ≤ 10 (returning)
• the• velocity
the velocity is positive
is changing atfor 0 ! t < 5 (climbing)
a constant rate which is equal to the gradient
of •thethe velocity
line (−9.8)is zero at t = 5 (maximum height reached)
• the velocity is negative for 5 < t ! 10 (falling)
Acceleration measures how velocity changes with time. In this example, the
• the velocity is changing at a constant rate equal to the gradient of the line (- 9.8)
rocket has a vertical acceleration of −9.8 m/s2 , due to the downward pull of
The rate of change of velocity with time is acceleration. In this example the downward force of
gravity. 2
gravity on the rocket produced an acceleration of - 9.8 m/s .
Example
Example
The experimental rocket followed a parabolic path. The height (metres) is given on the y-axis of
change in The experimental rocket followed a parabolic path. The height (metres) is
height with the graph below, and the distance travelled (metres) on the x-axis.
given on the y-axis, and the distance travelled (metres) on the x-axis.
distance
y
100
50
0 x
0 100 200 300 400 500
InInthis
thisheight-distance graph,
distance-height graph, thethe gradient
gradient of theofcurve
the can
curve is interpreted
be interpreted as theasrate
theat which
rate at which height changes with distance
height changes with distance travelled. travelled.
8 CHAPTER 1. GRADIENTS OF CURVES
Exercise 1.2
5
A chord is a line segment joining two points on a curve.
1.3. THE RATE OF CHANGE OF A FUNCTION 9
f (b)
f (b) ! f (a)
f (a)
a b x
b!a
TheThe
average rate ofrate
average change of the function
of change f (x) in thef interval
of the function (x) from from
x =a atotob xis = b is
!f f (b) " f (a)
∆f = in f (x) . f (b) − f (a)
change
= !x b"a = .
∆x change in x b−a
# This can be interpreted as the slope of the chord between ( a, f (a)) and ( b, f (b)) .
This is equal to the gradient of the chord6 from (a, f (a)) to (b, f (b).
The average rate of change across an interval is an approximation to the rate of change of a
6
function
As theat awidth
point (instantaneous rate[a,ofb]change).
of the interval decreases, the approximation
As the width of the interval (a, b) decreases, the approximation
∆f f (b) − f (a)
!f f=(b) " f (a) .
∆x= b−a
!x b"a
becomes closer to the rate of change of f (x) at x = a.7
approaches the instantaneous rate of change of f (x) at the point x = a .
You can see that the rate of change of f (x) at x = a is equal to the gradient of the
# This can be interpreted as the slope of the tangent line to the graph of f (x) at x = a .
tangent line to the graph of f (x) at x = a.
Problems 1.3
The gradient of the curve y = f (x) at a point x = a (the gradient of the
1. Sketch the graph
tangent line) = x 2 for 0the
of ymeasures ! x rate
! 4 and draw on of
of change it chords from P(1,
the function 1) towith
f (x) each respect
of the
pointstoQ(2,
x at4) , R(3,
the 9) and
point a. 16) .
x =S(4,
2. What is the average rate of change of x 2 between
The
i. xphrase instantaneous
= 1 and x = 4 rate of change is often used to emphasise that the rate
of change of the function f (x) is for a specific value of x. This contrasts with the
ii. x = 1 and x = 3
previous use of average rate of change.
iii. 6 x = 1 and x = 2
A chord is a line segment joining two points on a curve.
3. By 7considering
The phrase the
rateaverage rateisofused
of change change, estimate
describe to theinstantaneous rateasofx change
change in f (x) x 2 at
changes,ofeven though x
x = 1not
may to within 0.1.variable.
be a time
6
The word instantaneous is used describe to the rate of change of f (x) with respect to x even though the variable x
may not be a time variable.
10 CHAPTER 1. GRADIENTS OF CURVES
Exercise 1.3
1. Sketch the graph of y = x2 for 0 ≤ x ≤ 4 and draw the chords from P (2, 4) to
the points Q(1, 1), R(3, 9) and S(4, 16).
(a) x = 1 and x = 2
(b) x = 2 and x = 3
(c) x = 2 and x = 4
3. Use your answer to question 2 to deduce that the rate of change of x2 with
respect to x at x = 2 is between 3 and 5.
Differentiation
The gradient of a curve shows the rate at which a quantity changes on a graph.
If the quantity is described by a function1 then the rate of change of the function
can be found directly by algebra without drawing a graph. This process is called
differentiation. We call the rate of change of a function the derivative of the function.
There are two ways of finding derivatives of functions:
• from first principles, following the footsteps of the early mathematicians. This
is mainly of historical interest, however it introduces the important idea of a
limit and explains the notation used in differentiation.
• constructing derivatives from known results using the rules of differentiation.
Example
from To find the gradient of the parabola y = x2 at the point P (1, 1) :
first
principles 1. Find the gradient of the line through P (1, 1) and a second point Q on
the parabola. (See diagram on page 12.)
We take the second point to be Q(1 + h, (1 + h)2 ) where h stands for a
number.2
1
A function is a formula that has only one value for each input value, for example y = x2 .
The function keys on a calculator give one value for each input value.
2
If x = 1 + h, then y = x2 = (1 + h)2 .
11
This section shows how the early mathematical explorers calculated gradients and derivatives.
While this is mainly of historical interest, it introduces the important idea of a limit.
Example
To find the gradient of the parabola y = x 2 at the point (1, 1) :
1. First
12 find the gradient of the line going through (1, 1) and2.a second
CHAPTER point on the parabola.
DIFFERENTIATION
8
Q
6
• (1 + h, (1 + h) )
2
4
P
2
•(1, 1)
-2 0 1 12 + h
When h is a very small number, the line going through the two points will be very close to
You can see that when h is a very small number, the line through the
the tangent points
line atPtheand
point (1, 1).
Q will be very close to the tangent line at (1, 1).
2. The gradient of the line from P (1, 1) to Q(1 + h, (1 + h)2 ) is
2. The gradient of the line from (1, 1) to (1 + h, (1 +2 h)2) is
∆y (1 + h) − 1
=
∆x! y 1 (1 + +h h)
− 1" 1
2
= 2
!=x (1 +12h + h+"h1) − 1
1+h−1
2h + h2
=
6
h
A function is a formula that has only one value for each
h(2 + h)value. All the function keys on a scientific calculator
input
give one value from each input value. =
h
8 =2+h . . . as long as h 6= 0.
3. When h is very small, the gradient
∆y
=2+h
∆x
will be very close to the gradient of the tangent line at (1, 1).
We can deduce that ....
∆y
as h becomes smaller and smaller, becomes closer to 2.
∆x
This shows that the gradient of the parabola y = x2 at (1, 1) is exactly 2 .
Alternatively, we can say the derivative of the function x2 at x = 1 is 2.
When the early mathematicians explored differentiation they discovered some simple
relationships between functions and their derivatives that we can use to calculate
derivatives quickly.
For example, the derivative of the function x2 is always equal to 2x for every value
of x. This is shown in the next example, using the method of first principles.
• constructing derivatives from known results
8
Q
6
• (a + h,(a + h) )
2
4
P
2
• (a, a )
2
-2 0 a a2 + h
When h is a very small number, the line going through the two points will be very close to
When h is a very small number, the line going through the points P and
the tangent line at the point (1, 1). 2
Q will be very close to the tangent line at (a, a ).
2. The gradient of the line from P (a, a2 ) to Q(a + h, (a + h)2 ) is
2. The gradient of the line from (1, 1) to (1 + 2h, (1 2+ h)2) is
∆y (a + h) − a
= ! y (1 + h)2 " 1
∆x a + h=− a
(a2!+x2ah + 1 +h2h) "−1a2
=
a+h−a
2ah + h2
=
6 h
A function is a formula that has only one value for each input value. All the function keys on a scientific calculator
h(2a + h)
give one value from each input value. =
h
8 = 2a + h . . . as long as h 6= 0
∆y
= 2a + h
∆x
will be very close to the gradient of the tangent line at (a, a2 ).
We can deduce that ....
∆y
as h becomes smaller and smaller, becomes closer to 2a.
∆x
3
If x = a + h, then y = x2 = (a + h)2 .
14 CHAPTER 2. DIFFERENTIATION
This shows that the gradient of the parabola y = x2 at (a, a2 ) is exactly 2a.
Alternatively, we can say the derivative of the function x2 at x = a is 2a.
∆y
lim = 2a
h→0 ∆x
. . . which is read as
∆y
the limit, as h approaches 0, of is 2a.
∆x
∆y dy
(d) The symbol lim in (c) is the origin of the traditional symbol that is
h→0 ∆x dx
used to represent a derivative. Instead of writing
the derivative of y = x2 is 2x
2.1. FROM FIRST PRINCIPLES ... 15
we just write
dy d 2
= 2x or (x ) = 2x
dx dx
These are read aloud as
This notation is adjusted when different variables are used. For example, if
the relationship between population (P ) and time (t) is given by P = t2 , then
the population growth rate is 2t and we can write either
dP d 2
= 2t or (t ) = 2t
dt dt
(d) These traditional symbols are clumsy to type without special software and are
often replaced by dashes. For example, we can write
dy dP
y 0 or y 0 (x) instead of and P 0 or P 0 (t) instead of
dx dt
the derivative of y = x2 at x = 1 is 2
and
the population growth rate when t = 10 is 100
can be written as P 0 (10) = 100.
. . . and read aloud as either
Exercise 2.1
dy
3. If y = x2 − 2x, then it is known that = 2x − 2. Use this to:
dx
(a) find the gradient of the parabola at the y-intercept
(b) find the equation of the tangent line at the y-intercept
4
formulae representing real life situations are frequently called models.
2.2. CONSTRUCTING DERIVATIVES . . . 17
2.2.1 Powers
The most common functions are powers, and their derivatives have this pattern ...5
dy
If y = xa for any power a, then = axa−1 .
dx
Example
dy
derivatives a. If y = x2 , then = 2x2−1 = 2x1 = 2x
of powers dx
dy
b. If y = x100 , then = 100x100−1 = 100x99
dx
√ dy 1 1 1
c. If y = x , then y = x1/2 and = x1/2−1 = x−1/2 or √
dx 2 2 2 x
1 dy 1
d. If y = , then y = x−1 and = −1x−1−1 = −x−2 or − 2
x dx x
Example
velocity v The displacement s (meters) travelled by a car in time t seconds is given by
distance s the model
time t
s = t2 .
What is the velocity of the car after 5 seconds?
Answer:
The velocity is the rate of change in distance by time:
ds
v= = 2t2−1 = 2t
dt
When t = 5, v = 2t = 10 m/s.
5
Any variables can be used, not just x’s and y’s.
18 CHAPTER 2. DIFFERENTIATION
There are two powers that occur frequently and whose derivatives are worth mem-
orising: 1 (= x0 ) and x (= x1 ) .
Differentiation 11
Problems 2.2.1 dy dy
If y = 1 (= x0 ) , then = 0. If y = x (= x1 ) , then = 1.
1. Differentiate the following dx
functions dx
y = x 20
a. 2.2.1
Exercise
1
b. y = 2 the following functions
1. Differentiate
x
(a) y = x20
c. u = v 31
(b) y = 2
x
(c) uv == u13
d.
v1
(d) v = √
u
2. What is h!(2) 0
if h(t) = t 4 ?4
2. What is h (2) if h(t) = t ?
3 3
3. 3.Use
Usedifferentiation
differentiationtotofind thethe
find gradient of the
gradient curve
of the y =yx=
curve at
x (1,
at 1).
(1,1).
0 1 2
-2 -1
-8
2.2. CONSTRUCTING DERIVATIVES . . . 19
2.2.2 Polynomials
Many mathematical functions are built from simpler functions such as powers. We
use this to construct their derivatives.
The general form of a polynomial of degree n in x is
axn + bxn−1 + . . . + dx + e ,
Rule 1 (constants)
The derivative of a constant is zero.
f (x) = c =⇒ f 0 (x) = 0
Rule 2 (multiples)
The derivative of a constant multiple is the multiple of the derivative.
y = cf (x) =⇒ y 0 = cf 0 (x)
Rule 3 (sums)
The derivative of a sum of terms is the sum of their derivatives.
Example
y 0 = 2x + 2 × 1 + 0 = 2x + 2.
3. If y = (x + 3)(x + 7), expand the brackets first, then apply rules 3 and 2
y = (x + 3)(x + 7)
= x2 + 10x + 21
y 0 = 2x + 10
A polynomial is constructed by adding or subtracting multiples of powers of a single variable
and a constant term, eg. x 2 + 3x + 2 , x 3 ! 6 , .... 11
It can be differentiated by using the following two rules:
20 CHAPTER
The derivative of a constant term is 2. DIFFERENTIATION
zero.
Example
Example
vertical The height h (metres) of an experimental rocket after t seconds is given by
The
velocity height h (metres) of an experimental rocket after t (seconds) is given by h = 49t ! 4.9t 2 .
& h = 49t − 4.9t2 m/s.
maximum
height
height (m)
dh
150 150 =0
dt
122.5 m
50
0 time (s)
0 5 10
6
In mathematics, we use the smallest number of rules that are necessary.
This makes it easier to see how they can be altered or extended when exploring new situations.
2.2. CONSTRUCTING DERIVATIVES . . . 21
The rocket reaches its maximum height when its vertical velocity is 0 m/s.
This occurs when
dh 49
= 49 − 9.8t = 0 =⇒ t = = 5s
dt 9.8
Example
√
square If y = 2 − 3 x, then √
roots y = 2−3 x
= 2 − 3x1/2
y 0 = 0 − 3 × 12 x1/2−1
3
= − x−1/2
2
3
= − √
2 x
Note. The final line uses the same notation as in original question, a square
root symbol rather than a half power.
Exercise 2.2.2
3. Repeat the example of the experimental rocket above, assuming that the height
is given by
h(t) = 98t − 4.9t2 .
22 CHAPTER 2. DIFFERENTIATION
Rule 4 (products)
The derivative of a product is the derivative of the first function multiplied by
the second function, plus the first function multiplied by the derivative of the
second function.
Example
products (a) If y = (x + 1)(x2 + 2), then
f 0 g + f g0
y 0 = 1 × (x2 + 2) + (x + 1) × 2x
= 3x2 + 2x + 3
Exercise 2.2.3
(a) y = x2 (2x − 1)
(b) y = (x + 1)(x3 + 3)
(c) y = (x3 + 6x2 )(x2 − 1) + 20
(d) u = (7x + 3)(2 − 3x) + (x + 3)2
(e) u = 80(x2 + 7x)(x2 + 3x + 1)
(f) f (x) = 2 − (x2 − 5x + 1)(2x + 3)
1 1
(g) g(t) = (t + )(5t2 − 2 )
t t
2
(h) h(x) = (x + 1)(3x − 1)(2x − 3)
2.2. CONSTRUCTING DERIVATIVES . . . 23
Rule 5 (quotients)
The derivative of a quotient is the derivative of the numerator multiplied by
the denominator, less the numerator multiplied by the derivative of the de-
nominator, all divided by the square of the denominator.
Example
2x + 1
quotients (a) If y = , then
f 0 g − f g0 x+3
g2 2 × (x + 3) − (2x + 1) × 1
y0 =
(x + 3)2
5
=
(x + 3)2
1
(b) If f (x) = , then
x+3
0 × (x + 3) − 1 × 1
f 0 (x) =
(x + 3)2
1
= −
(x + 3)2
Exercise 2.2.3
y = u50 , where u = x2 + 1.
Example
composite 1. If y = (2x − 1)3 , then
functions
y = u3 , where u = 2x − 1.
√
2. If y = 1 − x2 , then
√
y= u, where u = 1 − x2 .
1
3. If y = , then
x2 +4
1
y= , where u = x2 + 1.
u
Example
‘function If f (x) = x7 and g(x) = 1 − x2 , then
of a
function’ (a) f (g(x)) = (1 − x2 )7
(b) g(f (x)) = 1 − (x7 )2 = 1 − x14
2.2. CONSTRUCTING DERIVATIVES . . . 25
Exercise 2.2.4
(a) f (g(x))
(b) g(f (x))
(a) h ◦ j(x)
(b) j ◦ h(x)
(a) l(m(x))
(b) m(l(x))
5. Identify outside and inside functions for the composite functions below.7
(a) (x + 1)5
√
(b) x − 4
(c) (x2 − 3x + 4)2
√
(d) (3x + x)3
7
When there is more than one answer, choose the inside and outside functions that are simplest.
26 CHAPTER 2. DIFFERENTIATION
Example
chain rule Differentiate y = (1 − x2 )7 .
Method 1
Put y = u7 , where u = 1 − x2 , then
dy du
= 7u6 and = −2x
du dx
so
dy dy du
= ×
dx du dx
= 7u6 × (−2x)
= −14x(1 − x2 )6
Method 2
Put y = f (g(x)), where f (x) = x7 and g(x) = 1 − x2 , then
dy
= f 0 (g(x)) × g 0 (x)
dx
= 7(1 − x2 )6 × (−2x)
= −14x(1 − x2 )6
Exercise 2.2.4
(e) (x2 + 1)3 (f) (x2 − 3x)2 (g) (2x2 − 3x + 1)3 (h) (3x − x3 )5
8. Differentiate:
√ √ √ √
(a) x + 1 (b) 2x + 3 (c) x2 + 3x − 1 (d) 2x − x3
1 1 3 5
(e) √ (f) √ (g) √ (h) √
x−1 x2 +3 3x2 − 1 10 − x2
8
This is called the chain rule because of the chain of derivatives. In the case of a ‘function
f (x) of a function g(x) of a function h(x)’, the derivative is f 0 (g(h(x)))g 0 (h(x))h0 (x).
2.2. CONSTRUCTING DERIVATIVES . . . 27
dy
(a) Find
dx
dy
(b) Solve =0
dx
x
(c) Find the maximum value of √ for x ≥ 0.
x4 + 1
28 CHAPTER 2. DIFFERENTIATION
When functions are explicitly defined in the form y = f (x) they can be differentiated
using the previous rules of differentiation. However functions can also be implicitly
defined.
Example
implicit
Consider the equation of the circle
relationship
x2 + y 2 = 4 2
x2 + y 2 = 4
y 2 = 4 − x2
√
y = ± 4 − x2
2 2
√ √
The gradient of the tangent
√ line to x + y = 4 at ( 2, 2), can now be found
by differentiating y = 4 − x2 , giving . . .
√
dy −x − 2
=√ =q √ = −1
dx 4 − x2 4 − ( 2)2
Example
dy
implicit To find directly from the implicit relationship x2 + y 2 = 4 . . .
differentiation dx
1. Assume that y is a function of x, writing it as y(x).
2.2. CONSTRUCTING DERIVATIVES . . . 29
Example
tangent to Use implicit differentiation to find the gradient of the tangent line to the
an ellipse general point P (x, y) on the ellipse x2 + xy + y 2 = 1. Then find the equation
of the tangent line at (1, −1)
Answer
1. Assume that y is a function of x, writing it as y(x).
Substituting (1, −1) into this equation shows that c = 2, and that the equation
of the tangent line at (1, −1) is y = −x + 2.
Example
normal to Use implicit differentiation to find the gradient of the tangent line at the
an ellipse x2 y 2
general point P (x, y) on the ellipse + = 1. Then find the equation of
√ √ 4 16
the normal line at ( 2, 2 2).
Answer
1. Rewrite the equation of the ellipse as 4x2 + y 2 = 16.
2. Assume that y is a function of x, writing it as y(x).
3. Differentiate both sides of 4x2 + y 2 = 16, e.g.
4x2 + y 2 = 16
d d
(4x2 + y 2 ) = (16)
dx dx
d
8x + (y 2 ) = 0 . . . differentiating each term
dx
dy
8x + 2y = 0 . . . by the chain rule
dx
dy 4x
= − . . . provided y 6= 0
dx y
4x
The gradient of the tangent line at P (x, y) is − .
y
The gradient of the normal at P (x, y) is
1 y y
− = −1 × − =
4x 4x 4x
−
y
√ √
The equation of the normal at ( 2, 2 2) is
1
y = x + c, for some number c.
2
2.2. CONSTRUCTING DERIVATIVES . . . 31
√ √
Substituting ( 2, 2 2) into this equation shows that
√
3 2
c= ,
2
√
√ √ 3 2
and that the equation of the tangent line at ( 2, 2 2) is y = 2x + .
2
Exercise 2.2.5
dy
1. Find if :
dx
(a) x2 + y 2 = 16
(b) x2 + 3y 2 = 9
(c) x2 − y 2 = 25
(d) x2 + xy + y 2 = 10
(e) x3 + 2x2 y + y 2 = 10
(a) x2 + y 2 = 8 at (2, 2)
y2
(b) x2 + = 3 at (1, 2)
2
4. Show that the normal to the circle x2 + y 2 = 1 at the point (a, b) with ab 6= 0
always passes through the origin.
Appendix A
First Principles
8 Differentiation
f (b)
f (b) ! f (a)
f (a)
a b x
b!a
TheThe
average rate ofof
gradient change of the function
the chord (a))in to
from (a, ff(x) the(b,
interval from a to b is
f (b) is
!f f (b) " f (a)
= .
∆y !x change bin" ya f (b) − f (a)
= = .
∆x of change
# This can be interpreted as the slope the chordinbetween
x ( a,b f−(a)a) and (b, f (b)) .
The average rate of change across an interval is an approximation to the rate of change of a
As the 6
function at awidth of the interval
point (instantaneous rate[a,ofb]change).
decreases, the approximation
As the width of the interval (a, b) decreases, the approximation
∆y f (b) − f (a)
!f f=(b) " f (a) .
∆x= b−a
!x b"a
approaches
becomes thecloser
instantaneous rate of change
to the gradient of theoftangent
f (x) at the point
line x = agraph
to the . of f (x) at x = a, and
so can
# This to the derivativeasofthef (x)
be interpreted slopeatofxthe
= tangent
a. line to the graph of f (x) at x = a .
If we put b = a + h, then the derivative of f (x) at x = a is given by the limit
Problems 1.3
dy ∆y f (a + h) − f (a)
for =
1. Sketch the graph of y = x 2dx lim
0 !∆x→0 and=
x ! 4∆x lim
draw
h→0on it(a
chords
+ h) from
− a P(1, 1) to each of the
points Q(2, 4) , R(3, 9) and S(4, 16) .
Definition
The derivative of y = f (x) at the point (a, f (a)) is given by the limit
dy f (a + h) − f (a)
= lim
dx h→0 h
Example
first Find the derivative of y = x2 at x = a using first principles
principles
at x = a 1. From the definition . . .
dy (a + h)2 − a2
= lim
dx h→0 h
dy (a2 + 2ah + h2 ) − a2
= lim
dx h→0 h
2ah + h2
= lim
h→0 h
= lim 2a + h
h→0
= 2a
dy
The derivative at x = a is = 2a
dx
Example
first Differentiate y = x2 + 4x + 2 using first principles
principles
without 1. From the definition . . .
using
x=a dy [(x + h)2 + 4(x + h) + 2] − [x2 + 4x + 2]
= lim
dx h→0 h
dy
The derivative is = 2x + 4
dx
34 APPENDIX A. FIRST PRINCIPLES
Example
cubic Differentiate y = x3 using first principles
function
1. From the definition . . .
dy (x + h)3 − x3
= lim
dx h→0 h
Example
rational
x+1
Differentiate f (x) = using first principles
function x+2
1. From the definition . . .
(x + h) + 1 x + 1
−
0 (x + h) + 2 x + 2
f (x) = lim
h→0 h
(x + h + 1)(x + 2) − (x + 1)(x + h + 2)
f 0 (x) = lim
h→0 (x + h + 2)(x + 2)h
1
= lim
h→0 (x + h + 2)(x + 2)
1
=
(x + 2)(x + 2)
1
=
(x + 2)2
1
The derivative is f 0 (x) = .
(x + 2)2
35
Exercise A
Answers
Exercise 1.1
1(a)
Exercise 1.2
1 − 0.25 2.25 − 1
1(a) (i) mP Q = = 1.5 (ii) mP R = = 2.5
1 − 0.5 1.5 − 1
1(b) This follows from mP Q < mtangent < mP R .
1(c) Using the points L(0.9, 0.81) and M (1.1, 1.21), mtangent ≈ 2 is an estimate to
within ±0.1 as
mP L = 1.9 < mtangent < mP M = 2.1
2. (i) matches (a), (ii) matches (c), (iii) matches (d), (iv) matches (b)
36
37
Exercise 1.3
1. s
S(4, 16)
15
y= x2
10 s
R(3, 9)
5 s
P (2, 4)
s
Q(1, 1)
0
0 1 2 3 4
Exercise 2.1
1(a) & 1(b)
15
y = x2 s
S(3 + h, (3 + h)2 )
10 s
R(3, 9)
0
0 1 2 3 3+h 4
-1
38 1 2 3 + h 3 B. ANSWERS
APPENDIX
1 s
V (2 + h, (2 + h)2 − 2(2 + h))
s
0 U (2, 0)
-1
1 2 2+h 3
4(a)
80000
P (t) = 600t − t2
40000
0
0 200 400 600
Exercise 2.2.1
dy dy 2 dv dv 1
1(a) = 20x19 1(b) =− 3 1(c) = 3u2 1(d) =− √
dx dx x du du 2u u
2. As h0 (t) = 4t3 , h0 (2) = 32.
dy
3. The derivative is = 3x2 , so the gradient of the curve is 3 at (1, 1).
dx
Exercise 2.2.2
dy dy dy dy 1
1(a) = 9x2 − 2 1(b) = 40x − 60 1(c) = 4x + 4 1(d) =− 2
dx dx dx dx x
dy
2. As = 2x − 4, the gradients are m = −2 and m = 2.
dx
dh
3(a) The vertical velocity is = 98 − 9.8t m/s
dt
3(b) At t = 0, the initial velocity is 98 m/s, and at t = 2, the velocity is 78.4 m/s
3(c) The maximum height is reached when t = 10 s. This height is h(10) = 490 m.
Exercise 2.2.3
dy dy
1(a) = 6x2 − 2x 1(b) = 4x3 + 3x2 + 3
dx dx
dy du
1(c) = 5x4 + 24x3 − 3x2 − 12x 1(d) = −40x + 11
dx dx
du
1(e) = 320x3 + 2400x2 + 3520x + 560 1(f) f 0 (x) = −6x2 + 14x + 13
dx
1 3
1(g) g 0 (t) = 15t2 + 5 + 2 + 4 1(h) h0 (x) = 24x3 − 33x2 + 18x − 11
t t
−3 2x(x + 1)
2(a) f 0 (x) = 2(b) g 0 (x) =
(x − 1)2 (2x + 1)2
dy t3 + 3 dy 4u
2(c) =− 2 2(d) = 2
dt (t − 3)2 du (u + 1)2
dx 2(u2 − 1) dt 1 + 2x
2(e) = 2 2(f) = √
du (u + u + 1)2 dx 2 x(1 − 2x)2
dy 2x dy 2
2(g) =− 2 2(h) =−
dx (x + 1)2 dx (x + 1)3
√
t
3. Density is given by the function D(t) = √ , so the rate of change of density
t+1
dD 1
is = √ √ .
dt 2 t( t + 1)2
40 APPENDIX B. ANSWERS
Exercise 2.2.4
1 1
8(a) √ 8(b) √
2 x+1 2x + 3
2x + 3 2 − 3x2
8(c) √ 8(d) √
2 x2 + 3x − 1 2 2x − x3
1 x
8(e) − √ 8(f) − √
2(x − 1) x − 1 2(x + 3) x2 + 3
2
9x 5x
8(g) − √ 8(h) √
(3x2 − 1) 3x2 − 1 (10 − x2 ) 10 − x2
3x + 2 5x2 + 6x
9(a) √ 9(b) √
2 x+1 2x + 3
3x + 8 x3 + 2x
9(c) √ 9(d) √
2(3x + 4) 3x + 4 (x2 + 1) x2 + 1
dy x4 − 1
10(a) =− √ 10(b) x = 21/4 ≈ 1.19
dx (x4 + 1) x4 + 1
21/4
10(c) max = ≈ 0.396.
2+1
Exercise 2.2.5
dy x dy x
1(a) =− 1(b) =−
dx y dx 3y
41
dy x dy 2x + y
1(c) = 1(d) =−
dx y dx x + 2y
dy 3x2 + 4xy
1(e) =− 2
dx 2x + 2y
2(a) m = −1 2(b) m = −1 2(c) m = −1
3(a) y = x 3(b) y = x + 1
4. The gradient of the tangent to the circle through the point (a, b) is
a
mtangent = − ,
b
so the gradient of the normal is
b
mnormal = ,
a
b
and the equation of the normal is y = x. This shows the normal passes through
a
(0, 0).
Exercise A