A X Athens
March 2006
Contents
1 Introduction 1
3 Chasles’ Identity 6
4 Affine Combinations 7
5 Affine Subspaces 9
6 Affine independence 14
7 Affine functions 17
7.1 Affine groups . . . . . . . . . . . . . . . . . . . . . . . . . 20
A Appendix 21
1 Introduction
Conventionally, the vector spaces R2 and R3 are taken as models of 2-
space (the plane) and 3-space. More generally, an n-space is represented
as an n-dimensional vector space. While this is essential for doing math-
ematics, it imposes certain choices which are certainly not intrinsic to
the physical understanding of 2-space or 3-space, namely a choice of ori-
gin, axes and scale. As we now see, even the more abstract notion of an
n-dimensional vector space has its shortcomings.
1
Figure 1: General change of coordinates.
direction. Here lies the first point of confusion because there is no explicit
information about the physical location of the origin or the physical
directions and scale represented by the basis vectors. However, this lack
of specific physical meaning is unavoidable for an abstract framework.
The frame {0, e1 , e2 } is thus devoid of physical meaning in itself, but
can be taken as a fundamental frame in terms of which other frames can
be defined. A second frame is thus a triple {a, b1 , b2 } where a = a1 e1 +
a2 e2 is a point in the fundamental frame and the linearly independent
vectors b1 and b2 give the directions and scales of the u-axis and v-axis
respectively in terms of the fundamental frame. A given point P can now
be described by its x and y coordinates or by its u and v coordinates.
To obtain the coordinate transform, we express the fundamental basis
{e1 , e2 } in terms of the new basis {b1 , b2 }. This defines a 2 × 2 matrix
2
C such that
u = c1,1 (x − a1 ) + c1,2 (y − a2 )
v = c2,1 (x − a1 ) + c2,2 (y − a2 )
(P + v) − w = P + (v − w) = P − (w−v),
(P − v) + w = P + (w − v) = P − (v − w),
3
Figure 2: Geometric interpretation of an affine space.
and
(P − v) − w = P − (v + w)
so that expressions like P − v + w can be interpreted unambiguously.
However, it should be noted that the addition or subtraction of two
points and the subtraction of a point from a vector have not been defined.
Axiom (c) can be restated as the existence of a unique mapping
P × P → V, (P, Q) 7→ P Q
such that
P + P Q = Q for all P, Q ∈ P.
Given that P, Q ∈ P and v ∈ V, the identities
(P + v)Q = P Q − v and P (Q + v) = P Q + v
4
This construction is known as the canonical affine structure on V.
Henceforth, whenever we refer to a vector space as an affine space, the
canonical affine structure will be implied.
vn
We take the view that this is not strictly necessary because we are merely
interpreting vectors in a new way. In other words, an element of Rn can
stand for a vector or a point, depending on context. It is nevertheless
convenient for presenting examples and performing computations.
L = {(x, y) ∈ R2 : x + y = 1}
which does not pass through the origin and is therefore not a linear
subspace of R2 . An affine space is obtained by defining addition L+R →
L as
(x, 1 − x) + v = (x + v, 1 − x − v).
We shall see later that L forms a 1-dimensional affine subspace of R2 .
H = {(x, y, z) ∈ R3 : x + y + z = 1}.
5
Figure 3: Chasles’ identity and the parallelogram law.
P = {(x, y, x2 + y 2 ) : x, y ∈ R}.
3 Chasles’ Identity
If P, Q and R are points in an affine space then
R = Q + QR = P + P Q + QR
P Q + QR = P R.
QP = −P Q. (2)
6
Given four points P, Q, R and S, applying Chasles’ identity yields
P Q + QR = P R = P S + SR
P S = QR ⇔ SR = P Q.
4 Affine Combinations
A linear combination of vectors is one of the fundamental concepts of
linear algebra. The analogous concept in an affine space is that of an
affine combination of points.
(a) If λ1 + · · · + λN = 1 then
N
X N
X
A+ λn APn = B + λn BPn
n=1 n=1
Thus, if λ1 + · · · + λN = 1 then
N
X N
X
P = λn Pn , A + λn APn
n=1 n=1
7
The scalars λ1P, . . . , λN , whose sum must equal unity, are called weights.
N
We say P = n=1 λn Pn is the barycentre of the weighted points
λn Pn , (Pn , λn ). By construction, this barycentre is the unique point
X such that
XN
AX = λn APn for all A ∈ P.
n=1
In particular,
1P = P = P + 0Q for all P, Q ∈ P.
as required.
8
Example 7 Consider the canonical affine structure on a vector space V.
If v1 , . . . , vN ∈ V and λ1 + · · · + λN = 1 then
N
X
a+ λn (vn − a) = λ1 v1 + · · · + λn vn
n=1
for all a ∈ V. This shows that affine combination are the same as linear
combinations in this setting.
5 Affine Subspaces
A linear subspace of a vector space V is defined as a non-empty subset of
V which is closed under linear combinations. The corresponding notion
in an affine space is that of an affine subspace, which is a subset that is
closed under affine combinations.
Let A = {P, V} be an affine space. Then a subset S of P is an affine
subspace of A if λ1 P1 + · · · + λN PN ∈ S whenever P1 , . . . , PN ∈ S and
λ1 + · · · + λN = 1. It follows immediately from this definition that the
empty set is a trivial affine subspace and that any intersection of affine
subspaces is an affine subspace.
By definition, an affine subspace S of A must contain λP + (1 − λ)Q
whenever λ ∈ R and P, Q ∈ S. In fact, this condition is sufficient to
ensure that S is an affine subspace.
P + λP Q ∈ S for all λ ∈ R, P, Q ∈ S
9
and λ0 + · · · + λn = 1 with λ0 6= 1. We must show that
P = λ0 S0 + · · · + λn Sn ∈ S.
λ1 S1 + · · · + λN SN with S1 , . . . , SN ∈ S (5)
is an affine subspace of A.
By definition of an affine subspace, it is clear that hSi is the smallest
one containing S, i.e. the intersection of all affine subspaces T of A such
that S ⊂ T . We call hSi the affine subspace generated by S. We also
10
Figure 4: Affine subspaces of R2 .
VA , {AP : P ∈ S} = {P Q : P, Q ∈ S} (6)
S + U , {P + u : P ∈ S and u ∈ U}.
11
If S is an affine subspace of A then (a) shows that {S, VA } is an affine
space in itself with respect to the operation S + VA → S induced by
P + V → P. We call the vector space VA the direction of the affine
subspace S and take dim VA to be the dimension of S. It follows from (a)
that the affine subspace generated by a finite set of points P1 , . . . , Pn
must have dimension ≤ n − 1.
P Q = P A + AQ = AQ − AP ∈ VA
A + VA ⊂ S ⊂ S + VA .
Moreover, if P, Q ∈ S then
P + AQ = A + AP + AQ ∈ A + VA .
This completes the proof of (a). The proof of (b) is left as an easy
exercise for the reader.
Statement (b) of Lemma 12 shows that the line and plane of Exam-
ples 3 and 4 are affine subspaces of R2 and R3 respectively.
P = {(x, y, x2 + y 2 ) : x, y ∈ R}
12
Since P contains the point (0, 0, 0) and is not a linear subspace of R3 , it
cannot be an affine subspace of R3 by part (a) of Lemma 12. Fixing y,
we obtain the one-dimensional affine subspace
2 2 2 1
S = {(x, y0 , x + y0 ) : x ∈ R} = (0, y0 , y0 ) + R
0
which is a parabola.
An affine subspace of dimension one is called a line and an affine
subspace of dimension two is called a plane. Given an affine space
A = {P, V} and a subset S of P, the points contained in S are said to
be
(a) collinear if S is contained in a 1-dimensional affine subspace of A
(b) coplanar if S is contained in a 2-dimensional affine subspace of A.
13
6 Affine independence
Lemma 15 Let (P, V) be an affine space and Pj ∈ P for each j in
an index set J. If the family of vectors (Pi Pj : i 6= j ∈ J) is linearly
independent for some i ∈ J then the same is true for all i ∈ J.
Since Pk Pj = Pi Pj − Pi Pk , we have
X X X
λ j Pk Pj = λj Pi Pj − λ j Pi Pk
j6=k j6=k j6=k
X X
= λ j Pi Pj − λj Pi Pk
j6=i,k j6=k
P
so that λj = 0 for i, k 6= j ∈ J and j6=k λj = λi = 0 as required.
P = λ0 P0 + · · · + λn Pn with λ0 + · · · + λn = 1. (7)
λ0 P0 + · · · + λn Pn = P0 + λ1 P0 P1 + · · · + λn P0 Pn
14
Let A = (P, V) be an affine space and J be an index set with 0 ∈ J.
A family of points F = (Aj : j ∈ J) is called an affine frame for A if
the family of vectors (A0 Aj : 0 6= j ∈ J) forms a basis of V. If F is an
affine frame for A then the family (A0 , A0 Aj : 0 6= j ∈ J) is called an
affine frame with origin A0 . Thus, if dim V = n then an affine frame
of A must consist of n + 1 points A0 , . . . , An and any point P ∈ P can
be written uniquely in the form
P = A0 + x1 b1 + · · · + xn bn
= x 0 A0 + x 1 A1 + · · · + x n An
λ0 P P0 + · · · + λn P Pn = 0 for all P ∈ P.
λ1 P0 P1 + · · · + λn P0 Pn = 0. (8)
15
of vectors is defined to be linearly independent if every finite subfamily
of G is linearly independent and is therefore linearly dependent if some
finite subfamily is linearly dependent.
Two points A, B ∈ P are affine independent if and only if AB 6= 0,
i.e. A 6= B. If A 6= B then
λ0 + · · · + λn = 1 and λ0 , . . . , λn ≥ 0
16
(b) Any intersection of convex sets is convex.
Statements (b) and (c) show that CH(S) is the smallest convex set
containing S.
P = λ0 C0 + · · · + λn Cn
with C0 , . . . , Cn ∈ C,
λ0 + · · · + λn = 1 and λ0 , . . . , λn ≥ 0.
λ1 λn
C= C1 + · · · + Cn ∈ C
1 − λ0 1 − λ0
and
P = λ0 C0 + (1 − λ0 )C ∈ C
as required. The proof of (b) and (c) is left as an easy exercise for the
reader.
7 Affine functions
Let (P, V) and (Q, W) be affine spaces. A map f : P → Q is called an
affine function if
17
(a) If P1 , . . . , Pn ∈ P and λ1 + · · · + λn = 1 then
Proof. The definition of an affine function ensures that (10) holds for
n = 2. Assume now that the relation holds for some positive integer
n ≥ 2. Let P0 , . . . , Pn ∈ P and λ0 + · · · + λn = 1 with λ0 6= 1. Putting
λ1 λn
P = P1 + · · · + Pn
1 − λ0 1 − λ0
we have
λ0 P0 + · · · + λn Pn = λ0 P0 + (1 − λ0 )P
so that
f (λ0 P0 + · · · + λn Pn ) = λ0 f (P0 ) + (1 − λ0 )f (P )
= λ0 f (P0 ) + λ1 f (P1 ) + · · · + λn f (Pn )
thus establishing (a). The proof of (b) and (c) is left as an easy exercise
for the reader.
f (P ) = B + L(AP ).
18
(b) If f : P → Q is an affine map and A ∈ P then a linear map
Lf : V → W is given by
Proof. The proof of (a) is left as an easy exercise for the reader. To
establish (b), fix A ∈ P. If λ ∈ R and v ∈ V then
A + λv = (1 − λ)A + λ(A + v)
so that
If u, v ∈ V then
1 1
A+u+v = (A + 2u) + (A + 2v)
2 2
whence
1 1
Lf (u + v) = f (A) f (A + 2u) + f (A + 2v)
2 2
1 1
= f (A)f (A + 2u) + f (A)f (A + 2v)
2 2
1 1
= Lf (2u) + Lf (2v) = Lf (u) + Lf (v).
2 2
To complete the proof of (b), we observe that if u, v ∈ V then
f (B)f (B + v) = f (A + b)f (A + b + v)
= Lf (b + v) − Lf (b) = Lf (v)
19
as required.
An affine function f : P → P is called a translation if Lf is the
identity map. It follows from (b) of Theorem 20 that f is a translation
if and only if, for any fixed point A ∈ P, we have
The easy proof of the following results is left as an exercise for the reader.
Given an affine space A = (P, V), the set AG(A) of all affine transforms
of P forms a subgroup of the permutation group SP and is known as
the affine group of A. The set GL(V) of all linear automorphisms of V
forms a subgroup of SV . Lemma 22 shows that the mapping
ker L = {f ∈ AG(A) : Lf = IV }
20
We denote the multiplicative group of non-zero real numbers by R∗ .
The linear mappings R∗ IV , called dilations, form a subgroup of GL(V)
which is isomorphic to R∗ under the homomorphism λ 7→ λIV . The
members of the affine subgroup DIL(A) , L−1 (R∗ IV ) are called affine
dilations. Given A ∈ P and λ ∈ R, the affine map
d(P, Q) = kP Qk .
A Appendix
21
Figure 5: Affine dilation of a triangle with fixed point A and ratio λ = 2/3.
22