Algebraic Equations
Mathematics
Tomas Johansson
LiTH–MAT–Ex–98–13
Contents
1 Introduction 3
4 Elliptic functions 25
4.1 General theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
4.2 The Weierstrass ℘ function . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
4.3 Solving the quintic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
4.4 Theta functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
4.5 History of elliptic functions . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
7 Conclusion 51
7.1 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
A Groups 53
B Permutation groups 54
C Rings 54
D Polynomials 55
References 57
2 CONTENTS
3
1 Introduction
In this report, we study how one solves general polynomial equations using only the coef-
ficients of the polynomial. Almost everyone who has taken some course in algebra knows
that Galois proved that it is impossible to solve algebraic equations of degree five or higher,
using only radicals. What is not so generally known is that it is possible to solve such equa-
tions in another way, namely using techniques from the theory of elliptic functions. Galois
theory and the theory of elliptic functions are both extensive and deep. But to understand
and find a way of solving polynomial equations we do not need all this theory. The aim
here is to pick out only the part of the theory which is necessary to understand why not all
equations are solvable by radicals, and the alternative procedures by which we solve them.
We will concentrate on finding an algorithm for the quintic equation.
In Chapter 2 we examine the history of polynomial equations. We also present the
formulas for solutions to equations of degree less then five. A basic tool, the Tschirnhaus
transformation, is introduced. These transformations are used to simplify the solution
process by transforming a polynomial into another where some coefficients are zero.
In Chapter 3 we introduce some Galois theory. In Galois theory one examines the rela-
tionship between the permutation group of the roots and a field containing the coefficients
of the polynomial. Here we present only as much theory as is necessary to understand why
not all polynomial equations are solvable by radicals. The history of the quintic equation
is also presented.
In Chapter 4 we introduce the elliptic functions and examine their basic properties. We
concentrate on one specific elliptic function, namely the Weierstrass ℘ function. We show
that with the help of this function one can solve the quintic equation. However, this is just
a theoretical possibility. To be able to get a practical algorithm, we need to know some
facts about theta functions, with which we also deal in this chapter. The history of elliptic
functions is included at the end of this chapter. Here we also present the general formula for
solving any polynomial equation due to H. Umemura. This formula is also only of theoretical
value, so we will look at other ways of solving the quintic. We present three such methods
in the last chapter. The one that we will concentrate on is Kiepert’s, because it uses both
elliptic functions and Tschirnhaus transformations in a clear and straightforward way.
In Chapter 5 we present two algorithms for solving the quintic, which are due to Hermite
and Gordan. Hermite uses a more analytic method compared with Gordan’s solution, which
involves the use of the icosahedron.
In Chapter 6 we present Kiepert’s algorithm for the quintic. This lends itself to imple-
mentation on a computer. This has in fact been done by B. King and R. Canfield, and at
the end of the chapter we discuss their results.
We conclude with an Appendix. Here one can find the most basic facts about groups,
rings, and polynomials.
Acknowledgment
I would like to thank the following people: my supervisor Bengt Ove Turesson for helping me
with this report and for teaching me LATEX 2ε in which this report is written, Peter Hackman
for helpfull hints about Galois theory, and Gunnar Fogelberg for historical background. I
also thank Thomas Karlsson who told me that this examination work was available, and
Bruce King for telling me about his work on the quintic. I also express my gratitude to John
H. Conway, Keith Dennis, and William C. Waterhouse for giving me information about L.
Kiepert.
4 1 INTRODUCTION
5
2.1 History
Polynomial equations arise quite naturally in mathematics, when dealing with basic math-
ematical problems, and so their study has a long history. The attempts to solve certain of
these equations have lead to new and exciting theories in mathematics, with the result that
polynomial equations has been an important cornerstone in the development to modern
mathematics.
It is known that in ancient Babylonia (2000 – 600 B.C.), one was able to solve the second
order equation
x2 + c1 x + c0 = 0, ci ∈ R,
beginning of the fifteenth centuary in Italy, when Scipione del Ferro (1465–1526) succeeded
in solving the equation
x3 + c1 x + c0 = 0.
In Section 2.3, we will show that every third degree equation can be transformed to the above
form, which means that del Ferro had solved the general cubic equation. Before his death,
del Ferro told his student Fior of his method for solving cubics. Fior challenged another
Italian, Niccolo Tartaglia (1500–1557), to a mathematical duel. This forced Tartaglia, who
knew that Fior could solve such equations, to find a general method for solving the cubic.
Tartaglia succeded with this and thus won the competition.
Tartaglia told Girolamo Cardano (1501–1576) about his method. Cardano published
the method in his book Ars Magna (around 1540). Here he observed that, in certain
cases, the method yields roots of negative numbers, and that these could be real numbers.
This means that he was touching upon the complex numbers. Cardano made his student
Lodovico Ferrari (1522–1565) solve the fourth degree equation, i.e., the quartic. Ferrari
succeeded with this, and Cardano also published this method in Ars Magna.
The case when Cardanos formulas lead to roots of negative numbers was studied by
Rafael Bombelli (1526–1573) in his book Algebra in 1560.
After the Italians, many famous mathematicians examined these equations, for example
Francois Viète (1540–1603) and Ehrenfried Walter von Tschirnhaus (1651–1708).
It was finally the German mathematician Gottfried von Leibniz (1646–1716) who, with-
out any geometry, verified these formulas.
More information about these Italians and the solutions to third and fourth degree
equations can be found in Stillwell [24].
x3 + c2 x2 + c1 x + c0 = 0, ci ∈ C. (1)
w = cos 2π 2π
3 + i sin 3 .
√
If a0 = 0, then this equation has roots y = 0 and y = ± −a1 . Consequently,
c2 √ c2
x=− or x = ± −a1 − ;
3 3
if a1 = 0, then we get the roots
√ c2 √ c2 √ c2
x= 3
−a0 − , x = w 3 −a0 − and x = w 2 3 −a0 − .
3 3 3
If a0 a1 6= 0, then we make the substitution
a1
y=z− .
3z
This substitution gives us an equation in z of the form
a31 a31
z 3 + a0 − ⇒ z 6 + a0 z 3 − = 0,
27z 3 27
which can be factorized as follows:
à à r !! à à r !!
3 a0 a20 a31 3 a0 a20 a31
z + + + z + − + = 0.
2 4 27 2 4 27
Proof From The Fundamental Theorem of Algebra, we know that there are three roots
to our original cubic equation. Therefore, there must be at least one root from each paren-
thesis which gives the same root to (1). Denote them by z1 and z2 . If they are equal, there
is nothing to prove, so assume that this is not the case. We then have
a1 a1 a 1 z1 − z 2 a1
z1 − = z2 − ⇔ z1 − z2 = − ⇔ z 1 z2 = − . (2)
3z1 3z2 3 z1 z2 3
x3 + 3x2 − 3x − 11 = 0.
We follow the method described above. Put x = y − 1. We then get the equation
y 3 − 6y − 6 = 0.
Now substitute
2
y=z+ .
z
This leads to
8 (z 3 − 2)(z 3 − 4)
z3 − 6 + = 0 ⇒ = 0.
z3 z3
We choose to solve z 3 − 2 = 0. The roots to the original equations then are
√ √ √ √ √ √
2 + 4 − 1, w 2 + w2 4 − 1 and w 2 2 + w 4 − 1.
3 3 3 3 3 3
If we had chosen the other parenthesis, we would have found exactly the same roots as
asserted.
We now investigate an arbitrary fourth degree equation:
x4 + c3 x3 + c2 x2 + c1 x + c0 = 0, ci ∈ C. (3)
pr = a0 ,
2q(r − p) = a1 ,
r + p − 4q 2 = a2 .
If we substitute t = q 2 , then the above equation becomes a third degree equation in t, which
can be solved by our third degree formula. We take one of the roots qi , i = 1, 2, . . . , 6, and
use it to make the factorization. Then we solve both second order equations in y, reverse
the transformations, and we find the roots of (3).
y 4 − 6y 2 − 24y + 16 = 0.
(2 + 4y + y 2 )(2 − 4y + y 2 ) = 0.
If we make a substitution where we involve the roots of (5), then, if we are clever,
coefficients can disappear. For example, if we succeed to obtain
where xk is a root of (5) as coefficient of y n−1 , then the coefficient for this term in our new
polynomial will disappear. Therefore, Tschirnhaus transformations will have the general
form
yk = an−1 xn−1
k + an−2 xn−2 + . . . + a1 xk + a0 , k = 1, 2, . . . , n,
where xk is a root of (5).
In order to make clever substitutions, we need to understand some more theory.
10 2 QUADRATIC, CUBIC, AND QUARTIC EQUATIONS
f (x) = xn + cn−1 xn + . . . + c1 x + c0 = 0.
Then
P1 + cn−1 = 0,
P2 + cn−1 P1 + 2cn−2 = 0,
P3 + cn−1 P2 + cn−2 P1 + 3cn−3 = 0,
..
.
Pn−1 + . . . + c4 P3 + c3 P2 + c2 P1 + (n − 1)c1 = 0,
and, for k ≥ n,
f (x) = (x − x1 )(x − x2 ) . . . (x − xn ).
Now differentiate this equality with the help of logarithmic differentiation, to obtain
f (x) f (x) f (x)
f 0 (x) = + + ... + ,
x − x1 x − x2 x − xn
where
f (x)
= xn−1 + an−2 xn−2 + . . . + a1 x + a0 .
x − xi
By comparing coefficients, we get
an−2 = cn−1 + xi ,
an−3 = cn−2 + an−2 xi ,
..
.
a0 = c 1 + a 1 xi .
Substituting the expression for an−k in the expression for an−k−1 , we have
an−2 = cn−1 + xi ,
an−3 = x2i + cn−1 xi + cn−2 ,
..
.
an−k = xik−1 + cn−1 xik−2 + . . . + cn−k+1 .
2.3 Tschirnhaus transformations 11
It follows that
By comparing coefficients of these two expressions one obtains the first part of the theorem.
The last part follows from summing the following identity over i = 1, 2, . . . , n:
xn+k
i + cn−1 xn+k−1
i + . . . + c1 xk+1
i + c0 xk = 0.
• Carry out the substitution with the roots as we just described above.
• Look at the Newton identities for both polynomials, then substitute these expressions,
and solve for the coefficients.
xn + cn−1 xn−1 + . . . + c1 x + c0 = 0.
yk = x k + a 0 , k = 1, 2, . . . , n,
y n + bn−1 y n−1 + . . . + b1 y + b0 = 0.
yk = x k + a 0 ,
gives
P1 (y1 , . . . , yn ) = P1 (x1 , . . . , xn ) + na0 .
But by Newton’s formula, we have
To get rid of the xn−1 -term we shall thus make the substitution
cn−1
yk = x k + .
n
If one has a second order equation and makes the substitution
c1
yk = x k + ,
2
then the equation in y is
³ c1 ´ 2 ³ c1 ´ c21
y− + c1 y − + c0 = 0 ⇒ y2 − + c0 = 0.
2 2 4
The roots to this equation are
r r
c21 c21
y1 = + c0 and y2 = − + c0 .
4 4
The roots of the equation in x will therefore be
r r
c1 c21 c1 c21
x1 = − + + c0 and x2 = − − + c0 .
2 4 2 4
Thus we have obtained the well-known formula for solving quadratics with the help of
a Tschirnhaus transformation.
To find a Tschirnhaus transformation where xn−1 , xn−2 and xn−3 vanish is quite messy;
see King [11].
If we have transformed a fifth degree equation to the Bring-Jerrard form, that is to the
form
x5 + c1 x + c0 = 0, c1 , c2 6= 0, (6)
then the transformation
c0
yk = xk ,
c1
transforms this equation to the form
y 5 + ax + a = 0,
s = wk + w2k + . . . + w(n−1)k ,
Therefore, if k 6= 0 modulo n, the sum equals 0, otherwise it equals n. We then get the
roots to the equation as
n−1
X
nxk = w−k+1 r(wj ), k = 1, 2 . . . , n. (8)
j=1
There exists different ways of finding the Lagrange resolvent for an equation.
Example 3.1 For a third degree equation, the method of Lagrange resolvents means by
(7) that we put
z = x1 + wx2 + w2 x3 ,
where the xi are solutions to a third degree equation and w is a cube root of unity. We can
permute these roots into each other which gives six different z values z1 , . . . , z6 . Then we
form an equation
(x − z1 )(x − z2 ) . . . (x − z6 ) = 0,
where zi are the six permutations of z. This equation will be quadratic in x3 , so it is
solvable. We now show that the equation above is a quadratic in x3 . We list the six
different permutations:
• z1 = x1 + wx2 + w2 x3
• z2 = x3 + wx1 + w2 x2 = wz1
• z3 = w 2 z1
14 3 SOLVABILITY OF ALGEBRAIC EQUATIONS
• z4 = x1 + wx3 + w2 x2
• z5 = wz4
• z6 = w 2 z4
Therefore we have
(x − z1 )(x − z2 ) . . . (x − z6 ) = (x3 − z13 )(x3 − z43 ) = x6 − (z13 + z43 )x3 + z13 z43 .
If we make similar constructions for the quartic, we get an equation of degree 24, which
is still also possible to solve. But when we do the same for the quintic, we get an equation
of degree 120. Observe that the degree of the resolvents equals the order of S 4 and S5 ,
respectively. We see that the degree grows rapidly and the resolvents get more and more
complex. This observation convinced Lagrange that quintic and higher order equations were
not solvable by radicals.
In 1799, the Italian mathematician Paolo Ruffini (1765–1822) published a proof that
equations of degree five or higher are not solvable by radicals. The proof was not complete,
and Ruffini tried during the rest of his life to improve the proof.
The Norwegian mathematician Niels Henrik Abel (1802–1829) presented a proof around
1825. There was an error in the proof which Abel corrected in 1826. He presented his proof
in the mathematical journal Crelle’s Journal and other mathematician subsequently began
studying the proof. Abel’s proof showed that it was impossible to solve the general equation
of degree ≥ 5.
Ruffini and Abel looked at Sn . The concepts were not at all clear, but they observed
that certain things that worked with S3 and S4 did not work for S5 . From this observation,
they concluded that the quintic and higher equations are not solvable by radicals.
Abel’s and Ruffini’s proofs did not answer the question as to which equations actually
can be solved by radicals. Neither did they present an equation not solvable by radicals.
The answer to these questions were unknown until Evariste Galois (1811–1832) answered
them. Galois did not know Ruffini’s work, but he knew Abel’s.
We shall present parts of Galois work below. In the next section we will use modern
concepts to prove a criterion for deciding when an equation is solvable by radicals. It may
3.1 History of the quintic 15
be a good idea to first look a little informally at Galois’ thoughts to understand better the
rather abstract concepts of Galois theory.
The concepts of ‘group’ and ‘ring theory’ were not invented at the time of Galois. In
fact, it was Galois who invented them, and understood their full power. Galois also invented
the term ‘field’. As mentioned, Galois made a lot of contributions to this area; for example,
he defined subgroups, normal groups, cosets, and permutation groups.
y
6
· T
· T
· T
· T
· T -x
T ·
T ·
T ·
T ·
T ·
Figure 1
If one is given a figure with nodes in the plane, as in Figure 1, one can permute the
nodes so the distance to the origin is preserved. These permutation are called distance
preserving. If one has two such permutations σ1 and σ2 , then obviously the permutation
σ1 σ2 is distance preserving. Also, to each element σ1 , there must be an element that reverses
the permutation. We call it the inverse and denote it by σ1−1 . Compositions of permutations
are also associative. All these properties mean that the set of permutations form a group.
The roots to the equation
x5 = 1,
lie symmetrically on a circle. There are other polynomials where the zeros exhibit a much
more antisymmetric appearance when plotted in the complex plane. It is tempting to
permute the roots onto each other and investigate the symmetry. That is what Galois did.
Galois considered a field F that contained the coefficients of a polynomial. One express
the coefficients in terms of the symmetric polynomials (see Section 3.2). Hence, permuting
the roots into each other will not affect the original field. Galois also constructed the Galois
Resolvent. If we have a polynomial with coefficients in F , then the resolvent
v = a 1 x1 + a 2 x1 + . . . + a n xn , a1 , . . . , an ∈ R,
is denoted a Galois resolvent if one gets n! different functions v when permuting the roots
onto each other. Then we form the equation
P (v) = (v − v1 )(v − v2 ) . . . (v − vn ) = 0,
where each vi is a certain symmetric function of the roots of the polynomial. But the
coefficients can be represented symmetrically in terms of the roots. Hence P is a polynomial
with coefficients in F .
16 3 SOLVABILITY OF ALGEBRAIC EQUATIONS
Galois also investigated when an arbitrary function of the roots can be expressed ratio-
nally by the resolvent v. Such functions include, for example, the roots of the polynomial.
In this case, Galois observed that one gets a permutation group that one can reduce to a
single element. More clearly we can put it this way. Assume that we have a polynomial
with coefficients in a field F , for example Q. We must adjoin certain nth roots to get a field
that contain the roots x1 , . . . , xn . The coefficients of the polynomial can be expressed by
the symmetric polynomials, so we can look at extensions of the field
Q(x1 , x2 , . . . , xn ),
(for the notation, see Section 3.2 and the Appendix). When we adjoin an nth root to this
field, then we destroy symmetry. But by adjoining nth roots in all the other variables
we regain symmetry. If we permute the roots onto each other, the original field will be
unaffected. These permutations form a group called the Galois group. When we adjoined
the roots, we got a chain of extensions
F ⊆ F (a) ⊆ . . . ⊆ E.
This is a key observation, because if it is impossible to have such a chain of groups for the
permutation group of index ≥ 5, then it is also impossible to solve equations of degree ≥ 5.
Galois succeeded in proving this. With the help of his theory he was also able to give a
criterion for when equations are solvable. Galois also produced an equation that was not
solvable by radicals. That is, Galois settled all the remaining question that Ruffini and
Abel had left unanswered.
We see that Galois’ theory is not very different from Lagrange’s theory. But because
Galois invented the group and field theory, he was able to prove much more than Lagrange.
Because of Galois short life (his life ended in a duel), he did not write down all his
thoughts in a strict and concise way. Galois’ brother Alfred and his friend August Chevalier
sent his posthumous work to several mathematician, but they did not get any response.
Joseph Liouville was the first mathematician to study Galois’ results conscientiously. Liou-
ville understood their importance and presented them in 1846.
Definition 3.3 Let u ∈ E. A polynomial m(x) ∈ F [x] that has minimal degree, which is
monic, and satisfies m(u) = 0 is called a minimal polynomial of u and is denoted min(F, u).
3.2 Galois theory 17
c0 + c1 u + . . . + cn−1 un−1 ,
where ci ∈ F and n is the degree of min F (u). The degree of the extension is denoted [E : F ]
and equals the degree of the minimal polynomial.
We are also interested in finding the smallest field that contains F and some elements
u1 , . . . , un ∈ E. We can build such an extension by simple ones, and then get a field which
we denote by F (u1 , . . . , un ). The extension F (u1 , u2 ) equals F (u1 )(u2 ), see Nicholson [18].
So we can always build extensions as simple ones.
f (x) = c(x − x1 ) . . . (x − xn ),
f (x) = c0 + c1 x + . . . + xn ∈ F [x].
If the zeros of f (x) are x1 , . . . , xn ∈ E, then the coefficients can be expressed by the
elementary symmetric polynomials. More precisely, if we define
X
σk = xi1 xi2 . . . xik , k = 1, 2, . . . , n,
i1 <i2 <...<ik
then
ck = (−1)n−k σn−k , k = 0, 1, 2, . . . , n − 1.
If we consider the automorphisms θ : E → E on E = F (u1 , . . . , un ), u1 , . . . , un ∈ E, for
which θ(F ) = F , we will have some control over the zeros of f . These automorphisms form
a group, so we can also make use of all the theory developed for groups. This motivates the
following definition.
Definition 3.5 Let E be a field extension of F . The Galois group Gal(E/F ) is the set of
automorphism θ : E → E that leaves F fixed.
Theorem 3.6 Assume that we have a field extension E = F (u1 , . . . , un ). The automor-
phisms θ : E → E that leaves F fixed only depend on how they permute the set {u1 , . . . , un }.
Proof We will use the fact that we can build extensions as simple ones. Put F (u1 ) = F1 .
The extension F1 (u2 ) = F (u1 , u2 ) consists of all elements
where n1 is the degree of the extension. But F1 can also be identified with the vector space
of sums
c00 + c01 u1 + . . . + c0n0 −1 un1 2 −1 , c0i ∈ F,
where n2 is the degree of the extension. If we substitute this expression for ci in (9), we see
that we get a binomial in u1 and u2 with coefficients c0i ∈ F . The automorphisms θ : E → E
that leaves F fixed, will not affect the c0i . Because of this fact we only have to know what
the automorphisms do with the u1 and u2 . Similarly for extensions with more than two
elements, but we will get a multinomial in u1 , . . . , un with coefficients c0i ∈ F . ¤
The idea behind the Galois group is to go back and forth between E and the Galois
group as a pair, and just like a function and its derivative in analysis, we can get a lot of
information by doing this.
If we have F ⊆ L ⊆ E, then Gal(E/L) is a subgroup of Gal(E/F ), and we call L an
intermediate field of the extension E/F . Conversely, if we have a subgroup S of the group
of all automorphism of E, then the set
is called the fixed field of S and is a subfield of E. What we really have is a one-to-one
inclusion reversing bijection between the set of intermediate fields L and the set of subgroups
H of Gal(E/F ).
Theorem 3.8
Proof Suppose that L = F(S). Then S ⊆ Gal(E/L) and this fact implies that
F(Gal(E/L)) ⊆ F(S) = L. But L ⊆ F(Gal(E/L)), and it follows that L = F(Gal(E/L)).
If H = Gal(E/L) for some subfield L of E, then L ⊆ F(Gal(E/L)) and
Gal(E/F(Gal(E/L)) ⊆ Gal(E/L) = H. But H ⊆ Gal(E/F(H)), so it follows that
H = Gal(E/F(H)). ¤
3.2 Galois theory 19
all the roots will lie in the extension field E. We adjoined a root of an element in each
extension to get the field E. It is clear that if we can solve the equation by radicals, then
we can make an extension E like the one above.
Definition 3.13 We call a field extension F (u1 , . . . , ur ) radical if there exist integers
n1 , . . . , nr such that un1 1 ∈ F and uni i ∈ F (u1 , . . . , ui−1 ), i = 2, 3, . . . , r.
Equations like xn − 1 = 0 will be of interest in Section 3.4. An element that satisfies
such an equation is called an nth root of unity. If charF does not divide n, an nth-root will
exist; see Morandi [16].
20 3 SOLVABILITY OF ALGEBRAIC EQUATIONS
Theorem 3.14 If F is a field that contains an nth root of unity w, then the extension
F (u)/F , where u is in some field E, is Galois and Gal(F (u)/F ) is abelian.
Proof Assume that un = a. Then xn − a = 0 has roots u, uw, . . . , uw n−1 . The poly-
nomial xn − a is separable over F (u). Hence F (u) is Galois by Theorem 3.10. This means
that if σ and τ ∈ Gal(F (u)/F ), then σ(u) and τ (u) are zeros of xn − a. So assume that
σ(u) = uw i and τ (u) = uw j . We then have
x4 − 7x2 + 10 = 0.
√ √
We can factorize the left hand side as√(x2 − 2)(x 2 − 5), so the zeros are ± 2 and ± 5.
√ √
We make the extensions F = Q ⊆ Q( 2) ⊆ Q( 2, 5) = E which are radical. What is
Gal(E/F )? Because√an irreducible
√ polynomial
√ √splits in E, we know that the extension is
Galois. This gives ± 2 → ± 2 and ± 5 → ± 5, because if u is a root of a minimal poly-
nomial, then σ(u), where σ ∈ Gal(E/F ), should
√ also be a root of this minimal polynomial.
The field Q should be fixed so for example 2 → 1 is impossible. So we have |G| = 4, as
predicted. The elements of the Galois group are
µ√ √ √ √ ¶
ε = √2 −√2 √5 −√5 ,
2− 2 5− 5
µ √ √ √ √ ¶
√2 − 2
√ √ 5 − √5 ,
σ1 =
− 2 2 5− 5
µ√ √ √ √ ¶
√ 2 −√2 √5 −√5
σ2 = ,
2− 2− 5 5
µ √ √ √ √ ¶
√2 −√2 √5 −√5
σ3 = .
− 2 2− 5 5
{e} = H0 ⊆ H1 ⊆ . . . ⊆ Hn = G,
It is clear that G0 is closed under the operation in G, and because of the lemma above,
we see that G0 really is a subgroup of G. But even more can be said.
Proof We have to show that ghg −1 ∈ G0 for every g ∈ G and every g ∈ G0 . The trick
is to insert g −1 g because we have
It follows from the theorem that G/G0 is abelian. We also see from the definition
preceeding Definition 3.17 that a normal subgroup H of G contains every commutator
(G0 ⊆ H) if and only if G/H is abelian. Conversely, if G0 ⊆ H, where H is a subgroup of
G, g ∈ G, and h ∈ H, then
G ⊇ G1 ⊇ G2 ⊇ . . . ⊇ G n ⊇ . . . ,
where G1 = G0 and Gi+1 = (Gi )0 . In this chain Gi+1 / Gi and Gi /Gi+1 is abelian. This is
connected to solvable groups by the following theorem.
{e} = H0 ⊆ H1 ⊆ . . . ⊆ Hn = G,
We have now developed all the theory necessary to give a criterion for when an equation
is solvable by radicals. This will be the subject of the next section.
Theorem 3.22 Suppose that char(F ) = 0. Let f (x) ∈ F [x] and let E be a splitting field
over F . Then the equation f (x) = 0 is solvable by radicals if and only if Gal(E/F ) is a
solvable group.
where wini ∈ Fi−1 for i = 1, . . . , k. We may assume that F contains all nth roots of unity
(otherwise adjoin them), so every extension will be Galois by Theorem 3.14. By Theorem
3.11 this implies that Gal(Fk /Fi ) / Gal(Fk /Fi−1 ). Consider the chain
Every subgroup is normal in the one preceding it. The field Fi is a Galois extension of
Fi−1 , hence Gal(Fi /Fi−1 ) is isomorphic to Gal(Fk /Fi−1 )/ Gal(Fk /Fi ) by Theorem 3.11. But
Gal(Fi /Fi−1 ) is abelian and so is each quotient group in the chain above. This means that
Gal(Fk /F ) is solvable. Since E ⊆ Fk and the extension is Galois, Gal(Fk /E) / Gal(Fk /F )
and Gal(Fk /E) ∼ = Gal(Fk /F )/ Gal(Fk /E), by Theorem 3.11. Thus Gal(Fk /E) is a homo-
morphic image of Gal(Fk /F ) which is a solvable group, i.e., Gal(Fk /E) is solvable.
For the converse, suppose that Gal(E/F ) is a solvable group. We then have a chain
{e} = H0 ⊆ H1 ⊆ . . . ⊆ Hr = Gal(E/F ),
3.4 Solvability by radicals 23
such that Hi+1 / Hi and Hi /Hi+1 is abelian. Let Ki = F(Hi ). Then Ki+1 will be Galois
over Li by Theorem 3.11 and Gal(Ki+1 /Ki ) ∼ = Hi /Hi+1 . Let w be an nth root of unity,
where n is the least common multiple of the orders of the elements in G. Set Li = Ki (w).
Then we have a chain of fields
F ⊆ L0 ⊆ . . . ⊆ Lr .
One can show that this is a radical extension; see Morandi [16] for the details. ¤
Proof The group An is simple and non-abelian for n ≥ 5, so the derived group of An
equals An i.e. A0n = An . This means that Gn 6= {e} in Theorem 3.20, because, by Theorem
3.19, the only simple groups that are solvable are abelian. Thus An is not solvable. The al-
ternating group An is a subgroup of Sn , so Sn is not solvable for n ≥ 5 by Corollary 3.21. ¤
With this theory one can show that the general polynomial of degree n ≥ 5 is not solvable
by radicals. However, for example, xn = 1 is solvable, because it has a well-behaved Galois
group.
24 3 SOLVABILITY OF ALGEBRAIC EQUATIONS
25
4 Elliptic functions
We have seen in the previous chapter that, in general, it is impossible to solve a polynomial
equation by radicals if the degree is greater than four. However, there are other ways one
can solve such equations. In this chapter we shall describe one possible method.
If there is a convergent sequence {ωn }n∈Z+ of periods of f , then the limit point ω is a
period. In fact, by continuity,
f (z + ω) = lim f (z + ωn ) = f (z),
n→∞
Theorem 4.4 A sequence of periods {ωn }n∈Z+ cannot have a finite accumulation point.
Proof If there were a finite accumulation point, then by the Cauchy criterion, ωn − ωm
tend to 0, as m, n → ∞. But ωn − ωm is a period of f . By Lemma 4.3 this is impossible. ¤
From the theorem we see that if we have a period ω, then among all periods with this
direction in the complex plane, there must be a period with least distance to 0. Such a
period is denoted a primitive period. If we have a primitive period ω, then every integer
multiple of ω also is a period, because
Could rω be a period even for real numbers r? The answer is no, because if there existed a
non-integer multiple of ω, denoted ω1 , then ω1 /ω = r, where r is a real number. By writing
r = n + t, where n is the integer part and t is the fractional part of r, it would follow that
ω1 − nω = tω. But tω is a period of f which is closer to zero than ω. This is impossible
because of the discussion above. We say that two periods are different if they are not integer
multiples of each other. Obviously two different periods satisfy Im( ωω21 ) 6= 0.
We can now prove the following theorem.
Theorem 4.5 (Jacobi theorem) There does not exist a non-constant meromorphic func-
tion with more than two primitive periods.
Proof In view of Theorem 4.4, we can always create a parallelogram which is spanned
by two different periods, such that there are no periods in it other than the vertices. Assume
that there is a third period ω3 . Then it can be represented as a linear combination of ω1
and ω2 :
ω3 = r 1 ω1 + r 2 ω2 , r1 , r2 ∈ R.
Let ri = ni + ti , were ni is the integer part and ti is the fractional part of ri . Then we have
ω3 − n 1 ω1 − n 2 ω2 = t 1 ω1 + t 2 ω2 .
If a function f has two different periods, then the function is called doubly-periodic. The
parallelogram with corners c, c + ω1 , c + ω1 + ω2 , and c + ω2 , where c is an arbitrary point
in the complex plane, is called a period parallelogram and is denoted by Ω. The only node
that is assumed to belong to the parallelogram is c, and the edges in it are the ones between
c and c + ω1 and c and c + ω2 . The complex plane is divided into a lattice induced by the
point c and the two periods. We only have to study a doubly-periodic function inside this
parallelogram.
4.1 General theory 27
»»
»» »»» ¤
»» » ¤
»» ¤ ¤
¤ » ¤
¤ ¤ » » »
:¤»
» ¤
¤ » »» ¤ » ¤º
»
¤ » ¤
¤º Ω ¤
¤ »¤
»
¤ » :
» ¤»» ¤
¤ »¤» »
¤ »»¤ c
» ¤ ¤
¤ ¤ »¤
¤ » »
»¤»
¤
»» ¤ »»
¤»»»
A lattice induced by the point c and two periods
Figure 2
But f (w + ω2 ) = f (w), and thus the first and third integral cancel. Similarly the second
and fourth cancel. Hence Z
f (z) dz = 0.
∂Ω
By the residue theorem,
Z n
X
f (z) dz = 2πi ri ,
∂Ω i=1
where ri are the residues of f , so the sum of residues is zero. ¤
It follows from the above theorem that there cannot be an elliptic function of order one.
28 4 ELLIPTIC FUNCTIONS
The function F (z) = f (z) − C, where C is a constant, is also an elliptic function, having
the same poles as f in a period-parallelogram. We thus have the following corollary.
Corollary 4.11 For every C ∈ C, the number of points z in Ω where f (z) = C, is equal
to the order of f .
Theorem 4.12 Let f be an elliptic function which has p zeros at a1 , . . . , ap and q poles at
p1 , . . . , pq . Then
p
X Xq
ord(ai ) − ord(qj ) = mω1 + nω2
i=1 j=1
and let
Λ0 = Λ − {0}.
The series on the right hand side converges for p > 2. Hence we have proved the theorem. ¤
In view of Theorem 4.13, the function
X 1
G(z) = −2 ,
(z − ω)3
ω∈Λ0
converges absolutely and uniformly in each bounded region of the z-plane that does not
intersect Λ0 . Hence the sum defines a meromorphic function.
We have
X 1 X 1
G(z + ω1 ) = −2 = −2 = G(z),
(z + ω1 − ω)3 (z − ω)3
ω∈Λ0 0 ω∈Λ
because of the infinite summation range, and similarly G(z + ω2 ) = G(z). Thus, we see
that G is doubly-periodic with poles kω1 + lω2 . By definition, this means that G is elliptic.
Moreover,
X 1 X 1
G(−z) = 2 3
= 2 = −G(z).
0
(z + ω) 0
(z − ω)3
ω∈Λ ω∈Λ
Hence G is odd.
We now introduce the Weierstrass function ℘ as
1 X 1 1
℘(z) = 2
+ 2
− 2.
z 0
(z − ω) ω
ω∈Λ
30 4 ELLIPTIC FUNCTIONS
Observe that ℘0 (z) = G(z). It follows that ℘ is an even function. From the above we have
that ℘0 (z) = ℘0 (z + ω1 ) for all z. Notice that the number 12 ω is not a pole of ℘. Integrating
the periodicity identity for ℘0 , we obtain
℘(z) = ℘(z + ω1 ) + c.
where X 1 X 1
g2 = 60 and g3 = 140 .
0
ω4 0
ω6
ω∈Λ ω∈Λ
The numbers g2 and g3 are called invariants.
Proof Make a Laurent expansion of ℘ at z = 0. Since odd terms in the sum cancel, we
obtain
1 X 1 X 1 1 g2 g3
℘(z) = 2 + 3z 2 4
+ 5z 4 6
+ . . . = 2 + z2 + z4 + . . . .
z 0
ω 0
ω z 20 28
ω∈Λ ω∈Λ
This gives
2 g2 g3
℘0 (z) = − 3
+ z + z3 + . . . ,
z 10 7
4 ³ g 2 g3 ´
℘0 (z)2 = 6 1 − z 4 − z 6 + . . . ,
z 10 7
3 1³ 3g2 4 3g3 6 ´
℘(z) = 6 1 + z + z + ... ,
z 20 28
and hence that
where A and B are constants. The left hand side is an elliptic function, and in view of the
right hand side, it is entire, and hence a constant. Thus A = B = 0. ¤
If we differentiate (10), we see that all the derivatives of ℘ can be expressed in terms of
℘ and ℘0 . For example,
2℘00 ℘0 = 12℘0 ℘2 − g2 ℘0
After division with 2℘0 , we get
℘00 = 6℘2 − 21 g2 . (11)
The right hand side of the differential equation (10) is a third degree polynomial in ℘(z),
so it can be factorized as
℘( 21 ω1 ) = ℘( 12 (ω1 + ω2 )),
then the second order elliptic function ℘(z) − ℘( 21 ω1 ) would have two second order zeros
counted according to multiplicity, which is impossible. Therefore we have
is defined to be Y
D = cn2n−2 (xi − xj )2 ,
1≤j<i≤n
This is an elliptic function of order 5. We know that it takes every value five times inside
a period parallelogram, and in particular it takes the value zero five times.
The fifth degree equation
4x5 − x4 − g2 x3 + (2 − g3 )x2 − 1 = 0
can be written as
x2 (4x3 − g2 x − g3 ) − x4 + 2x2 − 1 = 0.
If we put x = ℘(zj ), where zj is one of the five zero of f , and use the differential equation
(10) for ℘(z) , we get
Thus the function f has five points that solve a fifth degree equation. This shows that the
elliptic functions seem to have the properties we were looking for at the beginning of this
chapter. This observation will be extended in the next section.
32 4 ELLIPTIC FUNCTIONS
There are other functions that are connected to the Weierstrass function. These are the
Weierstrass ζ and σ functions which are defined by the expressions
Z z Z z
1 1 σ(z) 1
ζ(z) = − ℘(t) − 2 dt and ln = ζ(t) − dt.
z 0 t z 0 t
The zeta function is an meromorphic function with simple poles. It is not doubly periodic
but it is odd. The sigma function is an entire function with simple zeros.
Another important class of functions in the theory of elliptic functions are the elliptic
modular functions. One studies a function depending on a parameter τ , with Im(τ ) > 0,
under certain transformations, for example a Moebius transformation. We get these different
functions when we transform the function and they form a group called the modular group.
An example of a elliptic modular function is
g2 (1, τ )3
J(τ ) = ,
g2 (1, τ )3 − 27g3 (1, τ )2
where g2 and g3 are the invariants of ℘ and τ = ω2 /ω1 . With the aid of modular functions
one can prove Theorem 4.25 below.
We end this section by showing a connection between elliptic functions and elliptic
integrals.
If we have an integral
∞
dt
Z
u= √ , (12)
x 4t3 − at − b
where a and b are constants, then it is, by definition, an elliptic integral. If the constants
a and b satisfy a3 − 27b2 6= 0, we can take them as invariants and construct a Weierstrass
function.
If we see x as a function of u, then one might wonder if x is an elliptic function. It is,
and to prove this, make the substitution t = ℘(v). Then we get the integral
Z v
u= dv
0
P (x) = x5 + cx + c = 0. (13)
4.3 Solving the quintic 33
Recall from algebra that a polynomial has a double zero if and only if the zero is a zero to
the derivative of the polynomial. The derivative of P is 5x4 + c. Using long division on P
and P 0 , we get
x5 + cx + c = (5x4 + c) 51 x + ( 45 cx + c).
If we have a double zero, it follows that
c( 54 x + 1) = 0,
and we have
5 55
x=− and c=− .
4 44
We now define the elliptic function fc by
√ √
fc (z) = ℘(z)℘0 (z) + i c℘(z) + 2i c.
The order of fc is five, so it assumes every value five times inside a period parallelogram.
Especially, there are five points zj , j = 1, 2, . . . , 5, counted according to multiplicity, were
fc takes the value zero.
Lemma 4.18 The numbers ℘(zj ) are the roots to the equation x5 + cx + c = 0.
√ √
Proof Since fc (zj ) = 0, we have ℘(zj )℘0 (zj ) = −i c℘(zj ) − 2i c. If we square both
sides, we can use the differential equation for ℘. This manipulation gives
5
If we can show that fc (zj ) has five simple zeros for c 6= − 544 , then we are very close to
solving (13).
Lemma 4.19 If c 6= −55 /44 , then the function fc has five simple zeros.
Proof Assume on the contrary that c 6= −55 /44 and that fc has a double root, that is
√ √
fc (zj ) = ℘(zj )℘0 (zj ) + i c℘(zj ) + 2i c = 0,
√
fc0 (zj ) = ℘0 (zj )2 + ℘(zj )℘00 (zj ) + i c℘0 (zj ) = 0.
Now use the differential equation (10) and the formula (11) for the second derivative of ℘.
Then the derivative becomes
√
fc0 (zj ) = 10℘(zj )3 + i c℘0 (zj ) − c = 0. (14)
We must show that if z1 and z2 are distinct zeros of fc , then ℘(z1 ) 6= ℘(z2 ). If we can
show this, then we have solved (13)! We need another three lemmas.
Lemma 4.20 If z1 6= z2 and ℘(z1 ) = ℘(z2 ), then ℘0 (z1 ) = −℘0 (z2 ).
Proof By Theorem 4.12 and the fact that the ℘ has only one pole of order two at zero,
we have z1 + z2 = mω1 + nω2 , where m and n are integers. Thus z2 = mω1 + nω2 − z1 . The
derivative ℘0 is periodic and odd. Thus
℘0 (z2 ) = ℘0 (mω1 + nω2 − z1 ) = ℘0 (−z1 ) = −℘0 (z1 ). ¤
Lemma 4.21 If z1 , z2 are distinct zeros of fc , then ℘(z1 ) 6= ℘(z2 ).
Proof Suppose that ℘(z1 ) = ℘(z2 ) with z1 = 6 z2 . Then
0 √ √
fc (z1 ) = ℘(z1 )℘ (z1 ) + i c℘(z1 ) + 2i c = 0.
By the previous lemma, we can write the equality fc (z2 ) = 0 as
√ √
fc (z2 ) = −℘(z1 )℘0 (z1 ) + i c℘(z1 ) + 2i c = 0.
Subtracting this from the first equality yields
2℘(z1 )℘0 (z1 ) = 0.
But ℘(z1 ) 6= 0, because c 6= 0, whence ℘0 (z1 ) = 0. This means that the function ℘(z)−℘(z1 )
assumes the value 0 three times counted according to multiplicity, since it has a double zero
at z1 and a simple zero at z2 . But the order of ℘(z) − ℘(z1 ) is two, so we have obtained a
contradiction. ¤
Lemma 4.22 If c = 55 /44 , then fc has a double and three simple zeros.
Proof Because ℘ is elliptic it assumes every value, so there is a point√z1 such that
℘(z1 ) = −5/4. Using the differential equation for ℘, we have ℘0 (z1 ) = ± 16
15 5. Using the
numerical values of c, ℘(z1 ), and ℘0 (z1 ), we get
fc (z1 ) = fc0 (z1 ) = 0
and fc00 (z1 ) 6= 0. The other zeroes xi , i = 1, 2, 3, to the quintic are simple, and there are
numbers zi , i = 1, 2, 3 such that ℘(zi ) = xi . As above one can show that they are zeroes
to fc . From Lemma 4.21 we conclude that they are distinct. Hence fc has one double and
three simples zeroes and we have proved the lemma. ¤
We define a function
∞
2 τ +2mz)πi
X
θ3 (z, τ ) = e(m , Im(τ ) > 0.
m=−∞
Since Re(iτ ) < 0, it follows that the series converges uniformly in every set |z| ≤ R, where
R is a real number. One often puts q = eiπτ . Then we have
∞
2
X
θ3 (z, τ ) = q m e2πimz .
m=−∞
If we replace z with z + 1 in the function above, we get the same series. Therefore θ 3 (z) =
θ3 (z + 1), so θ3 is periodic. Now, if we substitute z + τ for z above, we get
∞
2 τ +2mz)πi+2iπmτ
X
θ3 (z + τ, τ ) = e(m .
m=−∞
were k is a constant with |k| < 1. It converges absolutely for x 6= 0, as can be seen by
comparison with the series
∞
X
log |1 − k 2m−1 x||1 − k 2m−1 x|.
m=1
2
from which it follows that aj = a0 (−1)j k j , and hence that
∞
2
X
f (x) = a0 (−1)j k j xj .
j=−∞
Then we have
∞
2
X
2πiz
f (e ) = a0 (−1)j k j e2πijz .
j=−∞
where
∞
Y
c= (1 − q 2m ),
m=1
4.5 History of elliptic functions 37
For more details, see Wang and Guo [26]. These formulas will be used in Chapter 6, where
we study an algorithm for solving the quintic.
all the previous methods for solving the quintic. For more information about Klein’s book,
see [12].
In Chapter 6, we study an algorithm due to Ludwig Kiepert (1846–1934). Kiepert
discovered many formulas in the theory of elliptic functions. His algorithm for solving the
quintic equation was presented in 1883.
During the last century, a lot has happened both in Galois theory and in the theory of
special functions. Group theory was developed and was given its modern form. More ways
of solving the quintic using special functions were developed. The development of com-
puters led to more and better algorithms for solving polynomial equations in an iterative
way. Bruce King and E. Rodney Canfield at Georgia University implemented Kiepert’s al-
gorithm on a microcomputer in the 1980’s. The commercial program package Mathematica
implemented Hermite’s algorithm for solving the quintic in 1994.
Kronecker believed that there would be a general theorem for expressing the roots of
any polynomial with elliptic functions, but he did not find any theorem of this kind. But
in 1983, Hiroshi Umemura proved a general formula for the nth degree equation. We just
state his theorem here.
where Ω is the period matrix of the hyperelliptic curve y 2 = F (x), and F (x) = x(x − 1)P (x)
if the degree of P (x) is odd, and F (x) = x(x − 1)(x − 2)P (x), otherwise.
x3 − 3x + 2c0 = 0, c0 ∈ R,
where c0 where expressed by trigonometric functions. By observing the similar form of this
equation and the Bring–Jerrard quintic, he thought that a corresponding formula would
exist for the quintic. We can summarize his technique as follows. Define the numbers k
and k 0 by
z2 − z 3 z1 − z 2
k2 = and (k 0 )2 = 1 − k 2 = ,
z1 − z 2 z1 − z 3
where the numbers zi are the roots to (10) for a Weierstrass function with periods ω 1 and
ω2 . The number k 2 can also be expressed by theta functions in the following way where we
put τ = ω1 /ω2 :
θ2 (0, τ )4
k2 = ;
θ3 (0, τ )4
see Wang and Guo [26]. We define the following functions:
s
√
4 θ2 (0, τ )
ϕ(τ ) = k = ,
θ3 (0, τ )
s
√
4 θ4 (0, τ )
ψ(τ ) = k0 = .
θ3 (0, τ )
Observe that
ϕ(τ )8 + ψ(τ )8 = 1,
because k 2 + (k 0 )2 = 1. We then define the parameters
v 6 − u6 + 5v 2 u2 (v 2 − u2 ) + 4uv(1 − v 4 u4 ) = 0.
40 5 HERMITE’S AND GORDAN’S SOLUTIONS TO THE QUINTIC
To relate this sixth degree equation with the Bring–Jerrard form, Hermite defined the
function
µ µ ¶¶ µ µ ¶ µ ¶¶ µ µ ¶ µ ¶¶
5 τ + 16 τ + 64 τ + 32 τ + 48
Φ(τ ) = ϕ (5τ ) + ϕ ϕ −ϕ ϕ −ϕ .
τ 5 5 5 5
Because of the definition of ϕ, it is built up by theta functions. Therefore, if we, for example,
replace τ with τ /5, ϕ will have a regular behaviour. Therefore,
µ ¶ µ ¶ µ ¶ µ ¶
5 τ + 16 τ + 32 τ + 48
Φ (5τ ) , Φ , Φ , Φ , and Φ ,
τ 5 5 5
2(1 + ϕ(τ )8 )
x5 − x − √
4 5
= 0.
5 ϕ(τ )2 ψ(τ )4
x5 − x − a0 = 0,
k 4 + z 2 k 3 + 2k 2 − z 2 k + 1 = 0,
√4
where z = a0 55 /2. When we have found this value for k, we can calculate the value of τ
from the definition of ϕ. The roots of the Bring–Jerrard quintic can then be expressed as
ϕ(τ + 16k)
xk = √
4 4 3
, k = 0, 1, . . . , 4.
2 5 ϕ(τ )2 ψ(τ )4
By using the series expansions of the functions involved above, one finds that this way
of calculating the roots gives series that converge very rapidly. For more details and a
derivation of the last expression, see King [11] and Prasolov and Solovyev [20].
cube. Doing the same thing with an octahedron gives a cube inscribed in it. Any symmetry
of a solid is clearly a symmetry of the dual. The dual of the icosahedron is the dodecahe-
dron which has 12 pentagonal faces, 30 edges, and 20 vertices (three edges meeting at each
vertex). Using distance preserving maps, which we dealt with in Chapter 3, shows that the
symmetry group of the icosahedron has order 60. The order of A5 is also 60, and one can
show that they are isomorphic; see Shurman [22]. Therefore we can view the icosahedron as
concrete picture of the group A5 . In Chapter 3, we saw that the unsolvability of the quintic
was connected to the group A5 . Because of this fact, when we study the icosahedral group,
we are also studying the quintic.
We now consider polyhedral polynomials. They are polynomials with roots that lie on
• the vertices,
©©H
¡@ @HH
©©¡ »X
» X @ HH
©»© ¡
» ·T X
X H
»©»© ¡ @ H
X H
X
C@
C ¡ · T @ ¡¤¤
@¡ @.¡
¤T · T ·C
CC ¤ T · C ¤¤
¤ T
· ·
T C
¤ T · C
¤ CC· T · T ¤¤ C
¤ T · C
¤
¤X T · C
H XXX » » »C
©
HH XX T · »»» ©
HH XX · » ©©
T.»
©
HH ©©
H©
An icosahedron
Figure 3
Often one expresses equations involving polyhedral polynomials in homogeneous form, that
is
see [11] for a derivation. Observe that the order of each form is equal to the number of
faces, edges, and vertices, respectively.
42 5 HERMITE’S AND GORDAN’S SOLUTIONS TO THE QUINTIC
1728f 5 − H 3 − T 2 = 0. (15)
Gordan expressed the different permutations of the icosahedron with the help of a complex
fifth root, ω = cos 2π 2π
5 + i sin 5 , and the variables u and v. He then expressed the same
rotations of the icosahedron with ω 2 and variables denoted w and z. This gives another set
of permutations of the icosahedron. Gordan then constructed following function:
ξ = u3 w2 z + u2 vz 3 + uv 2 w3 − v 3 wz 2 ,
which is unchanged by the permutations of the icosahedron. From this function Gordan
found two other functions, φ and ψ, with the help of techniques from invariant theory. He
then could derive a principal quintic in these three functions:
x5 + 5ξx2 − 5φx − ψ = 0.
The roots to this equations can now be expressed in the variables u, v, w, and z. Using
these roots he was able to derive another equation:
H
y 5 − 10y 3 + 45y − p = 0,
f5
√
where H and f are parameters. By making the substitution y = t f one gets a Brioschi
quintic (see Chapter 6):
t5 − 10ωt3 + 45ω 2 t − ω 2 = 0,
where ω is a parameter. It is possible to solve this equation with the help of elliptic functions
and by a partition of the icosahedron. This is done in Kiepert’s algorithm which we study
in Chapter 6. For more details, see King [11].
43
x5 + c4 x4 + c3 x3 + c2 x2 + c1 x + c0 = 0. (16)
z = x2 − ux + v. (17)
We then get
z 5 + 5a2 z 2 + 5a1 z + a0 = 0. (18)
The formulas for the transformation are
(2c24 − 5c3 )u2 + (4c34 − 13c4 c3 + 15c2 )u + 2c44 − 8c24 c3 + 10c4 c2 + 3c23 − 10c1 = 0,
and v is defined by
−c4 u − c24 + 2c3
v= .
5
where w1 and w2 are parameters. We now transform the equation (18) to the Brioschi form
(a22 + a2 a1 a0 − a31 )λ2 + (−11a32 a1 + a2 a20 − 2a21 a0 )λ + 64a22 a21 − 27a32 a0 − a1 a20 = 0,
and
(a2 λ2 − 3a1 λ − 3a0 )3
V = ,
a22 (λa2 a0 − λa21 − a1 a0 )
1
w= ,
1728 − V
V a22 − 8λ3 a2 − 72λ2 a1 − 72λa0
µ= .
λ2 a2 + λa1 + a0
In the equation for λ it suffices to solve the equation and take one of the two roots.
To understand and find the transformation (20) one studies the polyhedral polynomials
mentioned in Section 4.5. Briefly we do the following. We use the midpoints of the edges
of the icosahedron to divide it into five octahedrons. We then find the vertex and face
polynomials for these octahedrons. The expressions for them are
where ² = e2πi/5 ; see King [11]. The vertex polynomials tk satisfy the Brioschi equation
where f and T are the vertex and edge polynomials for the icosahedron. By using this fact
and making the substitution
µ ¶ µ 3¶
λf µf
zk = Wk + tk W k ,
H HT
where f and T are as above and H is the face polynomial for the icosahedron, we can
obtain the coefficients of the principal quintic (18). By reverting these steps we get the
transformation (20). For details, see King [11].
y 6 − 10f y 3 + Hy + 5f 2 = 0, (22)
where f and H are parameters. It is denoted the Jacobi sextic equation. One can show
that if the parameters satisfy
1728f 5 − H 3 − T 2 = 0, (23)
6.1 An algorithm for solving the quintic 45
(compare with (15)) and the roots to the Jacobi sextic are denoted y ∞ and yi , i = 0, 1, . . . , 4,
then the roots to (21) (observe that f , T , and H are parameters now) can be expressed as
s
1
tk = √ (y∞ − yk )(yk+2 − yk+3 )(yk+4 − yk+1 ), k = 0, 1, . . . , 4, (24)
5
where the subscript are calculated modulo five. For a proof of this result, see Perron [19].
By noticing that (19) is a special case of (21), where f 2 = T and t = yT /f 2 , we only have
to solve (22) to get the solution to the quintic.
Solving the Jacobi sextic: We shall solve (22) using elliptic functions. We begin by
expressing the coefficients of (21) in terms of the elliptic invariants g2 and g3 . Put D =
g23 − 27g32 . Write the equation (21) in the form
10 3 45 216
t5 + t + 2 t − 3 g3 = 0. (25)
D D D
We then get the relationships
1 216g3
f= and T = .
D D3
Substituting this into the identity (23), we get
¶3
HD2
µ
D=− − 27g32 .
12
10 3 12 5
y6 + y − 2 g2 y + 2 = 0. (26)
D D D
By comparing (25) with (19), we see that
1
D=− .
w
In order to solve (26), the key equation is
where σ is one of the Weierstrass functions mentioned in Chapter 4. This equation was
derived by Kiepert; see [10]. The values of z that are roots to this equation are z kl =
(kω1 + lω2 )/5, where ω1 and ω2 are the periods that corresponds to the invariants g2 and
46 6 KIEPERT’S ALGORITHM FOR THE QUINTIC
g3 constructed above, and k and l are integers satisfying |k| + |l| 6= 0. There are 24 different
pairs k, l that gives different roots to this equation inside a period parallelogram of ℘.
Now expand the determinant (27) using the fact that the derivatives can be expressed
by ℘ and ℘0 ; see Section 4.2. We then arrive at the equation
Using this equality and the fact that ℘00 and ℘(3) can be expressed by ℘ and ℘0 , we get the
following formula for χkl :
(℘00 )2 − 12℘kl (℘0kl )2
χkl = kl . (29)
4(℘0kl )2
We will now derive properties valid for every pair χkl , ℘kl . Therefore we will drop the
subscripts for χkl and ℘kl below. By substituting (29) into (28) we arrive at an equation
for
χ3 − ℘00 χ − (℘0 )2 = 0. (30)
Using the differential equation for ℘0 and the identity (11) in (30) and in (31) gives two
equations:
1
χ3 − (6℘2 − g2 )χ − (4℘3 − g2 ℘ − g3 ) = 0, (32)
2
1 2 1
(6℘ − g2 ) − (4χ + 12℘)(χ3 − 6℘2 χ + g2 χ) = 0.
2
(33)
2 2
Now introduce the variable ℘ = 1/2(z − χ). Use this in the expressions above and solve for
z 2 . We then get p
3z 2 = 5χ2 + g2 + 2χ 5χ2 + 3g2 .
To obtain (26), we introduce the variable χ2 = 1/s and multiply by s6 /D2 . From this we
can derive the following expression for the roots of (26):
¶¶−1
√
µ µ ¶ µ
2ω1 + 48kω2 4ω1 + 96ω2
sk = ℘ −℘ , (34)
5 5
¶¶−1
√
µ µ ¶ µ
2ω1 4ω1
s∞ = ℘ −℘ ; (35)
5 5
see Kiepert [10]. The thing to do know is to evaluate ℘ at these points, which means that
we must evaluate an infinite series. We do this with the help of theta functions. By using
the identity
z1 − z 2 θ4 (0, τ )4
=
z1 − z 3 θ3 (0, τ )4
(see Wang and Guo [26]), where zi is a root to the left hand side in the differential equation
(10) for ℘, and using the same differential equation and the fact that theta function can be
expanded as infinite products, one can derive the following formulas:
∞
√ 1 X 2 2
sj = (−1)m ²j(6m+1) q (6m+1) /60 , j = 0, 1, 2, 3, 4,
B m=−∞
√ ∞
√ 5 X 2
s∞ = (−1)m q 5(6m+1) /12 ,
B m=−∞
and
√ ∞
2
X
(−1)m q (6m+1) /12 ,
6
B= D
m=−∞
where ² = e2πi/5 and q = eπiω1 /ω2 . To find these expressions, one uses properties of the σ
functions. For a complete proof, see Akhiezer [1] and King [11].
We have now an algorithm for solving the general quintic using only its coefficients.
But one thing remains: we must find the periods of the elliptic functions since the formula
contains the factor q = eπiω1 /ω2 . In fact, it is sufficient to determine q. To do this we use
the elliptic integral (12) in the following way. Solve the equation
4x3 − g2 x − g3 = 0.
The number k 2 can also be expressed by theta functions in the following way:
θ2 (0, q)4
k2 = ;
θ3 (0, q)4
θ2 (0, q 4 )
2L = .
θ3 (0, q 4 )
Even if L is close to 1, this expression gives a good numerical value for q. The expression in
L for q is denoted the Jacobi nome. When we solve (10) we get three roots. It is important
how we order the roots, because for half of the orderings we will get |q| > 1 and then
the series for the roots does not converge. We must therefore make a check of this if we
implement this method of solving the quintic on a computer. To be sure that we have done
everything right, we can carry out the multiplication
4
Y
(s − s∞ ) (s − sk ),
k=0
Reversing the transformations: We now only have to reverse the transformations that
we have done. Use
1
yk2 = √ (s∞ − sk )(sk+2 − sk+3 )(sk+4 − sk+1 ),
5
ω2
yk = 2 ,
yk + 10/Dyk2 + 45/D 2
where the first equation comes from (24) and the last is a rewritten version of (21). To
reverse the first transformation use the following equation derived by Kiepert
one can derive an equation for xk which gives the equation above.
We now have a complete algorithm for solving the quintic.
6.2 Commentary about Kiepert’s algorithm 49
7 Conclusion
7.1 Conclusion
We have seen that algebraic equations have interested mathematicians for a long time. A
new era began when Galois proved the impossibility of solving equations of degree higher
than four, using only radicals. After this attention concentrated on finding other ways
of solving these equations. This was done in several ways, mainly using the theory of
special functions, such as elliptic and theta functions. This lead to several algorithms for
solving the quintic; these are presented in Chapter 5 and 6. These algorithms are not only
interesting from a theoretical point of view, but are also practically useful. The program
package Mathematica has already implemented a part of Hermite’s approach of solving
the quintic, and Kiepert’s algorithm has also been implemented on a computer. But these
solution methods only deal with the quintic. A goal for future research might be to find new
algorithms, or to simplify Umemuras solution, so that one can solve higher degree equations
in a similar way.
With this in mind we can finally say that polynomial equations have been a part of
mathematics for a long time, but there are still things to do; so they will certainly accom-
pany us into the 21th century.
52 7 CONCLUSION
53
Appendix
A Groups
Definition A.1 A nonempty set G, on which a binary operation ◦ is defined, is called a
group if, for arbitrary a, b, c ∈ G,
• (a ◦ b) ◦ c = a ◦ (b ◦ c) (associativity);
• for each a ∈ G there exists an element a−1 ∈ G such that a ◦ a−1 = a−1 ◦ a = e
(existence of inverse).
Definition A.2 If (G, ◦ ) and (G1 , · ) are groups, then a mapping σ : G → G1 is called an
homomorphism if σ(a ◦ b) = σ(a) · σ(b) for every a, b ∈ G.
Two cosets aH and bH are equal if and only if ab−1 ∈ H. We have also that aH = bH or
aH ∩ bH = ∅, and the same is true for the left cosets. This means that the cosets will form
a partition of G. If H / G, then the distinct cosets form a group called the factor group and
is denoted G/H. By definition aH ◦ bH = a ◦ bH. If G is finite, then |Ha| = |aH| = |H|,
and we have the following famous theorem due to Lagrange.
B Permutation groups
The set of all permutations of the set {1, 2, 3, . . . , n} is called the symmetric group and is
deoted Sn . The order of Sn is n!. We write a permutation as
µ ¶
1 2 3 ... n
σ=
i1 i2 i3 . . . in
One can show that the permutations form a subgroup of Sn called the alternating group
and denoted An .
C Rings
A nonempty set R is said to form a ring with respect to two binary operations, often called
addition and multiplication and denoted by + and ∗, if the following holds:
• (a ∗ b) ∗ c = a ∗ (b ∗ c);
• a ∗ (b + c) = a ∗ b + a ∗ c;
• (b + c) ∗ a = b ∗ a + c ∗ a.
Definition C.1 We define char(R) to be the smallest integer such that n ∗ a = 0 for all
elements in R. If no such integer exists, then char(R) = 0.
• Ring with unity: A ring with a multiplicative identity element. This element is
often denoted with 1.
Definition C.3 A subgroup H of a ring R having the property that r ∗ x ∈ H for all x ∈ H
and r ∈ R is called left ideal. Similary a subgroup H having the property x ∗ r ∈ H for all
x ∈ H and r ∈ R is called a right ideal. If a subgroup is both a left and right ideal, then it
is called an ideal.
55
Observe that the additative group is abelian, so the subgroups will be normal. If a
subgroup g is an ideal, we can define factor groups R/g = {r + g : r ∈ R} as the set of all
different cosets of g. This will in fact be a ring, denoted the factor ring.
Definition C.4 Assume that R and R1 are two rings, then a mapping σ : R → R1 is a
homomorphism if it preserves addition and multiplication.
D Polynomials
Let R be a ring and let x be any symbol not in R. By a polynomial in x over R is meant
an expression
n
X
n
P (x) = cn x + . . . + c1 x + c0 = cj x j .
j=0
Every cj xj is called a term and the coefficients of xn is denoted the leading coefficient. Two
polynomials are equal if every term are equal. If the operation in R are denoted by ◦ and ∗ ,
then we define addition as
and multiplication as
n µX
X k ¶
f (x) ∗ g(x) = (f0 ∗ g0 ) + (f0 ∗ g1 ◦ f1 ∗ g0 )x + . . . = fj ∗ gk−j xk .
k=0 j=0
Theorem D.1 (Division Algorithm) Let f, g ∈ R[x], where R is a ring. Suppose that
f 6= 0 and that the leading term in f is a unit. Then there exist uniquely determined
polynomials q and r such that
When a polynomial has a root, it is reducible in that ring. We end this section with a
useful criterion for when an polynomial in Z is irreducible in Q.
56 D POLYNOMIALS
• p divides each ci , i = 0, . . . , n − 1,
References
[1] Akhiezer, N., Elements of the Theory of Elliptic Functions, AMS, Rhode Island,
1990.
[2] Boyer, B. and Merzbach, C., A History of Mathematics, John Wiley & Sons, New
York, 1989.
[3] Brioschi F., Sur les équations du sixième degré, Acta math. 12 (1888), 83-101.
[4] Eddy, R. H., The Conics of Ludwig Kiepert – A Comprehensive Lesson in the Ge-
ometry of the Triangle, Math. Mag. 67 (1994), 188-205.
[5] Gordan, P., Sur les équations du cinquième degré, Journal de math. 1 (1885),
445–458.
[6] Gordan, P., Über Gleichungen fünften grades, Math. Annalen 28 1 (1887), 152–166.
[7] Hausner, A., The Bring-Jerrard Equation and Weierstrass Elliptic functions, Amer.
Math. Monthly 69 (1962), 193–196.
[8] Hermite, C., Sur l’équation du cinquième degré, C.R.A.S. Paris 48–49 (1865/66),
347–424.
[9] Kiepert, L., Auflösung der Gleichungen fünften grades, J. für Math. 87 (1878),
114–133.
[10] Kiepert, L., Wirkliche Ausführung der Ganzzahligen Multiplikation der Elliptischen
Funktionen, J. für Math. 76 (1873), 21–33.
[11] King, B., Beyond the Quartic Equation, Birkhäuser Verlag, Boston, 1996.
[12] Klein, F., Vorlesungen über das Ikosaeder , Birkhäuser Verlag, Basel, 1993.
[13] Klein, F., Weitere Untersuchungen über das Icosaeder, Math. Annalen 12 (1877),
503–560.
[14] Kronecker, L., Sur la résolution de l’équation du cinquième degré, C.R. Acad. Sci.
46 (1858), 1150–1152.
58 REFERENCES
[15] Kronecker, L., Über die classe der Gleichungen von denen die Theilung der ellip-
tischen Funktionen abhängt, Monatsberichte der Berliner Akad. der Wiss 4 (1879),
88–96.
[16] Morandi, S., Fields and Galois Theory, Springer-Verlag, New York, 1996.
[17] Mumford, D., Tata Lectures on Theta 2, Birkhäuser Verlag, Boston, 1984.
[18] Nicholson, W., Introduction to Abstract Algebra, PWS Publishing Company, Boston,
1993.
[19] Perron, O., Algebra II, Walter de Gruyter & Co, Berlin, 1933.
[20] Prasolov, V. and Solovyev, Y., Elliptic Functions and Elliptic Integrals, AMS,
Rhode Island, 1997.
[21] Rigatelli, T., Evariste Galois 1811-1832, Birkäuser Verlag, Basel, Boston, 1996.
[22] Shurman, M., Geometry of the Quintic, John Wiley & Sons, New York, 1997.
[23] Slodowy, P., Das Ikosaeder und die Gleichungen fünften Grades, in Aritmetik und
Geometri: Vier Vorelesungen, Birkäuser Verlag, Basel, 1986.
[24] Stillwell, J., Mathematics and Its History, Springer-Verlag, New York, 1989.
[25] van der Waerden, B., Science Awakening, P. Noordhoff, Groningen, 1961.
[26] Wang, Z. X. and Guo, D. R., Special Functions, World Scientific, Singapore, 1989.
[27] Weierstrass, K. and Schwarz, H. A., Formeln und Lersätze zum Gebrauche der
Elliptischen Funktionen, Springer-Verlag, Berlin, 1893.
Avdelning, Institution Datum
Division, Department Date
Titel
Title
Analytic Solutions to Algebraic Equations
Sammanfattning
Abstract
This report studies polynomial equations and how one solves them using
only the coefficients of the polynomial. It examines why it is impossible
to solve equations of degree greater than four using only radicals and how
instead one can solve them using elliptic functions. Although the quintic
equation is the main area of our investigation, we also present parts of
the history of algebraic equations, Galois theory, and elliptic functions.