(PDE2)
Jaroslav Vondrejc1
1
Technische Universitat Braunschweig, Carl-Friedrich-Gau-Fakultat, Institute of
Scientific Computing, Muhlenpfordstrasse 23, 38106 Braunschweig, Germany
Contents
I Theory to PDEs 4
3 Sobolev spaces 10
3.1 Prerequisites for Sobolev spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
3.1.1 Continuous functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
3.1.2 Weak derivatives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
3.1.3 Norms on finite dimensional spaces . . . . . . . . . . . . . . . . . . . . . . 11
3.2 Definition of Sobolev spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
3.3 Sobolev inequalities & embeddings . . . . . . . . . . . . . . . . . . . . . . . . . . 13
3.4 Trace operator values on boundary . . . . . . . . . . . . . . . . . . . . . . . . 15
3.5 Singularities of Sobolev functions . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
3.6 Properties of Sobolev spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
1
5.4.1 Boundary conditions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
5.4.2 Heat equation homogeneous Dirichlet boundary conditions . . . . . . . . 24
5.4.3 Helmholtz equation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
5.4.4 Helmholtz equation with Neumann boundary conditions . . . . . . . . . . 26
5.4.5 Neumann problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
6 Abstract approaches 27
6.1 Discretisation approaches . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
8 Introduction 32
8.1 Motivation example . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
8.1.1 Minimisation problem without a constraint . . . . . . . . . . . . . . . . . 32
8.1.2 Minimisation problem with a constraint . . . . . . . . . . . . . . . . . . . 32
8.1.3 Minimisation with a constraint that do not work . . . . . . . . . . . . . . 33
8.2 Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
9 Abstract formulations 34
9.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
9.2 Model problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
2
IV Appendix 46
3
Part I
Theory to PDEs
1 Introduction and repetition
1.1 Literature:
[Braess, 2007]: Chapter II, 1-3: Sobolev spaces and PDEs;
[Ern and Guermond, 2004]: Appendix A and B: overview of Banach and Hilbert spaces
and functional analysis
(explanation: it means that the space is also closed under those two operations) and following
conditions are satisfied for all u, v V and a, b R:
u+v =v+u
1u = u
a(bu) = (ab)u
Examples 1.2. The vector space has to be closed for addition and scalar multiplication.
For example the space Rd is closed for those two operations for d N.
Let a R2 , (e.g. a = (2, 1)), then the set {x R2 : a x = 0} is also closed under those
two operations for scalar product .
The set V = {x R2 : kxk2 1} is not a vector space, because it is not closed under
addition nor scalar multiplication, e.g. u = (1, 0) V because kuk2 = 1 but u + u =
(2, 0)
/ V because ku + uk = 2.
Definition 1.4 (Norm). For vector space V , norm is a map k k : V R satisfying for v V
and a R:
4
(iii) kvk = 0 imply v = 0 (zero vector)
kAxkp
kAkp = sup , (1.2) {?}
xRn ,x6=0 kxkp
which is a norm of linear operator, when we consider the matrix A as a linear operator on
Rn
Definition 1.7 (Scalar product). For vector space V , scalar product is a symmetric positive
definite bilinear map (, ) : V V R, i.e. for u, v, w V and a, b R
5
2 Introduction to Lebesgue spaces
2.1 Lebesgue spaces
Definition 2.1 (Lp norm). Let p 1 and f : R be a measurable function. Then
(
ess supx |f (x)| := inf{a : f (x) a for almost all x } for p =
kf kLp () := R 1 (2.1) eq:Lp_norm
( |f (x)|p dx) p otherwise
However, the function k k : R is not a norm on this space because kf k = 0 do not imply
that f = 0. It only implies that f = 0 almost everywhere. Therefore, we identify all functions
that equal almost everywhere; let
be a class (set) of those functions. In short words, the functions are equal almost everywhere if
their integrals are the same for different integration domains. For more details, see Appendix A.
Definition 2.3 (Lp space). We define Lp space as a set of function classes equal almost every-
where
Examples 2.5 (One-dimensional integrals). Decide whether the following functions f belong to
Lp (I) for
f (x) = x2 1 on I = (0, 1)
1
f (x) = x on I = (0, 1)
1
f (x) = x on I = (1, )
f (x) = x for R and I = (0, 1) or on I = (1, )
(
log x for = 1
Z
f (x) = x+1
+1 otherwise
R1 R
if = 1 then 0 |f | = and 1 |f | =
R1 R
if > 1 then 0 |f | < and 1 |f | =
R1 R
if < 1 then 0 |f | = and 1 |f | <
6
2.2 Some inequalities
hdef:convexi
Definition 2.6. A function f : R R is convex if f (tx + (1 t)y) tf (x) + (1 t)f (y) for
t [0, 1] and x, y R.
1
Lemma 2.7 (Young inequality). Let p, q (1, ) such that p + 1q = 1 and a, b are nonnegative
numbers. Then
ap aq
ab + (2.5) ?eq:Young_ineq?
p q
Proof. We use the fact that x 7 exp(x) is a convex function. Then
1 1 1 1 ap aq
ab = exp[log a + log b] = exp[ log ap + log bq ] exp[log ap ] + exp[log bq ] = + .
p q p q p q
where the inequality is from Def. 2.6 with x = log ap , y = log bq . and t = p1 .
Proof. The proof is a quite straightforward for p = 1 and q = , when we take essential
supremum. For other cases, we can take u = kffkp and v = kgk g
q
, which has a unit norm
kukp = kvkq = 1. Using the Young inequality, we calculate
Z Z Z
1 p 1
kuvk1 = |uv| |u| + |v|q = 1.
p q
kf gk1
Since kuvk1 = kf kp kgkq 1, we have the required inequality.
hlem:Minkowskii Lemma 2.9 (Minkowski inequality). Let f, g : R be measurable functions and 1 p .
Then
Proof. For the case when p = , the proof comes from the triangle inequality and the properties
of supremum. Otherwise, we can calculate
Z Z Z Z
p p p1 p1
kf + gkp = |f + g| |f + g| (|f | + |g|) = |f + g| |f | + |f + g|p1 |g|
Z p1 "Z 1 Z 1 #
p p p
|f + g|p |f |p + |g|p
= kf + gkp1
p (kf kp + kgkp )
p
where we have used Holder inequality (2.6) with coefficients p1 and p.
Lemma 2.10. Let be bounded and f Lp () for some p 1. Then f Lq () for any q
such that 1 q p.
Proof. We will proof it by showing that kf kqLq is finite. We can calculate it with the help of
p p
Holder inequality with constants pq , q (it satisfy the assumption of Holder space pq q
p + p = 1)
Z Z
q q q
kf kLq = |f | = 1|f |q k1k pq
p kf k p ,
L
L
7
Remark 2.11. The previous lemma fails when the domain is unbounded, e.g. = R. Then
the norm also controls the singularities in infinity. However, some interpolation between Lp
spaces can be established. See following lemma.
1 (1 )
= + (2.8) {?}
r s t
and f Lp Lq . Then also f Lr and it holds
T
fi f in V,
Examples 2.15.
spaces R, N are complete with norm corresponding to the norm given by the absolute value,
i.e. k k := | |.
space Q of rational numbers is not complete space; the proof is by counter example. The
number is irrational. Let set the sequence {xi }
i=1 using elements being the partial decimal
expressions of , i.e.
and so on. The the sequence converges to and all xi are rational.
Definition 2.16 (Closure). Let V be a vector space with norm k k. Then we denote by V or
cl V the closure of the set, i.e. the set consisting of all elements of V and also all limiting points
(the points that arise as a limits of all convergent subsequences, convergent with respect to k k).
kk
If we can consider more norms, we write V
Examples 2.17.
(0, 1) = [0, 1]
||
R=Q
8
the space of continuous functions C(I) on closed interval I is not complete if we consider
Lp norm with 1 p < . Consider e.g. a function on [0, 1]
(
1 nx for x = [0, n1 ]
fn (x) = ,
0 otherwise
which converges to
(
1 for x = 0
f (x) = .
0 otherwise
9
3 Sobolev spaces
Sobolev spaces are fundamental for the analysis of PDEs because naturally the norm of solution
kukLp and the norm of its gradient kukLp naturally arises. We will establish the relations
between different norms. This is important not only for theoretical analysis of PDEs but also
for numerics, where we can have norms derived e.g. from Lp -norms or Sobolev norms. On finite
dimensional spaces occurring in numerics, all norms are equivalent but there can be a huge
constant of equivalence, which blow up for infinite dimensional spaces spaces of functions.
The Sobolev spaces are also very important for the convergence of FEM method; particularly,
the smoother the solution (measuring in Sobolev norms), the higher convergence rate.
Lemma 3.4. This space is a Banach space (complete vector space with norm), where the norm
is defined as
|f (x) f (y)|
kf kC 0, = sup
x,y,x6=y |x y|
Definition 3.5 (Infinitely differentiable functions). Let C () denote the space of infinitely
differentiable functions f : R. Then C0 () is its subset containing functions with compact
support, i.e.
Remark 3.7. If a function has a derivative, then classical derivative coincide with a weak
derivative concept; the formula (3.4) is a Greens formula for integration by parts.
10
Example 3.8 (Weak derivative). Show that a derivative of f (x) := |x| on I = [1, 1] is g such
that g(x) = 1 for x 0 and g(x) = 1 for x < 0:
Z 1 Z 0 Z 1
0 0
|x| (x)dx = x (x)dx x0 (x)dx
1 1 0
Z 0 Z 1 Z 1
= 1(x)dx + 1(x)dx = g(x)(x)dx.
1 0 1
Notice that the formula hold independently on the value of g at x = 0! The function g is in
L2 (I).
Example 3.9 (Weak derivative that is a measure). Show that a derivative of g = f 0 on [1, 1]
from previous example has a derivative g 0 = 20 , which equals to Dirac measure concentrated at
zero:
Z 1 Z 0 Z 1
0 0
g (x)dx = 1 (x)dx + 10 (x)dx
1 1 0
= (x)|01 (x)|01 = [(0) (1)] + [(1) (0)] = 2(0),
because (1) = (1) = 0. Now, the derivative of g, is some object g 0 that has to satisfy
Z Z
0
g (x)(x) dx = 2(0) = 20 (x)(x) dx, (3.5) eq:dirac_int
I I
which is exactly the double of Dirac delta measure g 0 = 20 . It can be imagined that it is
something like
(
for x = 0
0 (x) = ,
0 otherwise
but it has only a good meaning when it is evaluated under integral with some test function as in
(3.5).
Lemma 3.12 (Norms on finite dimensional spaces). Let there are two norms k k and ||| ||| on
finite dimensional space V . Then those two norms are equivalent.
11
Proof. First we prove kvk1 nkvk2
n n
X X
kvk1 = |vi | = 1 |vi | = 1 v k1k2 kvk2 = nkvk2 (3.8) {?}
i=1 i=1
where 1 = (1, . . . , 1) Rn . Then we prove kvk2 nkvk
!1 !1 1
2 2
X X 2
2 2 2
kvk2 = |vi | max |vi | = n max |vi | = n max |vi | = nkvk . (3.9) {?}
i i i
i=1 i=1
Remark 3.15. We will show soon that above definition is really a norm on a suitable space of
functions.
Remark 3.16. In previous semester, we have introduced the Sobolev spaces in the following
sense
Sobolev_space2i
W 1,p = {v Lp () | v Lp (; Rd )} (3.13a) {?}
W01,p = {v W 1,p () | v(x) = 0 for x } (3.13b) {?}
But what is a meaning of function values on boundary ? It will be explained in section about
trace operator.
12
Definition 3.17 (Sobolev spaces). Let be a subset of Rd with piece-wise Lipschitz boundary.
Then we define for p [1, )
:Sobolev_spacei
kkW 1,p kkW 1,p
W 1,p = C , W01,p = C0 . (3.14a) {?}
13
Remark 3.22 (Embeddings inequality). The inequalities in previous lemma do not only say
that if f W 1,p () than f is in some other space such as Lp but it also says following: if we
have a Cauchy sequence {fi }iN in W 1,p (), than this sequence is Cauchy also in other spaces,
e.g.
kfm fn kLq () Ckfm fn kW 1,p () . (3.22) {?}
for some m, n > N N. Since those spaces are complete, we have
kfm f kLq () Ckfm f kW 1,p () 0 for m . (3.23) {?}
with a limit function f ; the convergence in one space means a convergence in another space,
which is generally not true in infinite dimensional spaces.
re-fried-ineqi?
Lemma 3.23 (Poincare-Friedrichs inequality). Assume that Rd is contained in d-dimensional
cube of side s. Then
kukL2 skukL2 (3.24) eq:poincare-fri
for all u H01 ()
Proof. See [Braess, 2007, Chapter II, 1.5].
This proof holds only for functions with zero values on boundary, or alternatively with zero
mean, see next Remark.
Remark 3.24 (Poincare-Wirtinger inequality). The variant of Poincare inequality is following
inequality
ku u kL2 CkukL2 (3.25) eq:Poincare-Wir
1
R
for u = u denoting the mean over .
Lemma 3.25 (Equivalent norms on H01 ). On H01 () with bounded in s-sized cube, the mapping
u 7 kukL2 (;Rd ) is a norm and it is equivalent norm to k kH01 .
Proof. We will use Poincare-Friedrichs inequality to get estimates
kuk2H 1 = kuk2L + kuk2L s2 kuk2L + kuk2L = (s2 + 1)kuk2L (s2 + 1)kuk2H 1 ,
which reveals
1
kukH 1 kukH 1 kukH 1 .
(s2 + 1)1/2
This relation is exactly required for the equivalent norms. So, in order to show that u 7
kukL2 (;Rd ) is a norm, we have to show homogeneity, triangle inequality, and separation of
points. The homogeneity comes from linearity of gradient and linearity of integration, i.e.
Z 1/2 Z 1/2
2
kaukL2 = kaukL2 = kau(x)k2 dx = a2 ku(x)k22 dx
Z 1/2
2
=a ku(x)k2 dx = akukL2
Triangle inequality comes from Holder inequality. For separation of points, we have to show that
kuk2L = 0 imply u = 0 a.e. But if kuk2L = 0, then from the Poincare-Friedrichs inequality,
we have also k0kL2 = 0, which is a norm and it implies u = 0 a.e.
14
3.4 Trace operator values on boundary
Assume that our computational domain Rd has a boundary . For a continuous functions
u C() we can easily consider its values on boundary. The problem arises for Sobolev functions
u W 1,p (), which are generally not continuous. Since has d-dimensional Lebesgue measure
zero, it is not clear what the function value on boundary means. This is solved out with trace
operator.
Lemma 3.27 (Trace theorem). Let be a bounded domain with sufficiently nice boundary
(e.g. C 1 boundary or piece-wise smooth with cone property). Then there exists a bounded linear
mapping T : W 1,p () Lp () for 1 p < such that
\
T u = u| for all u W 1,p () C() (3.26) {?}
kT ukL2 () CkukW 1,p for all u W 1,p () (3.27) eq:ineq_trace
Definition 3.28. The operator T from previous definition is called trace operator.
Remark 3.29 (Trace as limit of continuous functions). Using the denseness of continuous
functions in Sobolev spaces, for each f H 1 (), there exists sequence of continuous functions
{fn }nN such that fn f in H 1 , i.e. kf fn kH 1 0. Each continuous function has a good
meaning of values on boundary. Therefore, the trace at the boundary can be understood as a
limit of continuous functions at boundary
T f = u| = lim fn | .
n
However, it is necessary to show that the limit is independent of the choice of a sequence; there
are more sequences converging to f .
For d = 1,
(
2
1 2+1 |1 < for > 21
Z
kf k2L2 = 2 |x|2 dx = 2+1 x 0
0 for = 12
Use polar coordinates for d = 2, (r, t) = [r cos t, r sin t] = x, which is a map from (0, 1)
(0, 2) with determinant of Jacobian det = r
Z 1 Z 2 Z 1 (
< for > 1.
Z
2 2 2 2+1
kf kL2 = kxk2 dx = r r dt dr = 2 r dr .
0 0 0 = otherwise
Use polar coordinates for d = 3, (r, t, s) = [r cos t sin s, r sin t sin s, r cos s] = x, which is a
map from (0, 1) (0, 2) (0, ) with determinant of Jacobian det = r2 cos s
(
< for > 23 .
Z Z 1 Z 2 Z Z 1
kf k2L2 = kxk2
2 dx = r 2 2
r sin s dt dr ds = 4 r 2+2
dr .
0 0 0 0 = otherwise
15
for general dimension d, it leads to
(
for > d2 .
Z 1Z 1
<
Z Z
kf k2L2 = kxk2
2 dx = r 2
dS dr = C 2 d1
r r dr .
0 Br 0 = otherwise
where dS denote integration over a ball of radius r, i.e. Br = {x Rd : kxk2 = r}. The
constant C corresponds to the surface area of unit ball.
Example 3.31 (Sobolev
Pd functions). Let f : R be a function on Rd defined again as
f (x) = kxk2 = ( i=1 xi )/2 . The function is measurable so we will show that it belongs to
2
Sobolev space H 1 if the square of Sobolev norm kf k2H 1 = kf k2L2 + kf k2L2 is finite; the crucial
term is kf k2L2 because this term has stronger singularity. The gradient and Euclidean norm is
expressed as
d
X /21
f (x) = /2 x2i 2x, kf (x)k2 = kxk2
2 kxk2 = kxk1
2 .
i=1
Proof that the space is complete for addition of two vectors. Let f, g H 1 we will show that
kf + gkH 1 < . It can be shown using Mikowski inequality
Proof that norm is really a norm. I will show only that H 1 -norm, to show that (, )H 1 is an
inner product it follows similarly. Hence
1/2
kukH 1 () = kuk2L2 () + kuk2L2 (;Rd ) ,
16
triangle inequality: ku + vkH 1 () kukH 1 () + kvkH 1 () is based on triangle inequality
(Minkowski inequality) of L2 () norm, i.e.
1/2
ku + vkH 1 () = ku + vk2L2 () + ku + vk2L2 (;Rd )
1/2
(kukL2 () + kvkL2 () )2 + (kukL2 (;Rd ) + kvkL2 (;Rd ) )2 .
separates points: kukH 1 () = 0 implies u(x) = 0; it follows the same arguments as for L2
space.
17
4 Hilbert and Banach spaces
Definition 4.1 (Hilbert space). Let V be a vector space over R with scalar product. Then V is
Hilbert space if it is complete space.
Lemma 4.2 (Cauchy-Schwartz p inequality). The following hold (u, v) kukkvk for norm in-
duced by scalar product kuk = (u, u).
Proof. For v = 0, the proof is trivial, so we can assume v 6= 0. Then
Definition 4.7 (Banach space). Let V be a vector space over R with norm. Then V is Hilbert
space if it is complete space.
Examples 4.8.
Lp () for p 1 with norm (2.1) is a Banach space.
18
4.1 Linear operators, linear functionals, and dual space
Definition 4.9. Let V, W be two vector spaces over R. Then a map L : V W is linear if
Definition 4.10 (Bounded linear operators, norm). Let V, W be two Banach spaces. Then
linear mapping L : V W is bounded linear operator if there exists C > 0 such that
For simplicity, we also write Lu instead of L(u). The norm of the operator is defined as
Proof. Assume that an operator L is bounded. Then the continuity follows from inequality
Conversely, let an operator is continuous. Then, from definition of continuity, for each > 0
there exists such that following hold for all khk
So from linearity and homogeneity of norm, we have the estimate for arbitrary v V
kvkV v kvkV v
kL(v)k = k L( )k = kL( )k kvkV . (4.11) {?}
kvkV kvkV
Definition 4.12 (Linear functionals, dual space). For W = R from previous definition, we call
the map L continuous linear functional. Then the dual space V of V is a space of all bounded
linear functionals, i.e.
V = {f : V R; kf k < }.
Definition 4.13 (Separable Hilbert space). Hilbert space H is separable if there exists countable
basis.
19
5 Existence of solutions to elliptic PDEs
In this section, we will focus on existence results of PDEs, which will be treated within the
following abstract weak formulation.
Then the following problem is called abstract weak formulation: Find u V satisfying
20
Hence, u is a unique minimizer.
Conversely, if J has a minimum at u V , then the derivative of must vanish for t = 0 for
arbitrary v V , i.e.
0 (0) = a(u, v) l(v) = 0.
Since v V is arbitrary, we have (5.8).
Proof. First, we have to show that the functional J is bounded from below
1 kf k2 kf k2
J(v) kvk2V kf k kvkV = (kvkV kf kV )2 .
2 2 2 2
Then, we denote M = inf vV J(v), and let {vi }iN be an infimizing sequence (limi J(vi ) =
M ). Then we calculate
kvn vm k2V a(vn vm , vn vm ),
= 2a(vn , vn ) + 2a(vm , vm ) a(vn + vm , vn + vm ),
vn + vm vn + vm vn + vm
= 2a(vn , vn ) 4F (vn ) + 2a(vm , vm ) 4F (vm ) 4a( , ) + 8F ( ),
2 2 2
vn + vm
= 4J(vn ) + 4J(vm ) 8J( ),
2
4J(vn ) + 4J(vm ) 8M ;
the first inequality comes from the ellipticity assumption, the last inequality comes from the
fact that V is a convex, which implies vn +v
2
m
V and thus M J( vn +v
2
m
). Since {vi }iN is an
infimizing sequence, we have
kvn vm k2V 0 for m, n ,
21
which means that {vi }iN is a Cauchy sequence in V , and its limit exists
u = lim ui
i
since the space is complete. Since V is closed then also u V . The continuity of J implies that
J(u) = lim J(vi ) = inf J(v).
i vV
We now show the uniqueness of the solution. Assume that there are two solutions u1 , u2 V .
Then u1 , u2 , u1 , u2 , . . . is also an infimizing sequence, which has to be Cauchy sequence implying
u1 = u2 .
A L (, Rdd ) tensor valued material coefficients that are uniformly positive and
bounded, i.e. there exists 0 < < < such that
kuk22 A(x)u u, A(x)u u kuk2 kvk2
22
5.4 Derivation of convection-diffusion equation
we want to derive a convection-diffusion equation:
where
We derive it from continuity equation. We start from total concentration at some specimen
Rd , i.e. Z
Q= u(x, t) dx.
Then we derive the heat equation from postulate: change of total concentration in time
equals to a concentration flux through boundary and a source. It can be written as
Z Z
Q = n q(x, t) + f (x, t) dx (5.18) {?}
where q : R3 is a concentration flux density. Then we use Fourier law q(x) = D(x)u(x, t)+
v(x, t)u(x, t), which is a material
R law obtained Rfrom e.g. from observation or experiments, and
divergence (Gauss) theorem: n f (x) ds = f (x) dx to derive
Z Z
u(x, t) D(x)u(x, t) + [v(x, t)u(x, t)]dx = f (x, t)dx. (5.19) {?}
Since Q is defined on domain constant in time, we moved derivative into the integral. Then we
use following lemma, from which we can deduce the equation (5.17).
23
5.4.2 Heat equation homogeneous Dirichlet boundary conditions
Strong formulation: find u C 2 () C 0 () such that
T
Lemma 5.9. The solution continuously depends on load. It means that for two loads f, f L2
and corresponding solutions
a(u, v) = F (v) := (f, v)L2 , (5.21) {?}
a(u, v) = F (v) := (f, v)L2 , (5.22) {?}
we have
1
ku ukH kf fkL2 (5.23) {?}
Proof.
ku uk2H a(u u, u u) = a(u, u u) a(u, u u) = (f, u u)L2 (f, u u)L2
= (f f, u u)L2 kf fkL2 ku ukL2 kf fkL2 ku ukH
24
5.4.3 Helmholtz equation
Strong formulation: find u C 2 () C 0 () such that
T
Weak form:
V = H01 ()
kuk = kukH 1
Z Z
a(u, v) = u(x) v(x) dx + b(x)u(x)v(x) dx
Z
Lemma 5.10. There exists a unique solution to above heat equation for b L () and b(x)
b0 > 0.
Proof. We will proof the existence of unique solution with Lax-Milgram lemma; we have to satisfy
its assumptions. Linearity of bilinear form a and linearity of F comes from the properties of
integrals. Ellipticity follows from
Z Z
a(u, u) = u(x) u(x) dx + b(x)u(x)u(x) dx
Z Z
Boundedness of F comes from Holder inequality (the same like previous proof for heat
equation)
Z
F (v) = f (x) v(x) dx = kf vkL1 kf kL2 kvkL2 kf kL2 kvkH 1 .
25
5.4.4 Helmholtz equation with Neumann boundary conditions
Strong formulation: find u C 2 () C 0 () such that
T
26
Part II
Discretisation of variational problems
6 Abstract approaches
6.1 Discretisation approaches
Assume the abstract variational formulation from Definition 5.1. Since the corresponding
trial/test space V is infinite dimensional, we have to approximate this problem.
Here, we will mainly talk about conforming methods. It is based on approximation of the
infinite dimensional space V with a discrete one Vh that is a subspace
Vh V ; (6.1) {?}
Remark 6.3 (Ritz-Galerkin method). Since the Ritz method corresponds to Galerkin method
when the bilinear form is symmetric, we often talk about Ritz-Galerkin method.
Although, we focus on Ritz-Galerkin method, there are also other approaches, e.g. when the
trial and test space are different.
Remark 6.5. Using a priori estimates, based on ellipticity and continuity (boundedness) of
linear functional, we can show that the approximate solutions of Galerkin approximations are
bounded
kF k
kuh k . (6.5) {?}
See Remark 5.6 for a case in a continuous setting.
C
ku uh kV inf ku vh kV , (6.6) {?}
vh Vh
with a constant ellipticity constant and continutity constant C of bilinear form.
27
Proof. The solution u V and approximate solution uh Vh satisfy
a(u, v) = F (v) v V,
a(uh , v) = F (v) v Vh .
a(u uh , v) = 0 v Vh ,
The inequality
C
ku uh kV ku vh kV (6.7) {?}
holds for all vh Vh and therefore it has to hold also with infimum.
Remark 6.7 (Ceas lemma in symmetric case energetic-like estimate). When the bilinear
form a is symmetric, it is also a scalar product, which induces an energetic norm
p
kvkA = a(v, v).
ku uh k2A = a(u uh , u uh ) ku uh kA ku vh kA
where we have used Galerkin orthogonality and Cauchy-Schwartz inequality. Therefore, in en-
ergetic norm, the estimate is without a constant, i.e.
ku uh kA inf ku vh kA ,
vh Vh
Remark 6.8. Ceas lemma is a base for a proof of convergence of approximate solutions to the
continuous one. For details, see a corresponding section below.
28
7 Finite element method in general setting
Here, we will focus on some elements of a conforming method, i.e. when the approximation
space Vh is contained in the trial space V , written as Vh V .
For additional informationf, I refer to lecture; see also attached lecture notes written by Dr.
Noemi Friedman or see a literature for a detailed overview.
for each T T , the set T is closed and interior is nonempty and connected
the intersection of two elements is either empty set or a common vertex, edge, or facet.
For conforming method
Now we have a look on the fact why we need continuous basis function (continuous over
elements) for approximation of Sobolev functions.
Lemma 7.1. Let Vh be composed of polynomials over element, which are continuous on facets
(edges in 2D or faces in 3D), i.e. Vh C 0 (). Then
Vh H 1 () (7.1) {?}
Proof. Since any function v Vh is continuous, it is also in L2 . In order to show that this
function is in H 1 , we have to find its weak derivative and show that it is integrable. Notice that
this function generally does not have normal derivative on facets. I.e. we have to find wi (as a
weak partial derivative w.r.t. xi ) such that
Z Z
(x)
wi (x)(x)dx = v(x) dx. (7.2) {?}
xi
where the sum over all boundary integrals cancels because of continuity over elements and zero
values of at . The weak derivative is just a function with derivatives defined in interior
points of the elements.
Remark 7.2. For conforming approximation of H 1 space, the approximation space has to con-
sists of continuous functions. Similarly approximation of H 2 has to consists of functions with
continuous derivatives over elements. On other side L2 can be approximated with discontinous
Galerkin space (no continuity over elements is required).
29
7.2 Simplicial finite elements
This FEM space is based on complete polynomials:
X
Pk = p : Rd R | p(x) = c x1 i . . . xd d (7.6) {?}
P
; i i <k
where h is a characteristic size of some regular mesh (some non-degenerate mesh see literature
for details, e.g. [Gockenbach, 2006]). By hT , we denote the diameter of the element
30
The the characteristic size of the mesh can be defined as the maximum of hT over all elements,
i.e.
Theorem 7.3. Let an elliptic PDEs, defined for a trial space H 1 , has a solution from H s for
s > 1. Then the following estimates with a suitable constants can be derived
when we use a polynomial approximation of order at least s. The value s is called an order of
convergence; for s = 1 it is linear convergence, for s = 2 quadratic convergence, etc. It means
that for quadratic convergence we need at least quadratic basis functions.
This integration (typically over reference elements) with integration points xi and corresponding
weights wi introduces another numerical error (first error is introduced by Galerkin approxima-
tion).
Therefore, the bilinear forms and linear functional are approximated by
where u, v are trial and test functions from approximation (FEM) spaces.
The error in Galerkin approximation is controlled by Ceas lemma. In the case of numerical
integration, we have its generalisation: first Strang lemma
" #
a(vh , wh ) ah (vh , wh ) F (wh ) Fh (wh )
ku uh k inf ku vh k + sup + sup (7.20) {?}
vh Vh wh Vh kwh k wh Vh kwh k
Observe that for an exact integration, the first Strangs lemma reduces to the Ceas lemma. The
additional terms are called inconsistency error.
There is also a second Strangs lemma, which is useful for non-conforming methods (the
approximation space is not contained in the trial space Vh * V ) such as discontinuous Galerkin
method (when the finite element basis functions are not continuous over elements they are not
in a Sobolev space).
31
Part III
Saddle point problems
This topic is covered by the following book [Braess, 2007], particularly by chapter III, paragraph
4 Saddle point problems and paragraph 5 Mixed methods for the Poisson equation.
8 Introduction
8.1 Motivation example
vation-examplei
Let assume the minimisation of the following function J : Rd R defined as
1
J(v) = (v12 + v22 ) v1 v2 2v1 v2 = Av v b v (8.1) {?}
2
for
2 1
A= , b = (2, 1). (8.2) {?}
1 2
or in a weak form
Bu w = g w w R. (8.6) {?}
over a space
Vg = {v R2 | Bv = g}.
However, this is not a vector space!
The solution can be searched using Lagrange multipliers
32
Similarly this min-max problem is equivalent to the linear system:
A BT
u b
= (8.10) {?}
B 0 g
where the first constraint is the same as in initial constraint problem and second constraint is
just its multiple. The minimisation problem
is well-defined because the minimisation space Vg remains the same. However, the min-max
problem with Lagrange multipliers
do not have unique Lagrange multipliers because any combination = 1 + 22 with from
previous example is a possible solution. This is reflected in corresponding linear system
A B T 2B T
x f
B 0 0 1 = g , (8.14) {?}
2B 0 0 2 2g
where the matrix is singular (see that the last two equations are linearly dependent).
Alternatively, the corresponding constraint
8.2 Applications
optimisation
Stokes problem
dual problems
33
9 Abstract formulations
9.1 Introduction
In the motivation section 8.1, we have seen that some constraint minimisation problems leads
to linear system (saddle-point problems) that are not-defined. The proper treatment will be
developed here in abstract setting that also covers the PDEs with constraint.
where u is called a minimiser. However, it is usually not easy to find a basis for V to allow
direct numerical approximation; the space Vg is not a vector space. Therefore, we can write it
as
min J(v) + Vg (v) = J(u) + Vg (u) = J(u) (9.7) {?}
vX
34
hprob:saddlei Lemma 9.3 (Saddle-point problem). From the minmax problem with Lagrangian, stationary
equation (optimality condition) is expressed as: Find (u, ) such that
:saddle_systemi
a(u, v) + b(v, ) = f (v) v X (9.10a) {?}
b(u, ) = g() M (9.10b) {?}
The gradient of thus defines the (Gateaux) directional derivative of Lagrangian L at point
(u, ) in the direction of (v, ).
We can calculate split the Lagrangian
r2
L(u + rv, + s) = L(u, ) + r[a(u, v) + b(v, ) f (v)] + a(v, v) + s[b(u, ) g()],
2
which is used to calculate partial derivatives; one w.r.t variable r is then
r2
L(u + rv, + s) L(u, + s) r[a(u, v) + b(v, ) f (v)] + 2 a(v, v)
r (r = 0, s) = lim = lim
r0 r r0 r
= a(u, v) + b(v, ) f (v).
The derivatives have to be equal to zero for all possible directions (v, ), which gives the required
saddle-point system (9.10).
Remark 9.4. From the min-max problem, we can see that we have saddle-point at (u, )
Theorem 9.5 (Existence theorem). Assume that we have a saddle-point problem 11.2 and
spaces (9.4) with following assumption:
b(v, )
inf sup (9.12) {?}
M vX kvkX kkM
b(v, )
sup kkM M (9.13) {?}
vX kvkX
35
10 Discrete (matrix-vector) formulations
Spaces:
holding for all vectors u, v X and w M , where matrices A Rnn and B Rmn and
vectors f X and g M .
Saddle point problem: find (u, ) X M such that
A BT
u f
= (10.4) {?}
B 0 g
or
The following two conditions are crucial (along with continuity of the forms) for the existence
and uniqueness of the solution. Those conditions will be examined in general setting in next
section.
Remark 10.1 (Ellipticity condition). What does it mean ellipticity condition?
It means that the symmetric part of the matrix A is positive definite, its eigenvalues are positive,
recall spectral decomposition.
Remark 10.2 (Inf-sup condition). What does it mean inf-sup condition?
b(u, )
sup kkV (10.8) {?}
uX kukX
b(u, ) Bu
sup = sup kk (10.9) {?}
uX kukX uX kukX
the condition is not satisfied if Bu = 0, which means that there is a vector orthogonal to
range B. Therefore B has to be a full rank matrix.
Remark 10.3. The spaces from (9.4) can be characterised as
Proof of existence for this special case. First, we find a vector x0 Vg , which satisfy a con-
straint, e.g.
36
the inverse exists because BB T is positive definite
A BT
x f Ax0
= , (10.16) {?}
B 0 0
where now x V . Then, if we multiply the first equation with v V and use B T v = Bv = 0,
we get a problem for x V only (without )
Proof of solution uniqueness. It is based on standard arguments. Assume that we have two
solutions (x1 , 1 ) and (x2 , 2 )
A BT A BT
x1 f x2 f
= = (10.21) {?}
B 0 1 g B 0 2 g
Then we show that both solutions have to coincide. After substraction of both equations we
have
A BT
x1 x2 0
= (10.22) {?}
B 0 1 2 0
From second equation, we have x1 x2 V . From first equation, after multiplication with
(x1 x2 ), we obtain
37
10.1.2 Conjugate gradients on a subspace V
In previous section, we have seen that it is possible to find as solution as x = x0 + x where x0
is obtained from (10.13) and x can be found from a minimisation problem
1
x = arg min Ay y (f Ax0 ) y. (10.24) eq:min_over_V
yV 2
We can use the orthogonal projection on a space V . First, we can show that the following
operator/matrix P : X X
is projection, indeed
P B T = B T (BB T )1 BB T = B T
P w = B T (BB T )1 Bw = 0
Ax v = (f Ax0 ) v v V.
Ax Qv = (f Ax0 ) Qv v V,
Ax Qv = (f Ax0 ) Qv v X,
QAx v = Q(f Ax0 ) v v X,
QAx = Q(f Ax0 ).
This can be solved by conjugate gradients for initial approximation x(0) V . Indeed the Krylov
subspace for residuum r(0) V
A BT
B C
38
with symmetric and positive definite matrix A Rnn , symmetric and positive semidefinite
matrix C Rmm , and a full rank matrix B Rmn . Then the system is similar to
A 0
0 (C + BA1 B T )
as
A BT In A1 B T
In 0 A 0
=
B C BA1 Im 0 (C + BA1 B T ) 0 Im
1 T
A 0 In A B
= 1 T
B (C + BA B ) 0 Im
.
We note that the matrices V, W are similar if there exists invertible matrix P such that
V = P 1 W P.
Similar matrices U, V have the same eigenvalues and the eigenvector u, w are connect as v = P w
because
V v = v
P 1 V v = P 1 v
P 1 V P w = P 1 P w
W w = w.
has n positive eigenvalues because A is positive definite and m negative eigenvalues because the
matrix (C + BA1 B T ) is negative definite, i.e. for all Rm
(C + BA1 B T ), Rm = C, Rm + BA1 B T , Rm
(10.27) {?}
= C, Rm + A1 B T , B T Rn > 0
(10.28) {?}
| {z } | {z }
0 >0
39
11 Mixed-formulations and its numerical approximation
11.1 Applications to PDEs: Poisson equation with general boundary condi-
tions
Set of differential equations on = (0, 1) (0, 1) R2
= f, = Ae, e = u (11.1) {?}
with Dirichlet boundary conditions (for simplicity considered homogeneous)
u(x) = uD (x) for x D , (11.2) {?}
n (x) = N (x) for x N (11.3) {?}
T S
where N D = and N D = .
Weak formulation is based on a bilinear form a : V V R and linear functional F : V V
for some Hilbert space V , stating find u V such that
a(u, v) = F (v) v V (11.4) {?}
Primal formulation:
In order to satisfy boundary conditions, we have to introduce two spaces
VD = {v H 1 () : u|D = uD }, (11.5) {?}
1
V = {v H () : u|D = 0}. (11.6) {?}
The boundary conditions have to be some that there exists at least one function satisfying this
boundary condition u0 Vd . Then we search a solution u = u0 + u, where u V and satisfy
Z Z Z
a(u, v) = Au v = F (v) = fv + N v a(u0 , v) (11.7) {?}
N
Primal-mixed formulation:
Strong form:
A1 u = 0, (11.9) {?}
div = f (11.10) {?}
The primal-mixed formulation is obtained by a multiplication with test function and by a
modification of a second equation with the help of Greens theorem.
Spaces and forms
X = L2 (; Rd ), M = {v H 1 () : u|D = 0}. (11.11) {?}
a(, ) = (A1 , )L2 b( , u) = ( , u)L2 (11.12) {?}
Similarly to primal problem, we have to decompose u = u0 + u with
u0 MD = {v H 1 () : u|D = uD }.
The weak formulation than states: Find in (, u) X M
(A1 , )L2 ( , u)L2 = ( , u0 )L2 X (11.13) {?}
(, v)L2 = (f, v)L2 (N , v)L2 (N ) v M (11.14) {?}
40
Dual-mixed formulation:
Strong form:
A1 u = 0, (11.15) {?}
div = f (11.16) {?}
The essential boundary conditions have to be incorporated into the finite dimensional space.
Dirichlet boundary conditions are essential in primal-mixed formulation, while Neumann bound-
ary conditions are essential in dual-mixed formulation.
In dual-mixed formulation, it seems that u L2 () only. However, from the first equation
tested with C (; Rd ) such that n = 0, we have
41
11.1.2 Ellipticity and inf-sup condition in continuous setting
Primal-mixed formulation: We can derive the inf-sup inequality by choosing appropriate
element from the space X, i.e.
b( , ) ( , ) (, )
sup = sup = kkL2 CkkH 1 (11.24) {?}
X k kX X k kL2 kkL2
The discrete finite element spaces Xh and Mh have to satisfy inf-sup condition. However,
sometimes it is difficult to prove.
where Th is regular triangulation with triangles denoted by T . The inf-sup condition is satisfied
as in the continuous setting because Mh Xh .
42
11.3.2 Dual-mixed formulation
Xh = RTk X = Hdiv ()
= {v L2 (; Rd ) : v|T = P + px for all p, Pi Pk (T ), v n is continous on T \ and T Th }
Mh = {v L2 () : v|T Pk (T ) and T Th } M = L2 ()
Showing that RTk space is really a subspace of Hdiv : We derive for P RTk
Z X Z X Z X Z Z
P = P = P + P n = P
T Th T T Th T T Th T
| {z }
=0 because of continuity
where Ejk = {tPj + (t 1)Pk | t [0, 1]} is edge opposite to Pi , point ejk = 12 (Pj + Pk ) is the
middle point of Ejk , and njk is the outer normal of triangle T that is orthogonal to (Pj Pk ),
i.e. (Pj Pk ) njk = 0.
43
which has a stationary point given by condition
Dual-mixed formulation:
1 1
inf (A , ) for Zf = {v L2 (; Rd ) | v f = 0}
Zf 2
1 1
inf sup (A , ) ( f )
L L2 () |2
2
{z }
=L(,)
From the first equation, after Green theorem, we can deduce that
= u.
1
(u, p) = J(u, u + p) = (A(u + p), u + p) (f, u). (12.8) {?}
2
Dual problem
44
Using substitution (u + p) = q, we derive
(12.14) {?}
u q 2
q 2 2
Therefore
(
for p + f = 0
(0, p ) = 1 1 (12.15) {?}
2 (A p , p ) otherwise
Duality: From
(0, p ) p , 0 (u, 0)
(12.17) {?}
(0, p ) (u, 0) (12.18) {?}
sup (0, p ) inf (u, 0) (12.19) {?}
p u
because left- and right-hand side holds for any p and u. In many cases, it is possible to show
even equality. Indeed, we can continue with the duality to obtain
inf
(u , 0) sup (0, p ) inf (u, 0). (12.20) {?}
u p u
In our case of Poisson equation and in many others, we have (u , 0) = (u, 0), which gives
us equality in the primal-dual relation
sup (0, ph ) sup (0, p ) = inf (u, 0) inf (uh , 0) = (uh , 0). (12.23) {?}
ph Xh p X uH01 uh Yh
45
Part IV
Appendix
This part is here only for interested students; it will not be taught and thus examined.
Definition A.1 (Open and closed interval). Let a, b R such that a < b then we define an open
and a closed interval as
Definition A.2 (Outer and inner Lebesgue measure on R). Let A R be an arbitrary set, then
X
[
A := inf{ (bi ai ) : (ai , bi ) A} (A.2) {?}
i=1 i=1
is called outer Lebesgue measure of a set A. Let I be bounded interval such that A I, then
A := I (I A) (A.3) {?}
Remark A.3. The outer Lebesgue measure starts from measuring the sizes of intervals (both
open and close gives the same value). Then outer measure is defined as in infimum over the
sizes of open intervals that cover the set A. The inner measure can be defined as a supremum
of closed disjoint intervals that are contained in A, i.e.
X
[ \
A := sup{ (bi ai ) : [ai , bi ] A and [ai , bi ] [aj , bj ] = for i 6= j}. (A.4) {?}
i=1 i=1
For some sets the outer and inner measure coincide, we call this measurable sets, see next
definition. However, there are sets where outer and inner measure differs. This is the reason
why proper theory has to be developed.
A = A. (A.5) {?}
Definition A.5 (Lebesgue measure on R). We say that := from previous definition is a
Lebesgue measure on measurable sets.
46
Definition A.6 (Lebesgue measure on Rd with dimension d from N). Let I = (a1 , b1 )(a2 , b2 )
(ad , bd ) be a parallelepiped (square in two-dimensional setting d = 2) and vol I = (a1 , b1 )
(a2 , b2 ) . . . (ad , bd ) its volume. Then the d-dimensional Lebesgue measure is defined as
X [
A := A = { vol Ik : Ik A, Ik is a parallelepiped for each k N}. (A.6) {?}
kN kN
2 {(1, 2) (2, 4)} = 2, 2 {(1, 2) [2, 4]} = 2, 2 {(1, 2)} = 0, 3 {(1, 2) [2, 4]} = 0,
1 {(1, 2) [2, 4]} =
There exists non-measurable sets, see e.g. Hausdorff paradox and BanachTarski paradox.
Remark A.8 (Finite, countable, and uncountable sets). There are different notions how to
measure the size of the set. One option is the length of intervals, area of surfaces, or volume
of three-dimensional objects. Another notion is cardinality, which measure somehow a number
of elements of the set. We say that two sets have the same cardinality if there exists bijection
between them (one-to-one map which is onto). For examples set A = {2, 4, 6} is finite, it has
three elements so its cardinality is #A = 3. For infinite sets, we say that countable sets have the
same cardinality as natural numbers N, e.g. Z, Q, N2 ; other infinite sets are uncountable sets,
e.g. R, R2 , C.
Example A.9 (Countable sets). Show that #N = #Z. We will find an isomorphism between
them, i.e. map f : N Z defined as
(
i
for i|2 = 0
f (i) = 2 i1 (A.7) {?}
2 otherwise
where i|2 denotes the remainder after division by 2 (sometimes called modulo operation, e.g.
0|2 = 2|2 = 4|2 = 0, 1|2 = 3|2 = 1).
Example A.10 (Uncountable sets). Show that #(0, 1) = #(0, 1) (0, 1). There is again an
isomorphism defined as follows. Let a (0, 1), then each such number can be expressed as
decimal number 0, a1 a2 a3 . . . , i.e. each number is defined with a sequence {ai }iN . Then
f (a) = (0.a1 a3 a5 . . . , 0.a2 a4 a6 . . . ) (A.8) {?}
and again each tuple (0.a1 a3 a5 . . . , 0.a2 a4 a6 . . . ) represents one point in square (0, 1) (0, 1).
Remark A.11 (Cantor set). Cantor set is an uncountable set of Lebesgue measure zero.
47
A.2 Abstract measures
In previous section, we have introduced Lebesgue measure, which measures the sizes of sets
(their lengths, areas, or volumes). There are some other types of measuring sets, which do not
exactly corresponds to our understanding of sizes, areas, or volumes, but those measuring has
still a good sense. Those abstract theory is very useful also in engineering, e.g. in probability
theory.
In this section and later on, we will talk about measure space, which is a triple (, , ),
where is our basic set (e.g. computational domain, the set over which we usually integrate),
is a -algebra (the set of subsets of , which we can measure), and [0, ] is a measure,
i.e. a map that takes a set A (which also means A ) and assigns to this set its measure
(A) [0, ], which can also attain infinity; measurable space is a pair (, ) of set and some
-algebra.
Definition A.12 (-algebra). Let be a set and 2 be a power set (set of all possible sets).
Then 2 (we take only some of all possible subsets) is a -algebra if
(i) is in
Remark A.13. It can be proved that -algebra is closed under countable intersections, it also
contains empty set, etc.
Definition A.14 (Borel -algebra). Borel -algebra is a -algebra generated by all open sets,
i.e. smallest -algebra that contains all open sets.
= {1, 2, 3}, = {, , {1}, {2}, {3}, {2, 3}, {1, 3}, {1, 2}} is the biggest -algebra, and
equals to the space of all subsets = 2
= (0, 1), the set of all possible subsets 2 is not a -algebra because it contains unmea-
surable sets
(i) is a -algebra.
(ii) = 0
(iii) -aditivity: for each sequence {Ai }iN of pairwise disjoint sets from
X
(iN Ai ) = Ai (A.9) {?}
iN
48
Remark A.17. The measure allows the values. When the measure of the whole set is finite
< , we talk about finite measure. Generalizations of here introduced measure include e.g.
sign measures (allowing for negative values of measure) or complex measures (with values in
complex plain).
discrete measure: measures on finite sets, i.e. let = {1, 2, 3, 4, 5}, then we define A =
#A (the number of elements=cardinality) for each A , or A = 2 ;
probability measure ifR = 1; for example let f = 1/2 on (0, 2) and f = 0 otherwise, then
, defined as A := A f d with Lebesgue measure , is a probability measure
We then also write f = g a.e. (or for almost all x or shortly for a.a. x ). In the similar
setting, we can talk about other relations such as f < g a.e.
Remark A.21 (Operations with measurable functions). One example of measurable function is
characteristic function of measurable set. But this set is also closed under standard operations,
also limit operations. It means that if f, g, fi are measurable for i N and a R then also
following functions are measurable
49
heq:simple_funi
Definition A.22 (Nonnegative simple function). Let s : R be a measurable function on a
measurable space (, ). Then we say that it is a non-negative simple function if it has finite
range (finite number of elements) on measurable sets, i.e. it can be expressed as
n
X
s= i Ai (A.12) {?}
i=1
where Ai are pairwise disjoint sets from , and i R are positive i 0 for all i, and A is a
characteristic function of set A, i.e.
(
1 for x A
A = (A.13) {?}
0 otherwise
Examples A.23 (Simple functions).
function f : R R, defined as f (x) = 1 for all x R, is a simple function because its
range is {1}, which is a finite set
function f : R R, defined as f (x) = x for all x R, is not a simple function because its
range is R, which is not a finite set
function f : R R, defined as f (x) = x for all x (0, 1] and f (x) = 0 otherwise, is not a
simple function because its range is [0, 1], which is not a finite set (is uncountable)
function f : R R, defined as f (x) = n1 for all x (n, n + 1], n N and f (x) = 0
otherwise, is not a simple function because its range is {1/n}nN = {1, 12 , 13 , . . . }, which is
not a finite set (is countable)
function f : R R, defined as f (x) = n1 for all x (n, n + 1], n N, such that
n 1000 and f (x) = 0 otherwise, is a simple function because its range is {1/n}1000
n=1 =
{1, 12 , 13 , . . . , 1/1000}, which is a finite set
some other function such as sin(x), exp(x), x2 , x3 are not simple function because their
range is not finite
Definition A.24 (Lebesgue integration of nonnegative simple functions). Let s : R be a
nonnegative simple function like in Def. A.22 and D be a measurable set. Then we define
integral as
Z Xn
s d = i (D Ai ). (A.14) {?}
D i=1
For values like 0 that can possibly occur in the sum, we consider 0 = 0
Definition A.25 (Lebesgue integration of positive functions). Let f : R be a nonnegative
measurable function on measurable space (, , ), i.e. f 0 a.e.. Then we define integral on
D as
Z Z
f d = sup{ s d : s is a nonnegative simple function such that 0 s f }. (A.15) {?}
D D
Definition A.26 (Lebesgue integration). Let f : R be a measurable function on measurable
space (, , ), i.e. f 0 a.e., and f + = max{f, 0} and f = max{f, 0} be its decomposition
into positive and negative parts. Then we define integral on D as
Z Z Z
f d = +
f d f d (A.16) {?}
D D D
when at least one of the integrals is finite.
50
Remark A.27 (Integrals with discrete measure).
assume that (, , ) is measurable space with = {1, 2, 3, 4, 5}, = 2 and discrete measure
A = #A (the number of elements=cardinality) for each A ; then functions u, v : R
can be represented as standard Euclidean vectors u, v R5 in dimension 5 with components
ui = u(i) and v i = v(i). These maps u, v are measurable and are even simple functions
5
X
u= u(i){i}
i=1
Remark A.30 (Difference between integrals). If the Newton, Riemann, and Lebesgue integrals
exists then they equal each other. Usually, the function is not Newton integrable if there is no
primitive function (antiderivative); then still it can have Riemann and Lebesgue integral. It can
also happen that a function is Newton or Riemann integrable but not Lebesgue integrable; this
happens if
Z Z Z
f d = +
f d f d = ,
D D D
i.e. when the Lebesgue integral is not absolutely integrable, i.e f (x) = sinx
x on (0, ) for
(0, 1).
There are also functions, which are Lebesgue integrable but not Riemann integrable, e.g.
Dirichlet function (the indicator function of all rational numbers, Q ). The advantage of
Lebesgue integrals lies in its generality (different measures can be considered), which is espe-
cially advantageous in higher dimensions. It also allows for easy limit theorems, discussed in
next section, which are necessary in analysis of PDEs.
51
A.4.2 Properties of Lebesgue integration
Lemma A.31 (Basic properties). Let a, b R and f, g : R be measurable.
R R R
(i) af + bg d = a f + b g d
R R
(ii) f = g almost everywhere then f = g
R R
(iii) f g almost everywhere then f g
R R
(iv) if f < then also |f | <
R R
(v) | f | |f |
R
Lemma A.32. Let f : R be a positive measurable function and f d = 0. Then f = 0
almost everywhere.
Proof. Set E := {x : f (x) 0}, then E f d = E f + d = 0 and using the previous theorem
R R
The following theorems describe the convergence theorems, which are of particular in PDEs.
Lemma A.35 (Dominated convergence theorem). Let R{fi }i N be a sequence ofR measurable
functions such that limk fk = f a.e. and fk g with |g| d < . Then also |f | d <
and
Z Z
lim fk d = f d
k
References
aess2007bookFEM [Braess, 2007] Braess, D. (2007). Finite elements: Theory, fast solvers, and applications in solid
mechanics. Cambridge University Press.
arlet1991HNAFEM [Ciarlet, 1991] Ciarlet, P. G. (1991). Handbook of Numerical Analysis: Finite Element Methods.
Elsevier Science Publishers B.V. (North-Holland).
eland1976convex [Ekeland and Temam, 1976] Ekeland, I. and Temam, R. (1976). Convex Analysis and Varia-
tional Problems. North-Holland, Amsterdam.
uermond2013book [Ern and Guermond, 2004] Ern, A. and Guermond, J.-L. (2004). Theory and practice of finite
elements. Springer Science & Business Media, New York.
52
Evans2000PDE [Evans, 2000] Evans, L. (2000). Partial Differential Equations, volume 19. American Mathe-
matical Society.
Fortin1991 [Fortin and Brezzi, 1991] Fortin, M. and Brezzi, F. (1991). Mixed and hybrid finite element
methods. Springer Science & Business Media.
Gockenbach2006 [Gockenbach, 2006] Gockenbach, M. (2006). Understanding and implementing the finite element
method.
Fenics2017 [Langtangen and Logg, 2017] Langtangen, H. P. and Logg, A. (2017). Solving PDEs in Python.
Springer International Publishing.
Rudin1986real [Rudin, 1986] Rudin, W. (1986). Real and complex analysis. McGraw-Hill, New York, third
edition.
Wandzurat2003 [Wandzurat and Xiao, 2003] Wandzurat, S. and Xiao, H. (2003). Symmetric quadrature rules
on a triangle. Comput. & Math. with Appl., 45(12):18291840.
53