Links to UNSW TV
Click on the topic to go to the online recording:
Week 1: Probability space, Calculating with probabilities,
Counting.
Week 2: Bernoulli distribution, Binomial distribution,
Geometric distribution, Negative Binomial distribution.
Week 3: Numerical methods to summarize data, Graphical
procedures to summarize data.
Week 4: Sampling with and without replacement, Properties
of sample mean and variance.
Week 5: Chi-squared distribution, Student-t distribution,
Snecdors F distribution, Distribution of sample mean and
variance.
Introduction in Probability
Probability space
Sample Space & -algebra
Probability Measure
Counting
Counting Principles
Computing Probabilities
S
(null/empty set) for all i 6= j, then
Ek F.
k=1
Example:
Consider the experiment of selecting a card at random from a
deck of four cards with four different suits, and noting its suit:
hearts (H), diamonds (D), spades (S), or clubs (C ).
The sample space is: = {H, D, S, C }.
All the sets of possible events are a -algebra of F.
3/33
(H, D)
(H, S)
(H, C )
(D, S)
(D, C )
(S, C )
4/33
Sample Spaces
5/33
Introduction in Probability
Probability space
Sample Space & -algebra
Probability Measure
Counting
Counting Principles
Computing Probabilities
Probability Measure
A probability measure on is a function Pr from the subsets
of to < that satisfies the following axioms:
1. For all events Ei , 0 Pr (Ei ) 1;
2. Pr () = 1;
3. For any events E1 , E2 , . . . where Ei Ej = for every i 6= j,
then:
!
[
X
Pr
Ek = Pr (E1 ) + Pr (E2 ) + . . . =
Pr (Ek ) .
k=1
k=1
6/33
Introduction in Probability
Probability space
Sample Space & -algebra
Probability Measure
Counting
Counting Principles
Computing Probabilities
Definitions
Union of two events: C = A B
is an event that both A and/or B occurs, i.e., C iff A
or B.
Intersection of two events: C = A B
is an event that both A and B occurs, i.e., C iff A
and B.
Complement of an event: B = Ac
is the event that A does not occur, i.e., Ac iff
/ A.
Events are disjoint: if they have no outcomes in commun
i.e., A and C are disjoint iff A C = .
7/33
Useful laws
Commutative laws:
AB =B A
and A B = B A.
Associative laws:
(A B) C = A (B C )
and (A B) C = A (B C ).
Distributive laws:
(A B) C = (A C ) (B C )
and (A B) C = (A C ) (B C ).
DeMorgans laws:
(A B)c = Ac B c and (A B)c = Ac B c .
8/33
9/33
10/33
Pr ((E1 E2 ) E3 )
Pr (E1 E2 ) + Pr (E3 )
Pr ((E1 E2 ) E3 )
Pr (E1 E2 ) + Pr (E3 )
Pr ((E1 E3 ) (E2 E3 ))
11/33
Inequality rules
Using the definition of probability measure one can prove the
following inequalities:
Booles Inequality: If E1 , E2 , . . . , En are any n events, then
!
n
n
[
X
Pr
Ek
Pr (Ek ) ,
k=1
k=1
Inequality rules
13/33
Introduction in Probability
Probability space
Sample Space & -algebra
Probability Measure
Counting
Counting Principles
Computing Probabilities
Conditional Probability
The conditional probability of A, given B, as:
Pr (A |B ) =
Pr (A B)
Pr (B)
S
P
Pr
Ak |B =
Pr (Ak |B ).
14/33
k=1
k=1
Conditional Probability
15/33
X
k=1
16/33
Pr (A |Ek ) Pr (Ek ) .
Introduction in Probability
Probability space
Sample Space & -algebra
Probability Measure
Counting
Counting Principles
Computing Probabilities
Independence
Events A and B are said to be independent if:
Pr (A B) = Pr (A) Pr (B) .
Equivalently, we have events A and B independent if:
Pr (A |B ) = Pr (A)
and
Pr (B |A ) = Pr (B) .
18/33
19/33
Example
Consider the experiment of selecting n cards at random from a
deck of 52 cards with 13 hearts (H), 13 diamonds (D), 13 spades
(S), and 13 clubs (C ) cards.
False For n = 2, the second card is independent of the first card.
False For n = 14, the 14th card is mutually independent of the first
13 cards.
True Previous two questions, but now when the card is put back in
the stock before the next one is selected.
True For n = 3, A=at least 2 H and B=at least 2 C are
mutually exclusive events.
False For n = 4, A=at least 2 H and B=at least 2 C are
mutually exclusive events.
20/33
Introduction in Probability
Probability space
Sample Space & -algebra
Probability Measure
Counting
Counting Principles
Computing Probabilities
Bayes Theorem
Bayes Theorem: Suppose E1 , E2 , . . . represent a complete
partitioning of the sample space , then for any non-empty
event A F, we have:
Pr (A |Ek ) Pr (Ek )
,
Pr (Ek |A ) = P
j=1 Pr (A |Ej ) Pr (Ej )
for any k = 1, 2, . . ..
21/33
Example
An insurance company classifies its policyholders according to
three risk classes: L (low risk), M (medium risk) and H (high
risk). The proportion of H policyholders is 20% and the
proportion of L policyholders is 50%. For each of the risk
classes, the probability of a claim is 0.01 for L, 0.02 for M,
and 0.04 for H.
a. Question: If a claim occurs, what is the probability that it is
from a L (low risk) policyholder?
22/33
Example (cont.)
b. Question: If a claim occurs, what is the probability that it is
from a M (medium risk) policyholder?
c. Question: If a claim occurs, what is the probability that it is
from a H (high risk) policyholder?
0.30.02
0.019
0.20.04
0.019
= 32%.
= 42%.
Introduction in Probability
Probability space
Sample Space & -algebra
Probability Measure
Counting
Counting Principles
Computing Probabilities
Counting Principles
Multiplication Rule: Suppose S1 , S2 , . . . , Sm are m sets with
respective number of elements n1 , n2 , . . . , nm . The number of
ways of choosing one element from each set is given by:
n1 n2 . . . nm .
r ! (n r )!
r
n
n1 , n2 , . . . , nr
25/33
n!
.
n1 ! n2 ! . . . nr !
Example
Question: An airline has 6 flights daily from Sydney to
Honolulu, and 8 flights daily from Honolulu to Los Angeles.
The airline offers no direct flight from Sydney to Los Angeles.
If the flights are to be made on separate days, how many
different flight arrangements can the airline offer from Sydney
to Los Angeles?
27/33
30/33
Introduction in Probability
Probability space
Sample Space & -algebra
Probability Measure
Counting
Counting Principles
Computing Probabilities
Computing Probabilities
If the elements of the sample space all have equal
probability and assuming: (a) there are N elements in and
(b) the event A can occur in any of n mutually exclusive ways,
then:
n
Pr (A) = .
N
Question: A drawer contains 10 pairs of socks. If 6 socks are
taken at random and without replacement, compute the
probability that there is at least one matching pair among
these 6 socks.
31/33
32/33
6
9
5
8
= 6 0.0758 =
5
11 .
5
8
= 0.0758.
a
.
a+b
Week 1
Week 3
Week 4
Probability: Week 2
Week 6
Review
Estimation: Week 5
Week
7
Week
8
Week 9
Hypothesis testing:
Week
10
Week
11
Week
12
Linear regression:
Week 2 VL
Week 3 VL
Week 4 VL
Video lectures: Week 1 VL
Week 5 VL
101/144
Introduction in probability
Exercise
Mathematical methods
Random variables and distributions
Measures of location: probability distributions
Measures of dispersion
r th central/non-central moments
Generating functions
Summary
Summary
Course overview
Week 1
Week 2-4
Week 5-6
Week 7-9
Week 10-12
102/144
This week
Describing distribution using measures of location and
dispersion:
-
mean;
variance;
skewness;
kurtosis.
103/144
Introduction in probability
Exercise
Mathematical methods
Random variables and distributions
Measures of location: probability distributions
Measures of dispersion
r th central/non-central moments
Generating functions
Summary
Summary
Sample spaces
An insurer offers health insurance. An individual can have file
either one, two, three, or no claim in a given quarter. For
simplicity, it is not possible to issue more than three claims.
Questions:
a. State all possible events.
b. Give the sample space.
c. Give the algebra.
Solutions:
a. E {{0}, {1}, {2}, {3}, {0, 1}, {0, 2}, {0, 3}, {1, 2}, {1, 3},
{2, 3}, {0, 1, 2}, {0, 1, 3}, {0, 2, 3}, {1, 2, 3}, {0, 1, 2, 3}, }.
b. = {0, 1, 2, 3}.
104/144
c. {{0}, {1}, {2}, {3}, {0, 1}, {0, 2}, {0, 3}, {1, 2}, {1, 3}, {2, 4},
{0, 1, 2}, {0, 1, 3}, {0, 2, 3}, {1, 2, 3}, {0, 1, 2, 3}, }.
Solutions:
a. Pr (C = {2, 3}|C = {1, 3}) =
Pr(C ={3})
(Pr(C ={1})+Pr(C ={3}))
= 1/10, and
Pr(C ={2})
(Pr(C ={0})+Pr(C ={2}))
= 1/10.
106/144
Counting principles
The history of an individuals claiming record is had he had 3
quarters with 2 claims, 2 quarters with 1 claim and 15
quarters without a claim.
Questions:
a. What is the probability that the insured had first 15 quarters
without a claim and then 5 quarters with at least one claim?
b. What is the probability that the insured had first 15 quarters
without a claim and then 2 quarters with one claim and then 3
quarters with two claims?
c. Comment on your results.
Solutions:
107/144
a. Use Combination,
n = 20, r = 5, number of ways choosing
objects: 20
=
15,
504. Thus, probability is 1/15,504.
5
b. Use Multinomial, n = 20, r1 = 15, r2 = 2, r3 = 3, number of
20!
= 155, 040. Thus, probability is
ways choosing objects: 15!2!3!
1/155,040.
Introduction in probability
Exercise
Mathematical methods
Random variables and distributions
Measures of location: probability distributions
Measures of dispersion
r th central/non-central moments
Generating functions
Summary
Summary
for all x.
Distribution function
Properties of a distribution function:
1. FX () is a non-decreasing function, i.e., FX (x1 ) FX (x2 )
whenever x1 x2 ;
2. FX () is right-continuous, that is for all x,
lim+ FX (x + ) = FX (x);
0
3. FX () = 0;
4. FX (+) = 1.
109/144
- continuous;
- discrete;
- mixed.
A
B
C
D
E
F
F(x)
0.8
0.6
0.4
0.2
0
10
110/144
0
x
Which one
is/are
distribution
functions?
Solution:
B and F.
Not nondecreasing:
A;
Not rightcontinuous:
C;
Not
F () = 0:
D;
Not
F () = 1:
10
E.
Note that:
p.m.f. satisfies
FX (x) .
x
pX (xk ) = 1;
k=0
Introduction in probability
Exercise
Mathematical methods
Random variables and distributions
Measures of location: probability distributions
Measures of dispersion
r th central/non-central moments
Generating functions
Summary
Summary
x pX (x) ,
all x
if
is discrete;
ZX
X = E[X ] =
x fX (x) dx,
if X is continuous.
112/144
E [X ] = x fX (x)dx = 0 x fX (x)dx = 12 x 2 0 = 12 .
h (x) pX (x) ,
all x
if
is discrete;
ZX
E[h(X )] =
if X is continuous.
1
Exercise: Let X be a r.v. with pdf: pX (x) = 10
for
x = 0, 1, 2, . . . , 9 and zero otherwise, let h(X ) = X 2 .
Calculate E [h(X )].
Solution: P
P
1
E [h(X )] = all x h(x) pX (x) = 9x=0 x 2 10
= 28.5.
113/144
xpX ,Y (x, y ) +
x,y
E[X Y ] =
X
X
x
114/144
x,y
(x y ) pX ,Y (x, y ) =
x,y
XX
x
(x pX (x))
X
y
(x pX (x)) (y pY (y ))
(y pY (y )) = E[X ] E[Y ].
Example
An insurance company offers motor vehicle insurance. The
probability that an insured files a claim is 20%. Assume that
the insured files not more than one claim.
a. Question: What is the probability mass function?
b. Question: What is the expected number of claims.
c. Question: Assumes that there are 150 insured. What is the
expected number of claims.
The claims will be paid at the end of the year. The required
capital will depends on investment return. A 10% increase in
asset value occurs w.p. 20% and a decrease of 5% occurs
w.p. 15%. The claim value is $1,000 for each claim.
115/144
Solution
0.2, if x = 1;
a. pX (x) =
and zero otherwise.
0.8, if x = 0,
P
b. E [X ] = all x pX (x) x = 0.2 1 + 0.8 0 = 0.2.
c. E [150 X ] = 150 E [X ] = 150 0.2 = 30.
d. Let Y be
the random variable of the asset value. We have:
0.2, if y = 1/1.1;
0.65, if y = 1;
and zero otherwise.
pY (y ) =
0.15, if y = 1/0.95,
E [1000 Y ] =1000 E [Y ]
1
1
=1000 0.2
+ 0.65 1 + 0.15
1.1
0.95
=1000 0.99689 = 996.89.
Introduction in probability
Exercise
Mathematical methods
Random variables and distributions
Measures of location: probability distributions
Measures of dispersion
r th central/non-central moments
Generating functions
Summary
Summary
Measures of dispersion
X X gives deviation from the expected value.
117/144
Variance of X
MAD is not easily to calculate, and has not the nice
properties (not related to moments of distribution).
Another measure of dispersion: Let X be a random variable,
the variance is given by:
2 = Var (X ) =E (X X )2
=E X 2 2X .
The function E (X )2 is minimized when = X .
=Var (X ) + Var (Y ) ,
119/144
Exercises
The claims of the motor vehicle insurance are itself stochastic.
The distribution of the claim value of an insured is:
0,
if x < 0;
fX (x) =
5 exp ( x) , if x > 0;
pX (0) =0.8.
a. Question: Find the expected value of the claims size.
b. Question: Find the variance of the claims size.
c. Question: The price of an insurance contract is the expected
value plus half the standard deviation. Find the price of the
MVI contract.
d. Question: Same as c., but now for the 150 contract together.
120/144
Solution
R
a. E [X ] = 0 0.8 + 0 5 exp ( x) xdx
i
h
= 5 0
1)
= 0 + 5 exp(x)
()2
c. Price = E [X ] + 0.5
p
Var (X ) =
1
5
+ 0.5
3
5
q
i
P150
Xi + 0.5 Var ( i=1 Xi ) =
p
3 150
150E [X ]+0.5 150 Var (X ) = 150
+0.5
5
5 =
d. Price = E
121/144
(1) =
exp ( x) x 2 dx
E X 2 =02 0.8 +
5
0
2
x
2x
2
=0 + exp( x)
+
5
()2
()3 0
2
2
=
.
= 0
3
5
5 2
2
1
9
2
Var (X ) =E X 2 (E [X ]) = 2
=
.
5
252
252
b.
1
2
1
2 .
hP
150
i=1
913.5
30.22
.
1
5 .
Skewness
Let X be a random variable. The skewness of X is given by:
"
#
X X 3
.
X = E
X
Coefficient of skewness:
3 =
E X3
(E [X 2 ])3/2
X X3 +Y Y3
X3 +Y
if X , Y are independent.
Pr(X=x)
Pr(X=x)
Pr(X=x)
distribution i)
1/2
1/3
1/6
0
1/2
1/3
1/6
0
2
1/2
1/3
1/6
0
10
123/144
0
x
distribution ii)
0
x
distribution iii)
4
x
Snedecors F p.d.f.
Snedecors F c.d.f.
1
v = 20, v = 40
1
0.9
0.8
0.8
0.6
0.4
0.7
0.6
0.5
0.4
0.3
0.2
0.2
0.1
0
0
124/144
2
x
0
0
Beta p.d.f.
Beta c.d.f.
1
a= 4, b= 2
2
0.9
0.8
1.5
0.5
0.7
0.6
0.5
0.4
0.3
0.2
0.1
a= 4, b= 2
0
0
125/144
0.5
x
0
0
0.5
x
Kurtosis
Let X be a random variable. The (excess) kurtosis of X is
given by:
"
#
X X 4
X = E
3.
X
Measures the peakedness (positive) or flatness (negative) of a
random variable.
Kurtosis coefficient:
4 =
126/144
E X4
(E [X 2 ])2
1/18
3
Pr(X=x)
Pr(X=x)
distribution i)
8/9
0
x
distribution ii)
1/8
0
x
distribution iii)
3/4
1/8
10
127/144
3/4
Pr(X=x)
4
x
p.d.f.
c.d.f.
1
0.7
0.9
0.6
0.5
0.4
0.3
0.2
0.8
0.7
0.6
0.5
0.4
0.3
0.2
Uniform( 3, 3)
Normal(0,1)
Laplace(0,1/ 2)
0.1
0.1
0
0
x
0
x
Exercises
An insurance company is pricing its policies using the
standard deviation pricing principle.
The regulator requires that insurers have enough reserves in
order to reduce the probability of ruin to 0.5%.
Holding capital is a cost.
a. Would the insurer company prefer claims with a distribution
I. with mean $100 and standard deviation of $5.
II. with mean $50 and standard deviation of $10.
129/144
Exercises
c. Would the insurer company prefer claims with a distribution
I. with mean $100 and standard deviation of $5 and a skewness
of $0 and a kurtosis of $5.
II. with mean $100 and standard deviation of $5 and a skewness
of $0 and a kurtosis of $2.
Solution:
a.
b.
c.
d.
130/144
II.
II.
II.
Cannot say from the question.
Introduction in probability
Exercise
Mathematical methods
Random variables and distributions
Measures of location: probability distributions
Measures of dispersion
r th central/non-central moments
Generating functions
Summary
Summary
r th central moments
Let X be a r.v., the rth central moment is given by:
X
(x X )r pX (x) ,
all x
if
is discrete;
r
ZX
E [(X X ) ] =
(x X )r fX (x) dx,
if X is continuous.
Relation central & non-central moments:
r
h i
X
r
r
E [(X X ) ] =
E X k (X )r k .
k
k=0
(Non-central) Moments
Let X be a r.v., the rth (non-central) moment is given by:
X
x r pX (x) ,
all x
if
is discrete;
ZX
E [X r ] =
x r fX (x) dx,
if X is continuous.
Consider the Motor Vehicle insurer from slide 115.
Recall the mean of the claim size from b. on slide 116.
a. Question: Find the second central moment for an insured.
a. Solution:
Var (X ) =
E hX 2 (E [Xi ])2 = (0.2 12 + .08 02 ) 0.22 = 0.16 or
132/144
Exercises
b. Question: Find the skewness using the third central moment.
b. Solution: Start with the third central moment:
h
i
E (X X )3 =0.2 (1 0.2)3 + 0.8 (0 0.2)3 =0.1024 0.0064=0.096.
h
i
#
"
3
3
E
(X
)
X
0.096
(X X )
=
=
= 1.5.
X =E
3
3
0.163/2
c. Question: Find
and third non-central moments.
2 the second
2
c. Solution:
E X = 1 0.2 + 02 0.8 = 0.2 and
3
E X = 13 0.2 + 03 0.8 = 0.2.
d. Question: Find the skewness using only non-central moments.
d. Solution:
h
i
E (X X )3 = E X 3 3X X 2 + 32X X 3X =
E X 3 3X E X 2 + 32X E [X ] 33X =0.2 0.12 + 0.024 0.008=0.096.
133/144
= E e Xt ,
* using exp(x) = 1 + x +
134/144
x2
2!
x3
3!
+ . . ., with x = X t.
Introduction in probability
Exercise
Mathematical methods
Random variables and distributions
Measures of location: probability distributions
Measures of dispersion
r th central/non-central moments
Generating functions
Summary
Summary
MX (0) = E [X r ] ,
for r = 0, 1, 2, 3, . . . ;
MmX +b (t) = MX (m t) e bt , for constants m, b;
MX +Y (t) = MX (t) MY (t), only if X , Y are independent.
135/144
h
i X
tk
MX (t) = E e X t =
k .
k!
k=0
=
x r e xt fX (x) dx
h
i
=E X r e X t .
Set t = 0 and you get the desired result.
Remark: If the m.g.f. exists for t in an open interval
containing zero, then it uniquely determines the probability
distribution.
137/144
Example: Consider the MVI from slide 115. The m.g.f. is:
h i X
MX (t) = E e Xt =
pX (x) e xt = 0.2 e 1t + 0.8 e 0t = 0.8 + 0.2e t .
all x
138/144
Exercises
Consider the Motor Vehicle insurer from slide 120.
a. Question: Determine the m.g.f. of the claim size.
b. Question: Use the m.g.f. to determine the skewness.
a. Solution: We have:
Z
h
i
x tx
tX
0t
MX (t) = E e
=0.8 e +
e
e dx
5
Z 0
(t)x
=0.8 +
e
dx
5
0
=0.8 +
e (t)x
5 (t )
0
=0.8
.
5 (t )
139/144
5(t) .
1
2
E [X ] =
=
1 (t )
=
t
5
5
t=0
t=0
2
MX (t)
2
= 2 (t )3
= 2
E X2 =
2
t
5
5
t=0
t=0
3
MX (t)
6
2
E X3 =
3 (t )4
= 3.
=
3
t
5
5
t=0
t=0
2
1
9
2
=
Var (X ) =E X (E [X ])2 = 2
5
(5)2
252
E (X X )3
E X 3 3X E X 2 + 32X E [X ] 3X
X =
=
Var (X )3/2
Var (X )3/2
6
12
11
6 52
25
1
3 3 253 + 2 1253
= 5
=
3 3 + 2 3 = 122/27.
3
3
(3/(5))
3
3
3
140/144
h i X
PY (t) = E t Y =
pY (i) t i .
i=1
Properties of p.g.f.:
- The relationship between p.g.f. and m.g.f. is as follows:
PY (t) = MY (log(t)) .
(r )
- Probabilities: Pr(Y = r ) = P Y (t) /r !
t=0
X
1 i
h i or MX (t) = e CX (t) ,
CX (t) = log(MX (t)) =
i!
i=1
where i =
(i)
CX (0)
is the i th cumulant.
Properties of c.g.f.:
i = i th central moment, for r = 2, 3;
CmX +b (t) = CX (m t) + b t,
for constants m, b;
CX +Y (t) = CX (t) + CY (t),
X , Y independent.
(r )
142/144
for r = 4, 5, 6, . . ..
Introduction in probability
Exercise
Mathematical methods
Random variables and distributions
Measures of location: probability distributions
Measures of dispersion
r th central/non-central moments
Generating functions
Summary
Summary
Mathematical Expectation
The mathematical expectation of h (X ) is:
X
h (x) pX (x) ,
if X is discrete;
all x
E [h (X )] =
Z
Mean: = E [X ].
Moments: r = E [X r ] refers to the r th (non-central)
moment.
Central moments: E [(X x )r ] refers to the r th central
moment.
143/144
Dispersion
h
i
Variance: 2 = Var (X ) = E (X )2 = E X 2 2 .
#
X X 3
refers to the skewness. It
Skewness: X = E
X
measures the lack of symmetry in the p.d.f..
"
#
X X 4
Kurtosis: X = E
3 refers to the kurtosis.
X
It measures the peakedness or flatness of the p.d.f..
Moment generating function: MX (t) = E e X t , it
uniquely defines a density function.
It is useful to calculate
(r )
r
moments: r = E [X ] = M X (t) .
"
t=0
144/144