Chapter 5 2
Arises frequently as the null
distribution of a test statistic
especially in likelihood-ratio test
and most notably in Analysis of
Variance.
Comparing statistical models that
have been fit to the data.
F cannot be negative, and it is a
continuous distribution.
The F distribution is positively
skewed.
Its values range from 0 to
As F the curve approaches the
X-axis.
Chapter 5 3
3
Chapter 5 4
The F distribution is also used for testing whether two or
more sample means came from the same or equal
populations.
Assumptions:
The sampled populations follow the normal distribution.
The populations have equal standard deviations.
The samples are randomly selected and are independent.
Chapter 5 5
5
Whether the differences between the groups are
significant depends on
Chapter 5 6
A division of the overall variability in data values in
order to compare means.
Overall (or “total”) variability is divided into two
components:
the variability “between” groups, and
the variability “within” groups
Summarized in an “ANOVA” table.
Chapter 5 7
Two variables: 1 independent variable,
1 dependent variable
12 22 k2
Chapter 5 9
Comparing Means of Two or More
Populations
• State the hypothesis
• H0: µ1 = µ2 =…= µk
Step 1 • H1: The means are not all equal
ANOVA
table! • Test Statistics
SST k 1
Step 2 F
SSE n k
• Critical value
Step 3
Chapter 5 10
10
Comparing Means of Two or More
Populations
• Decision
Step 4 • Reject H0 if F > F,k-1,n-k
• Conclusion
Step 5
Chapter 5 11
11
Only one classification factor is considered
Factor
1 Response/ outcome/
dependent variable
Treatment 2
(samples)
(The level of
the factor)
Replicates (1,…,j)
The object to a
given treatment Chapter 5 12
Degrees Sum of Squares (SS) Mean of Squares (MS) F Value
of
Freedom
(Df)
Between Ftest
sample 2
MS (Tr )
Treatment
Treatment / Factor 2
1 k 2 1 2
(Factor
SS (Tr ) Ti. T..
Variation) k -1 n i 1 kn
SS (Tr ) / Factor
k 1 Error
2
Within
samples
SSE
(Error T- k SSE SST SS (Tr ) Error
2
MSE
variation)
N k
k n
1 2
Total T-1 SST xij2 T..
i 1 j 1 kn
Chapter 5 13
• n = number of data all together
• a = number of levels
• x = mean for entire data set is
Group i has
• ni = # of data in level i
• xij = value for data j in level i
• xi = mean for level i
• si = standard deviation for level i
Chapter 5 14
An experiment was performed to determine whether the
annealing temperature of ductile iron affects its tensile
strength. Five specimens were annealed at each of four
temperatures. The tensile strength (in ksi) was measured
for each temperature. The results are presented in the
following table. Can you conclude that there are differences
among the mean strengths?
Chapter 5 15
Subjects: 25 patients with blisters
Treatments: Treatment A, Treatment B, Placebo
Measurement: # of days until blisters heal
Chapter 5 19
The ANOVA F-statistic is a ratio of the Between
Group Variaton divided by the Within Group
Variation:
Between MSA
F
Within MSE
A large F is evidence against H0, since it
indicates that there is more difference
between groups than within groups.
Chapter 5 20
SST ( xij x ) 2
x
x 2 ij
2
ij
obs n
SSE ( xij xi ) 2
obs
SSA ( xi x)
2
( xi ) 2
x ij
2
obs ni n
SS MSA
SST SSA SSE ; MS ; F
DF MSE
Chapter 5 21
One-way Analysis of Variance
Source DF SS MS F P
Factor a-1 SS(Between) MSA MSA/MSE
Error n-a SS(Error) MSE
Total n-1 SS(Total)
From F-distribution
with a-1 numerator and
n-a denominator d.f.
MSA = SS(Between)/(a-1)
n-1 = (a-1) + (n-a) MSE = SS(Error)/(n-a)
SS(Total) = SS(Between)Chapter 5
+ SS(Error) 22
“F” means “F test statistic”
One-way Analysis of Variance
Source DF SS MS F P
Factor 2 2510.5 1255.3 93.44 0.000
Error 12 161.2 13.4
Total 14 2671.7
P-Value
Source DF SS MS F P
Factor 2 2510.5 1255.3 93.44 0.000
Error 12 161.2 13.4
Total 14 2671.7
Source DF SS MS F P
Factor 2 2510.5 1255.3 93.44 0.000
Error 12 161.2 13.4
Total 14 2671.7
1255.2 = 2510.5/2
13.4 = 161.2/12
14 = 2 + 12 93.44 = 1255.3/13.4
2671.7 = 2510.5 + 161.2 Chapter 5 25
1 1
X1 X 2 t MSE n n
1 2
Chapter 5 26
26
Chapter 5 27
2 Independent Variables
Examples
IV#1 IV#2 DV
Drug Level Age of Patient Anxiety Level
Key Advantages
Compare relative influences on DV
Examine interactions between IV
Chapter 5 28
Only two classification factor is considered
Factor B
1 2 j
1
Factor A 2
i
Chapter 5 29
The standard two-way ANOVA tests are valid under the following
conditions:
xij1 , , xijk
Chapter 5 30
Source (Df) Sum of Squares (SS) Mean of Squares (MS) F Value
1 a b 2 x...2
SSAB xij .
SSAB MSAB
Ftest Interaction
(a-1)(b-1)
Interaction MSAB
n i 1 j 1 abn a 1 b 1 MSE effect
Total abn-1 a b n
x...2
SST x 2
ijk
i 1 j 1 k 1 abn
Chapter 5 31
A chemical engineer is studying the effects of various
reagents and catalyst on the yield of a certain process. Yield
is expressed as a percentage of a theoretical maximum. 4
runs of the process were made for each combination of 3
reagents and 4 catalysts. Construct an ANOVA table and
test is there an interaction effect between reagents and
catalyst.
Reagent
Catalyst
1 2 3
Chapter 5 32
Miller, I and Miller, M (1999). Mathematical Statistics, 6th Ed.
Prentice-Hall, Inc.
Chapter 1 33