(KVT3)
Lecturers:
Programme
Date
Topic
Lecturer
Sept 2
Sept 5
Introductory statistics
Funding academic research
AZ
DK
Sept 9
AZ
Sept 12
Medical writing
DK
Sept 16
Contingency tables
CD
Sept 21
Parametric analysis
DK
Sept 23
Non-parametric analysis
CD
Sept 26
DK
Sept 30
Regression analysis
CD
Okt 5
Study design
AZ
Okt 7
Survival analysis
CD
Okt 19
AZ
Learning material
Software: SPSS
The students are expected to bring a laptop.
SPSS should be installed before the course starts using guidelines at:
http://spss.software.aau.dk/
Examination
Written exam - 4 hours, in January
Pensum: slides + learning material specified
for each lecture
Exam questions will reflect lectures and course
assignments
Some questions will require use of a software (SPSS)
Hjlpemidler: everything, but not Internet
and communication with others
More detailed Exam info can be found at
http://person.hst.aau.dk/az/MedIs7
6
Alina Zalounina
Center for Model-based
Medical Decision Support
7
Learning material:
Chapter 1: Data
Chapters 2-5: Descriptive Statistics
Chapters 7-8: Statistical Inference
Learning Objectives
Type of data
Categorical
data
Nominal
Ordinal
etnicity
score
gender
marital status
type of operation
smoking status
Metric
data
Discrete
Continuous
number of
children
weight
height
temp.
age
blood pressure
time
cholesterol
body mass index
Type of Statistics
Descriptive used to organize and
describe a sample
Inferential used to extrapolate from a
sample to a larger population
12
Learning Objectives
Measures of Variability
- Variance
- Standard deviation
- Standard error
Descriptive Plots
- Boxplot
- Histogram
- Q-Q plot
Data distibutions
- Normal
- Binomial
14
Frequency table
Relative frequency
15
x
i 1
Population
Mean
x
i 1
Sample Mean
16
Median (middle)
17
Measures of Variability
Variance
( xi x)
i 1
n -1
Sample Variance
( xi )
N
i 1
Population Variance
18
s
se=
n
n
( xi x)
i 1
n -1
Sample SD
( xi )
N
Standard
Error
2
i 1
Population SD
19
Descriptive Plots
Boxplot
20
Histogram
Overall shape curve shows distribution
Normal distribution
Bell-shaped
1 x- 2
P(a X b) f(x)dx
a
95%
-1.96*
+1.96*
X = a continuous variable
f(x) = probability distribution function of X
= mean
= standard deviation
Check normality
Without inspecting the data it is risky to assume a normal
distribution.
There are a number of graphs that can be used to check the
deviations of the data from the normal distribution:
A histogram should reveal a bell shaped curve.
QQ plot: Curvature of the points indicates departures of
normality
24
Skew distribution
25
Binomial distribution
binomial variable
26
n!
x
p (1 p)n-x
x!(n-x)!
Note: n!=n(n-1)(n-2)1
27
P(X=2)=?
5!
5-2
2
=> P(X=2)=
(1
0.3)
0.31
0.3
2!(5-2)!
probability
distribution
28
Learning Objectives
Inferential Statistics
Can your experiment make a statement about
the general population?
Two types of tests:
1. Parametric
2. Non-Parametric
Learning Objectives
Absolute risk
Relative risk
Odds
Odds ratio
outcome
Apgar
score <7
Yes
No
Totals
Yes
11
No
17
19
Totals
10
20
30
risk factor
Interpretation of RR:
Mothers who smoked during pregnancy had more than 5 times
the risk of getting low Apgar score as those who did not smoke.
Odds
outcome
Apgar score<7
risk
factor
Mother
smoked
during
pregnancy
Yes
No
Totals
Yes
10
No
17
20
Totals
11
19
30
The ratio between the odds is the odds ratio for smoking
among mothers with low score compared to mothers with
high score:
OR = odds1/odds2 = 22.67
Interpretation of OR:
Mothers with low Apgar score were more than 22 times as
likely to have smoked during pregnancy as those with high
Apgar score.
RR versus OR
Exposed Non-Exposed
Outcome
No Outcome
Outcome
A(B D)
RR
B(A C)
No Outcome
Exposed
NonExposed
AD
OR
BC
Learning Objectives
Introduction to SPSS
Example
39
Data view
Variable view
Smoking
LowApgarScore
41
Frequences
Cross - Tabulations
42
Risk estimate
Relative risks:
4.25=(17/20)/(2/10)
0.188=(3/20)/(8/10)
43
Descriptives
Box-plot
44
Histogram
45
Q-Q Plot
46
Learning Objectives
Exercises:
http://person.hst.aau.dk/az/MedIs7
48