Not a statistician! I do work in the area of chemometrics So emphasis will be on practical knowledge and application of statistics, rather than theorems and proofs.
Objectives
Understand the fundamental principles of statistical inference. Understand the general principles underlying the most common tests. Know the assumptions of common tests and understand impact of violations. Be able to perform standard statistical analyses
Ernest Rutherford
To call in a statistician after the experiment is done may be no more than asking him to perform a postmortem examination; he may be able to say what the experiment died of.
Silver
Moderate familiarity coupled with a strong desire to demonstrate same; statistical reach exceeds grasp
Knows when statistical issues are (and are not) important; recognizes limitations (of self and statistical science)
Overwhelming concern with statistical minutae; scientific forest often obscured by statistical trees.
That to which we can/should all aspire.
Gold
11
Getting Started Statistics: Collection Organization Analysis Interpretation of DATA to answer questions and make decisions.
12
Populations and samples: Study a part in order to gain information about the whole
Population
Sample
Analyse
13
An Ideal Sample Ideally, a sample should be representative of the population as a whole. representative: what is true of the sample is true of the population (apart from the number of observations). This is more likely to happen if the sampling process is free of bias. bias: systematic favouring of certain members of the population over others
14
Im in!
Me too!
And me!
15
The study of 1236 men aged over 60 found that those who had three alcoholic drinks a day had a 60% lower risk of death than teetotallers.
- Many drinkers never made it to age 60
Survey of mothers with young children on welfare. Those who voluntarily went to job training courses were found to have an increased chance of leaving the welfare state.
- volunteers are more motivated anyway
16
Population
17
quality control
If there are periodic, seasonal and/or systematic effects, our sampling process will be biased.
18
1 2
5 8
7
9
10
11
13
12
Each individual in the population has the same chance of being selected in the sample.
19
20
$25,000-$49,999/year
Under $25,000/year
50%
30%
Select 20%, 50%, 30% of your sample from each income category, respectively. Can be more efficient than simple random but more difficult to analyse.
21
Types of Data
Nominal Categorical M/F Group 1, 2 or 3 Red, blue, green Poor, moderate, good Left, middle, right Small, average, large
Ordinal
Quantitative