Francis J. DiTraglia
University of Pennsylvania
Lecture # 1
2/ 29
Sample
Observed subset, or portion, of a population
Sample Size
# of items in the sample, typically denoted n
Examples...
3/ 29
Parameter
Numerical measure that describes specific characteristic of a
population.
Statistic
Numerical measure that describes specific characteristic of sample.
Examples...
4/ 29
Population
Sample
Numerical
Summary
Statistic
Numerical
Summary
Parameter
5/ 29
Descriptive Statistics
Graphical and numerical procedures to summarize data.
Inferential Statistics
Using data to estimate, predict, quantify uncertainty.
6/ 29
This Course
Summary Statistics
Graphics
Ive only every seen black ravens, so all ravens must be black.
7/ 29
2. Nonsampling Error
I
8/ 29
9/ 29
Huge Sample
Sent out over 10 million ballots; 2.4 million replies! (Compared to
less than 45 million votes cast in actual election)
Prediction
Landslide for Landon: Landonslide, if you will.
10/ 29
Spectacularly Mistaken!
Roosevelt
Landon
41%
57%
Actual Result:
61%
37%
11/ 29
Biased Sample
Some units more likely to be sampled than others.
I
Non-response Bias
Even if sample is unbiased, cant force people to reply.
I
In this case, neither effect alone was enough to throw off the result
but together they did.
12/ 29
13/ 29
14/ 29
ME 2
p
P(1 P)/n
We Report: P ME
16/ 29
17/ 29
Problem
18/ 29
Confounder
19/ 29
Essential Point!
Random assignment neutralizes effect of all confounding factors:
since groups are initially equal, on average, any difference that
emerges must be the treatment effect.
20/ 29
Pool of
Experimental
Subjects
Randomly divided
into two groups
Treatment
Subjects Blind
Control
Evaluation
Experimenters Blind
Evaluation
21/ 29
22/ 29
23/ 29
24/ 29
Observational Data
Data that do not come from a randomized experiment.
25/ 29
26/ 29
Regression
Technique that allows us to remove influence of confounders.
Works well if we can identify and gather data on all of them. But...
27/ 29
28/ 29
Reminders:
29/ 29