1. There is a lower infant mortality rate for athome births assisted by midwives than for hospital births attended by physicians. Does this mean that concerned women would be wise to opt for home deliveries by
midwives?
A. B.
Yes No
1 - 61
1 - 62
Chapter 1
Introduction to Statistics
1-1 Overview 1-2 The Nature of Data 1-3 Uses and Abuses of Statistics 1-4 Design of Experiments
Collecting Data
We typically obtain our data from two common sources; observational studies and experiments. In an observational study, we observe and measure specific characteristics but we dont attempt to manipulate or modify the subjects being studied. Example: Surveys. In an experiment, we apply some treatment and then proceed to observe its effects on the subjects. Example: Clinical drug trials.
1 - 63
MATH 1050Y-A (FW 2012-13)
1 1
1 - 64
Design of Experiments
When designing an experiment, one should follow the steps below. 1. Identify your objective. What question are you trying to answer? What is the population of interest? 2. Collect sample data. The sample must be large enough to detect the treatment effect, representative of the population under consideration, and unaffected by extraneous factors. 3. Use a random procedure to avoid bias. 4. Analyze the data and form conclusions.
MATH 1050Y-A (FW 2012-13)
Design of Experiments
Example: Hudson Auto Repair
The manager of Hudson Auto would like to have a better understanding of the cost of parts used in the engine tune-ups performed in the shop. She examines 50 customer invoices for tune-ups and records the costs of parts, rounded to the nearest dollar
1 - 65
1 - 66
2. A sample of 50
engine tune-ups is examined.
1 - 67
1 - 68
Design of Experiment
When designing an experiment to test the effectiveness of one or more treatments, subjects should be assigned to the different treatments in such a way that those groups are very similar. One simple yet effective approach is known as a completely randomized design. In this approach, subjects are assigned to the treatments at random. Another approach is a rigorously controlled design wherein subjects are chosen carefully so that the different groups are carefully arranged to be similar.
MATH 1050Y-A (FW 2012-13)
2 2
1 - 69
1 - 70
Confounding
The results of experiments are sometimes ruined by confounding. Confounding occurs when the effects from two or more variables cannot be distinguished from each other. Example: In a study of the effects of education on salary, a group of older subjects without university degrees might be compared to a group of younger subjects with university degrees. If a difference in salary is observed, it would be difficult to tell if the difference is due to education or age.
MATH 1050Y-A (FW 2012-13)
Replication
Replication is the repetition of an experiment on more than one subject. Samples should be large enough so that the erratic behaviour that is characteristic of very small samples will not disguise the true effects of different treatments.
1 - 71
1 - 72
Randomization
Randomization is used when subjects are assigned to different groups through a process of random selection. The idea is to use chance as a way to create two groups that are similar. Randomization plays a crucial role in how the samples are chosen. We will now explore several such sampling methods. REMEMBER: Data carelessly collected may be so completely useless that no amount of statistical torturing can salvage them.
MATH 1050Y-A (FW 2012-13)
Random Sample
In a random sample, members of the population are selected in such a way that each individual has an equal chance of being selected.
1 - 73
1 - 74
3 3
Example: The Excel function RANDBETWEEN(1,X) can be used to generate random numbers between 1 and X
1 - 75
1 - 76
Stratified Sampling
With stratified sampling, we subdivide the populations into at least two different subpopulations (or strata) that share the same characteristics (such as gender), then we draw a simple random sample from each stratum.
Stratified Sampling
Stratified sampling can help us reduce variation in our results that might result from a simple random sample. Example: In surveying views on pay equity, we might use gender as a basis for creating two strata and use proportionate sampling. A simple random sample might be biased toward males or females.
1 - 77
1 - 78
Systematic Sampling
In systematic sampling, we select some starting point and then select every kth element in the population.
Cluster Sampling
In cluster sampling, we first divide the population area into clusters, then randomly select a few of those sections, and then choose all the members from those selected sections.
Example: When selecting a sample of objects for inspection on an assembly line, you might select every 100th object.
MATH 1050Y-A (FW 2012-13)
1 - 79
1 - 80
Convenience Sampling
In convenience sampling, we simply use results that are readily available. Convenience sampling does not employ randomization and can be heavily biased.
4 4
1 - 81
1 - 82
Summary
Three very important considerations in the design of experiments are as follows 1. Use randomization to assign subjects to different groups. 2. Use replication by repeating the experiment on enough subjects so that effects of treatment or other factors can be clearly seen. 3. Control the effects of variables by using such techniques as blinding and a completely randomized experimental design.
MATH 1050Y-A (FW 2012-13)
Summary
1 - 83
1 - 84
Errors
No matter how well you plan and execute the sample collection process, there is likely to be some error in the results. Sampling error is the difference between a sample result and the true population result; such an error results from chance sample fluctuations. Nonsampling error occurs when the sample data incorrectly collected, recorded, or analyzed (such as by selecting a nonrandom and biased sample, using a defective instrument, or copying the data incorrectly).
MATH 1050Y-A (FW 2012-13)
1 - 85
1 - 86
5 5
A. B. C. D. E.
1 - 87
1 - 88
Coming up
Our next step will be organizing and summarizing data Assignment #1 is posted Due Sept. 18th in seminar For next class: Read sections 2-1 to 2-2
1 - 89
1 - 90