Anda di halaman 1dari 8

HYPOTHESIS TESTING

A hypothesis is a claim. Usually there are two


rival claims.
Shower hypothesis: Water temperature is
acceptable /not
How to Develop a Hypothesis?
We need to translate a problem into a statement
involving a statistical measure. The measure or a
parameter like µ or p is then used in the
derivation of hypothesis. For example, the
CLAIM may be that education increases
earnings so µ =average earnings of the
educated. If the average earnings of the entire
population is known to be $30,000
Now the two rival claims are:
H0 µ ≤ 30000 versus Ha µ > 30000 .

H0 is called the null hypothesis. It says that the


educated earn no better than others.
Ha is called the alternative hypothesis, which
says that the educated earn more.
In this example, the alternative is one-sided. If
the claim were that the educated earn LESS than
others, then the alternative would be set up as Ha
µ < 30000.
Hypothesis testing is a scientific method of
choosing between two claims H0 and Ha
Some principles:
1) H0 is presumed true unless overwhelming
evidence rejects it. E.g., H0: defendant is not
guilty!
2)Sample data give test statistics for µ , p or σ 2
3)We reject the Null H0 if statistic falls in
Rejection Region
4) Null is usually a zero value (hence the name
null).
5) Instead of saying ACCEPT one says FAILS
TO REJECT.
6) Absolute certainty does not exist.

If there is a standard value for µ the null is H0


µ =std value (or true value). e.g. H0 µ =10 and
two-sided alternative is Ha µ ≠ 10, where it
could be larger than 10 or smaller than 10.

Need Skill to decide (1) appropriate statistical


parameter µ , p, etc. (2) appropriate Null (3)
One-sided or two-sided.
When one of the alternatives is selected, there
can be error.
Type I (α ) and Type II (β ) errors
Definition: Type I error: Selecting Ha when H0
should be selected.
H0 DOGS are dead, Ha DOGS are alive
Truth is DOGS are dead, still selecting Ha is
Type I error

Definition: Type II error: Selecting H0 when Ha


should be selected.
H0 DOGS are dead, Ha DOGS are alive
Truth is DOGS are alive, still selecting H0
means Type II error.

Definition: α denotes the probability of Type I


error = This is also called the “Level of test.”
Definition: β denotes the probability of Type II
error. This is usually hard to determine since it
depends on unknown parameter µ itself.
It is desirable to formulate the hypothesis so that
α is the most serious consequence. The two
types of errors are inversely related. There is a
trade-off between α and β . The smaller we
make α the larger the β we have to accept.
Hence one usually chooses α largest tolerable !
Steps in the Test of Hypothesis
1.Define the hyp. to be tested in plain English.
2.Select the appropriate statistical measure (such
as µ p, σ 2) to rephrase the hypothesis.
3.Determine whether hyp. should be 1 or 2-
sided.
4.State the hypothesis using the statistical
measure selected in step 2.
5.Specify α , the “level” of the test.
6.Select the appropriate test statistic, based on
the information at hand and the assumptions
you are willing to make.
7.Determine the critical value of the test statistic.
Three factors for critical value:
a. the type of alternative hypothesis,
(1) Two-sided (2) 1-sided left (3) 1-sided Right
b. the specification of a, the level of the test,
c. the distribution of the test statistic.
8.Collect sample data and compute the value of
the test statistic.
9.Make the decision. Is the value of the test
statistic in the rejection region?
a. If yes, reject the null hypothesis in favor of
the alternative.
b.If no, do not reject the null hypothesis.
10. State conclusion in terms of the orig.
question.
11-5 Testing a Hypothesis about a Population
Mean µ
Example illustrates the hypothesis testing
procedure.

A local school board member wants to know if


sophomore students at Lincoln High School
have approximately the same reading level as
the state average for tenth graders. The state
average is150 words per minute with a standard
deviation of 15. The level of the test is to be set
at .05. A random sample size of 100 tenth
graders has been drawn, and the resulting
average is 157 words per minute.

Step 1 (Define the hypothesis in plain English.)


The hypothesis is straight forward:
• Lincoln High School tenth graders are
reading at the state average.
• They are not.
Step 2 (select statistical measure) µ =average
of something. Number of words read per minute
by Lincoln Hi sophomores.
Step 3 (one or 2 sided) They want to know
either way, above or below average, so 2-sided.
Step 4 ( State H0 Ha ) H0 µ =150 the std. value
and Ha µ ≠ 150
Step 5 (Level of test?) The problem statement
says α = 0.05 or Type I error of 5%
Step 6 (select statistic) The choice depends on
answer to 3 questions. (i) Is σ known? (ii) Is
the distribution Normal? (iii) is n large?
Case 1: If σ is known, n is large, and
distribution is Normal, use Z=( y − µ 0 ) / σ ,
where σ =σ / √ n
Case 2: If σ is known, n is large, but the
distribution may not be Normal. Still, since
n>30 Central Limit Theorem suggests same Z
statistic.
Case 3: If σ is unknown, Normality not
assumed and n>30 we can simply use s instead
of σ in the above formula.
Step 7 (Determine critical value) For 5%, level
z=1.96. The rejection region is in Tails. “Fail to
Reject” region is at the center. Why?
Recall H0 µ =150 is the null. If true, the
difference ( y − µ ) should be near zero.
Since y is likely to be normal, its z-transform
is N(0,1), or the standard normal variable. Hence
from Normal Tables we can decide that if y >
1.96 or y < −1.96 it belongs in Tails where we
decide to reject the null of zero.
Step 8 (Compute the test statistic) If sample
mean y =157 we have Z= (157 − 150)/ SE,
where SE=σ / √ n = 15/10=1.5, Now Z=4.67.
This means that y is 4.67 standard deviations
away from hypothesized value of zero. This
seems too far away from the value of the null. Is
it too far from formal statistical viewpoint?
Step 9 (Make a decision) Critical value is 1.96
the test statistic is much larger than critical value
1.96 Decision must be to reject the null H0. The
difference between y and µ could not have
been caused by mere sampling variation. We
decide that the observed difference is
statistically significant. That is, the difference is
rather rare for a 5% test, so unlikely.
Step 10: (State conclusion) There is significant
evidence at the 5% level that tenth graders at
Lincoln Hi do not read at the state average. It
would be tempting to conclude that they read at
a higher level, but we did not set up our test as a
one-sided test. The conclusions must be
consistent with the hypothesis.

P-Values
Definition: The probability of observing a
value as extreme or more extreme than the
observed value.

It is computed as the tail area with reference


to the observed value. For one-sided test, it
is one tail, for two-sided test P-value is
computed as the sum of the two tail areas.

Decisions: If α > P-value Reject the Null.


For example, for a 95% test with α =0.05
we reject the null if the P-value tail is small,
say 0.04. If the tail is small, the observed
value is obviously in the rejection region.

Anda mungkin juga menyukai