The Observed Significance of a Test
The Observed Significance
The conceptual basis of our testing procedure is that we reject only if the data that we obtained would constitute a rare event if
were actually true. The level of significance
specifies what is meant by "rare". The observed significance of the test is a measure of how rare the value of the test statistic that we have just observed would be if the null hypothesis were true. That is, the observed significance of the test just performed is the probability that, if the test were repeated with a new sample, the result of the new test would be at least as contrary to
and in support of
as what was observed in the original test.
Definition
The observed significance or -value of a specific test of hypotheses is the probability, on the supposition that
is true, of obtaining a result at least as contrary to
and in favor of
as the result actually observed in the sample data.
Think back to Note 8.27 "Example 4- in Section 8.2 "Large Sample Tests for a Population Mean" concerning the effectiveness of a new pain reliever. This was a left-tailed test in which the value of the test statistic was . To be as contrary to
and in support of
as the result
actually observed means to obtain a value of the test statistic in the interval
. Rounding
to
, we can read directly from Figure 12.2 "Cumulative Normal Probability" that
. Thus the
-value or observed significance of the test in Note 8.27 "Example 4". is 0.0294 or about 3%. Under repeated sampling from this population, if
were true then only about
of all samples of size 50 would give a result as contrary to
and in favor of
as the sample we observed. Note that the probability 0.0294 is the area of the left tail cut off by the test statistic in this left-tailed test.
Analogous reasoning applies to a right-tailed or a two-tailed test, except that in the case of a twotailed test being as far from as the observed value of the test statistic but on the opposite side of
is just as contrary to
as being the same distance away and on the same side of
, hence the corresponding tail area is doubled.
Computational Definition of the Observed Significance of a Test of Hypotheses
The observed significance of a test of hypotheses is the area of the tail of the distribution cut off by the test statistic (times two in the case of a two-tailed test).
EXAMPLE 6
Compute the observed significance of the test performed in Note 8.28 "Example 5" in Section 8.2 "Large Sample Tests for a Population Mean".
Solution:
The value of the test statistic was , which by Figure 12.2 "Cumulative Normal Probability" cuts off a tail of area
, as shown in Figure 8.7 "Area of the Tail for". Since the test was two-tailed, the observed significance is
.
Figure 8.7
Area of the Tail for Note 8.34 "Example 6"