First, this section talks about the history of the normal distribution and the central limit theorem and the relation of normal distributions to errors. Then, it discusses how to compute the area under the normal curve. It then moves on to the normal distribution, the area under the standard normal curve, and how to translate from non-standard normal to standard normal. Finally, it addresses how to compute (cumulative) binomial probabilities using normal approximations.
Learning Objectives
- State the relationship between the normal distribution and the binomial distribution
- Use the normal distribution to approximate the binomial distribution
- State when the approximation is adequate
In the section on the history of the normal distribution, we saw that the normal distribution can be used to approximate the binomial distribution. This section shows how to compute these approximations.
Let's begin with an example. Assume you have a fair coin and wish to know the probability that you would get 8 heads out of 10 flips. The binomial distribution has a mean of and a variance of
. The standard deviation is therefore 1.5811. A total of 8 heads is
standard deviations above the mean of the distribution. The question then is, "What is the probability of getting a value exactly
standard deviations above the mean?" You may be surprised to learn that the answer is 0 : The probability of any one specific point is 0. The problem is that the binomial distribution is a discrete probability distribution, whereas the normal distribution is a continuous distribution.
The solution is to round off and consider any value from to
to represent an outcome of 8 heads. Using this approach, we figure out the area under a normal curve from
to
. The area in green in Figure 1 is an approximation of the probability of obtaining 8 heads.
Figure 1. Approximating the probability of 8 heads with the normal distribution.
The solution is therefore to compute this area. First we compute the area below 8.5 and then subtract the area below 7.5.
The results of using the normal area calculator to find the area below 8.5 are shown in Figure 2. The results for 7.5 are shown in Figure 3.
Figure 2. Area below 8.5.
Figure 3. Area below 7.5.
The difference between the areas is 0.044, which is the approximation of the binomial probability. For these parameters, the approximation is very accurate. The demonstration in the next section allows you to explore its accuracy with different parameters.
If you did not have the normal area calculator, you could find the solution using a table of the standard normal distribution (a Z table) as follows:
1. Find a score for
using the formula
.
3. Find a score for
using the formula
.
5. Subtract the value in step 4 from the value in step 2 to get .
The same logic applies when calculating the probability of a range of outcomes. For example, to calculate the probability of 8 to 10 flips, calculate the area from to
The accuracy of the approximation depends on the values of and
. A rule of thumb is that the approximation is good if both
and
are both greater than 10.