Continuous Probability Functions and the Uniform Distribution
Site: | Saylor Academy |
Course: | CS250: Python for Data Science |
Book: | Continuous Probability Functions and the Uniform Distribution |
Printed by: | Guest user |
Date: | Friday, 4 April 2025, 5:44 AM |
Description
Once you have grasped the notion of a discrete random variable, it should be clear that all random variables need not be discrete. For example, consider measuring the atmospheric temperature at some prescribed location. The measured temperature would be random and could take on a continuum of values (theoretically speaking). Under these circumstances, we say that the random variable is continuous. All the machinery developed for discrete random values (such as expected value, variance, and mean) must be elevated to continuous random variables to handle this situation. The uniform distribution (which you have programmed using the random module) is an example of a continuous probability distribution.
Continuous Probability Functions
We begin by defining a continuous probability density function. We use the function notation f(x). Intermediate algebra may have been your first formal introduction to functions. In the study of probability, the functions we study are special. We define the function f(x) so that the area between it and the x-axis is equal to a probability. Since the maximum probability is one, the maximum area is also one. For continuous probability distributions, PROBABILITY = AREA.
Example 5.1
Consider the function for 0 ≤ x ≤ 20. x = a real number. The graph of
is a horizontal line. However, since 0 ≤ x ≤ 20, f(x) is restricted to the portion between x = 0 and x = 20, inclusive.
Figure 5.5
for 0 ≤ x ≤ 20.
The graph of is a horizontal line segment when 0 ≤ x ≤ 20.
The area between where 0 ≤ x ≤ 20 and the x-axis is the area of a rectangle with base = 20 and height =
.
Suppose we want to find the area between and the x-axis where 0 < x < 2.
Figure 5.6
Reminder
The area corresponds to a probability. The probability that x is between zero and two is 0.1, which can be written mathematically as P(0 < x < 2) = P(x < 2) = 0.1.
Suppose we want to find the area between and the x-axis where 4 < x < 15.
Figure 5.7
(15 – 4) = 11 = the base of a rectangle
The area corresponds to the probability P(4 < x < 15) = 0.55.
Suppose we want to find P(x = 15). On an x-y graph, x = 15 is a vertical line. A vertical line has no width (or zero width). Therefore,
Figure 5.8
P(X <= x), which can also be written as P(X < x) for continuous distributions, is called the cumulative distribution function or CDF. Notice the less than or equal to symbol. We can also use the CDF to calculate P(X > x). The CDF gives area to the left and P(X > x) gives area to the right. We calculate P(X > x) for continuous distributions as follows: P(X > x) = 1 – P (X < x).
Figure 5.9
Label the graph with f(x) and x. Scale the x and y axes with the maximum x and y values. ,
.
To calculate the probability that x is between two values, look at the following graph. Shade the region between x = 2.3 and x = 12.7. Then calculate the shaded area of a rectangle.
Figure 5.10
Try It 5.1
Consider the function for
. Draw the graph of f(x) and find P(2.5 < x < 7.5).
Source: OpenStax, https://openstax.org/books/statistics/pages/5-introduction
This work is licensed under a Creative Commons Attribution 4.0 License.
The Uniform Distribution
The uniform distribution is a continuous probability distribution and is concerned with events that are equally likely to occur. When working out problems that have a uniform distribution, be careful to note if the data are inclusive or exclusive of endpoints.
Example 5.2
The data in Table 5.1 are 55 smiling times, in seconds, of an eight-week-old baby.
10.4 | 19.6 | 18.8 | 13.9 | 17.8 | 16.8 | 21.6 | 17.9 | 12.5 | 11.1 | 4.9 |
12.8 | 14.8 | 22.8 | 20.0 | 15.9 | 16.3 | 13.4 | 17.1 | 14.5 | 19.0 | 22.8 |
1.3 | 0.7 | 8.9 | 11.9 | 10.9 | 7.3 | 5.9 | 3.7 | 17.9 | 19.2 | 9.8 |
5.8 | 6.9 | 2.6 | 5.8 | 21.7 | 11.8 | 3.4 | 2.1 | 4.5 | 6.3 | 10.7 |
8.9 | 9.4 | 9.4 | 7.6 | 10.0 | 3.3 | 6.7 | 7.8 | 11.6 | 13.8 | 18.6 |
Table 5.1
The sample mean = 11.49 and the sample standard deviation = 6.23.
We will assume that the smiling times, in seconds, follow a uniform distribution between zero and 23 seconds, inclusive. This means that any smiling time from zero to and including 23 seconds is equally likely. The histogram that could be constructed from the sample is an empirical distribution that closely matches the theoretical uniform distribution.
Let X = length, in seconds, of an eight-week-old baby's smile.
The notation for the uniform distribution is
X ~ U(a, b) where a = the lowest value of x and b = the highest value of x.
The probability density function is
For this example, X ~ U(0, 23) and
Formulas for the theoretical mean and standard deviation are
Try It 5.2
1 | 12 | 4 | 10 | 4 | 14 | 11 |
7 | 11 | 4 | 13 | 2 | 4 | 6 |
3 | 10 | 0 | 12 | 6 | 9 | 10 |
5 | 13 | 4 | 10 | 14 | 12 | 11 |
6 | 10 | 11 | 0 | 11 | 13 | 2 |
Example 5.3
a. Refer to Example 5.2. What is the probability that a randomly chosen eight-week-old baby smiles between two and 18 seconds?Solution 1

b. Find the 90th percentile for an eight-week-old baby's smiling time.
Solution 2

c. Find the probability that a random eight-week-old baby smiles more than 12 seconds knowing that the baby smiles more than eight seconds.
Solution 3
Find P(x > 12|x > 8) There are two ways to do the problem. For the first way, use the fact that this is a conditional and changes the sample space. The graph illustrates the new sample space. You already know the baby smiled more than eight seconds.
for 8 < x < 23
Write a new f(x):
for 8 < x < 23.

For the second way, use the conditional formula from Probability Topics with the original distribution.
For this problem, A is (x > 12) and B is (x > 8).
So,

Try It 5.3
Example 5.4
The amount of time, in minutes, that a person must wait for a bus is uniformly distributed between zero and 15 minutes, inclusive.Solution 1
a. Let X = the number of minutes a person must wait for a bus. a = 0 and b = 15. X ~ U(0, 15). Write the probability density function.
Solution 2
b.Solution 32
k is sometimes called a critical value.

Try It 5.4
The total duration of baseball games in the major league in the 2011 season is uniformly distributed between 447 hours and 521 hours inclusive.- Find a and b and describe what they represent.
- Write the distribution.
- Find the mean and the standard deviation.
- What is the probability that the duration of games for a team for the 2011 season is between 480 and 500 hours?
- What is the 65th percentile for the duration of games for a team for the 2011 season?
Example 5.5
Suppose the time it takes a nine-year old to eat a donut is between 0.5 and 4 minutes, inclusive. Let X = the time, in minutes, it takes a nine-year-old child to eat a doughnut. Then X ~ U(0.5, 4).Solution 1
a. 0.5714b. Find the probability that a different nine-year-old child eats a doughnut in more than two minutes given that the child has already been eating the doughnut for more than 1.5 minutes.
The second question has a conditional probability. You are asked to find the probability that a nine-year-old child eats a doughnut in more than two minutes given that the child has already been eating the donut for more than 1.5 minutes. Solve the problem two different ways (see Example 5.3). You must reduce the sample space. First way: Since you know the child has already been eating the doughnut for more than 1.5 minutes, you are no longer starting at a = 0.5 minutes. Your starting point is 1.5 minutes.
Write a new f(x):

Solution 2
The probability that a nine-year-old child eats a donut in more than two minutes given that the child has already been eating the doughnut for more than 1.5 minutes is
Try It 5.5
Suppose the time it takes a student to finish a quiz is uniformly distributed between six and 15 minutes, inclusive. Let X = the time, in minutes, it takes a student to finish a quiz. Then X ~ U(6, 15).Example 5.6
Ace Heating and Air Conditioning Service finds that the amount of time a repairman needs to fix a furnace is uniformly distributed between 1.5 and four hours. Let x = the time needed to fix a furnace. Then x ~ U(1.5, 4).- Find the probability that a randomly selected furnace repair requires more than two hours.
- Find the probability that a randomly selected furnace repair requires less than three hours.
- Find the 30th percentile of furnace repair times.
- The longest 25 percent of furnace repair times take at least how long? (In other words: find the minimum time for the longest 25 percent of repair times.) What percentile does this represent?
- Find the mean and standard deviation
Solution 1

Solution 2
b. P(x < 3) = (base)(height) = (3 – 1.5)(0.4) = 0.6The graph of the rectangle showing the entire distribution would remain the same. However the graph should be shaded between x = 1.5 and x = 3. Note that the shaded area starts at x = 1.5 rather than at x = 0. Because X ~ U(1.5, 4), x cannot be less than 1.5.

Solution 3
c.

P (x < k) = 0.30
P(x < k) = (base)(height) = (k – 1.5)(0.4)
0.3 = (k – 1.5) (0.4); Solve to find k:
0.75 = k – 1.5, obtained by dividing both sides by 0.4
k = 2.25, obtained by adding 1.5 to both sides
Solution 4
d.
P(x > k) = 0.25
P(x > k) = (base)(height) = (4 – k)(0.4)
0.25 = (4 – k)(0.4); Solve for k:
0.625 = 4 − k,
obtained by dividing both sides by 0.4
−3.375 = −k,
Solution 5
Try It 5.6
The amount of time a service technician needs to change the oil in a car is uniformly distributed between 11 and 21 minutes. Let X = the time needed to change the oil on a car.- Write the random variable X in words. X = __________________.
- Write the distribution.
- Graph the distribution.
- Find P (x > 19).
- Find the 50th percentile.