Terms and definitions from descriptive statistics readily carry over to situations where values are countable. The set of outcomes for a coin flip, the roll of a dice, or a set of cards are all examples of discrete random variables. We can then put concepts such as mean and standard deviation on a firmer mathematical footing by defining the expected value and the variance of a discrete random variable.
Mean or Expected Value and Standard Deviation
The expected value of a discrete random variable X, symbolized as E(X), is often referred to as the long-term average or mean (symbolized as μ). This means that over the long term of doing an experiment over and over, you would expect this average. For example, let X = the number of heads you get when you toss three fair coins. If you repeat this experiment (toss three fair coins) a large number of times, the expected value of X is the number of heads you expect to get for each three tosses on average.
NOTE
To find the expected value, E(X), or mean μ of a discrete random variable X, simply multiply each value of the random variable by its probability and add the products. The formula is given as
Here x represents values of the random variable X, P(x) represents the corresponding probability, and symbol represents the sum of all products xP(x). Here we use symbol μ for the mean because it is a parameter. It represents the mean of a population.
Example 4.3
A men's soccer team plays soccer zero, one, or two days a week. The probability that they play zero days is .2, the probability that they play one day is .5, and the probability that they play two days is .3. Find the long-term average or expected value, μ, of the number of days per week the men's soccer team plays soccer.
To do the problem, first let the random variable X = the number of days the men's soccer team plays soccer per week. X takes on the values 0, 1, 2. Construct a PDF table adding a column x*P(x), the product of the value x with the corresponding probability P(x). In this column, you will multiply each x value by its probability.
x | P(x) | x*P(x) |
---|---|---|
0 | .2 | (0)(.2) = 0 |
1 | .5 | (1)(.5) = .5 |
2 | .3 | (2)(.3) = .6 |
As you learned, if you toss a fair coin, the probability that the result is heads is 0.5. This probability is a theoretical probability, which is what we expect to happen. This probability does not describe the short-term results of an experiment. If you flip a coin two times, the probability does not tell you that these flips will result in one head and one tail. Even if you flip a coin 10 times or 100 times, the probability does not tell you that you will get half tails and half heads. The probability gives information about what can be expected in the long term. To demonstrate this, Karl Pearson once tossed a fair coin 24,000 times! He recorded the results of each toss, obtaining heads 12,012 times. The relative frequency of heads is 12,012/24,000 = .5005, which is very close to the theoretical probability .5. In his experiment, Pearson illustrated the law of large numbers.
The law of large numbers states that, as the number of trials in a probability experiment increases, the difference between the theoretical probability of an event and the relative frequency approaches zero (the theoretical probability and the relative frequency get closer and closer together). The relative frequency is also called the experimental probability, a term that means what actually happens.
In the next example, we will demonstrate how to find the expected value and standard deviation of a discrete probability distribution by using relative frequency.
NOTE
The formula of the varianceExample 4.4
Try It 4.4
Example 4.5
Suppose you play a game of chance in which five numbers are chosen from 0, 1, 2, 3, 4, 5, 6, 7, 8, 9. A computer randomly selects five numbers from zero to nine with replacement. You pay $2 to play and could profit $100,000 if you match all five numbers in order (you get your $2 back plus $100,000). Over the long term, what is your expected profit of playing the game?To do this problem, set up a PDF table for the amount of money you can profit.
Let X = the amount of money you profit. If your five numbers match in order, you will win the game and will get your $2 back plus $100,000. That means your profit is $100,000. If your five numbers do not match in order, you will lose the game and lose your $2. That means your profit is -$2. Therefore, X takes on the values $100,000 and –$2. That is the second column x in the PDF table below.
To win, you must get all five numbers correct, in order. The probability of choosing the correct first number is
To get the fourth column xP(x) in the table, we simply multiply the value x with the corresponding probability P(x).
x |
P(x) | x*P(x) | |
---|---|---|---|
Loss |
–2 | .99999 | (–2)(.99999) = –1.99998 |
Profit
|
100,000 | .00001 | (100000)(.00001) = 1 |
Table 4.9
Try It 4.5
Example 4.6
Suppose you play a game with a biased coin. You play each game by tossing the coin once.Solution 1
a. X = amount of profitSolution 2
Solution 3
c. Add the last column of the table. The expected valueTry It 4.6
Example 4.7
Solution 1
(1, 1) | (1, 2) | (1, 3) | (1, 4) | (1, 5) | (1, 6) |
(2, 1) | (2, 2) | (2, 3) | (2, 4) | (2, 5) | (2, 6) |
(3, 1) | (3, 2) | (3, 3) | (3, 4) | (3, 5) | (3, 6) |
(4, 1) | (4, 2) | (4, 3) | (4, 4) | (4, 5) | (4, 6) |
(5, 1) | (5, 2) | (5, 3) | (5, 4) | (5, 5) | (5, 6) |
(6, 1) | (6, 2) | (6, 3) | (6, 4) | (6, 5) | (6, 6) |
Table 4.13
Add the values in the third column to find the expected value:
Add the values in the fourth column and take the square root of the sum:
Some of the more common discrete probability functions are binomial, geometric, hypergeometric, and Poisson. Most elementary courses do not cover the geometric, hypergeometric, and Poisson. Your instructor will let you know if he or she wishes to cover these distributions.
A probability distribution function is a pattern. You try to fit a probability problem into a pattern or distribution in order to perform the necessary calculations. These distributions are tools to make solving probability problems easier. Each distribution has its own special characteristics. Learning the characteristics enables you to distinguish among the different distributions.