The Mean Value Theorem and Its Consequences

Site: Saylor Academy
Course: MA005: Calculus I
Book: The Mean Value Theorem and Its Consequences
Printed by: Guest user
Date: Tuesday, May 7, 2024, 10:57 AM

Description

Read this section to learn about the Mean Value Theorem and its consequences. Work through practice problems 1-3.

Introduction

If you averaged 30 miles per hour during a trip, then at some instant during the trip you were traveling exactly 30 miles per hour.

That relatively obvious statement is the Mean Value Theorem as it applies to a particular trip. It may seem strange that such a simple statement would be important or useful to anyone, but the Mean Value Theorem is important and some of its consequences are very useful for people in a variety of areas. Many of the results in the rest of this chapter depend on the Mean Value Theorem, and one of the corollaries of the Mean Value Theorem will be used every time we calculate an "integral" in later chapters. A truly delightful aspect of mathematics is that an idea as simple and obvious as the Mean Value Theorem can be so powerful.

Before we state and prove the Mean Value Theorem and examine some of its consequences, we will consider a simplified version called Rolle's Theorem.


Source: Dale Hoffman, https://s3.amazonaws.com/saylordotorg-resources/wwwresources/site/wp-content/uploads/2012/12/MA005-4.2-Mean-Value-Theorem.pdf
Creative Commons License This work is licensed under a Creative Commons Attribution 3.0 License.

Rolle's Theorem

Suppose we pick any two points on the x-axis and think about all of the differentiable functions which go through those two points (Fig. 1).

Fig. 1

Since our functions are differentiable, they must be continuous and their graphs can not have any holes or breaks. Also, since these functions are differentiable, their derivatives are defined everywhere between our two points and their graphs can not have any "corners" or vertical tangents. The graphs of the functions in Fig. 1 can still have all sorts of shapes, and it may seem unlikely that they have any common properties other than the ones we have stated, but Michel Rolle found one. He noticed that every one of these functions has one or more points where the tangent line is horizontal (Fig. 2), and this result is named after him.

Fig. 2

Rolle's Theorem: If \mathrm{f}(\mathrm{a})=\mathrm{f}(\mathrm{b}), and \mathrm{f}(\mathrm{x}) is continuous for \mathrm{a} \leq \mathrm{x} \leq \mathrm{b} and differentiable for \mathrm{a} < \mathrm{x} < \mathrm{b}, then there is at least one number c, between a and b, so that f^{\prime}(c)=0.

Proof: We consider two cases: when f(x)=f(a) for all x in (a, b) and when f(x) \neq f(a) for some x in (a, b).

Case I, \mathrm{f}(\mathrm{x})=\mathrm{f}(\mathrm{a}) for all \mathrm{x} in (\mathrm{a}, \mathrm{b}): If \mathrm{f}(\mathrm{x})=\mathrm{f}(\mathrm{a}) for all \mathrm{x} between \mathrm{a} and \mathrm{b}, then \mathrm{f} is a horizontal line segment and \mathrm{f}^{\prime}(\mathrm{c})=0 for all values of \mathrm{c} strictly between \mathrm{a} and \mathrm{b}.

Case II, f(x) \neq f(a) for some x in (a, b): Since f is continuous on the closed interval [a, b], we know from the Extreme Value Theorem that \mathrm{f} must have a maximum value in the closed interval [\mathrm{a},
    \mathrm{b}] and a minimum value in the interval.

If f(x)>f(a) for some value of x in [a, b], then the maximum of f must occur at some value c strictly between a and b, a < c < b. (Why can't the maximum be at a or b?) Since f(c) is a local maximum of f, then c is a critical number of f and f^{\prime}(c)=0 or f^{\prime}(c) is undefined. But f is differentiable at all \mathrm{x} between \mathrm{a} and \mathrm{b}, so the only possibility left is that \mathrm{f}^{\prime}(\mathrm{c})=0.

If f(x) < f(a) for some value of x in [a, b], then f has a minimum at some value x=c strictly between a and b, and f^{\prime}(c)=0.

In either case, there is at least one value of \mathrm{c} between a and \mathrm{b} so that \mathrm{f}^{\prime}(\mathrm{c})=0.

Example 1: Show that f(x)=x^{3}-6 x^{2}+9 x+2 satisfies the hypotheses of Rolle's Theorem on the interval [0,3] and find the value of c which the theorem says exists.

Solution: \mathrm{f} is a polynomial so it is continuous and differentiable everywhere. \mathrm{f}(0)=2 and f(3)=2. f^{\prime}(x)=3 x^{2}-12 x+9=3(x-1)(x-3) so f^{\prime}(x)=0 at 1 and 3.
The value c=1 is between 0 and 3. Fig. 3 shows the graph of f.

Fig. 3

Practice 1: Find the value(s) of c for Rolle's Theorem for the functions in Fig. 4.

Fig. 4

The Mean Value Theorem

Geometrically, the Mean Value Theorem is a "tilted" version of Rolle's Theorem (Fig. 5). In each theorem we conclude that there is a number \mathrm{c} so that the slope of the tangent line to \mathrm{f} at \mathrm{x}=\mathrm{c} is the same as the slope of the line connecting the two ends of the graph of \mathrm{f} on the interval [\mathrm{a}, \mathrm{b}]. In Rolle's Theorem, the two ends of the graph of \mathrm{f} are at the same height, \mathrm{f}(\mathrm{a})=\mathrm{f}(\mathrm{b}), so the slope of the line connecting the ends is zero. In the Mean Value Theorem, the two ends of the graph of \mathrm{f} do not have to be at the same height so the line through the two ends does not have to have a slope of zero.

Fig. 5

Mean Value Theorem: If f(x) is continuous for a \leq x \leq b and differentiable for a < x < b,

then there is at least one number c, between a and b, so the tangent line at c is parallel to the secant line through the points (a, f(a)) and (b, f(b)): f^{\prime}(c)=\frac{f(b)-f(a)}{b-a}.

Proof: The proof of the Mean Value Theorem uses a tactic common in mathematics: introduce a new function which satisfies the hypotheses of some theorem we already know and then use the conclusion of that previously proven theorem. For the Mean Value Theorem we introduce a new function, \mathrm{h}(\mathrm{x}), which satisfies the hypotheses of Rolle's Theorem. Then we can be certain that the conclusion of Rolle's Theorem is true for \mathrm{h}(\mathrm{x}), and the Mean Value Theorem for \mathrm{f} follows from the conclusion of Rolle's Theorem for \mathrm{h}.

First, let \mathrm{g}(\mathrm{x}) be the straight line through the ends (\mathrm{a}, \mathrm{f}(\mathrm{a})) and (\mathrm{b}, \mathrm{f}(\mathrm{b})) of the graph of \mathrm{f}. The function \mathrm{g} goes through the point (\mathrm{a},
    \mathrm{f}(\mathrm{a})) so \mathrm{g}(\mathrm{a})=\mathrm{f}(\mathrm{a}). Similarly, \mathrm{g}(\mathrm{b})=\mathrm{f}(\mathrm{b}). The slope of the linear function g is \frac{f(b)-f(a)}{b-a} so g^{\prime}(x)=\frac{f(b)-f(a)}{b-a} for all x between a and b, and g is continuous and differentiable. (The formula for \mathrm{g} is \mathrm{g}(\mathrm{x})=\mathrm{f}(\mathrm{a})+\mathrm{m}(\mathrm{x}-\mathrm{a}) with \mathrm{m}=(\mathrm{f}(\mathrm{b})-\mathrm{f}(\mathrm{a}))
    /(\mathrm{b}-\mathrm{a}).)

Define h(x)=f(x)-g(x) for a \leq x \leq b (Fig. 6). The function h satisfies the hypotheses of Rolle's theorem:
\mathrm{h}(\mathrm{a})=\mathrm{f}(\mathrm{a})-\mathrm{g}(\mathrm{a})=0 and \mathrm{h}(\mathrm{b})=\mathrm{f}(\mathrm{b})-\mathrm{g}(\mathrm{b})=0
\mathrm{h}(\mathrm{x}) is continuous for \mathrm{a} \leq \mathrm{x} \leq \mathrm{b} since both \mathrm{f} and \mathrm{g} are continuous there, and \mathrm{h}(\mathrm{x}) is differentiable for \mathrm{a} < \mathrm{x} < \mathrm{b} since both \mathrm{f} and \mathrm{g} are differentiable there, so the conclusion of Rolle's Theorem applies to \mathrm{h}: there is a c, between a and b, so that h^{\prime}(c)=0.

Fig. 6

The derivative of h(x)=f(x)-g(x) is h^{\prime}(x)=f^{\prime}(x)-g^{\prime}(x) so we know that there is a number c, between a and b, with h^{\prime}(c)=0. But 0=h^{\prime}(c)=f^{\prime}(c)-g^{\prime}(c) so f^{\prime}(c)=g^{\prime}(c)=\frac{f(b)-f(a)}{b-a}.

Graphically, the Mean Value Theorem says that there is at least one point \mathrm{c} where the slope of the tangent line, f^{\prime}(c), equals the slope of the line through the end points of the graph segment, (\mathrm{a}, \mathrm{f}(\mathrm{a})) and (\mathrm{b}, \mathrm{f}(\mathrm{b})). Fig. 7 shows the locations of the parallel tangent lines for several functions and intervals.

Fig. 7

The Mean Value Theorem also has a very natural interpretation if \mathrm{f}(\mathrm{x}) represents the position of an object at time x : f^{\prime}(x) represents the velocity of the object at the instant x, and \frac{\mathrm{f}(\mathrm{b})-\mathrm{f}(\mathrm{a})}{\mathrm{b}-\mathrm{a}}=\frac{\text
    { change in position }}{\text { change in time }} represents the average (mean) velocity of the object during the time interval from time a to time b. The Mean Value Theorem says that there is a time c, between a and b, when the instantaneous velocity, f^{\prime}(c), is equal to the average velocity for the entire trip, \frac{\mathrm{f}(\mathrm{b})-\mathrm{f}(\mathrm{a})}{\mathrm{b}-\mathrm{a}}. If your average velocity during a trip is 30 miles per hour, then at some instant during the trip you were traveling exactly 30 miles per hour.

Practice 2: For f(x)=5 x^{2}-4 x+3 on the interval [1,3], calculate m=\frac{f(b)-f(a)}{b-a} and find the value of c so that f^{\prime}(c)=m.

Some Consequences of the Mean Value Theorem

If the Mean Value Theorem was just an isolated result about the existence of a particular point \mathrm{c}, it would not be very important or useful. However, the Mean Value Theorem is the basis of several results about the behavior of functions over entire intervals, and it is these consequences which give it an important place in calculus for both theoretical and applied uses.

The next two corollaries are just the first of many results which follow from the Mean Value Theorem.

We already know, from the Main Differentiation Theorem, that the derivative of a constant function. f(x)=k is always 0, but can a nonconstant function have a derivative which is always 0? The first corollary says no.

Corollary 1: If \mathrm{f}^{\prime}(\mathrm{x})=0 for all \mathrm{x} in an interval \mathrm{I}, then \mathrm{f}(\mathrm{x})=\mathrm{K}, a constant, for all \mathrm{x} in \mathrm{I}.

Proof: Assume \mathrm{f}^{\prime}(\mathrm{x})=0 for all \mathrm{x} in an interval \mathrm{I}, and pick any two points a and \mathrm{b}(\mathrm{a} \neq \mathrm{b}) in the interval. Then, by the Mean Value Theorem, there is a number c between a and b so that \mathrm{f}^{\prime}(\mathrm{c})=\frac{\mathrm{f}(\mathrm{b})-\mathrm{f}(\mathrm{a})}{\mathrm{b}-\mathrm{a}}. By our assumption, \mathrm{f}^{\prime}(\mathrm{x})=0 for all \mathrm{x} in \mathrm{I} so we know that 0=\mathrm{f}^{\prime}(\mathrm{c})=\frac{\mathrm{f}(\mathrm{b})-\mathrm{f}(\mathrm{a})}{\mathrm{b}-\mathrm{a}} and we can conclude that \mathrm{f}(\mathrm{b})-\mathrm{f}(\mathrm{a})=0 and \mathrm{f}(\mathrm{b})=\mathrm{f}(\mathrm{a}). But a and \mathrm{b} were any two points in I, so the value of f(x) is the same for any two values of x in I, and f is a constant function on the interval I.

We already know that if two functions are parallel (differ by a constant), then their derivatives are equal, but can two nonparallel functions have the same derivative? The second corollary says no.

Corollary 2: If f^{\prime}(x)=g^{\prime}(x) for all x in an interval I, then f(x)-g(x)=K, a constant, for all x in I, so the graphs of \mathrm{f} and \mathrm{g} are "parallel" on the interval I.

Proof: This corollary involves two functions instead of just one, but we can imitate the proof of the Mean Value Theorem and introduce a new function \mathrm{h}(\mathrm{x})=\mathrm{f}(\mathrm{x})-\mathrm{g}(\mathrm{x}). The function \mathrm{h} is differentiable, and h^{\prime}(x)=f^{\prime}(x)-g^{\prime}(x)=0 for all x in I, so, by Corollary 1, h(x) is a constant function and \mathrm{K}=\mathrm{h}(\mathrm{x})=\mathrm{f}(\mathrm{x})-\mathrm{g}(\mathrm{x}) for all \mathrm{x} in the interval. Then \mathrm{f}(\mathrm{x})=\mathrm{g}(\mathrm{x})+\mathrm{K}.

We will use Corollary 2 hundreds of times in Chapters 4 and 5 when we work with "integrals". Typically you will be given the derivative of a function, f^{\prime}(x), and asked to find all functions f which have that derivative. Corollary 2 tells us that if we can find one function \mathrm{f} which has the derivative we want, then the only other functions which have the same derivative are \mathrm{f}(\mathrm{x})+\mathrm{K}: once you find one function with the right derivative, you have essentially found all of them.

Example 2: (a) Find all functions whose derivatives equal 2 \mathrm{x}.
(b) Find a function g(x) with g^{\prime}(x)=2 x and g(3)=5.

Solution: (a) We can recognize that if \mathrm{f}(\mathrm{x})=\mathrm{x}^{2} then \mathrm{f}^{\prime}(\mathrm{x})=2 \mathrm{x} so one function with the derivative we want is f(x)=x^{2}. Corollary 2 guarantees that every function g whose derivative is 2 x has the form \mathrm{g}(\mathrm{x})=\mathrm{f}(\mathrm{x})+\mathrm{K}=\mathrm{x}^{2}+\mathrm{K}. The only functions with derivative 2 \mathrm{x} have the form \mathrm{x}^{2}+\mathrm{K}.

(b) Since g^{\prime}(x)=2 x, we know that g must have the form g(x)=x^{2}+K, but this is a whole "family" of functions (Fig. 8), and we want to find one member of the family . We know that g(3)=5 so we want to find the member of the family which goes through the point (3,5). All we need to do is replace the \mathrm{g}(\mathrm{x}) with 5 and the \mathrm{x} with 3 in the formula \mathrm{g}(\mathrm{x})=\mathrm{x}^{2}+\mathrm{K}, and then solve for the value of \mathrm{K}:
    5=\mathrm{g}(3)=(3)^{2}+\mathrm{K} so K=-4. The function we want is g(x)=x^{2}-4.

Fig. 8

Practice 3: Restate Corollary 2 as a statement about the positions and velocities of two cars.

Practice Answers

Practice 1: \mathrm{f}^{\prime}(\mathrm{x})=0 when \mathrm{x}=2 and 6 so \mathrm{c}=2 and \mathrm{c}=6.

\mathrm{g}^{\prime}(\mathrm{x})=0 when \mathrm{x}=2, 4, and 6 so \mathrm{c}=2, \mathrm{c}=4, and \mathrm{c}=6.


Practice 2: f(x)=5 x^{2}-4 x+3 on [1,3]. f(1)=4 and f(3)=36 so

\mathrm{m}=\frac{\mathrm{f}(\mathrm{b})-\mathrm{f}(\mathrm{a})}{\mathrm{b}-\mathrm{a}}=\frac{36-4}{3-1}=16

\mathrm{f}^{\prime}(\mathrm{x})=10 \mathrm{x}-4 so \mathrm{f}^{\prime}(\mathrm{c})=10 \mathrm{c}-4=16 if 10 \mathrm{c}=20 and \mathbf{c}=\mathbf{2}.
The graph of \mathrm{f} and the location of \mathrm{c} are shown in Fig. 16.

Fig. 16

Practice 3: If two cars have the same velocities during an interval of time (\mathrm{f}^{\prime}(\mathrm{t})=\mathrm{g}^{\prime}(\mathrm{t}) for \mathrm{t} in \mathrm{I}) then the cars are always a constant distance apart during that time interval.

(Note: The "same velocity" means same speed and same direction. If two cars are traveling at the same speed but in different directions, then the distance between them changes and is not constant)