The Unapologetic Mathematician

Mathematics for the interested outsider

Fermat’s Theorem

Okay, the Heine-Borel theorem tells us that a continuous real-valued function f on a compact space X takes a maximum and a minimum value. In particular, this holds for functions on closed intervals. But how can we recognize a maximum or a minimum when we see one?

First of all, what we get from the Heine-Borel theorem is a global maximum and minimum. That is, a point c\in X so that for any x\in X we have f(c)\geq f(x) (or f(c)\leq f(x)). We also can consider “local” maxima and minima. As you might guess from local connectedness and local compactness, a local maximum (minimum) c is a global maximum (minimum) in some neighborhood U\in\mathcal{N}(c). For example, if f is a function on some region in \mathbb{R} then having a local maximum at c means that there is some interval (a,b) with a<c<b, and for every x\in(a,b) we have f(c)\geq f(x).

So a function may have a number of local maxima and minima, but they’re not all global. Still, finding local maxima and minima is an important first step. In practice there’s only a finite number of them, and we can easily pick out which of them are global by just computing the function. So what do they look like?

For functions on regions in \mathbb{R}, the biggest part of the answer comes from Fermat’s theorem. The theorem itself actually talks about differentiable functions, so the first thing we’ll say is that an extremum may occur at a point where the function is not differentiable (though a point of nondifferentiability is not a sure sign of being an extremum).

Now, let’s say that we have a local maximum at c and that f is differentiable at c. We can set up the difference quotient \frac{f(x)-f(c)}{x-c}. When we take our limit as x goes to c, we can restrict to the neighborhood where c gives a global maximum, so f(x)-f(c)\leq0. To the right of c, x-c>0, so the difference quotient is negative here. To the left of c, x-c<0, so the difference quotient is positive here. Then since the limit must be a limit point of both of these regions, it must be {0}. That is, f'(c)=0. And the same thing happens for local minima.

So let’s define a “critical point” of a function to be one where either f isn’t differentiable or f'(c)=0. Then any local extremum must happen at a critical point. But not every critical point is a local extremum. The easiest example is f(x)=x^3, which has derivative f'(x)=3x^2. Then the only critical point is x=0, for which f(x)=0, but any neighborhood of x=0 has both positive and negative values of f(x), so it’s not a local maximum or minimum.

Geometrically, we should have expected as much as this. Remember that the derivative is the slope of the tangent line. At a local maximum, the function rises to the crest and falls again, and at the top the tangent line balances perfectly level with zero slope. We can see this when we draw the graph, and it provides the intuition behind Fermat’s theorem, but to speak with certainly we need the analytic definitions and the proof of the theorem.

January 21, 2008 Posted by | Analysis, Calculus | 7 Comments