Higher Differentials and Composite Functions
Last time we saw an example of what can go wrong when we try to translate higher differentials the way we did the first-order differential. Today I want to identify exactly what goes wrong, and I’ll make use of the summation convention to greatly simplify the process.
So, let’s take a function of
variables
and a collection of
functions
, each depending on
variables
. We can think of these as the components of a vector-valued function
which has continuous second partial derivatives on some region
. If the function
has continuous second partial derivatives on some region
containing the image
, then we can compose the two functions to give a single function
, and we’re going to investigate the second differential of
with respect to the variables
.
To that end, we want to calculate the second partial derivative
First, we take the derivative in terms of , and we use the chain rule to write
Now we have to take the derivative in terms of . Luckily, this operation is linear, so we don’t have to worry about the hidden summations in the notation. We do, however, have to use the product rule to handle the multiplications
where we’ve used the chain rule again to convert a derivative in terms of into one in terms of
.
And here we’ve come to the problem itself. For we can write out the second differential in terms of the
The first term here is the second differential in terms of the . If there were an analogue of Cauchy’s invariant rule, this would be all there is to the formula. But we’ve got another term — one due to the product rule — based on the second differentials of the functions
themselves. This is the term that ruins the nice transformation properties of higher differentials, and which makes them unsuitable for many of our purposes.
Notice, though, that we have not contradicted Clairaut’s theorem here. Indeed, as long as and all the
have continuous second partial derivatives, then so will
. Further, the formula we derived for the second partial derivatives of
is manifestly symmetric between the two derivatives, and so the mixed partials commute.
[…] Like I said yesterday, because of extraneous terms the higher differentials don’t transform well, and so […]
Pingback by Taylor’s Theorem « The Unapologetic Mathematician | October 20, 2009 |
[…] where I’ve used the physicists’ convention on the variables instead of the common one in multivariable calculus classes. Then we could plug these expressions for , , and into our function , and get a composite function of the variables and , which we can then attack with the tools from the last couple days, being careful about when we can and can’t trust Cauchy’s invariant rule, since the second differential can transform oddly. […]
Pingback by Extrema with Constraints I « The Unapologetic Mathematician | November 25, 2009 |