Okay, now let’s generalize away from partial derivatives. The conceptual problem there was picking a bunch of specific directions as our basis, and restricting ourselves to that basis. So instead, let’s pick any direction at all, or even more generally than that.
Given a vector , we define the directional derivative of the function in the direction of by
It’s common to omit the brackets I’ve written in here, but that doesn’t make it as clear that we have a new function , and we’re asking for its value at . Instead, can suggest that we’re applying to the value . It’s also common to restrict to be a unit complex number, which is then used as a representative vector for all of those pointing in the same direction. I find that to be a needless hindrance, but others may disagree.
Anyhow, this looks a lot like our familiar derivative. Indeed, if we’re working in and we set we recover our regular derivative. And we have the same sort of interpretation: if we move a little bit in the direction of then we can approximate the change in
Now, does the existence of these limits guarantee the continuity of at ? No, not even the existence of all directional derivatives at a point assures us that the function will be continuous at that point. Indeed, we can consider another of our pathological cases
and patch it by defining . We take the directional derivative at using the direction vector
If then we find , while if we find . But we know that this function can’t be continuous, since if we approach the origin along the parabola we get a limit of instead of .
Again, the problem is that directional derivatives imply continuity along straight lines in various directions, but even continuity along every straight line through the point isn’t enough to assure continuity as a function of two variables, let alone more. We need something even stronger than directional derivatives.
On the other hand, directional derivatives are definitely stronger than partial derivatives. First of all, we haven’t had to make any choice of an orthonormal basis. But if we do have an orthonormal basis at hand, we find that partial derivatives are just particular directional derivatives
Incidentally, I’ve done two things here worth noting. First of all, I’ve gone back to using superscript indices for vector components. This allows the second thing, which is the transition from writing a function as taking one vector variable to rewriting the vector in terms of the basis at hand to writing the function as taking real variables . I know that some people don’t like superscript indices and the summation convention, but they’ll be standard when we get to more general spaces later, so we may as well get used to them now. Luckily, when we really understand something we shouldn’t have to pick coordinates, and indices only come into play when we do pick coordinates. Thus all the really meaningful statements shouldn’t have many indices to confuse us.