Integrable Distributions Have Integral Submanifolds
Let’s say that we have a one-dimensional distribution on a manifold . Around any point
we can find a patch
and an everywhere-nonzero vector field
on
so that
spans
for every
. Then we know we can find a chart
around
such that
on
. Then the curve with coordinates
and
for all other
is a one-dimensional integral submanifold of
through
.
Now, this doesn’t always work for every distribution . It turns out that the key ingredient is that all one-dimensional distributions are integrable; we can show that any integrable
-dimensional distribution has an integrable submanifold through any point
.
To give an even more detailed statement, let be a
-dimensional integrable distribution on
. for every
there is a chart
with
,
, and so that for any
the set
is an integral submanifold of
. Further, any connected integral submanifold of
comes in this way.
Since this statement is purely local we can get away with working in some set of local coordinates to start with, although obviously not the one we’re trying to find. As such, we will assume that , that
, and that
is spanned by the
for
. We’ll also let
be the projection onto the first
components, so
is an isomorphism. By parallel translation,
for all
in a neighborhood
of
.
Now, like we did yesterday we can use these isomorphisms to build vector fields
on
belonging to
that are
-related to the
on
. Then
, but since
is integrable we know that
. Since
is an isomorphism on
, we conclude that
. Now we can find a coordinate patch
around
with
on
, just as in the one-dimensional case. It’s no loss of generality to tweak it until we have
. This gives us an integral submanifold through the origin.
But our assertion goes further! Let , where
is the complementary projection to
. This map has maximal rank everywhere, so we know that for each
the preimage
is an
-dimensional submanifold
. The tangent space to
consists of exactly those vectors in the kernel of
, but since
is a diffeomorphism these are exactly those vectors
such that
is in the kernel of
— those
.
Conversely, if is a connected integral manifold of
contained in
. If
, then
is in
, which is spanned by the
for
. Thus
. And so
for all
. Since
is connected,
is constant, and thus
comes from picking a value
for each
.
Integral Submanifolds
Given a -dimensional distribution
on an
-dimensional manifold
, we say that a
-dimensional submanifold
is an “integral submanifold” of
if
for every
. That is, if the subspace of
spanned by the images of vectors from
is exactly
.
This is a lot like an integral curve, with one slight distinction: in the case on an integral curve we also demand that the length of match that of
, not just the direction (up to sign).
Now, if for every there exists an integral submanifold
of
with
, then
is integrable. Indeed, let
and
belong to
. Since
is an isomorphism of vector spaces at every point, we can find
and
that are
-related to
and
, respectively. That is,
for all
, and similarly for
and
. But then we know that
, and so
.
Distributions
A vector field defines a one-dimensional subspace of
at any point
with
: the subspace spanned by
. If
is everywhere nonzero, then it defines a one-dimensional subspace of each tangent space. A distribution generalizes this sort of thing to higher dimensions.
To this end, we define a -dimensional distribution
on an
-dimensional manifold
to be a map
, where
is a
-dimensional subspace of
. Further, we require that this map be “smooth”, in the sense that for any
there exists some neighborhood
of
and
vector fields
such that the vectors
span
for each
.
Notice here that the don’t have to work for the whole manifold
. Indeed, we will see that in many cases there are no everywhere-nonzero vector fields on a manifold
. But over a small patch
we might more easily find
vector fields that are linearly independent at each point, and thus define a smooth
-dimensional distribution over
. Then more general smooth distributions come from patching these sorts of smooth distributions together.
A vector field on
“belongs to” a distribution
— which we write
— if
for all
. We say that
is “integrable” if
for all
and
belonging to
.
Every one-dimensional manifold is integrable. To see this, we note that if and
belong to
then
for some constant
, at least at those points
where
. Thus we see that
and so is proportional to
, and thus belongs to
. To handle points where
, we can put the scalar multiplier on the other side.
What Does the Bracket Measure? (part 2)
Today we’ll prove the assertions we made last time: if and
are vector fields with flows
and
, respectively in some neighborhood
of some point
, we define the curve
We assert that for any smooth we have
To show the first, we define the three “rectangles”
Notice that ,
, and
. The chain rule lets us then calculate:
as asserted. As for the other assertion, we start by observing
Using the fact that we can turn the first term on the right into
Now, similar tedious calculations that make the big one above look like idle doodling give us two more identities:
and we conclude that
as asserted.
What Does the Bracket Measure? (part 1)
We again pick up the question we posed earlier about what the bracket measures. Clearly it should have something to do with flows and their failure to commute. So we consider vector fields and
with flows
and
, respectively.
Now, starting with a point and given some small time displacement
we can flow along
and then
—
— or we can flow along
and then
—
— and get two (potentially different) points in
. Unfortunately, we can’t just take their difference as we could when measuring the failure of algebra elements to commute.
However, we do have something rather closely related: the commutator in the sense of a group. That is, we can flow forwards along , then along
, then backwards along
and then backwards along
to get the point
:
If and
were to commute this would just be the constant curve
, but in general it’s a smooth curve passing through
at
. The bracket
will indicate the direction in which
passes through
, and something about its speed.
Now, it turns out that the derivative always vanishes at
. In fact,
is more closely related to the “second derivative” of
, though it’s not immediately clear what this means. Indeed, we can define
for any
in an interval around
, but
, and there is in general no good way to compare these vectors for different values of
.
Instead, we will prove the following two facts for any smooth — here “smoothness” entails twice differentiability — where
is any neighborhood of
:
The first indicates that . Indeed, if
then we could surely find some
which changes in the direction it points, which would give a nonzero value to
. The second gives the result we’re really after, as Taylor’s theorem applied to
shows us that
And thus the bracket indeed is the second (and lowest) order term in describing the direction of as it passes through
.
Building Charts from Vector Fields
Sorry for the delay; I’ve been swamped at my actual job the last couple days.
Anyway, I left off last time by pointing out that coordinate vector fields commute:
Today I want to show a certain converse: if we have vector fields
on some open region
that are linearly independent at some
and which commute —
for all
and
— then we can find some coordinate chart
around
so that the
are the first
coordinate vector fields. That is,
In particular, if is a vector field with
then there is a coordinate chart
around
with
.
If is any chart, then we can describe the
th coordinate vector field by saying it’s the unique vector field on
that is
-related to the
th partial derivative in
. That is, we’re trying to prove that:
on
.
In fact, we can further simplify our claim by assuming that ,
, and
— that the vector fields agree at the point
. Indeed, if
is any coordinate map taking
to
then we can define the vector fields
on
. These must have vanishing brackets because we can calculate:
What’s more, if is a local diffeomorphism of
with
, then
is a coordinate map satisfying our assertion.
Now, let be the flow of
, and let
be a small enough neighborhood of
that we can define
by
The order that the flows come in here doesn’t matter, since we’re assuming that the — and thus their flows — commute. Anyway, given any smooth test function
on
we can check
That is, . To see this for any other
, simply swap around the flows to bring
to the front.
We can also check that
Thus is the identity transformation on
. The inverse function theorem now tells us that there is a chart
around
with
, which will then satisfy our assertions.
Brackets and Flows
Now, what does the Lie bracket of two vector fields really measure? We’ve gone through all this time defining and manipulating a bunch of algebraic expressions, but this is supposed to be geometry! What does the bracket actually mean? It turns out that the bracket of two vector fields measures the extent to which their flows fail to commute.
We won’t work this all out today, but we’ll start with an important first step: the bracket of two vector fields vanishes if and only if their flows commute. That is, if and
are vector fields with flows
and
, respectively, then
if and only if
for all
and
.
First we assume that the flows commute. As we just saw last time, the fact that for all
means that
is
-invariant. That is,
. But this implies that the Lie derivative
vanishes, and we know that
.
Conversely, let’s assume that . For any
we can define the curve
in the tangent space
by
. Since the Lie derivative vanishes, we know that
, and I say that
for all
, or (equivalently) that
.
Fixing any we can set
. Then we calculate
Now this means that is
-invariant for all
, meaning that
and
commute for all
and
, as asserted.
As a special case, if is a coordinate patch then we have the coordinate vector fields
. The fact that partial derivatives commute means that the brackets disappear:
This corresponds to the fact that adding to the
th coordinate and
to the
th coordinate can be done in either order. That is, their flows commute.
Invariance and Flows
When we discussed the Lie algebra of a Lie group we discussed “left-invariant” vector fields. More generally than this if is a diffeomorphism we say that a vector field
is “
-invariant” if it is
-related to itself. That is, a vector field on a Lie group
is left-invariant if it is
-invariant for all
.
Now we want a characterization of -invariance in terms of the flow
of
. I say that
is
-invariant if and only if
for all
. That is, the flow should commute with
.
We’ll show this by showing that the vector field has flow
. Then if
is
-related to itself we know that
, and so by uniqueness we conclude that the flows
and
are equal, as asserted.
So, what makes the flow of
? First of all, we have to check the initial condition that
, which is perfectly straightforward to check:
More involved is the differential condition. It will help if we rewrite a bit as a function of both
and
:
Now we can start on the differential condition:
And thus is indeed the flow of
.
Calculating the Lie Derivative
It’s all well and good to define the Lie derivative, but it’s not exactly straightforward to calculate it from the definition. For one thing, it requires knowing the flow of the vector field
, which requires solving a differential equation that might be difficult in practice. Luckily, there’s an easier way.
But first, a lemma: if is an interval containing
,
is an open set, and
is a differentiable function with
for all
, then there is another differentiable function
such that
This is basically just like a lemma we proved for functions on star-shaped neighborhoods. Indeed, it suffices to set
Now let ,
, and
is the local flow of a vector field
on a region
containing
. Consider the function
, which satisfies the condition of the above lemma. We can thus write
for some . Or if we write
we can write
We also can see that . And so we can write
So now we calculate
Last time we wrote the action of on a smooth function
as
which pattern we can recognize in our formula. We thus continue
That is, for any two vector fields the Lie derivative is actually the same as the bracket
.
The Lie Derivative
Let’s go back to the way a vector field on a manifold gives us a “derivative” of smooth functions
. If
is a smooth vector field it has a maximal flow
which gives a one-parameter family
of diffeomorphisms, which we can think of as “moving forward along
by
.
Now given a smooth function we use this as if we were taking a derivative from all the way back in single-variable calculus: measure
at
, flow forward by
and measure
at
, take the difference, divide by
, and take the limit as
approaches zero:
Note that even if is not complete we do always have some interval around
on which
is defined and this difference quotient makes sense.
So far this is just a complicated (but descriptive!) way of restating something we already knew about. But now we can take this same approach and apply it to other vector fields. So if is another smooth vector field, we define the “Lie derivative” of
by
as:
Again we evaluate at both
and
, but here’s where a trick comes in: we can’t compare these two vectors directly, since they live at different points on
, and thus in different tangent spaces. So in order to compensate we use the flow itself to move backwards from
back to
, and use the derivative
to carry along the vector
.
We can come up with an alternate version of this formula by using similar techniques to those above:
That is, if we define the curve in the tangent space
then
. This would seem to make it live in the tangent space to
— that is, in
— but remember that since
is a vector space we identify it with all of its tangent spaces. Thus
, just like
is.
