The Determinant of Unitary and Orthogonal Transformations
Okay, we’ve got groups of unitary and orthogonal transformations (and the latter we can generalize to groups of matrices over arbitrary fields. These are defined by certain relations involving transformations and their adjoints (transposes of matrices over more general fields). So now that we’ve got information about how the determinant and the adjoint interact, we can see what happens when we restrict the determinant homomorphism to these subgroups of .
First the orthogonal groups. This covers orthogonality with respect to general (nondegenerate) forms on an inner product space , the special case of orthogonality with respect to the underlying inner product
, and the orthogonal matrix group over arbitrary fields
. The general form describing all of these cases is
where is the adjoint or the matrix transpose, as appropriate. Now we can take the determinant of both sides of this equation, using the fact that the determinant is a homomorphism. We find
Next we can use the fact that . We can also divide out by
, since we know that
is invertible, and so its determinant is nonzero. We’re left with the observation that
And thus that the determinant of an orthogonal transformation must be a square root of
in our field. For both real and complex matrices, this says
, landing in the “sign group” (which is isomorphic to
).
What about unitary transformations? Here we just look at the unitarity condition
We take determinants
and use the fact that the determinant of the adjoint is the conjugate of the determinant
So the determinant of a unitary transformation must be a unit complex number in the circle group (which, incidentally, contains the sign group above).
It seems, then, that when we take determinants the analogy we’ve been pushing starts to come out. Unitary (and orthogonal) transformations are like complex numbers on the unit circle, and their determinants actually are complex numbers on the unit circle.
The Determinant of the Adjoint
It will be useful to know what happens to the determinant of a transformation when we pass to its adjoint. Since the determinant doesn’t depend on any particular choice of basis, we can just pick one arbitrarily and do our computations on matrices. And as we saw yesterday, adjoints are rather simple in terms of matrices: over real inner product spaces we take the transpose of our matrix, while over complex inner product spaces we take the conjugate transpose. So it will be convenient for us to just think in terms of matrices over any field for the moment, and see what happens to the determinant of a matrix when we take its transpose.
Okay, so let’s take a linear transformation and pick a basis
to get the matrix
and the formula for the determinant reads
and the determinant of the adjoint is
where we’ve now taken the transpose.
Now for the term corresponding to the permutation we can rearrange the multiplication. Instead of multiplying from
to
, we multiply from
to
. All we’re doing is rearranging factors, and our field multiplication is commutative, so this doesn’t change the result at all:
But as ranges over the symmetric group
, so does its inverse
. So we relabel to find
and we’re back to our formula for the determinant of itself! That is, when we take the transpose of a matrix we don’t change its determinant at all. And since the transpose of a real matrix corresponds to the adjoint of the transformation on a real inner product space, taking the adjoint doesn’t change the determinant of the transformation.
What about over complex inner product spaces, where adjoints correspond to conjugate transposes? Well, all we have to do is take the complex conjugate of each term in our calculation when we take the transpose of our matrix. Then carrying all these through to the end as we juggle indices around we’re left with
The determinant of the adjoint is, in this case, the complex conjugate of the determinant.
Unitary and Orthogonal Matrices
Let’s see what happens when we take a unitary or orthogonal transformation and turn it into a matrix by picking a basis for our vector space.
First a unitary transformation on a complex vector space. We pick a basis and set up the matrix
We can also set up the matrix for the adjoint
That is, the adjoint matrix is the conjugate transpose. This isn’t really anything new, since we essentially saw it when we considered Hermitian matrices.
But now we want to apply the unitarity condition that . It will make our lives easier here to just write out the sum over the basis in the middle and find
Now, this isn’t particularly useful on its face. I mean, what does that mess even mean? But if nothing else it tells us that we can describe unitary matrices in terms of (a lot of) equations involving only complex numbers. We can then pick out all the complex matrices which represent unitary transformations. They form the “unitary group”
.
What about orthogonal matrices? Again, we pick a basis to get a matrix
and also a matrix for the adjoint
Here the adjoint matrix is just the transpose, not the conjugate transpose, since we’re working over a real inner product space. Then we can write down the orthogonality condition
Again, this doesn’t really seem to tell us much, but we can use these equations to cut out the matrices which represent orthogonal transformations from all real matrices. They form the “orthogonal group”
.
But there’s something else we should notice here. The equations for the unitary group involved complex conjugation, so we need some structure like that to talk sensibly about unitarity. However, the orthogonality equations only involve basic field operations like addition and multiplication, and so these equations make sense over any field whatsoever. That is, given a field we can consider the collection of all
matrices with entries in
, and then impose the above orthogonality condition to cut out the matrices in the orthogonal group
, while the first orthogonal group is
.
One useful orthogonal group is . This is not the same as the unitary group
, though it can be confusing to keep the two separate at first. The unitary group consists of matrices whose inverses are their conjugate transposes, instead of just their transposes for the complex orthogonal group. The unitary group preserves a sesquilinear inner product, which has a clear geometric interpretation we’ve been talking about. The orthogonal group preserves a bilinear form, which doesn’t have such a clear visual interpretation. They are related in a way, but we’ll be coming back to that subject much later on.
Unitary Transformations
Unitary transformations are like orthogonal transformations, except we’re working with a complex inner product space. We’ll focus on just the transformations that are unitary with respect to the inner product itself. That is, we ask that
and so we must have just as we wrote for orthogonal transformations, but we have to use the adjoint that’s appropriate to our complex inner product. In this way, unitary and orthogonal transformations are related in a way similar to that in which Hermitian and symmetric forms are related.
Now, we’ve got this running analogy between endomorphisms on an inner product space and complex numbers. Taking the adjoint is like complex conjugation, so Hermitian transformations are like real numbers because they’re equal to their own adjoints. But here, we’re looking at transformations whose inverses are equal to their adjoints. What does this look like in terms of our analogy?
Well, we’ve noted that a transformation composed with its own adjoint is a sure way to get a positive-definite transformation . This is analogous to the way that a complex number times its own conjugate is always nonnegative:
. In fact, we use this to interpret
as the squared-length of a the complex number
. So what’s the analogue of the unitarity condition
? That’s like asking for
, and so
must be a unit-length complex number. Unitary transformations are like the complex numbers on the unit circle.
Orthogonal transformations
Given a form on a vector space represented by the transformation
and a linear map
, we’ve seen how to transform
by the action of
. That is, the space of all bilinear forms is a vector space which carries a representation of
. But given a particular form
, what is the stabilizer of
? That is, what transformations in
send
back to itself.
Before we answer this, let’s look at it in a slightly different way. Given a form we have a way of pairing vectors in
to get scalars. On the other hand, if we have a transformation
we could use it on the vectors before pairing them. We’re looking for those transformations so that for every pair of vectors the result of the pairing by
is the same before and after applying
.
So let’s look at the action we described last time: the form is sent to
. So we’re looking for all
so that
We say that such a transformation is -orthogonal, and the subgroup of all such transformations is the “orthogonal group”
. Sometimes, since the vector space
is sort of implicit in the form
, we abbreviate the group to
.
Now there’s one particular orthogonal group that’s particularly useful. If we’ve got an inner-product space (the setup for having our bra-ket notation) then the inner product itself is a form, and it’s described by the identity transformation. That is, the orthogonality condition in this case is that
A transformation is orthogonal if its adjoint is the same as its inverse. This is the version of orthogonality that we’re most familiar with. Commonly, when we say that a transformation is “orthogonal” with no qualification about what form we’re using, we just mean that this condition holds.
Let’s take a look at this last condition geometrically. We use the inner product to define a notion of (squared-)length and a notion of (the cosine of) angle
. So let’s transform the space by
and see what happens to our inner product, and thus to lengths and angles.
First off, note that no matter what we use, the transformation in the middle is self-adjoint and positive-definite, and so the new form is symmetric and positive-definite, and thus defines another inner product. But when is it the same inner product? When
, of course! For then we have
So orthogonal transformations are exactly those which preserve the notions of length and angle defined by the inner product. Geometrically, they correspond to rotations and reflections that change orientations, but leave lengths of vectors the same, and leave the angle between any pair of vectors the same.
Sunday Samples 131
One of the things I really liked about Ben Folds was that his vocal range is almost exactly the same as mine, and so it was always rather easy to sing along to. The general tenor of his lyrics didn’t hurt much either.
I don’t know if this one had any inspiration from last week’s Sample. Either way, it seems interesting that boxing seems to have this cultural resonance with the idea of a washed-up career. From the 1995 debut it’s “Boxing”. Not a bad waltz, either.
Read more »
Transformations of Bilinear Forms
Sorry for the delay, but at least now everything is in Maryland. Now I just need a job and then I’ll have to move back out of my parents’ house again :/
Anyway, now we want to see how linear maps between vector spaces affect forms on those spaces. We’ve seen a hint when we talked about the category of inner product spaces: if we have a bilinear form on a space
and a linear map
, then we can “pull back” the form along the map. That is, we take two vectors in
, apply
to both of them, and then stick them into the form on
.
When we write this out with our Dirac notation, we don’t really make much of a distinction between the vector spaces, relying on context to tell which vectors belong to which spaces, as well as which pairing we’re using. Luckily, when we write something sensible, it usually doesn’t matter how we parse things. Dirac notation is very robust! So, given vectors and
in
we have the form given by
as we’ve seen over and over. But now let’s take two vectors and
in
. First, we hit each of these vectors with
. The ket vector
clearly becomes
, while the bra vector
becomes
. Now using the form from before we get
This looks like a form on that’s described by the linear transformation
. Just as a sanity check, we can verify that
sends
to
,
then sends
to itself, and the adjoint
sends
back to
. So this is indeed a transformation from
to itself that can be used to represent a form.
As a particular case, we might consider an automorphism and consider it as a change of basis. When we did this to a linear map
we got a new linear map by conjugation
, and we say that these two transformations are similar. In one sense, this is “the same” linear map, but described with different “coordinates” on the vector space. But when we apply it to a bilinear form we turn
into
, and we say that these two are “congruent”. In the same sense as before, they describe “the same” form on
, but using different coordinates on the vector space.
Like similarity, congruence gives an action of on the space of bilinear forms on
. Indeed, if
and
are both automorphisms, we can act on
first by
to get
, then then by
to get
. But this is the same as
, which is the action by the product
.
Sunday Samples 130 (one day late)
I sort of which I had something more momentous to say in my thousandth post. Oh well…
What is it about moving that pushes Simon & Garfunkel into my mind? Maybe something about the way Art Garfunkel got a graduate degree in mathematics, but never did anything significant with it either.
Oh well. This time it’s “The Boxer”, from 1970s Bridge Over Troubled Water.
Read more »
Nondegenerate Forms II
Okay, we know what a nondegenerate form is, but what does this mean for the transformation that represents the form?
Remember that the form represented by the transformation is nondegenerate if for every nonzero ket vector
there is some bra vector
so that
. But before we go looking for such a bra vector, the transformation
has turned the ket vector
into a new ket vector
. If we find that
, then there can be no suitable vector
with which to pair it. So, at the very least, we must have
for every
. That is, the kernel of
is trivial. Since
is a transformation from the vector space
to itself, the rank-nullity theorem tells us that the image of
is all of
. That is,
must be an invertible transformation.
On the other hand, if is invertible, then every nonzero ket vector
becomes another nonzero ket vector
. Then we find that
where this last inequality holds because the bra-ket pairing is an inner product, and is thus positive-definite. Indeed, a positive-definite (or negative-definite) form must be nondegenerate. Thus it is sufficient for to be invertible.
Incidentally, this approach gives us a good way of constructing a lot of positive-definite transformations. Given an invertible transformation , we expand
Since the form defined by the bra-ket pairing is invertible, so is the form defined by . And this is a sensible concept, since
is self-adjoint. Indeed, we take its adjoint to find
This extends our analogy with the complex numbers. An invertible transformation composed with its adjoint is a self-adjoint, positive-definite transformation, just as a nonzero complex number multiplied by its conjugate is a real, positive number.
The Unapologetic… Book?
Yesterday some lonely voice on Twitter with the moniker @arthegall (which nickname I simply cannot parse) suggested turning this weblog into a book. The idea has come up before, but it usually has a spirit of “Turning weblogs into books is a Thing That People Do”, and so I might someday follow suit. But to look deeply at this particular weblog and say that it should be done? Clearly there are some problems here.
First of all, is there really a market for it? I flatter myself that there really is a “generally interested lay audience”, but how big is it? Unless one says something controversial — or that some crank thinks is controversial — math, physics, and computer science weblog readers tend to be a rather silent bunch. The obvious exceptions are those who have major corporate backing. Not that I disparage major corporate backing or anything, but it might reinforce my point that I don’t have it already. And even if that’s just the whims of chance, I really have no good way of gauging my current audience, let alone who might be willing to shell out for an honest-to-Gauß book. Without an audience, publishers aren’t going to be interested, and without a publisher I’m left with vanity presses. At which point, what’s really the difference between a vanity press and a weblog anyway?
Second of all.. well, just look at this mess. I can’t even keep my blogroll up-to-date. If you’re wondering why you don’t see your weblog there it may well be that I’m reading it and haven’t updated that thing in over a year. I’m worse with bibliographies. One of the nice things about the weblog format is that I don’t really have to do a lot in the way of referencing my sources, especially since nothing I’m saying is particularly new (unless I’m making it up myself). Ninety percent of what you read would get cited as “[stuff] I remember from graduate school [or earlier]“, with the rest filled in with occasional tokes from old textbooks or Wikipedia to make sure I’m getting the details and standard usage more or less correct. See, I really don’t think in terms of where ideas come from. Once they go into my head they’re churned into a sort of lumpy grey paste. Semantic leveling is great when it comes to pulling out oblique analogies (a useful tool in mathematics, actually), but it’s really horrible when it comes to giving credit where it’s due.
And yet, a book could lead to fame and fortune, and what American doesn’t want that? The tour might not get me on with my mancrush Keith, but Rachel tries to extend overtures to math/science/technology geeks from her policy-wonk side of the room. She even made mention of ∏ Day, which I’m sure could provide a great hook for an interview given my well-established stance. And Stewart and Colbert are wide open. Yesterday Stewart’s featured interview was with a guy who wrote a bio of Henry Hudson, of all things. So.. it might not be a bad idea.
But what would I write about. Just “turn blog into book” doesn’t feel right to me. Terry Tao can just make a tarball of a year’s worth of posts, slap an ex post facto theme on it, and ship it out. He can get away with this partially for the reason he’s got a built-in weblog audience in the first place: he’s got a Fields Medal. Me, I can’t get a proper postdoctoral position to save my life, so I have to work a bit harder on coherence. There’s way too much here, with no clear goal. I have local goals, and longer-term goals, but overall this project is very unstructured. I can hold what attention I do because it’s a nice little snack of whatever I happen to be talking about today. But a book needs to have a theme and, in a sense, a goal. Tangents are acceptable, but even they should clearly tie into the overall narrative.
And then I thought of one topic I’ve covered here, which could be stripped down a bit and rearranged into a single coherent thread: the progression from natural numbers to real numbers, and beyond. Natural numbers are, or at least seem, to be simple and intuitive, but by the time we’re talking about real numbers I don’t think most people have a real sense of what these things are. At the time most people first see them (even under the capable eyes of a good schoolteacher) they probably don’t realize what huge conceptual leaps they’re making. Even their teachers may not really know. Even up through the calculus sequence, which is where even most mathematically-able people stop (going on into science or engineering), it’s never really made explicit what’s going on. The fine details that are essential for making calculus work as it does are swept under the rug. By the time many people are ready to put real thought into what a number is, they’re beyond the point where there’s going to be a class telling them that the question is as deep as it really is.
Now, I don’t think I’d be interested in going in a philosophical direction with this, trying to really answer the question in a deep way, but I’d probably have to brush up on some of that sort of thing to make appropriate mention of it. My concern would be more to explore the progression of notions of “number”, and to explore the structures we encounter along the way. I wouldn’t want to go into nearly as much depth on group theory as I’ve done here, but some mention of it in parallel with the step from natural numbers to integers would be useful, for instance. But I’d need a collaborator — or at least an editor — who can understand what I’m doing and keep me honest about my references.
But given the right support I wouldn’t be against the idea…