The Multiplicity of an Eigenvalue
We would like to define the multiplicity of an eigenvalue of a linear transformation
as the number of independent associated eigenvectors. That is, as the dimension of the kernel of
. Unfortunately, we saw that when we have repeated eigenvalues, sometimes this doesn’t quite capture the right notion. In that example, the
-eigenspace has dimension
, but it seems from the upper-triangular matrix that the eigenvalue
should have multiplicity
.
Indeed, we saw that if the entries along the diagonal of an upper-triangular matrix are , then the characteristic polynomial is
Then we can use our definition of multiplicity for roots of polynomials to see that a given value of has multiplicity equal to the number of times it shows up on the diagonal of an upper-triangular matrix.
It turns out that generalized eigenspaces do capture this notion, and we have a way of calculating them to boot! That is, I’m asserting that the multiplicity of an eigenvalue is both the number of times that
shows up on the diagonal of any upper-triangular matrix for
, and the number of independent generalized eigenvectors with eigenvalue
— which is
.
So, let’s fix a vector space of finite dimension
over an algebraically closed field
. Pick a linear transformation
and a basis
with respect to which the basis of
is upper-triangular. We know such a matrix will exist because we’re working over an algebraically closed base field. I’ll prove the assertion for the eigenvalue
— that the number of copies of
on the diagonal of the matrix is the dimension of the kernel of
— since for other eigenvalues we just replace
with
and do the exact same thing.
We’ll prove this statement by induction on the dimension of . The base case is easy: if
then the kernel of
has dimension
if the upper triangular matrix is
, and has dimension
for anything else.
For the inductive step, we’re interested in the subspace spanned by the basis vectors through
. Let’s call this subspace
. Now we can write out the matrix of
:
We can see that every vector in — linear combinations of
through
— lands back in
. Meanwhile
, where the components of
are given in the last column. The fact that
is invariant under the action of
means that we can restrict
to that subspace, getting the transformation
. Its matrix with respect to the obvious basis is
The dimension of is less than that of
, so we can use our inductive hypothesis to conclude that
shows up on the diagonal of this matrix
times. But we saw yesterday that the sequence of kernels of powers of
has stabilized by this point (since
has dimension
), so this is also
. The last diagonal entry of
is either
or not. If
, we want to show that
On the other hand, if , we want to show that
The inclusion-exclusion principle tells us that
Since this dimension of a subspace has to be less than or equal to , the difference in dimensions on the right can only be either zero or one. And we also see that
So if we need to show that every vector in
actually lies in
, so the difference in dimensions is zero. On the other hand, if
we need to find a vector in
that’s not in
, so the difference in dimensions has to be one.
The first case is easier. Any vector in but not in
can be written uniquely as
for some nonzero scalar
and some vector
. When we apply the transformation
, we get
. Since
, the coefficient of
is again nonzero. No matter how many times we apply
, we’ll still have a nonzero vector left. Thus the kernel of
is completely contained in
, and so we conclude
.
In the second case, let’s look for a vector of the form . We want to choose
so that
. At the first application of
we find
. Thus
But the dimension of is
, and so by this point the sequence of images of powers of
has stabilized! That is,
and so we can find a so that
. This gives us a vector in the kernel of
that doesn’t lie in
, and the inductive step is complete.
As a final remark, notice that the only place we really used the fact that is algebraically closed is when we picked a basis that would make
upper-triangular. Everything still goes through as long as we have an upper-triangular matrix, but a given linear transformation may have no such matrix.
