Upper-Triangular Matrices
Until further notice, I’ll be assuming that the base field is algebraically closed, like the complex numbers
.
What does this assumption buy us? It says that the characteristic polynomial of a linear transformation is — like any polynomial over an algebraically closed field — guaranteed to have a root. Thus any linear transformation
has an eigenvalue
, as well as a corresponding eigenvector
satisfying
So let’s pick an eigenvector and take the subspace
it spans. We can take the quotient space
and restrict
to act on it. Why? Because if we take two representatives
of the same vector in the quotient space, then
. Then we find
which represents the same vector as .
Now the restriction of to
is another linear endomorphism over an algebraically closed field, so its characteristic polynomial must have a root, and it must have an eigenvalue
with associated eigenvector
. But let’s be careful. Does this mean that
is an eigenvector of
? Not quite. All we know is that
since vectors in the quotient space are only defined up to multiples of .
We can proceed like this, pulling off one vector after another. Each time we find
The image of in the
th quotient space is a constant times
itself, plus a linear combination of the earlier vectors. Further, each vector is linearly independent of the ones that came before, since if it weren’t, then it would be the zero vector in its quotient space. This procedure only grinds to a halt when the number of vectors equals the dimension of
, for then the quotient space is trivial, and the linearly independent collection
spans
. That is, we’ve come up with a basis.
So, what does look like in this basis? Look at the expansion above. We can set
for all
. When
we set
. And in the remaining cases, where
, we set
. That is, the matrix looks like
Where the star above the diagonal indicates unknown matrix entries, and the zero below the diagonal indicates that all the entries in that region are zero. We call such a matrix “upper-triangular”, since the only nonzero entries in the matrix are on or above the diagonal. What we’ve shown here is that over an algebraically-closed field, any linear transformation has a basis with respect to which the matrix of the transformation is upper-triangular. This is an important first step towards classifying these transformations.
