The notion of a tensor product also extends to modules, but the generalization is not quite as straightforward as it was for direct sums.
We start with a ring and consider a right -module and a left -modules . We consider functions which take an element of each module and return an element of some abelian group . Such a function is called “middle linear” if
That is, if it is bilinear as a function of abelian groups, and if we can pull the action of from the first argument to the second and back without changing the value of the function. This condition may seem a little artificial, but I’ll motivate it a bit more later.
The tensor product is the abelian group for the universal middle linear function, just as the tensor product for abelian groups was the abelian group for the universal bilinear function. We show that such an object exists by a similar construction. Take the free abelian group generated by all elements of and write a generator as . Then impose the relations , , and . Then given any middle linear map there is a unique homomorphism of abelian groups so that .
In general, the tensor product over is just an abelian group — it eats -module structures like does. That said, like plays well with extra module structures, so does tensor product. If is a right -module and a left -module, then carries the structure of a left -module. Indeed we can define and check that this action respects the relations we imposed. Similarly, if has an additional right -module structure commuting with the action of then is a right -module, and the same goes for extra modules structures on . Unlike in the case of , no structures get “flipped over” in this process.
If is a commutative ring, then every module is both a left and a right -module. Thus, the same is true of — the tensor product eats the left module structure on and the right module structure on , but leaves the other two structures.
Now for an example that should motivate the idea of a middle-linear map. Let , , and be right -modules with an extra left -module structure on . Then is a left -module and is a right -module. We consider and and calculate the two composites
so we can use the action on either factor. This means that composition of these homomorphisms is a middle-linear function, and so defines a linear function .
We’ve covered direct sums in the case of abelian groups — that is, -modules — but the concept extends to modules over arbitrary rings. This gives me a good chance to go back and clean up my coverage.
We build the direct sum of two -modules and on the product of the underlying sets. We define an abelian group structure by , and an action of by .
Now, here’s the diagram:
The direct sum of two left -modules and comes equipped with four module homomorphisms. For we have the pair and . These are defined as follows:
There is a similar pair with a similar definition for . These homomorphisms satisfy the identities
where is the identity homomorphism on the module , and is the homomorphism between two modules sending every element of the domain to the element in the codomain.
Now if we have any two homomorphisms and from a module to and respectively, then there is a unique homomorphism making the two triangles on the top commute. That is, , and similarly for . In fact, we can define , for then
and so on. Similarly, given two homomorphisms and from and to a module , then the homomorphism is the unique homomorphism making the lower two triangles commute.
The upshot of all this is that the direct sum behaves like both the direct product and the free product of two groups, since it satisfies both universal properties. For any finite number of modules we can build the direct sum and it also satisfies the analogous universal properties, and comes equipped with analogous injections and projections satisfying analogous relations to those above.
The upshot of all this is that the direct sum of a finite collection of -modules behaves like both the direct product and the free product of groups. In fact, we can take that as the definition, derive the relations between the injections and projections, and use the above construction to show such a thing actually exists. On the other hand, we can take the relations between injections and projections as the definitions, use the construction to show existence, and derive the universal property from the relations as above.
For an infinite index set the situation is a bit more complicated. Here we use the definition from the injections and projections with the specified relations. Then the underlying set of the infinite direct sum is not the infinite cartesian product of the underlying sets. It’s actually the list of all such “-tuples” where all but a finite number of the entries are the elements of the respective modules. This only satisfies the universal property for the — the bottom of the diagram above. For the top we really do need the infinite direct product of the modules, which uses the whole infinite cartesian product of the underlying sets. However, for most purposes the direct sum is all we need, and the relations between the injections and the projections are the most useful part of this definition.
There is an interesting preorder we can put on the nonzero elements of any commutative ring with unit. If and are nonzero elements of a ring , we say that divides — and write — if there is an so that . The identity trivially divides every other nonzero element of .
We can easily check that this defines a preorder. Any element divides itself, since . Further, if and then there exist and so that and , so and we have .
On the other hand, this preorder is almost never a partial order. In fact since and we see that and , and most of the time . In general, when both and we say that and are associates. Any unit comes with an inverse , so we have and . If for some unit , then and are associates because .
We can pull a partial order out of this preorder with a little trick that works for any preorder. Given a preorder we write if both and . Then we can check that defines an equivalence relation on , so we can form the set of its equivalence classes. Then descends to an honest partial order on .
One place that divisibility shows up a lot is in the ring of integers. Clearly and are associate. If and are positive integers with , then there is another positive integer so that . If then . Otherwise . Thus the only way two positive integers can be associate is if they are the same. The preorder of divisibility on induces a partial order of divisibility on .
We use cardinal numbers to count how many elements are in a set. Another thing we think of numbers for is listing elements. That is, we put things in order: first, second, third, and so on.
We identified a cardinal number as an isomorphism class of sets. Ordinal numbers work much the same way, but we use sets equipped with well-orders. Now we don’t allow all the functions between two sets. We just consider the order-preserving functions. If and are two well-ordered sets, a function preserves the order if whenever then . We consider two well-ordered sets to be equivalent if there is an order-preserving bijection between them, and define an ordinal number to be an equivalence class of well-ordered sets under this relation.
If two well-ordered sets are equivalent, they must have the same cardinality. Indeed, we can just forget the order structure and we have a bijection between the two sets. This means that two sets representing the same ordinal number also represent the same cardinal number.
Now let’s just look at finite sets for a moment. If two finite well-ordered sets have the same number of elements, then it turns out they are order-equivalent too. It can be a little tricky to do this straight through, so let’s sort of come at it from the side. We’ll use finite ordinal numbers to give a model of the natural numbers. Since the finite cardinals also give such a model there must be an isomorphism (as models of between finite ordinals and finite cardinals. We’ll see that the isomorphism required by the universal property sends each ordinal to its cardinality. If two ordinals had the same cardinality, then this couldn’t be an isomorphism, so distinct finite ordinals have distinct cardinalities. We’ll also blur the distinction between a well-ordered set and the ordinal number it represents.
So here’s the construction. We start with the empty set, which has exactly one order. It can seem a little weird, but if you just follow the definitions it makes sense: any relation from to itself is a subset of , and there’s only one of them. Reading the definitions carefully, it uses a lot of “for every”, but no “there exists”. Each time we say “for every” it’s trivially true, since there’s nothing that can make it false. Since we never require the existence of an element having a certain property, that’s not a problem. Anyhow, we call the empty set with this (trivial) well-ordering the ordinal . Notice that it has (cardinal number) zero elements.
Now given an ordinal number we define . That is, each new number has the set of all the ordinals that came before it as elements. We need to put a well-ordering on this set, which is just the order in which the ordinals showed up. In fact, we can say this a bit more concisely: if . More explicitly, each ordinal number is an element of every one that comes after it. Also notice that each time we make a new ordinal out of the ones that came before it, we add one new element. The successor function here adds one to the cardinality, meaning it corresponds to the successor in the cardinal number model of . This gives a function from the finite ordinals onto the finite cardinals.
What’s left to check is the universal property. Here we can leverage the cardinal number model and this surjection of finite ordinals onto finite cardinals. I’ll leave the details to you, but if you draw out the natural numbers diagram it should be pretty clear how to how that the universal property is satisfied.
The upshot of all of this is that finite ordinals, like finite cardinals, give another model of the natural numbers, which is why natural numbers seem to show up when we list things.
I’m talking tomorrow in the Geometry, Symmetry, and Physics seminar here at Yale about the work that spun out of my realization of March 16. This ties into knot colorings, but goes far beyond that starting point. I won’t be able to say this with just the tools I’ve developed in the main line of my writings, so like the Atlas stuff it may not be comprehensible (yet!) to anyone beyond professionals.
So here’s the most general statement. Let be an algebraic category and a co- object in the category of pointed topological pairs up to homotopy. Write for the plane with marked points, and for the cube. Then every tangle gives rise to a cospan in :
where the -objects are taken in the category of pointed pairs up to homotopy. This then gives rise to an anafunctor from the comma category to the comma category . This assignment is a monoidal functor from the category of tangles to the category of (categories, anafunctors). When is the category of quandles, this functor categorifies the extension to tangles of the coloring number invariant of knots and links.
I’ve complained before about the opacity of the application procedure. I’ve gotten to the point now that I’m asking all outstanding schools what my status is. Following is a list — which I will keep updating — of departments that have closed their application procedures without bothering to inform applicants that they have been rejected. Petty? Maybe. But I’m the one who’s gotten screwed here, so the least they can do is be publicly known as not acting in good faith towards their applicants.
- George Mason University
- Kansas State University
- Northwestern University
- Purdue University
- Stony Brook University
- University of Arizona
- University of California, Davis
- University of California, Irvine
- University of California, Los Angeles
- University of Michigan
- University of Regina
- University of Tennessee
- University of Toronto
Note: I am giving departments the benefit of the doubt if they say they did send word. Still I am shocked — shocked — at how many letters the post office has lost.
I’ve posted my notes for Zuckerman’s third and final talk. This should get us up to “what exactly does the Atlas Project do?”
Maybe we have enough pull around here to get Jeff Adams to come up and answer that question in a talk before I’m banished from the academy, but even if not I may be able to get time to sit down with him one-on-one once I head back to the Maryland area. Anyhow, my explanations are behind the jump, as usual.
Read more »
Many times we’ll be interested in an abelian group on which more than one ring has an action, or on which a ring has two different actions. Most interesting are the cases when these different actions commute with each other. That is, if carries an -module structure with action and an -module structure with action , then it’s really nice if . We’ll keep track of this sort of thing by hanging subscripts off of the module’s name to keep track of mutually commuting actions. In this case we’ll say to denote two commuting left actions, one by and one by .
Every module over a commutative ring has both a left and a right action, and these clearly commute with each other because the ring is commutative. That is, if is an -module for a commutative ring , we automatically get two commuting actions, written .
Another example we’ll see in more generality later is the tensor powers of an abelian group . The group itself is a module over . Now the tensor power also is a module over . We just define and extend linearly. But we also have a homomorphism from the symmetric group to the monoid of endomorphisms of . For a permutation we define , shuffling around the factors. This extends to a unique homomorphism , which gives another module structure on . These two actions, one of and one of , commute with each other.
Now that I have some examples down, I want to consider how these extra module structures play with — which tells us the homomorphisms between left -modules — and — which does the same for right modules. I’ll mostly treat the left module case because the other side is very similar.
The first thing to make clear is that eats up the -module structure. If we take left modules and then is just an abelian group with no -module structure at all. The interesting things happen when we’ve got extra module structures floating around.
If also has a right -module structure for another ring , then we get a right -module structure on the homomorphisms. We can define . On the right side of the equation we’re using the given action of on . It’s not too hard to verify that this defines a right action on .
On the other hand, if has a right -module structure, we get a left action on the homomorphisms. We define . Let’s verify this one carefully:
There are a number of similar cases, which you should check through:
- is a right -module.
- is a left -module.
- is a left -module.
- is a right -module.
- is a right -module.
- is a left -module.
- is a left -module.
- is a right -module.
In general, eats a left -module structure from each module we stick in. Extra module structures in the second slot carry through, while extra module structures in the first slot get flipped over from right to left and back. The same goes for , except it eats a right -modules structure from each slot.
One explicit example of this effect: over a commutative ring , every left module is also a right module and vice-versa. There’s really no difference between and here, so we’ll just write . Now we’re looking at , so one structure (say the left one, for now) on each module gets eaten, leaving a right -module structure on each slot. The second slot carries through and the first slot flips over, giving a left and a right action of on . This will come in very handy when we start considering modules over fields.
Again, once we have a new structure it’s important to understand what sort of functions connect different instancees of the structure. For modules we have module homomorphisms.
By now this should be pretty straightforward. A left -module is an abelian group equipped with an action of . A homomorphism of -modules is a homomorphism of abelian groups that also preserves the action of . That is, it’s a linear function satisfying for all . We call such a function -linear.
In terms of the tensor product picture, any linear function gives us a linear function . The condition of -linearity is that the following diagram commute:
where the horizontal arrows are the actions of on and , respectively.
Now if we pick two left -modules and we have a bunch of different -module homomorphisms from to . We’ll call the set of them , sometimes shortened to , or even just . The important thing about this set is that it inherits the structure of an abelian group from the one on .
There’s always a homomorphism sending every element of to the zero element of . Also, given homomorphisms and we can define . It’s straightforward to check that this preserves the action of as long as and both do. Finally, there’s a homomorphism defined by . We can easily see that these operations on satisfy the axioms of an abelian group.
In fact it gets even better. Not only are the homomorphism sets abelian groups, but composition is bilinear! Let’s consider four homomorphisms between three modules , , , and . Then we build up the composition . Let’s see what it does to an element .
This gives us a linear function for every three modules , , and .
What happens if we pick all three modules to be the same one? Each homomorphism set is , which we’ll call . Then we get a linear function . This is a ring structure! We call it the ring of -endomorphisms of . If the ring is the ring of integers and is an abelian group, then is just the endomorphism ring we considered earlier. This is an example of how the theory of modules naturally extends the theory of abelian groups.
The analogue in ring theory for the idea of a group action is that of a module. Again we want every element of the ring to behave like a function on a set and for multiplication to correspond to composition of functions, but now we have the addition floating around and we’d like to include it in the structure as well. We’ll handle this by letting the ring act on an abelian group.
So let’s take a ring and an abelian group . We say that has the structure of a left -module if each element acts as a linear function from to itself. We write for the effect of this function on an element . Linearity here means that . We also require that the ring structure play its part
- if the ring has an identity element
We can say all this another way. Since this action of on is linear in both the ring and the abelian group we get a linear function from the tensor product just like we had one for the multiplication of the ring: . Not every such function will work, though. We also need that the following diagram commutes:
Around the top of the diagram we use the module action twice, while around the bottom we first multiply in the ring and then use the action once. If the ring has an identity we also require that the following diagram commute
where on the top we use the unique linea function sending the integer to the identity in , and the diagonal is the canonical isomorphism from to .
And there’s another way to say it. Remember that every abelian group comes equipped with the endomorphism ring . An -module structure on is a ring homomorphism .
All three ways of defining a module — from the raw axioms, as a linear function , or as a homomorphism — are useful in various situations, and it’s important to be able to slide back and forth between the different pictures.
Of course, the fact that I said left -module above should immediately lead you to think about right -modules. These are the same, except that the module action is written and satisfies — the order in which the factors are applied is reversed. Over a commutative ring we often ignore this distinction, since we can always switch the order of any product.
Every ring immediately has the structure of a left and a right -module, using multiplication as the action. This is most clearly seen from the tensor-product definition, where the commuting square just expresses the associativity of ring multiplication. On the other hand, every abelian group immediately has the structure of an -module. This is trivial in the homomorphism picture: just pick the identity homomorphism.
Somewhat more importantly, every abelian group is naturally a -module. We define — adding up copies of the group element . Everything we say for modules in general will apply to abelian groups, and in fact much of what we’ve already said about abelian groups will extend to other modules.
At this point I want to make a point about language. The terms “abelian group” and “-module” are interchangeable, but there are cases in which I feel the context clearly indicates using one or the other. Later we’ll come to (and many reader have already seen) “homology groups”, which are more naturally modules than groups. Similarly, the one-dimensional circle, torus, and real projective space are all semantically very different, even though they happen to be equivalent in many situations. I think that this imprecision can lead to confusion on the part of a student, so I’ll try my best to use the most apropriate of equivalent terms.