David Corfield of The n-Category Café and Alexandre Borovik of Mathematics Under the Microscope have opened up
an infinite dialogue A Dialogue on Infinity. And they’ve chosen an excellent WordPress theme for it!
Basically, we consider each possible path from the source — which we’ll assume is at the origin of our lattice — to the target — which will be at lattice coordinates — as a series circuit, and the collection of all possible paths is a giant parallel circuit. If a given path has length it passes through resistors, and so its total resistance is . Thus we need to form the sum over all paths
and take the reciprocal of this sum. So we’ve got a combinatorial problem: how many lattice paths have length and go from the origin to ? Let’s call this number and build the “exponential generating function”
We choose the exponential generating function here rather than the ordinary generating function because of a manipulation we want to do later that looks nicer for exponential generating functions. We’ll go from to our desired sum a little later.
For now, let’s consider a simpler generating function. Consider the number of paths on a one-dimensional lattice with length that start at and end at . First off, if we have no paths that work. Similarly, if is not divisible by we won’t have any good paths. But when , there are generally many different paths. How do we get our hands on one? Choose steps to take left and the other to take right, and we’ll end up steps to the right. So we set to get the exponential generating function
where is a Bessel function of the first kind.
Now a path in the two-dimensional lattice is really a mixture of two one-dimensional paths. That is, we break our steps up into chunks of and steps, respectively, put a one-dimensional path on the chunk ending at , and put a one-dimensional path on the chunk ending at . And it turns out that generating functions are perfect for handling this!
This tells us how to find the number of paths of length that end at — it’s just the th derivative of this power series, evaluated at . That is
Now let’s consider the ordinary generating function for :
Here the coefficient of is the number of paths of length (and thus with resistance ) from the origin to . What we want to calculate is the sum . To get at this, we’ll divide our power series by and then integrate with respect to . This will give the series , which we can then evaluate at . Notice that we never have any paths of length zero unless we’re counting up the reciprocal resistance from the origin to itself, which should definitely be infinite anyway, so dividing by isn’t really a problem.
Now I admit there’s one glaring problem here: I don’t know offhand how to go from an exponential generating function to an ordinary generating function, much less how to integrate what we get from doing that to this product of Bessel functions.
[UPDATE]: I was just working on some numerics and there’s a problem here somewhere. The coefficients are off, but by a (sort of) predictable amount. I don’t really have time to fix this now, so I’ll come back with more later.
[UPDATE]: Okay, I see what’s wrong now. I’m adjusting what I said above.
[UPDATE]: Someone pointed out to me that after all of this I need to do a renormalization just like for the path integrals in quantum field theory. That is, in the above I need to restrict to paths that never hit the same edge twice, which is a harder combinatorial problem than I expected.
Still, it’s nifty to see the Bessel functions…
A metric space is a set equipped with a notion of “distance” in the set. That is, we have a set and a function , which assigns a real number to every pair of points in . This function will satisfy the following axioms:
- For all , .
- For all , if and only if .
- For all , .
- For all , .
The first says that distances are all nonnegative real numbers. The second says that any point is distance from itself, and only from itself. The third says that the distance between two points doesn’t depend on the order in which we take the points. The last is called the triangle inequality, because if we think of the points as the vertices of a triangle then it’s shorter to go from to along the leg connecting those two than to take the detour to .
Notice that these properties line up with those of absolute values. That is, the function defined by will be a distance function on .
Now any metric space is actually a uniform space. We define an entourage for each positive real number . This consists of all the pairs with . Since each of these will contain the diagonal. The intersection is the entourage corresponding to the smaller of and . Each is its own reflection by the symmetry of the distance function. And the triangle inequality gives our half-size entourages — if and then .
For the real numbers themselves we should verify that we get back the same uniform structure as we did before. Remember that the uniform structure we got from completing the uniform structure on the rational numbers had an entourage for each positive , with . Each one of these shows up in the entourages for the metric structure, by considering as a positive real number, but does every basic entourage from the metric structure show up as an entourage in the complete uniform structure? It does! The Archimedean property tells us that for any positive we can find a positive rational number . Then , and so is an entourage in the completion of the uniform structure on the rationals.
Let’s look at the neighborhood structure we get from the entourages of the metric structure. A subset is a neighborhood of if and only if it contains for some . That is, it must contain the “open ball” of all such that .
In this means that we have a neighborhood base for each point consisting of the intervals . Thus a subset of will be open if and only if it contains such a symmetric neighborhood of each of its points, and this will happen if and only if is the union of a collection of open intervals. Then we can take the intervals as a base for our topology.
As a final coup de grâce, let’s write down explicitly the condition that a function be a continuous map. We have a neighborhood base of our topology, and we know we only need to check the neighborhood definition of continuity on a neighborhood base.
So, a function will be continuous at if and only if for each neighborhood there is a neighborhood with . Translating this all into our explicit language for the real numbers and restricting to neighborhood bases says that a function is continuous at if and only if for each there is a so that implies . And we’re back to the old definition of continuity from calculus 1! Then, as usual, we say that is continuous if the above condition holds for all .
What about uniform continuity. We can again translate the statements to our special case and check them on the basic entourages. A function will be uniformly continuous if for every there is a so that for all , implies that .
Notice particularly the difference between uniform continuity and continuity. Continuity says that (for all ) (for all there exists a ) such that ( implies ). Uniform continuity says that (for all there exists a ) such that (for all ) ( implies ). The quantifier for shows up after the quantifier for in the latter definition. That is, for a uniformly continuous function we can pick the uniformly to apply to all points , while for a merely continuous function we may have to use a different for each point . At first it doesn’t seem to be that big a deal, which always causes a certain amount of confusion in an advanced calculus (undergraduate real analysis) class, but it turns out that being able to choose the same at every point makes a lot of nice things work out that don’t otherwise hold.
[UPDATE]: I’m feeling a little silly that I didn’t mention this before, but the last two definitions immediately port over to any function between metric spaces and by just using the local definitions of “distance” in place of that for . A function is continuous if for all and there is a so that implies . Similarly, is uniformly continuous if for all there is a so that for all implies .
Whether we use Dedekind cuts or Cauchy sequences to construct the ordered field of real numbers (and it doesn’t matter which), we are taking the ordered field of rational numbers and enlarging it to be “complete” in some sense or another. But we also aren’t making it too much bigger. The universality property we got from completing the uniform structure already gives evidence of that, but there’s another property which we can show is true of , and which shows that the real numbers aren’t too unwieldy.
In The Sand Reckoner, the ancient Greek mathematician Archimedes once set about the problem of should the number of grains of sand in existence to be finite. He does this by determining a (very weak) upper bound: the number of grains of sand it would take to fill up the entire universe, as he understood the latter term. He writes:
There are some … who think that the number of the sand is infinite in multitude; and I mean by the sand not only that which exists about Syracuse and the rest of Sicily but also that which is found in every region whether inhabited or uninhabited. Again there are some who, without regarding it as infinite, yet think that no number has been named which is great enough to exceed its multitude. And it is clear that they who hold this view, if they imagined a mass made up of sand in other respects as large as the mass of the earth filled up to a height equal to that of the highest of the mountains, would be many times further still from recognizing that any number could be expressed which exceeded the multitude of the sand so taken. But I will try to show you by means of geometrical proofs, which you will be able to follow, that, of the numbers named by me … some exceed not only the number of the mass of sand equal in magnitude to the earth filled up in the way described, but also that of a mass equal in magnitude to the universe
The deep fact here is a fundamental realization about numbers: the set of natural numbers has no upper bound in the real number system. That is, no matter how huge a real number we pick there’s always a natural number bigger than it. Equivalently, given any positive real number — even as small as the volume of a grain of sand — and another positive real number — even as large as the volume of (the ancient Greek conception of) the universe — there’s some natural number so that . When this happens in a given ordered field we say that the field is “Archimedean”.
So let’s show that is Archimedean. If there were positive real numbers and so that for all natural numbers , then would be an upper bound for the set of . Then Dedekind completeness gives us a least upper bound , and we can just take to be this least upper bound. Now , and also , and so . That is, is another upper bound for the set of multiples of . But since was chosen to be positive we see that , contradicting the assumption that was the least such upper bound. So such a pair of real numbers can’t exist.
In particular, we can take a positive real number and consider the set of natural numbers which are larger than it. Since the natural numbers are well-ordered, there is a least such number, and it can’t be because we assume . Subtracting one from this number will then give the largest natural number that is still below in the real number order, and we denote this number by . We can thus write any positive number uniquely as the sum of a natural number and some remainder with .
It turns out that the real numbers are actually the largest Archimedean field. That is, if is any ordered field satisfying the Archimedean property, there will be an monomorphism of ordered fields , making (the image of) a subfield of . I won’t prove this here, but I will note one thing about the meaning of this result: the Archimedean property essentially limits the size of an ordered field. That is, an ordered field can’t get too big without breaking this property. Dually, an ordered field can’t get too small without breaking Dedekind completeness or uniform completeness. Completeness pulls the field one way, while the Archimedean property pulls the other way, and the two reach a sort of equilibrium in the real numbers, living both at the top of one world and the bottom of the other.
Sorry to not get this posted until so late, but the end of the semester has been a bit hectic.
We’ve used Dedekind cuts to “complete” the order on the rational numbers — to make sure that every nonempty set of numbers with an upper bound has a least upper bound. We’ve also used Cauchy sequences to “complete” the uniform structure on the rational numbers — to make sure that every Cauchy sequence converges. But do we actually get the same thing in each case?
If we take a real number represented by a Cauchy sequence it’s easy to come up with a cut. Given a rational number we use the constant sequence and compare it to . If is eventually nonnegative then is less than x$, and should go into the left set . On the other hand, if it’s eventually nonpositive then is greater than and should go into the right set . It’s straightforward to show that this function from to the set of cuts preserves the order.
Now let’s start with the cut and write down a Cauchy sequence. Pick some and , and construct the sequence as follows. First write down and . Now set . This value will either be in or it won’t. If it is, replace by , and otherwise replace by . Then define as the midpoint between our two left and right points, and again replace either the left or the right point. Keep going, and we see that all future numbers in the sequence are closer to each other than the current and are to each other. And these two always keep moving closer and closer to each other, halving their distance at each step. So the sequence has to be Cauchy. If we picked a different and to start with, we’d get an equivalent sequence. I’ll leave this to you to show.
Notice here that the points in the sequence that lie in are moving steadily upwards towards the cut, and those in are moving steadily downwards towards it. Eventually, the sequence will rise above any point in and fall below any point in , and so if we take this sequence and build a cut from it we will get back the exact same cut we started with. Also, if we build a cut from a Cauchy sequence, and then a sequence from it, we get back an equivalent sequence. Thus we have set up a bijection between the set of cuts and the set of equivalence classes of Cauchy sequences, and we’ve already seen that it preserves the order structure.
Now let’s look at the map from sequences to cuts and verify that it preserves addition and multiplication of positive numbers. This will make the map into an isomorphism of ordered fields, and so both constructions are describing essentially the same thing. So if we have Cauchy sequences and , which give right sets and of rational numbers, then what’s the right set of the sequence ? It’s the set of rational numbers so that is eventually nonnegative. But any such can be broken up as , where and are both eventually nonnegative. That is, is the set of sums of elements of and , and so addition is preserved. The proof for multiplication is essentially the same.
So both methods of extending the real numbers give us essentially the same ordered field, which is thus both complete as a uniform space and Dedekind complete.
There’s another sense in which the rational numbers are lacking and the real numbers fix them up. This one is completely about the order structure on , and will lead to another construction of the real numbers.
Okay, so what’s wrong with the rational numbers now? In any partial order we can consider least upper or greatest lower bounds. That is, given a nonempty set of rational numbers the least upper bound or “supremum” is a rational number so that for all — it’s an upper bound — and if is any upper bound then . Similarly the “infimum” is the greatest lower bound — a lower bound that’s greater than all the other lower bounds.
There’s just one problem: there might not be a supremum of a given set. Even if the set is bounded above, there may be no least upper bound. For instance, the set of all rational numbers so that is bounded above, since is an example of an upper bound, and is another. But no matter what upper bound we have in hand, we can always find a lower number which is still an upper bound for this set. In fact, the upper bound should be , but this isn’t a rational number!
Now we define an ordered field to be “Dedekind complete” if every nonempty set with an upper bound has a least upper bound . Considering the set consisting of the negatives of elements of , every nonempty set with a lower bound will have a least lower bound. The flaw in the rational numbers is that they are not Dedekind complete.
So, in order to complete them, we will use the method of “Dedekind cuts”. Given a nonempty set with any upper bounds at all, the collection of all the upper bounds forms another set , and any element of gives a lower bound for . We then have the collection of all lower bounds of , which we call . Every rational number will be in either or . Given a rational number if there is an with then is a lower bound for , and is thus in . If not, then is an upper bound for and is thus in . However, one number may be in both and . If has a rational supremum then it is in both and . There can be no more overlap.
So we’ve come up with a way of cutting the rational numbers into a pair of sets — the “left set” and “right set” of the “cut” — with for every and . We define a new total order on the set of cuts by if . Every rational number corresponds to one of the cuts which contains a one-point overlap at that rational number, and clearly this inclusion preserves the order on the rational numbers.
Now if I take any nonempty collection of cuts with an upper bound (in the order on the set of cuts) we can take the union of all the and the intersection of all the to get a new cut. The left set contains all the , and it’s contained in any other left set which contains them, so it’s the least upper bound. Thus the collection of cuts is now Dedekind complete.
We can also add field structures to this completion of an ordered field. For this purpose, it will be useful to denote a generic element of by , and assume runs over all elements of wherever it appears. For instance, given cuts and , we define their sum to be . The negative of a cut will be . The product of two positive cuts and will have as its right set the collection of all products , and its left set defined to make this a cut. Finally, the reciprocal of a positive cut will have the set as its right set, and its left set defined to make this a cut.
This suffices to define all the field operations on the set of cuts, and if we start with we get another model of . I’ll leave verification of the field axioms as an exercise, and come back to prove that the method of cuts and the method of Cauchy sequences are equivalent. Once you play with cuts for a while, you may understand why I came at the real numbers with Cauchy sequences first. The cut approach seems to have a certain simplicity, and it’s less ontologically demanding since we’re only ever talking about pairs of subsets of the rational numbers rather than gigantic equivalence classes of sequences. But in the end I always find cuts to be extremely difficult to work with. Luckily, once we’ve shown them to be equivalent to Cauchy sequences it will establish that the real numbers we’ve been talking about are Dedekind complete, and we can put the messiness of this definition behind us.
We’ve defined the real numbers as a topological field by completing the rational numbers as a uniform space, and then extending the field operations to the new points by continuity. Now we extend the order on the rational numbers to make into an ordered field.
First off, we can simplify our work greatly by recognizing that we just need to determine the subset of positive real numbers — those with . Then we can say if . Now, each real number is represented by a Cauchy sequence of rational numbers, and so we say if has a representative sequence with each point .
What we need to check is that the positive numbers are closed under both addition and multiplication. But clearly if we pick and to be nonnegative Cauchy sequences representing and , respectively, then is represented by and is represented by , and these will be nonnegative since is an ordered field.
Now for each , , so . Also, if and , then and , so , and so . These show that defines a preorder on , since it is reflexive and transitive. Further, if and then and , so and thus . This shows that is a partial order. Clearly this order is total because any real number either has a nonnegative representative or it doesn’t.
One thing is a little hazy here. We asserted that if a number and its negative are both greater than or equal to zero, then it must be zero itself. Why is this? Well if is a nonnegative Cauchy sequence representing then represents . Now can we find a nonnegative Cauchy sequence equivalent to ? The lowest rational number that can be is, of course, zero, and so . But for and to be equivalent we must have for each positive rational an so that for . But this just says that converges to !
So is an ordered field, so what does this tell us? First off, we get an absolute value just like we did for the rationals. Secondly, we’ll get a uniform structure as we do for any ordered group. This uniform topology has a subbase consisting of all the half-infinite intervals and for all real . But this is also a subbase for the metric we got from completing the rationals, and so the two topologies coincide!
One more very important thing holds for all ordered fields. As a field is a kind of a ring with unit, and like any ring with unit there is a unique ring homomorphism . Now since in any ordered field, we have , and , and so on, to show that no nonzero integer can become zero under this map. Since we have an injective homomorphism of rings, the universal property of the field of fractions gives us a unique field homomorphism extending the ring homomorphism from the integers.
Now if is complete in the uniform structure defined by its order, this homomorphism will be uniformly complete. Therefore by the universal property of uniform completions, we will find a unique extension . That is, given any (uniformly) complete ordered field there is a unique uniformly continuous homomorphism of fields from the real numbers to the field in question. Thus is the universal such field, which characterizes it uniquely up to isomorphism!
So we can unambiguously speak of “the” real numbers, even if we use a different method of constructing them, or even no method at all. We can work out the rest of the theory of real numbers from these properties (though for the first few we might fall back on our construction) just as we could work out the theory of natural numbers from the Peano axioms.
We’ve defined the topological space we call the real number line as the completion of the rational numbers as a uniform space. But we want to be able to do things like arithmetic on it. That is, we want to put the structure of a field on this set. And because we’ve also got the structure of a topological space, we want the field operations to be continuous maps. Then we’ll have a topological field, or a “field object” (analogous to a group object) in the category of topological spaces.
Not only do we want the field operations to be continuous, we want them to agree with those on the rational numbers. And since is dense in (and similarly is dense in ), we will get unique continuous maps to extend our field operations. In fact the uniqueness is the easy part, due to the following general property of dense subsets.
Consider a topological space with a dense subset . Then every point has a sequence with . Now if and are two continuous functions which agree for every point in , then they agree for all points in . Indeed, picking a sequence in converging to we have
So if we can show the existence of a continuous extension of, say, addition of rational numbers to all real numbers, then the extension is unique. In fact, the continuity will be enough to tell us what the extension should look like. Let’s take real numbers and , and sequences of rational numbers and converging to and , respectively. We should have
but how do we know that the limit on the right exists? Well if we can show that the sequence is a Cauchy sequence of rational numbers, then it must converge because is complete.
Given a rational number we must show that there exists a natural number so that for all . But we know that there’s a number so that for , and a number so that for . Then we can choose to be the larger of and and find
So the sequence of sums is Cauchy, and thus converges.
What if we chose different sequences and converging to and ? Then we get another Cauchy sequence of rational numbers. To show that addition of real numbers is well-defined, we need to show that it’s equivalent to the sequence . So given a rational number does there exist an so that for all ? This is almost exactly the same as the above argument that each sequence is Cauchy! As such, I’ll leave it to you.
So we’ve got a continuous function taking two real numbers and giving back another one, and which agrees with addition of rational numbers. Does it define an Abelian group? The uniqueness property for functions defined on dense subspaces will come to our rescue! We can write down two functions from to defined by and . Since agrees with addition on rational numbers, and since triples of rational numbers are dense in the set of triples of real numbers, these two functions agree on a dense subset of their domains, and so must be equal. If we take the from as the additive identity we can also verify that it acts as an identity real number addition. We can also find the negative of a real number by negating each term of a Cauchy sequence converging to , and verify that this behaves as an additive inverse, and we can show this addition to be commutative, all using the same techniques as above. From here we’ll just write for the sum of real numbers and .
What about the multiplication? Again, we’ll want to choose rational sequences and converging to and , and define our function by
so it will be continuous and agree with rational number multiplication. Now we must show that for every rational number there is an so that for all . This will be a bit clearer if we start by noting that for each rational there is an so that for all . In particular, for sufficiently large we have , so the sequence is bounded above by some . Similarly, given we can pick so that for and get an upper bound for all . Then choosing to be the larger of and we will have
for . Now given a rational we can (with a little work) find and so that the expression on the right will be less than , and so the sequence is Cauchy, as desired.
Then, as for addition, it turns out that a similar proof will show that this definition doesn’t depend on the choice of sequences converging to and , so we get a multiplication. Again, we can use the density of the rational numbers to show that it’s associative and commutative, that serves as its unit, and that multiplication distributes over addition. We’ll just write for the product of real numbers and from here on.
To show that is a field we need a multiplicative inverse for each nonzero real number. That is, for each Cauchy sequence of rational numbers that doesn’t converge to , we would like to consider the sequence , but some of the might equal zero and thus throw us off. However, there can only be a finite number of zeroes in the sequence or else would be an accumulation point of the sequence and it would either converge to or fail to be Cauchy. So we can just change each of those to some nonzero rational number without breaking the Cauchy property or changing the real number it converges to. Then another argument similar to that for multiplication shows that this defines a function from the nonzero reals to themselves which acts as a multiplicative inverse.
The Secret Blogging Seminar is hosting the latest installment of the Carnival of Mathematics. They seem to have done a fair job of collecting links, with more (and more varied) entries than I remember seeing before. Enjoy.
Well, yesterday was given over to exam-writing, so today I’ll pick up a few scraps I mentioned in passing on Thursday.
First of all, the rational numbers are countable. To be explicit, in case I haven’t been before, this means that there is an injective function from the set of rational numbers to the set of natural numbers. Really, I’ll just handle the positive rationals, but it’s straightforward how to include the negatives as well. To every positive rational number we can get a pair of natural numbers — the numerator and the denominator. Then we can send the pair to the number , which is a bijection between the set of all pairs of natural numbers and all natural numbers. Clearly they contain the natural numbers, so the set of rational numbers is countably infinite.
Now, equivalence relations. Given any relation on a set we can build it up into an equivalence relation. First throw in the diagonal to make it reflexive. Then throw in all the points for to make it symmetric. For transitivity, we can similarly start throwing elements into the relation until this condition is satisfied.
But that’s all sort of ugly. Here’s a more elegant way of doing it: given any relation , consider all the relations on that contain — . Some of these will be equivalence relations. In particular, the whole product is an equivalence relation, so there is at least one such. It’s simple to verify that the intersection of any family of equivalence relations on is again an equivalence relation, so we can take the intersection of all equivalence relations on containing to get the smallest such relation. Notice, by the way, how this is similar to generating a topology from a subbase, or to taking the closure of a subset in a topological space.
Finally, absolute values. In any totally ordered group we have the positive “cone” of all elements with . and the negative “cone” of all elements with . In the latter case, we can multiply both sides by the inverse of to get in the positive cone. Notice that the identity is in both cones, but the reflection described leaves it fixed. So for every element in we get a well-defined element called its absolute value. Of course, we often assume that is abelian and write this all additively instead of multiplicatively.
This function has a number of nice properties. First of all, is always in . Secondly, is the identity in if and only if itself is the identity. Thirdly, . And finally, if is abelian we have the “triangle inequality” .
Okay, does that catch us up?