The Unapologetic Mathematician

Mathematics for the interested outsider

Products of Metric Spaces

Shortly we’re going to need a construction that’s sort of interesting in its own right.

We know about products of topological spaces. We can take products of metric spaces, too, and one method comes down to us all the way from Pythagoras.

The famous Pythagorean theorem tells us that in a right triangle the length c of the side opposite the right angle stands in a certain relation to the lengths a and b of the other two sides: c^2=a^2+b^2. So let’s say we’ve got metric spaces (M_1,d_1) and (M_2,d_2). For the moment we’ll think of them as being perpendicular and define a distance function d on M_1\times M_2 by

d((x_1,x_2),(y_1,y_2)=\sqrt{d_1(x_1,y_1)^2+d_2(x_2,y_2)^2}

The quantity inside the radical here must be nonnegative, since it’s the sum of two nonnegative numbers. Since the result needs to be nonnegative, we take the unique nonnegative square root.

Oops, I don’t think I mentioned this before. Since the function f(x)=x^2 has f'(x)=2x as its derivative, it’s always increasing where x is positive. And since we can eventually a square above any real number we choose, its values run from zero all the way up to infinity. Now the same sort of argument as we used to construct the exponential function gives us an inverse sending any nonnegative number to a unique nonnegative square root.

Okay, that taken care of, we’ve got a distance function. It’s clearly nonnegative and symmetric. The only way for it to be zero is for the quantity in the radical to be zero, and this only happens if each of the terms d_1(x_1,y_1) and d_2(x_2,y_2) are zero. But since these are distance functions, that means x_1=y_1 and x_2=y_2, so (x_1,x_2)=(y_1,y_2).

The last property we need is the triangle inequality. That is, for any three pairs (x_1,x_2), (y_1,y_2), (z_1,z_2) we have the inequality

d((x_1,x_2),(z_1,z_2))\leq d((x_1,x_2),(y_1,y_2))+d((y_1,y_2),(z_1,z_2))

Substituting from the definition of d we get the statement

\sqrt{d_1(x_1,z_1)^2+d_2(x_2,z_2)^2}\leq\sqrt{d_1(x_1,y_1)^2+d_2(x_2,y_2)^2}+\sqrt{d_1(y_1,z_1)^2+d_2(y_2,z_2)^2}

The triangle inequalities for d_1 and d_2 tell us that d_1(x_1,z_1)\leq d_1(x_1,y_1)+d_1(y_1,z_1) and d_2(x_2,z_2)\leq d_2(x_2,y_2)+d_1(y_2,z_2). So if we make these substitutions on the left, it increases the left side of the inequality we want. Thus if we can prove the stronger inequality

\begin{aligned}\sqrt{d_1(x_1,y_1)^2+2d_1(x_1,y_1)d_1(y_1,z_1)+d_1(y_1,z_1)^2+d_2(x_2,y_2)^2+2d_2(x_2,y_2)d_2(y_2,z_2)+d_2(y_2,z_2)^2}\\\leq\sqrt{d_1(x_1,y_1)^2+d_2(x_2,y_2)^2}+\sqrt{d_1(y_1,z_1)^2+d_2(y_2,z_2)^2}\end{aligned}

we’ll get the one we really want. Now since squaring preserves the order on the nonnegative reals, we can find this equivalent to

\begin{aligned}d_1(x_1,y_1)^2+2d_1(x_1,y_1)d_1(y_1,z_1)+d_1(y_1,z_1)^2+d_2(x_2,y_2)^2+2d_2(x_2,y_2)d_2(y_2,z_2)+d_2(y_2,z_2)^2\\\leq d_1(x_1,y_1)^2+d_2(x_2,y_2)^2+2\sqrt{d_1(x_1,y_1)^2+d_2(x_2,y_2)^2}\sqrt{d_1(y_1,z_1)^2+d_2(y_2,z_2)^2}+d_1(y_1,z_1)^2+d_2(y_2,z_2)^2\end{aligned}

Some cancellations later:

\begin{aligned}d_1(x_1,y_1)d_1(y_1,z_1)+d_2(x_2,y_2)d_2(y_2,z_2)\\\leq \sqrt{d_1(x_1,y_1)^2d_1(y_1,z_1)^2+d_1(x_1,y_1)^2d_2(y_2,z_2)^2+d_2(x_2,y_2)^2d_1(y_1,z_1)^2+d_2(x_2,y_2)^2d_2(y_2,z_2)^2}\end{aligned}

We square and cancel some more:

2d_1(x_1,y_1)d_1(y_1,z_1)d_2(x_2,y_2)d_2(y_2,z_2)\leq d_1(x_1,y_1)^2d_2(y_2,z_2)^2+d_2(x_2,y_2)^2d_1(y_1,z_1)^2

Moving these terms around we find

\begin{aligned}0\leq\left(d_1(x_1,y_1)d_2(y_2,z_2)\right)^2-2\left(d_1(x_1,y_1)d_2(y_2,z_2)\right)\left(d_2(x_2,y_2)d_1(y_1,z_1)\right)+\left(d_2(x_2,y_2)d_1(y_1,z_1)\right)^2\\=\left(d_1(x_1,y_1)d_2(y_2,z_2)-d_2(x_2,y_2)d_1(y_1,z_1)\right)^2\end{aligned}

So at the end of the day, our triangle inequality is equivalent to asking if a certain quantity squared is nonnegative, which it clearly is!

Now here’s the important thing at the end of all that calculation: this is just one way to get a metric on the product of two metric spaces. There are many other ones which give rise to different distance functions, but the same topology and the same uniform structure. And often it’s the topology that we’ll be most interested in.

In particular, this will give us a topology on any finite-dimensional vector space over the real numbers, but we don’t want to automatically equip that vector space with this norm unless we say so very explicitly. In fact, we don’t even want to make that same assumption about the two spaces being perpendicular to each other. The details of exactly why this is so I’ll leave until we get back to linear algebra, but I want to be clear right now that topology comes for free, but we may have good reason to use different “distances”.

August 19, 2008 Posted by John Armstrong | Point-Set Topology, Topology | | 3 Comments

Metric Spaces are Categories!

A guest post by Tom Leinster over at The n-Category Café reminded me of an interesting fact I haven’t mentioned yet: a metric space is actually an example of an enriched category!

First we’ll need to pick out our base category \mathcal{V}, in which we’ll find our hom-objects. Consider the set of nonnegative real numbers with their real-number order, and add in a point called \infty that’s above all the other points. This is a totally ordered set, and orders are categories. Let’s take the opposite of this category. That is, the objects of our category V are the points in the “interval” \left[0,\infty\right], and we have an arrow x\rightarrow y exactly when x\geq y.

This turns out to be a monoidal category, and the monoidal structure is just addition. Clearly this gives a monoid on the set of objects, but we need to check it on morphisms to see it’s functorial. But if x_1\geq y_1 and x_2\geq y_2 then x_1+x_2\geq y_1+y_2, and so we can see addition as a functor.

So we’ve got a monoidal category, and we can now use it to form enriched categories. Let’s keep out lives simple by considering a small \mathcal{V}-category \mathcal{C}. Here’s how the definition looks.

We have a set of objects \mathrm{Ob}(\mathcal{C}) that we’ll call “points” in a set X. Between any two points p_1 and p_2 we need a hom-object \hom_\mathcal{C}(p_1,p_2)\in\mathrm{Ob}(\mathcal{V}). That is, we have a function d:X\times X\rightarrow\left[0,\infty\right].

For a triple (p_1,p_2,p_3) of objects we need an arrow \hom_\mathcal{C}(p_2,p_3)\otimes\hom_\mathcal{C}(p_1,p_2)\rightarrow\hom_\mathcal{C}(p_1,p_3). In more quotidian terms, this means that d(p_2,p_3)+d(p_1,p_2)\geq d(p_1,p_3).

Also, for each point p there is an arrow from the identity object of \mathcal{V} to the hom-object \hom_\mathcal{C}(p,p). That is, 0\geq d(p,p), so d(p,p)=0.

These conditions are the first, fourth, and half of the second conditions in the definition of a metric space! In fact, there’s a weaker notion of a “pseudometric” space, wherein the second condition is simply that d(p,p)=0, and so we’re almost exactly giving the definition of a pseudometric space.

The only thing we’re missing is the requirement that d(p_1,p_2)=d(p_2,p_1). The case can be made (and has been, by Lawvere) that this requirement is actually extraneous, and that it’s in some sense more natural to work with “asymmetric” (pseudo)metric spaces that are exactly those given by this enriched categorical framework.

February 11, 2008 Posted by John Armstrong | Algebra, Category theory, Point-Set Topology, Topology | | 7 Comments

Some theorems about metric spaces

We need to get down a few facts about metric spaces before we can continue on our course. Firstly, as I alluded in an earlier comment, compact metric spaces are sequentially compact — every sequence has a convergent subsequence.

To see this fact, we’ll use the fact that compact spaces are the next best thing to finite. Specifically, in a finite set any infinite sequence would have to hit one point infinitely often. Here instead, we’ll have an accumulation point \xi in our compact metric space X so that for any \epsilon>0 and point x_m in our sequence there is some n\geq m with d_X(x_n,\xi)<\epsilon. That is, though the sequence may move away from \xi, it always comes back within \epsilon of it again. Once we have an accumulation point \xi, we can find a subsequence converging to \xi just as we found a subnet converging to any accumulation point of a net.

Let’s take our sequence and define F_N=\mathrm{Cl}(\{x_n, n\geq N\}) — the closure of the sequence from x_N onwards. Then these closed sets are nested F_1\supseteq F_2\supseteq...\supseteq F_N\supseteq..., and the intersection of any finite number of them is the smallest one, which is clearly nonempty since it contains a tail of the sequence. Then by the compactness of X we see that the intersection of all the F_N is again nonempty. Since the points in this intersection are in the closure of any tail of the sequence, they must be accumulation points.

Okay, that doesn’t quite work. See the comments for more details. Michael asks where I use the fact that we’re in a metric space, which was very astute. It turns out on reflection that I did use it, but it was hidden.

We can still say we’re looking for an accumulation point first and foremost, because if the sequence has an accumulation point there must be some subsequence converging to that point. Why not a subnet in general? Because metric spaces must be normal Hausdorff (using metric neighborhoods to separate
closed sets) and first-countable! And as long as we’re first-countable (or, weaker, “sequential”) we can find a sequence converging to any limit point of a net.

What I didn’t say before is that once we find an accumulation point there will be a subsequence converging to that point. My counterexample is compact, and any sequence in it has accumulation points, but we will only be able to find subnets of our sequence converging to them, not subsequences. Unless we add something to assure that our space is sequential, and metric spaces do that.

We should note in passing that the special case where X is a compact subspace of \mathbb{R}^n is referred to as the Bolzano-Weierstrass Theorem.

Next is the Heine-Cantor theorem, which says that any continuous function f:M\rightarrow N from a compact metric space M to any metric space N is uniformly continuous. In particular, we can use the interval \left[a,b\right] as our compact metric space M and the real numbers \mathbb{R} as our metric space N to see that any continuous function on a closed interval is uniformly continuous.

So let’s assume that f is continuous but not uniformly continuous. Then there is some \epsilon>0 so that for any \delta>0 there are points x and y in M with d_M(x,y)<\delta but d_N(f(x),f(y))\geq\epsilon. In particular, we can pick \frac{1}{n} as our \delta and get two sequences x_n and y_n with d_M(x_n,y_n)<\frac{1}{n} but d_N(f(x),f(y))\geq\epsilon. By the above theorem we can find subsequences x_{n_k} converging to \bar{x} and y_{n_k} converging to \bar{y}.

Now d_X(x_{n_k},y_{n_k})<\frac{1}{n_k}, which converges to {0}, and so \bar{x}=\bar{y}. Therefore we must have d_Y(f(x_{n_k}),f(y_{n_k}) also converging to {0} by the continuity of f. But this can’t happen, since each of these distances must be at least \epsilon! Thus f must have been uniformly continuous to begin with.

January 31, 2008 Posted by John Armstrong | Point-Set Topology, Topology | | 7 Comments

The Heine-Borel Theorem

We’ve talked about compact subspaces, particularly of compact spaces and Hausdorff spaces (and, of course, compact Hausdorff spaces). So how can we use this to understand the space \mathbb{R} of real numbers, or higher-dimensional versions like \mathbb{R}^n?

First off, \mathbb{R} is Hausdorff, which should be straightforward to prove. Unfortunately, it’s not compact. To see this, consider the open sets of the form (-x,x) for all positive real numbers x. Given any real number y we can find an x with |y|<x, so y\in(-x,x). Therefore the collection of these open intervals covers \mathbb{R}. But if we take any finite number of them, one will be the biggest, and so we must miss some real numbers. This open cover does not have a finite subcover, and \mathbb{R} is not compact. We can similarly show that \mathbb{R}^n is Hausdorff, but not compact.

So, since \mathbb{R}^n is Hausdorff, any compact subset of \mathbb{R}^n must be closed. But not every closed subset is compact. What else does compactness imply? Well, we can take the proof that \mathbb{R}^n isn’t compact and adapt it to any subset A\subseteq\mathbb{R}^n. We take the collection of all open “cubes” (-x,x)^n consisting of n-tuples of real numbers, each of which is between -x and x, and we form open subsets of A by the intersections U_x=(-x,x)^n\cap A. Now the only way for there to be a finite subcover of this open cover of A is for there to be some x so that U_x=A. That is, every component of every point of A has absolute value less than x, and so we say that A is “bounded”.

We see now that every compact subset of \mathbb{R}^n is closed and bounded. It turns out that being closed and bounded is not only necessary for compactness, but they’re also sufficient! To see this, we’ll show that the closed cube \left[-x,x\right]^n is compact. Then a bounded set A is contained in some such cube, and a closed subset of a compact space is compact. This is the Heine-Borel theorem.

In the n=1 case, we just need to see that the interval \left[-x,x\right] is compact. Take an open cover \{U_i\} of this interval, and define the set S to consist of all y\in\left[-x,x\right] so that a finite collection of the U_i cover \left[-x,y\right]. Then define t to be the least upper bound of S. Basically, t is as far along the interval as we can get with a finite number of sets, and we’re hoping to show that t=x. Clearly it can’t go past x, since S\subseteq\left[-x,x\right]. But can it be less than x?

In fact it can’t, because if it were, then we can find some open set U from the cover that contains t. As an open neighborhood of t, the set U contains some interval (t-\epsilon,t+\epsilon). Then t-\epsilon must be in S, and so there is some finite collection of the U_i which covers \left[-x,t-\epsilon\right]. But then we can just add in U to get a finite collection of the U_i which covers \left[-x,t+\frac{\epsilon}{2}\right], and this contradicts the fact that t is the supremum of S. Thus t=x and there is a finite subcover of \left[-x,x\right], making this closed interval compact!

Now Tychonoff’s Theorem tells us that products of closed intervals are also compact. In particular, the closed cube \left[-x,x\right]^n\subseteq\mathbb{R}^n is compact. And since any closed and bounded set is contained in some such cube, it will be compact as a closed subspace of a compact space. Incidentally, since n is finite, we don’t need to wave the Zorn talisman to get this invocation of the Tychonoff magic to work.

As a special case, we can look back at the one-dimensional case to see that a compact, connected space must be a closed interval \left[a,b\right]. Then we know that the image of a connected space is connected, and that the image of a compact space is compact, so the image of a closed interval under a continuous function f:\mathbb{R}\rightarrow\mathbb{R} is another closed interval.

The fact that this image is an interval gave us the intermediate value theorem. The fact that it’s closed now gives us the extreme value theorem: a continuous, real-valued function f on a closed interval \left[a,b\right] attains a maximum and a minimum. That is, there is some c\in\left[a,b\right] so that f(c)\geq f(x) for all x\in\left[a,b\right], and similarly there is some d\in\left[a,b\right] so that f(d)\leq f(x) for all x\in\left[a,b\right].

January 18, 2008 Posted by John Armstrong | Analysis, Calculus, Point-Set Topology, Topology | | 2 Comments

Tychonoff’s Theorem

One of the biggest results in point-set topology is Tychonoff’s Theorem: the fact that the product of any family \{X_i\}_{i\in\mathcal{I}} of compact spaces is again compact. Unsurprisingly, the really tough bit comes in when we look at an infinite product. Our approach will use the dual definition of compactness.

Let’s say that a collection \mathcal{F} of closed sets has the finite intersection hypothesis if all finite intersections of members of the collection are nonempty, so compactness says that any collection satisfying the finite intersection hypothesis has nonempty intersection. We can then form the collection \Omega=\{\mathcal{F}\} of all collections of sets satisfying the finite intersection hypothesis. This can be partially ordered by containment — \mathcal{F}'\leq\mathcal{F} if every set in \mathcal{F}' is also in \mathcal{F}.

Given any particular collection \mathcal{F} we can find a maximal collection containing it by finding the longest increasing chain in \Omega starting at \mathcal{F}. Then we simply take the union of all these collections to find the collection at its top. This is almost exactly the same thing as we did back when we showed that every vector space is a free module! And just like then, we need Zorn’s lemma to tell us that we can manage the trick in general, but if we look closely at how we’re going to use it we’ll see that we can get away without Zorn’s lemma for finite products.

Anyhow, this maximal collection \mathcal{F} has two nice properties: it contains all of its own finite intersections, and it contains any set which intersects each set in \mathcal{F}. These are both true because if \mathcal{F} didn’t contain one of these sets we could throw it in, make \mathcal{F} strictly larger, and still satisfy the finite intersection hypothesis.

Now let’s assume that \mathcal{F} is a collection of closed subsets of \prod\limits_{i\in\mathcal{I}}X_i satisfying the finite intersection hypothesis. We can then get a maximal collection \mathcal{G} containing \mathcal{F}. Then given an index i\in\mathcal{I} we can consider the collection \{\overline{\pi_i(G)}\}_{G\in\mathcal{G}} of closed subsets of X_i and see that it, too, satisfies the finite intersection hypothesis. Thus by compactness of X_i the intersection of this collection is nonempty. Letting U_i be a closed set containing one of these intersection points x_i, we see that the preimage \pi_i^{-1}(U_i) meets every G\in\mathcal{G}, and so must itself be in \mathcal{G}.

Okay, so let’s take the point x_i for each index and consider the point p in \prod\limits_{i\in\mathcal{I}}X_i with i-th coordinate x_i. Then pick some set V=\prod\limits_{i\in\mathcal{I}}V_i containing p from the base for the product topology. For all but a finite number of the i, V_i=X_i. For those finite number where it’s smaller, the closure of V_i contains the point x_i\in X_i, and so \pi_i^{-1}(V_i) is in \mathcal{G}. So their finite intersection must be nonempty, and so is V itself!

Now, since V is in \mathcal{G}, it must intersect each of the closed sets in the original collection \mathcal{F}. Since the only constraint on V is that it contain p, this point must be a limit point of each of the sets in \mathcal{F}. And because they’re closed, they must contain all of their limit points. Thus the intersection of all the sets in \mathcal{F} is nonempty, and the product space is compact!

January 17, 2008 Posted by John Armstrong | Point-Set Topology, Topology | | 3 Comments

The Image of a Compact Space

One of the nice things about connectedness is that it’s preserved under continuous maps. It turns out that compactness is the same way — the image of a compact space X under a continuous map f:X\rightarrow Y is compact.

Let’s take an open cover \{U_i\} of the image f(X). Since f is continuous, we can take the preimage of each of these open sets \{f^{-1}(U_i)\} to get a bunch of open sets in X. Clearly every point of X is the preimage of some point of f(X), so the f^{-1}(U_i) form an open cover of X. Then we can take a finite subcover by compactness of X, picking out some finite collection of indices. Then looking back at the U_i corresponding to these indices (instead of their preimages) we get a finite subcover of f(X). Thus any open cover of the image has a finite subcover, and the image is compact.

January 16, 2008 Posted by John Armstrong | Point-Set Topology, Topology | | 3 Comments

Some compact subspaces

Let’s say we have a compact space X. A subset C\subseteq X may not be itself compact, but there’s one useful case in which it will be. If C is closed, then C is compact.

Let’s take an open cover \{F_i\}_{i\in\mathcal{I}} of C. The sets F_i are open subsets of C, but they may not be open as subsets of X. But by the definition of the subspace topology, each one must be the intersection of C with an open subset of X. Let’s just say that each F_i is an open subset of X to begin with.

Now, we have one more open set floating around. The complement of C is open, since C is closed! So between the collection \{F_i\} and the extra set X\setminus C we’ve got an open cover of X. By compactness of X, this open cover has a finite subcover. We can throw out X\setminus C from the subcover if it’s in there, and we’re left with a finite open cover of C, and so C is compact.

In fact, if we restrict to Hausdorff spaces, C must be closed to be compact. Indeed, we proved that if C is compact and X is Hausdorff then any point x\in X\setminus C can be separated from C by a neighborhood U\subseteq X\setminus C. Since there is such an open neighborhood, x must be an interior point of X\setminus C. And since x was arbitrary, every point of X\setminus C is an interior point, and so X\setminus C must be open.

Putting these two sides together, we can see that if X is compact Hausdorff, then a subset C\subseteq X is compact exactly when it’s closed.

January 15, 2008 Posted by John Armstrong | Point-Set Topology, Topology | | 1 Comment

Compact Spaces

An amazingly useful property for a space X is that it be “compact”. We define this term by saying that if \{U_i\}_{i\in\mathcal{I}} is any collection of open subsets of X indexed by any (possibly infinite) set \mathcal{I} so that their union \bigcup\limits_{i\in\mathcal{I}}U_i is the whole of X — the sexy words are “open cover” — then there is some finite collection of the index set \mathcal{A}\subseteq\mathcal{I} so that the union of this finite number of open sets \bigcup\limits_{i\in\mathcal{A}}U_i still contains all of X — the sexy words are “has a finite subcover”.

So why does this matter? Well, let’s consider a Hausdorff space X, a point x\in X, and a finite collection of points A\subseteq X. Given any point a\in A, we can separate x and a by open neighborhoods x\in U_a and a\in V_a, precisely because X is Hausdorff. Then we can take the intersection U=\bigcap\limits_{a\in A}U_a and the union V=\bigcup\limits_{a\in A}V_a. The set U is a neighborhood of X, since it’s a finite intersection of neighborhoods, while the set V is a neighborhood of A. These two sets can’t intersect, and so we have separated x and A by neighborhoods.

But what if A is an infinite set? Then the infinite intersection \bigcap\limits_{a\in A}U_a may not be a neighborhood of x! Infinite operations sometimes cause problems in topology, but compactness can make them finite. If A is a compact subset of X, then we can proceed as before. For each a\in A we have open neighborhoods x\in U_a and a\in V_a, and so A\subseteq\bigcup\limits_{a\in A}V_a — the open sets V_a form a cover of A. Then compactness tells us that we can pick a finite collection A'\subseteq A so that the union V=\bigcup\limits_{a\in A'}V_a of that finite collection of sets still covers A — we only need a finite number of the V_a to cover A. The finite intersection U=\bigcap\limits_{a\in A'}U_a will then be a neighborhood of x which doesn’t touch V, and so we can separate any point x\in X and any compact set A\subseteq X by neighborhoods.

As an exercise, do the exact same thing again to show that in a Hausdorff space X we can separate any two compact sets A\subseteq X and B\subseteq X by neighborhoods.

In a sense, this shows that while compact spaces may be infinite, they sometimes behave as nicely as finite sets. This can make a lot of things simpler in the long run. And just like we saw for connectivity, we are often interested in things behaving nicely near a point. We thus define a space to be “locally compact” if every point has a neighborhood which is compact (in the subspace topology).

There’s an equivalent definition in terms of closed sets, which is dual to this one. Let’s say we have a collection \{F_i\}_{i\in\mathcal{I}} of closed subsets of X so that the intersection of any finite collection of the F_i is nonempty. Then I assert that the intersection of all of the F_i will be nonempty as well if X is compact. To see this, assume that the intersection is empty:
\bigcap\limits_{i\in\mathcal{I}}F_i=\varnothing
Then the complement of this intersection is all of X. We can rewrite this as the union of the complements of the F_i:
X=\bigcup\limits_{i\in\mathcal{I}}F_i^c
Since we’re assuming X to be compact, we can find some finite subcollection \mathcal{A}\subseteq\mathcal{I} so that
X=\bigcup\limits_{i\in\mathcal{A}}F_i^c
which, taking complements again, implies that
\bigcap\limits_{i\in\mathcal{A}}F_i=\varnothing
but we assumed that all of the finite intersections were nonempty!

Now turn this around and show that if we assume this “finite intersection property” — that if all finite intersections of a collection of closed sets F_i are nonempty, then the intersection of all the F_i are nonempty — then we can derive the first definition of compactness from it.

January 14, 2008 Posted by John Armstrong | Point-Set Topology, Topology | | 4 Comments

Separation Axioms

Now that we have some vocabulary about separation properties down we can talk about properties of spaces as a whole, called the separation axioms.

First off, we say that a space is T_0 if every two distinct points can be topologically distinguished. This fails, for example, in the trivial topology on a set X if X has at least two points, because every point has the same collection of neighborhoods — \mathcal{N}(x)=\{X\} for all points x\in X. As far as the topology is concerned, all the points are the same. This turns out to be particularly interesting in conjunction with other separation axioms, since we often will have one axiom saying that a property holds for all distinct points, and another saying that the property holds for all topologically distinguishable points. Adding T_0 turns the latter version into the former.

Next, we say that a space is R_0 if any two topologically distinguishable points are separated. That is, we never have a point x in the closure of the singleton set \{y\} without the point y being in the closure of \{x\}. Adding T_0 to this condition gives us T_1. A T_1 space is one in which any two distinct points are not only topologically distinguishable, but separated. In particular, we can see that the singleton set \{x\} is closed, since its closure can’t contain any other points than x itself.

A space is R_1 if any two topologically distinguishable points are separated by neighborhoods. If this also holds for any pair of distinct points we say that the space is T_2, or “Hausdorff”. This is where most topologists start to feel comfortable, though the topologies that arise in algebraic geometry are usually non-Hausdorff. To a certain extent (well, to me at least) Hausdorff spaces feel a lot more topologically natural and intuitive than non-Hausdorff spaces, and you almost have to try to construct pathological spaces to violate this property. Back in graduate school, some of us adapted the term to apply more generally, as in “That guy Steve is highly non-Hausdorff.”

One interesting and useful property of Hausdorff spaces is that the image of the diagonal map \Delta:X\rightarrow X\times X defined by \Delta(x)=(x,x) is closed. To see this, notice that it means the complement of the image is open. That is, if (x,y) is a pair of points of X with x\neq y then we can find an open neighborhood containing the point (x,y) consisting only of pairs (z,w) with z\neq w. In fact, we have a base for the product topology on X\times X consisting of products two open sets in X. That is, we can pick our open neighborhood of (x,y) to be the set of all pairs (z,w) with z\in Z and w\in W, where Z is an open subset of X containing x and W is an open subset containing y. To say that this product doesn’t touch the diagonal means that Z\cap W=\varnothing, which is just what it means for x and y to be separated by neighborhoods!

We can strengthen this by asking that any two distinct points are separated by closed neighborhoods. If this holds we say the space is T_{2\frac{1}{2}}. There’s no standard name for the weaker version discussing topologically distinguishable points. Stronger still is saying that a space is “completely Hausdorff” or completely T_2, which asks that any two distinct points be separated by a function.

A space X is “regular” if given a point x\in X and a closed subset C\subseteq X with x\notin C we can separate \{x\} and C by neighborhoods. This is a bit stronger than being Hausdorff, where we only asked that this hold for two singletons. For regular spaces, we allow one of the two sets we’re separating to be any closed set. If we add on the T_0 condition we’re above T_1, and so singletons are just special closed sets anyhow, but we’re strictly stronger than regularity now. We call this condition T_3.

As for Hausdorff, we say that a space is completely regular if we can actually separate \{x\} and C by a function. If we take a completely regular space and add T_0, we say it’s T_{3\frac{1}{2}}, or “completely regular Hausdorff”, or “Tychonoff”.

We say a space is “normal” if any two disjoint closed subsets are separated by neighborhoods. In fact, a theorem known as Urysohn’s Lemma tells us that we get for free that they’re separated by a function as well. If we add in T_1 (not T_0 this time) we say that it is “normal Hausdorff”, or T_4.

A space is “completely normal” if any two separated sets are separated by neighborhoods. Adding in T_1 we say that the space is “completely normal Hausdorff”, or T_5.

Finally, a space is “perfectly normal” if any two disjoint closed sets are precisely separated by a function. Adding T_1 makes the space “perfectly normal Hausdorff”, or T_6.

The Wikipedia entry here is rather informative, and has a great schematic showing which of the axioms imply which others. Most of these axioms I won’t be using, but it’s good to have them out here in case I need them.

January 11, 2008 Posted by John Armstrong | Point-Set Topology, Topology | | 5 Comments

Separation Properties

There’s a whole list of properties of topological spaces that we may want to refer to called the separation axioms. Even when two points are distinct elements of the underlying set of a topological space, we may not be able to tell them apart with topological techniques. Points are separated if we can tell them apart in some way using the topology. Today we’ll discuss various properties of separation, and tomorrow we’ll list some of the more useful separation axioms we can ask that a space satisfy.

First, and weakest, we say that points x and y in a topological space X are “topologically distinguishable” if they don’t have the same collection of neighborhoods — if \mathcal{N}(x)\neq\mathcal{N}(y). Now maybe one of the collections of neighborhoods strictly contains the other: \mathcal{N}(x)\subseteq\mathcal{N}(y). In this case, every neighborhood of x is a neighborhood of y. a forteriori it contains a neighborhood of y, and thus contains y itself. Thus the point x is in the closure of the set \{y\}. This is really close. The points are topologically distinguishable, but still a bit too close for comfort. So we define points to be “separated” if each has a neighborhood the other one doesn’t, or equivalently if neither is in the closure of the other. We can extend this to subsets larger than just points. We say that two subsets A and B are separated if neither one touches the closure of the other. That is, A\cap\bar{B}=\varnothing and \bar{A}\cap B=\varnothing.

We can go on and give stronger conditions, saying that two sets are “separated by neighborhoods” if they have disjoint neighborhoods. That is, there are neighborhoods U and V of A and B, respectively, and U\cap V=\varnothing. Being a neighborhood here means that U contains some open set S which contains A and V contains some open set T which contains B, and so the closure of A is contained in the open set, and thus in U. Similarly, the closure of B must be contained in V.. We see that the closure of B is contained in the complement of S, and similarly the closure of A is in the complement of T, so neither A nor B can touch the other’s closure. Stronger still is being “separated by closed neighborhoods”, which asks that U and V be disjoint closed neighborhoods. These keep A and B even further apart, since these neighborhoods themselves can’t touch each other’s closures.

The next step up is that sets be “separated by a function” if there is a continuous function f:X\rightarrow\mathbb{R} so that for every point a\in A we have f(a)=0, and for every point b\in B we have f(b)=1. In this case we can take the closed interval \left[-1,\frac{1}{3}\right] whose preimage must be a closed neighborhood of A by continuity. Similarly we can take the closed interval \left[\frac{2}{3},2\right] whose preimage is a closed neighborhood of B. Since these preimages can’t touch each other, we have separated A and B by closed neighborhoods. Stronger still is that A and B are “precisely separated by a function”, which adds the requirement that only points from A go to {0} and only points from B go to 1.

This list of separation

January 10, 2008 Posted by John Armstrong | Point-Set Topology, Topology | | 5 Comments