The Unapologetic Mathematician

Mathematics for the interested outsider

Coordinate Vectors Span Tangent Spaces

Given a point p in an n-dimensional manifold M, we have the vector space \mathcal{T}_pM of tangent vectors at p. Given a coordinate patch (U,x) around p, we’ve constructed n coordinate vectors at p, and shown that they’re linearly independent in \mathcal{T}_pM. I say that they also span the space, and thus constitute a basis.

To see this, we’ll need a couple lemmas. First off, if f is constant in a neighborhood of p, then v(f)=0 for any tangent vector v. Indeed, since all that matters is the germ of f, we may as well assume that f is the constant function with value c. By linearity we know that v(c)=cv(1). But now since 1=1\cdot1 we use the derivation property to find

\displaystyle v(1)=v(1\cdot1)=v(1)1(p)+1(p)v(1)=2v(1)

and so we conclude that v(1)=v(c)=0.

In a slightly more technical vein, let U be a “star-shaped” neighborhood of 0\in\mathbb{R}^n. That is, not only does U contain 0 itself, but for every point x\in U it contains the whole segment of points tx for 0\leq t\leq1. An open ball, for example, is star-shaped, so you can just think of that to be a little simpler.

Anyway, given such a U and a differentiable function f on it we can find n functions \psi_i with \psi_i(0)=D_if(0), and such that we can write

\displaystyle f=f(0)+\sum\limits_{i=1}^nu^i\psi_i

where u^i is the ith component function.

If we pick a point x\in U we can parameterize the segment c(t)=tx, and set \phi=f\circ c to get a function on the unit interval \phi:[0,1]\to\mathbb{R}. This function is clearly differentiable, and we can calculate

\displaystyle\phi'(t)=\sum\limits_{i=1}^n\frac{\partial f}{\partial x^i}\frac{dc^i}{dt}=\sum\limits_{i=1}^nD_if(tx)x^i

using the multivariable chain rule. We find

\displaystyle\begin{aligned}f(x)-f(0)&=\phi(1)-\phi(0)\\&=\int\limits_0^1\phi'(t)\,dt\\&=\sum\limits_{i=1}^nx^i\int\limits_0^1D_if(tx)\,dt\end{aligned}

We can thus find the desired functions \psi_i by setting

\displaystyle\psi_i(x)=\int\limits_0^1D_if(tx)\,dt

Now if we have a differentiable function f defined on a neighborhood U of a point p\in M, we can find a coordinate patch (U,x) — possibly by shrinking U — with x(p)=0 and x(U) star-shaped. Then we can apply the previous lemma to f\circ x^{-1} to get

\displaystyle f\circ x^{-1}=f(p)+\sum\limits_{i=1}^nu^i\psi_i

with \psi_i(0)=\left[\frac{\partial}{\partial x^i}(p)\right](f). Moving the coordinate map to the other side we find

\displaystyle f=f(p)+\sum\limits_{i=1}^nx^i\left(\psi_i\circ x\right)

Now we can hit this with a tangent vector v

\displaystyle\begin{aligned}v(f)&=v\left(f(p)+\sum\limits_{i=1}^nx^i\left(\psi_i\circ x\right)\right)\\&=v\left(f(p)\right)+\sum\limits_{i=1}^nv\left(x^i\left(\psi_i\circ x\right)\right)\\&=\sum\limits_{i=1}^n\left(v(x^i)\left[\psi_i\circ x\right](p)+x^i(p)v\left(\psi_i\circ x\right)\right)\\&=\sum\limits_{i=1}^nv(x^i)\psi_i(0)\\&=\sum\limits_{i=1}^nv(x^i)\left[\frac{\partial}{\partial x^i}(p)\right](f)\end{aligned}

where we have used linearity, the derivation property, and the first lemma above. Thus we can write

\displaystyle v=\sum\limits_{i=1}^nv(x^i)\frac{\partial}{\partial x^i}(p)

and the coordinate vectors span the space of tangent vectors at p.

As a consequence, we conclude that \mathcal{T}_pM always has dimension n — exactly the same dimension as the manifold itself. And this is exactly what we should expect; if M is n-dimensional, then in some sense there are n independent directions to move in near any point p, and these “directions to move” are the core of our geometric notion of a tangent vector. Ironically, if we start from a more geometric definition of tangent vectors, it’s actually somewhat harder to establish this fact, which is partly why we’re starting with the more algebraic definition.

March 31, 2011 Posted by | Differential Topology, Topology | 8 Comments

Tangent Vectors and Coordinates

Let’s say we have a coordinate patch (U,x) around a point p in an n-dimensional manifold M. We can use the function f to give us some tangent vectors at p called the “coordinate vectors”.

We define the coordinate vector \frac{\partial}{\partial x^i}(p)\in\mathcal{T}_pM as follows: given a smooth function f\in\mathcal{O}_p, we define

\displaystyle\left[\frac{\partial}{\partial x^i}(p)\right](f)=\left[D_i(f\circ x^{-1})\right](x(p))

Okay, I know that that’s confusing. But all we mean is this: start with a function f:U\to\mathbb{R}. We compose it with the inverse of the coordinate map to get f\circ x^{-1}:x(U)\to\mathbb{R}, where x(U) is some open neighborhood of the point x(p)\in\mathbb{R}^n. Now we can take that ith partial derivative of this function and evaluate it at the point x(p).

The first thing we really should check is that it doesn’t matter which representative f we pick. That is, if f=g in some neighborhood of p, do we get the same answer? Indeed, in that case f\circ x^{-1}=g\circ x^{-1} in some neighborhood of x(p), and so their partial derivatives are identical. Thus this operation only depends on the germ f\in\mathcal{O}_p.

But is it a tangent vector? It’s easy to see that it’s a linear functional, so we just have to check that it’s a derivation at p:

\displaystyle\begin{aligned}\left[\frac{\partial}{\partial x^i}(p)\right](fg)=&\left[D_i((fg)\circ x^{-1})\right](x(p))\\=&\left[D_i((f\circ x^{-1})(g\circ x^{-1}))\right](x(p))\\=&\left[D_i(f\circ x^{-1})\right](x(p))\left[g\circ x^{-1}\right](x(p))\\&+\left[f\circ x^{-1}\right](x(p))\left[D_i(g\circ x^{-1})\right](x(p))\\=&\left[\frac{\partial}{\partial x^i}(p)\right](f)g(p)+f(p)\left[\frac{\partial}{\partial x^i}(p)\right](g)\end{aligned}

And so we have at least these n vectors at each point p\in M. We can even tell that they much be distinct — and even linearly independent — since we can calculate

\displaystyle\begin{aligned}\left[\frac{\partial}{\partial x^i}(p)\right](x^j)&=\left[\frac{\partial}{\partial x^i}(p)\right](\pi^j\circ x)\\&=\left[D_i(\pi^j\circ x\circ x^{-1})\right](x(p))\\&=\left[D_i(\pi_j)\right](x(p))\end{aligned}

where \pi^j is the jth coordinate projection \pi^j:\mathbb{R}^n\to\mathbb{R}. But we know that D_i(\pi^j) is always and everywhere \delta_i^j — it takes the value 1 if i=j and 0 otherwise.

Thus \frac{\partial}{\partial x^i}(p) takes a different value on x^i than on all the other x^j. Further, any linear combination of the \frac{\partial}{\partial x^j}(p) for j\neq i must take the value 0 on x^i, while \frac{\partial}{\partial x^i}(p) takes the value 1; we see that none of the coordinate vectors can be written as a linear combination of the rest, and conclude that the dimension of \mathcal{T}_pM is at least n.

March 30, 2011 Posted by | Differential Topology, Topology | 8 Comments

Tangent Vectors at a Point

Tangent vectors are a very important concept in differential geometry, and they’re one of the biggest stumbling blocks in comprehension. There are two major approaches: one more geometric, and one more algebraic. I find the algebraic approach a bit more satisfying, since it gets straight into the important properties of tangent vectors and how they are used, and it helps set the stage for tangent vectors in other contexts like algebraic geometry. Unfortunately, it’s not at all clear at first what this definition means geometrically, and why these things deserve being called “tangent vectors”. So I have to ask a little patience.

Now, we take a manifold M with structure sheaf \mathcal{O}. We pick some point p\in M and get the stalk \mathcal{O}_p of germs of functions at p. This is a real algebra, and we define a “tangent vector at p” to be a “derivation at p” of this algebra. That is, v is a function v:\mathcal{O}_p\to\mathbb{R} satisfying

\displaystyle\begin{aligned}v(cf+dg)&=cv(f)+dv(g)\\v(fg)&=v(f)g(p)+f(p)v(g)\end{aligned}

The first of these conditions says that v is a linear functional on \mathcal{O}_p. It’s the second that’s special: it tells us that v obeys something like the product rule.

Indeed, let’s take a point x\in\mathbb{R} and consider the operation D_x defined by D_x(f)=f'(x) for any function f that is differentiable at x. This is linear, since both the derivative and evaluation operations are linear. The product rule tells us that

\displaystyle\begin{aligned}D_x(fg)&=\left[fg\right]'(x)\\&=f'(x)g(x)+f(x)g'(x)\\&=D_x(f)g(x)+f(x)D_x(g)\end{aligned}

So D_x satisfies the definition of a “tangent vector at x“. Indeed, as it turns out D_x corresponds to what we might normally consider the vector based at x pointing one unit in the positive direction.

It should immediately be clear that the tangent vectors at p form a vector space. Indeed, the sum of two tangent vectors at p is firstly the sum of two linear functionals, which is again a linear functional. To see that it also satisfies the “derivation” condition, let v and w be tangent vectors at p and check

\displaystyle\begin{aligned}\left[v+w\right](fg)&=v(fg)+w(fg)\\&=v(f)g(p)+f(p)v(g)+w(f)g(p)+f(p)w(g)\\&=\left(v(f)+w(f)\right)g(p)+f(p)\left(v(g)+w(g)\right)\\&=\left[v+w\right](f)g(p)+f(p)\left[v+w\right](g)\end{aligned}

Checking that scalar multiples of tangent vectors at p are again tangent vectors at p is similar. We write \mathcal{T}_pM to denote this vector space of tangent vectors at p to the manifold M.

I want to call attention to one point of notation here, and I won’t really bother with it again. We seem to be using each of f and g to refer to two different things: a germ in \mathcal{O}_p — which is an equivalence class of sorts — and some actual function in \mathcal{O}(U) for some neighborhood U of p which represents the germ. To an extent we are, and the usual excuse is that since we only ever evaluate the function at p itself, it doesn’t really matter which representative of the germ we pick.

However, a more nuanced view will see that we’ve actually overloaded the notation f(p). Normally this would mean evaluating a function at a point, yes, but here we interpret it in terms of the local ring structure of \mathcal{O}_p. Given a germ f\in\mathcal{O}_p there is a projection \mathcal{O}_p\to\mathbb{R}, which we write as f\mapsto f(p).

If all this seems complicated, don’t really worry about it. You can forget the whole last paragraph and get by on “sometimes we use a germ as if it’s an actual function defined in a neighborhood of p, and it will never matter which specific representative function we use because we only ever ask what happens at p itself.”

March 29, 2011 Posted by | Differential Topology, Topology | 13 Comments

Local Rings

Sorry for the break last Friday.

As long as we’re in the neighborhood — so to speak — we may as well define the concept of a “local ring”. This is a commutative ring which contains a unique maximal ideal. Equivalently, it’s one in which the sum of any two noninvertible elements is again noninvertible.

Why are these conditions equivalent? Well, if we have noninvertible elements r_1 and r_2 with r_1+r_2 invertible, then these elements generate principal ideals (r_1) and (r_2). If we add these two ideals, we must get the whole ring, for the sum contains r_1+r_2, and so must contain 1, and thus the whole ring. Thus (r_1) and (r_2) cannot both be contained within the same maximal ideal, and thus we would have to have two distinct maximal ideals.

Conversely, if the sum of any two noninvertible elements is itself noninvertible, then the noninvertible elements form an ideal. And this ideal must be maximal, for if we throw in any other (invertible) element, it would suddenly contain the entire ring.

Why do we care? Well, it turns out that for any manifold M and point p\in M the algebra \mathcal{O}_p of germs of functions at p is a local ring. And in fact this is pretty much the reason for the name “local” ring: it is a ring of functions that’s completely localized to a single point.

To see that this is true, let’s consider which germs are invertible. I say that a germ represented by a function f:U\to\mathbb{R} is invertible if and only if f(p)\neq0. Indeed, if f(p)=0, then f is certainly not invertible. On the other hand, if f(p)\neq0, then continuity tells us that there is some neighborhood V of p where f(p)\neq0. Restricting f to this neighborhood if necessary, we have a representative of the germ which never takes the value zero. And thus we can define a function g(q)=\frac{1}{f(q)} for q\in V, which represents the multiplicative inverse to the germ of f.

With this characterization of the invertible germs in hand, it should be clear that any two noninvertible germs represented by f_1 and f_2 must have f_1(p)=f_2(p)=0. Thus f_1(p)+f_2(p)=0, and the germ of f_1+f_2 is again noninvertible. Since the sum of any two noninvertible germs is itself noninvertible, the algebra \mathcal{O}_p of germs is local, and its unique maximal ideal \mathfrak{m}_p consists of those functions which vanish at p.

Incidentally, we once characterized maximal ideals as those for which the quotient R/I is a field. So which field is it in this case? It’s not hard to see that \mathcal{O}_p/\mathfrak{m}_p\cong\mathbb{R} — any germ is sent to its value at p, which is just a real number.

March 28, 2011 Posted by | Differential Topology, Ring theory, Structure of Rings, Topology | 3 Comments

Germs of Functions

Let’s take the structure sheaves we defined last time and consider the stalks at a point p\in M. It turns out that since we’re working with sheaves of \mathbb{R}-algebras, we can sort of shortcut the messy limit process.

As before, given some open neighborhood U of p, we let \mathcal{O}_M(U) be the algebra of smooth functions — as smooth as M is itself — on U. Now we define \mathcal{Z}_{M,p}(U) to be the ideal of those functions which vanish on some neighborhood of p. Then we define the quotient

\displaystyle\mathcal{O}_{M,p}(U)=\mathcal{O}_M(U)/\mathcal{Z}_{M,p}(U)

Notice that we have effectively pushed our limiting process into the definition of the ideal \mathcal{Z}_{M,p}(U), where for each open neighborhood V\subseteq U of p we get an ideal of functions vanishing on V. The ideal we care about is the union over all such neighborhoods V, and the process of taking this union is effectively a limit.

Anyhow, there’s still the possibility that this depends on the U from which we started. But this is actually not the case; we get a uniquely defined algebra \mathcal{O}_{M,p}=\mathcal{O}_{M,p}(U) no matter which neighborhood U of p we start from.

Indeed, I say that there is an isomorphism \mathcal{O}_{M,p}(M)\to\mathcal{O}_{M,p}(U). In the one direction, this is simply induced by the restriction map \mathcal{O}_M(M)\to\mathcal{O}_M(U) — if two functions are equal on some neighborhood of p in M, then they’re certainly equal on some neighborhood of p in U. And this restriction is just as clearly injective, since if two functions are equivalent in \mathcal{O}_{M,p}(U) then they must agree on some neighborhood of p, which means they were already equivalent in \mathcal{O}_{M,p}(M).

The harder part is showing that this map is surjective, and thus an isomorphism. But given U, let V be an open neighborhood of p whose closure is contained in U — we can find one since U must contain a neighborhood of p homeomorphic to a ball in \mathbb{R}^n, and we can certainly find V within such a neighborhood. Anyhow, we know that there exists a bump function \phi which is identically 1 on V and supported within U. We can thus define a smooth function g\in\mathcal{O}_M(M) on all of M by setting g(q)=\phi(q)f(q) inside U and g(q)=0 elsewhere. Since f and g agree on the neighborhood V of p, they are equivalent in \mathcal{O}_{M,p}(U), and thus every equivalence class in \mathcal{O}_{M,p}(U) has a representative coming from \mathcal{O}_{M,p}(M).

We write the stalk as \mathcal{O}_{M,p}, or sometimes \mathcal{O}_p if the manifold M is clear from context, and we call the equivalence classes of functions in this algebra “germs” of functions. Thus a germ subsumes not just the value of a function at a point p, but is behavior in an “infinitesimal neighborhood” around p. Some authors even call the structure sheaf of a manifold — especially a complex analytic manifold (which we haven’t really discussed yet) — the “sheaf of germs” of functions on the manifold, which is a little misleading since the germs properly belong to the stalks of the sheaf. Luckily, this language is somewhat outmoded.

March 24, 2011 Posted by | Differential Topology, Topology | 5 Comments

Sheaves of Functions on Manifolds

Now that we’ve talked a bunch about presheaves and sheaves in general, let’s talk about some particular sheaves of use in differential topology. Given a smooth manifold — for whatever we choose smooth to mean — we can define sheaves of real algebras of real-valued functions for every less-stringent definition of smoothness.

In the first case of a bare topological manifold M, we have no real sense of differentiability at all, and so it only makes sense to talk about continuous real-valued functions f:U\to\mathbb{R}. Given an open set U\subseteq M we let \mathcal{O}^0_M(U) be the \mathbb{R}-algebra of real-valued functions that are defined and continuous on U.

Next, if M is a C^1 manifold, then it not only makes sense to talk about continuous real-valued functions — we can define \mathcal{O}^0_M just as above — but we can also talk about differentiable real-valued functions. Given an open set U\subseteq M, we let \mathcal{O}^1_M(U) be the \mathbb{R}-algebra of continuously-differentiable real valued functions f:U\to\mathbb{R}.

As we increase the smoothness of M, we can consider smoother and smoother functions. If M is a C^k manifold, we can define \mathcal{O}^j_M — the sheaf of j-times continuously-differentiable functions. Given an open set U, we let \mathcal{O}^j_M(U) be the \mathbb{R}-algebra of real-valued functions on U with j continuous derivatives.

Continuing up the latter, if M is a C^\infty manifold, then we can define all of the above sheaves, along with the sheaf \mathcal{O}^\infty_M of infinitely-differentiable functions. And if M is analytic, we can also define the sheaf of analytic functions.

In each case, I’m not going to bother going through the proof that we actually do get sheaves. The core idea is that continuity, differentiability, and analyticity are notions defined locally, point-by-point. Thus if we restrict the domain of such a function we get another function of the same kind, and pasting together functions that agree on their overlaps preserves smoothness. This doesn’t hold, however, for global notions like boundedness — it’s easy to define a collection of functions on an open cover of \mathbb{R}, each of which is bounded, which define an unbounded function when pasted together.

For each class of manifolds, the sheaf of the smoothest functions we can define has a special place. If M is in class C^k — where k can be 0, any finite whole number, \infty, or \omega — then the sheaf \mathcal{O}^k_M is often just written \mathcal{O}_M, and is called the “structure sheaf” of M. It turns out that most, if not all, of the geometrical properties of M are actually bound up within its structure sheaf, and so this is a very important object of study indeed.

March 23, 2011 Posted by | Differential Topology, Topology | 2 Comments

The Stalks of a Presheaf

One more construction we’ll be interested in is finding the “stalk” \mathcal{F}_x of a presheaf \mathcal{F} over a point x. We want to talk about how a presheaf behaves at a single point, but a single point is almost never an open set, so we need to be a bit creative.

The other thing to be careful is that we’re actually not concerned about behavior at a single point. Indeed, considering the sheaf of continuous functions on a space X, we see that at any one point the function is just a real number. What’s interesting is how the function behaves in an infinitesimal neighborhood around the point.

The answer is to use the categorical definition of a limit. Given a point x\in X the collection \mathrm{Subset}(X)_x of open neighborhoods of x form a directed set, and we can take the limit \varinjlim_{\mathrm{Subset}(X)_x}\mathcal{F}.

Again, we’d like to understand this in more concrete terms, for when \mathcal{F} is a set, or a set with some algebraic structure attatched. It turns out that if we unpack all the category theory — basically using the existence theorem — it’s not really that bad.

An element of the stalk \mathcal{F}_x is an element of \mathcal{F}(U) for some neighborhood U of x. Two elements are considered equivalent if they agree on some common neighborhood of x. That is, if we have f\in\mathcal{F}(U) and g\in\mathcal{F}(V), and if there is some W\subseteq U\cap V so that f\vert_W=g\vert W, then we consider f and g to be the same element of \mathcal{F}_x. They don’t have to be the same everywhere, but so long as they become the same when restricted to a sufficiently small neighborhood of x, they’re effectively the same.

Our usual category-theoretical juggling can now reassure us that the stalks of a sheaf of groups are groups, the stalks of a sheaf of rings are rings, and so on, all using this same set-theoretic definition.

March 22, 2011 Posted by | Topology | 3 Comments

The Direct Image Functor

So far our morphisms only let us compare presheaves and sheaves on a single topological space X. In fact, we have a category \mathrm{Sh}(X) of sheaves (of sets, by default) on X. But there are also constructions that involve more than one space. The direct image functor is a way of pushing forward a sheaf structure along a continuous map. It’s relatively simple and we may find it useful, so let’s just get it out of the way now.

So, let’s say we have two topological spaces X and Y, and a continuous function f:X\to Y. I say that if \mathcal{F}\in\mathrm{Sh}(X) is a sheaf on X, then we can define a “direct image” sheaf f_*\mathcal{F} on Y in a natural way. Indeed, given an open set V\subseteq Y, we know that its preimage f^{-1}(V)\subseteq X is an open subset of X. And so it only makes sense to define f_*\mathcal{F}(V)=\mathcal{F}(f^{-1}(V)).

Now you might be thinking, “wait, if the canonical example of a sheaf is a sheaf of functions, shouldn’t we be pulling back?” And this does make a certain amount of sense: given a function on all of Y we can define a function on all of X by composing it with f, so it seems that “functions pull back” naturally. It would seem to make sense for us to take a function g defined on an open subset V\subseteq Y, compose it with f, and put the resulting g\circ f into the set corresponding to f^{-1}(V).

But while this defines sets for all these preimages, not all preimages are of this form! Indeed, we have no idea how to define the elements of an “inverse image” sheaf over a set U where f(U) is not itself open, and there is no guarantee at all that it will be. There is a way to remedy this problem, using a method called “sheafification”, but that’s a more involved subject I’d rather not dig into quite yet.

March 21, 2011 Posted by | Topology | 2 Comments

Mappings Between Presheaves

As ever, we want our objects of study to be objects in some category, and presheaves (and sheaves) are no exception. But, luckily, this much is straightforward.

Remember that we ended up defining a presheaf as a functor. Given our topological space X we set up the partial order category \mathrm{Subset}(X), flipped it around to \mathrm{Subset}(X)^\mathrm{op} so the arrows pointed the opposite way, and then said a presheaf of sets is a functor \mathcal{F}:\mathrm{Subset}(X)^\mathrm{op}\to\mathbf{Set}. So the natural home for them is the functor category \mathbf{Set}^{\mathrm{Subset}(X)^\mathrm{op}}, where the morphisms are natural transformations.

So what does this mean for our usual case where we consider presheaves of sets, or of sets equipped with some algebraic structure? Well, it means that we map from one presheaf \mathcal{F} to another one \mathcal{G} by picking a map for each and every open set: \phi_U:\mathcal{F}(U)\to\mathcal{G}(U). But these maps must be compatible with the restrictions: if V\subseteq U then we must have \phi_V\circ\cdot\vert_V=\cdot\vert_V\circ\phi_U. That is, given an element in \mathcal{F}(U), we can either first restrict it to V and then map it by \phi_V to \mathcal{G}(V), or we can first map it by \phi_U to \mathcal{G}_U and then restrict the result to V. In either case, we should get the same answer.

March 19, 2011 Posted by | Topology | 1 Comment

Sheaves

For the moment we will be more concerned with presheaves, but we may as well go ahead and define sheaves. These embody the way that not only can we restrict functions to localize them to smaller regions, but we can “glue together” local functions on small domains to define functions on larger domains. This time, let’s start with the fancy category-theoretic definition.

For any open cover \{U_i\}_{i\in I} of an open set U, we can set up the following diagram:

\displaystyle\mathcal{F}(U)\to\prod\limits_{i\in I}\mathcal{F}(U_i)\rightrightarrows\prod\limits_{(i,j)\in I\times I}\mathcal{F}(U_i\cap U_j)

Let’s talk about this as if we’re dealing with a sheaf of sets, to make more sense of it. Usually our sheaves will be of sets with extra structure, anyway. The first arrow on the left just takes an element of \mathcal{F}(U), restricts it to each of the U_i, and takes the product of all these restrictions. The upper arrow on the right takes an element of \mathcal{F}(U_i) and restricts it to each intersection U_i\cap U_j. Doing this for each U_i we get a map from the product over i\in I to the product over all pairs (i,j). The lower arrow is similar, but it takes an element in \mathcal{F}(U_j) and restricts it to each intersection U_i\cap U_j. This may look the same, but the difference in whether the original set was the first or the second in the intersection makes a difference, as we shall see.

Now we say that a presheaf \mathcal{F} is a sheaf if and only if this diagram is an equalizer for every open cover U_i. For it to be an equalizer, first the arrow on the left must be a monomorphism. In terms of sets, this means that if we take two elements s\in\mathcal{F}(U) and t\in\mathcal{F}(U) so that s\vert_{U_i}=t\vert_{U_i} for all $latex U_i, then s=t. That is, elements over U are uniquely determined by their restrictions to any open cover.

The other side of the equalizer condition is that the image of the arrow on the left consists of exactly those products in the middle for which the two arrows on the right give the same answer. More explicitly, let’s say we have an s_i\in\mathcal{F}(U_i) for each U_i, and let’s further assume that these elements agree on their restrictions. That is, we ask that s_i\vert_{U_i\cap U_j}=s_j\vert_{U_i\cap U_j}. If this is true for all pairs (i,j), then the product \left(s_i\right)_{i\in I} takes the same value under either arrow on the right. Thus it must be in the image of the arrow on the left — there must be some s\in\mathcal{F}(U) so that s\vert_{U_i}=s_i. In other words, as long as the local elements s_i\in\mathcal{F}(U_i) “agree” where their domains overlap, we can “glue them together” to give an element s\in\mathcal{F}(U).

Again, the example to keep in mind is that of continuous real-valued functions. If we have a continuous function f_U:U\to\mathbb{R} and another continuous function f_V:V\to\mathbb{R}, and if f_U(x)=f_V(x) for all x\in U\cap V, then we can define f:U\cup V\to\mathbb{R} by “gluing” these functions together over their common overlap: f(x)=f_U(x) if x\in U, f(x)=f_V(x) if x\in V, and it doesn’t matter which we choose when x\in U\cap V because both functions give the same value there.

So, a sheaf is a presheaf where we can glue together elements over small domains so long as they agree when restricted to their intersections, and where this process defines a unique element over the larger, “glued-together” domain.

March 17, 2011 Posted by | Topology | 12 Comments