Indefinite Integrals and Convergence I
Let’s see how the notion of an indefinite integral plays with sequences of simple functions in the norm.
If is a mean Cauchy sequence of integrable simple functions, and if each
has indefinite integral
, then the limit
exists for all measurable sets
. Indeed, for each
we have a sequence of real numbers
. We compare
and find that since the sequence of simple functions is mean Cauchy the sequence of real numbers
is Cauchy. And thus it must converge to a limiting value, which we define to be
. In fact, the convergence is uniform, since the last step of our inequality had nothing to do with the particular set
!
Now, this set function is finite-valued as the uniform limit of a sequence of finite-valued functions. Since limits commute with finite sums, and since each
is finitely additive, we see that
is finitely additive as well; it turns out that it’s actually countable additive.
If is a disjoint sequence of measurable sets whose (countable) union is
, then for every pair of positive integers
and
the triangle inequality tells us that
Choosing a large enough we can make the first and third terms arbitrarily small, and then we can choose a large enough
to make the second term arbitrarily small. And thus we establish that
We can say something about the sequence of set functions : each of them is — as an indefinite integral — absolutely continuous, but in fact the sequence is uniformly absolutely continuous. That is, for every
there is a
independent of
so that
for every measurable set
with
.
Let be a sufficiently large integer so that for
we have
which exists by the fact that is mean Cauchy. Then we can pick a
so that
for all and
with
. We know that such a
exists for each
by absolute continuity, and so we just pick the smallest of them for
.
This will then work for all
, but what if
? Well, then we can write
and so the same works for all
as well.
The L¹ Norm
We can now introduce a norm to our space of integrable simple functions, making it into a normed vector space. We define
Don’t worry about that little dangling off of the norm, or why we call this the “
norm”. That will become clear later when we generalize.
We can easily verify that and that
, using our properties of integrals. The catch is that
doesn’t imply that
is identically zero, but only that
almost everywhere. But really throughout our treatment of integration we’re considering two functions that are equal a.e. to be equivalent, and so this isn’t really a problem —
implies that
is equivalent to the constant zero function for our purposes.
Of course, a norm gives rise to a metric:
and this gives us a topology on the space of integrable simple functions. And with a topology comes a notion of convergence!
We say that a sequence of integrable functions is “Cauchy in the mean” or is “mean Cauchy” if
as
and
get arbitrarily large. We won’t talk quite yet about convergence because our situation is sort of like the one with rational numbers; we have a sense of when functions are getting close to each other, but most of these mean Cauchy sequences actually don’t converge within our space. That is, the normed vector space is not a Banach space.
However we can say some things about this notion of convergence. For one, a sequence that is Cauchy in the mean is Cauchy in measure as well. Indeed, for any
we can define the sets
And then we find that
As and
get arbitrarily large, the fact that the sequence is mean Cauchy tells us that the left hand side of this inequality gets pushed down to zero, and so the right hand side must as well.
This notion of convergence will play a major role in our study of integration.
Indefinite Integrals
We can use integrals to make a set function out of any integrable function
.
We call this set function the “indefinite integral” of , and it is defined for all measurable subsets
. This isn’t quite the same indefinite integral that we’ve worked with before. In that case we only considered functions
, picked a base-point
, and defined a new function
on the domain. In our new language, we’d write
, so the two concepts are related but they’re not quite the same.
Anyhow, the indefinite integral is “absolutely continuous”. That is, for every
there is a
so that
for all measurable
with
. Indeed, if
is an upper bound for
, then we can show that
And so if we make small enough we can keep
small.
Further, an indefinite integral is countably additive. Indeed, if
is a characteristic function then countable additivity of
follows immediately from countable additivity of
. And countable additivity for general simple functions
is straightforward by writing each such function as a finite linear combination of characteristic functions.
With the exception of this last step, nothing we’ve said today depends on the function being simple, and so once we generalize our basic linearity and order properties we will immediately have absolutely continuous indefinite integrals.
More Properties of Integrals
Today we will show more properties of integrals of simple functions. But the neat thing is that they will follow from the last two properties we showed yesterday. And so their proofs really have nothing to do with simple functions. We will be able to point back to this post once we establish the same basic linearity and order properties for the integrals of wider and wider classes of functions.
First up: if and
are integrable simple functions with
a.e. then
Indeed, the function is nonnegative a.e., and so we conclude that
Next, if and
are integrable simple functions then
Here we use the triangle inequality and invoke the previous result.
Now, if is an integrable simple function then
The absolute value is greater than both
and
, and so we find
which implies the inequality we asserted.
As a heuristic, this last result is sort of like the triangle inequality to the extent that the integral is like a sum; adding inside the absolute value gives a smaller result than adding outside the absolute value. However, we have to be careful here; the integral we’re working with is not the limit of a sum like the Riemann integral was. In fact, we have no reason yet to believe that this integral and the Riemann integral have all that much to do with each other. But that shouldn’t stop us from using this analogy to remember the result.
Finally, if is an integrable simple function,
is a measurable set, and
and
are real numbers so that
for almost all
, then
Indeed, the assumed inequality is equivalent to the assertion that a.e., and so — as long as
— we conclude that
which is equivalent to the above. On the other hand, if , then
must be zero on all but a portion of
of finite measure or else it wouldn’t be integrable. Thus, in order for the assumed inequalities to hold, we must have
and
. The asserted inequalities are then all but tautological.
Basic Properties of Integrable Simple Functions
We want to nail down a few basic properties of integrable simple functions. We define two simple functions and
to work with.
First of all, if is simple then the scalar multiple
is simple for any real number
. Indeed,
, and so the exact same sets
must have finite measure for both
and
to be integrable. It’s similarly easy to see that if
and
are both integrable, then
is integrable. Thus the integrable simple functions form a linear subspace of the vector space of all simple functions.
Now if is integrable then the product
is integrable whether or not
is. We can write
If each has finite measure, then so does each
, whether each
does or not. Thus we see that the integrable simple functions form an ideal of the algebra of all simple functions.
We can use this to define the integral of a function over some range other than all of . If
is an integrable simple function and
is a measurable set, then
is again an integrable simple function. We define the integral of
over
as
This has the effect of leaving the same on
and zeroing it out away from
. Thus the integral over the rest of the space
contributes nothing.
The next two properties are easy to prove for integrable simple functions, but they’re powerful. Other properties of integration will be proven in terms of these properties, and so when we widen the class of functions under consideration we’ll just have to reprove these two. The ones we will soon consider will immediately have proofs parallel to those for simple functions.
Not only is the function integrable, but we know its integral:
Indeed, if you were paying attention yesterday you’d have noticed that we said we wanted integration to be linear, but we never really showed that it was. But it’s not really complicated: the expression represents
as a simple function, and it’s clear that the formula holds as stated.
Almost as obvious is the fact that if is nonnegative a.e., then
. Indeed in any representation of
as a simple function, any term
corresponding to a negative
must have
or else
wouldn’t be nonnegative almost everywhere. But then the term
contributes nothing to the integral! Every other term has a nonnegative
and a nonnegative measure
, and thus every term in the integral is nonnegative. This is the basis for all the nice order properties we will find for the integral.
Integrating Simple Functions
We start our turn from measure in the abstract to applying it to integration, and we start with simple functions. In fact, we start a bit further back than that even; the simple functions are exactly the finite linear combinations of characteristic functions, and so we’ll start there.
Given a measurable set , there’s an obvious way to define the integral of the characteristic function
: the measure
! In fact, if you go back to the “area under the curve” definition of the Riemann integral, this makes sense: the graph of
is a “rectangle” (possibly in many pieces) with one side a line of length
and the other “side” the set
. Since
is our notion of the “size” of
, the “area” will be the product of
and
. And so we define
That is, the integral of the characteristic function with respect to the measure
is
. Of course, this only really makes sense if
.
Now, we’re going to want our integral to be linear, and so given a linear combination we define the integral
Again, this only really makes sense if all the associated to nonzero
have finite measure. When this happens, we call our function
“integrable”.
Since every simple function is a finite linear combination of characteristic functions, we can always use this to define the integral of any simple function. But there might be a problem: what if we have two different representations of a simple function as linear combinations of characteristic functions? Do we always get the same integral?
Well, first off we can always choose an expression for so that the
are disjoint. As an example, say that we write
, where
and
overlap. We can rewrite this as
. If
is integrable, then
and
both have finite measure, and so
is subtractive. Thus we can verify
Thus given any representation the corresponding disjoint representation gives us the same integral.
But what if we have two different disjoint representations and
? Our function can only take a finite number of nonzero values
. We can define
to be the (measurable) set where
takes the value
. For any given
, we can consider all the
so that
. The corresponding sets
must be a disjoint partition of
, and additivity tells us that the sum of these
is equal to
. But the same goes for the
corresponding to values
. And so both our representations give the same integral as
. Everything in sight is linear, so this is all very straightforward.
At the end of the day, the integral of any simple function is well-defined so long as all the preimage
of each nonzero value
has a finite measure. Again, we call these simple functions “integrable”.
Convergence in Measure and Algebra
Unlike our other methods of convergence, it’s not necessarily apparent that convergence in measure plays nicely with algebraic operations on the algebra of measurable functions. All our other forms are basically derived from pointwise convergence, and so the limit laws clearly hold; but it takes some work to see that the same is true for convergence in measure. So, for the rest of this post assume that and
are sequences of finite-valued measurable functions converging in measure to
and
, respectively.
First up: if and
are real constants, then
converges in measure to
. Indeed, we find that
Thus if and
, then
. That is,
Since and
converge in measure to
and
, we can control the size of each of these sets by choosing a sufficiently large
, and thus
converges in measure to
.
Next, if a.e., the sequence
converges in measure to
. Indeed, the set
differs negligibly from the set
. This, in turn, is exactly the same as the set
, which differs negligibly from the set
. Thus control on the measure of one of these sets is control on all of them.
Now we’ll add the assumption that the whole space is measurable, and that
is finite (that is, the measure space is “totally finite”). This will let us conclude that the sequence
converges in measure to
. As the constant
increases, the measurable set
gets larger and larger, while its complement
gets smaller and smaller; this complement is measurable because
is measurable.
In fact, the measure of the complement must decrease to zero, or else we’d have some set of positive measure on which is larger than any finite
, and thus
on a set of positive measure. But then
couldn’t converge to
in measure. Since
is totally finite, the measure
must start at some finite value and decrease to zero; if
were infinite, these measures might all be infinite. And so for every
there is some
so that
.
In particular, we can pick a so that
. On
, then, we have
. Convergence in measure tells us that we can pick a large enough
so that
has measure less than as well. The set
must be contained between these two sets, and thus will have measure less than
for sufficiently large
.
Now we can show that converges in measure to
for any
, not just ones that are a.e. zero. We can expand
, and thus rewrite
. Our first result shows that
converges to
, and our second result then shows that
also converges to
. Our third result shows that
converges to
. We use our first result to put everything together again and conclude that
converges to
as we asserted.
And finally we can show that converges in measure to
. We can use the same polarization trick as we’ve used before. Write
; we’ve just verified that the squares converge to squares, and we know that linear combinations also converge to linear combinations, and so
converges in measure to
.
Convergence in Measure II
Sorry, I forgot to post this before I left this morning.
The proposition we started with yesterday shows us that on a set of finite measure, a.e. convergence is equivalent to convergence in measure, and a sequence is Cauchy a.e. if and only if it’s Cauchy in measure. We can strengthen it slightly by removing the finiteness assumption, but changing from a.e. convergence to almost uniform convergence: almost uniform convergence implies convergence in measure. Indeed, if
converges to
almost uniformly then for any two positive real numbers
and
there is a measurable set
with
and
for all
and sufficiently large
. Thus we can make the set
where
and
are separated as small as we like, as required by convergence in measure.
We also can show some common-sense facts about sequences converging and Cauchy in measure. First, if converges in measure to
, then
is Cauchy in measure. We find that
because if both and
are within
of the same number
, then they’re surely within
of each other. And so if we have control on the measures of the sets on the right, we have control of the measure of the set on the left.
Secondly, if also converges in measure to
, then it only makes sense that
and
should be “the same”. It wouldn’t do for a convergence method to have many limits for a convergent sequence. Of course, this being measure theory, “the same” means a.e. we have
. But this uses almost the same relation:
Since we can make each of the sets on the right arbitrarily small by choosing a large enough , we must have
for every
; this implies that
almost everywhere.
Slightly deeper, if is a sequence of measurable functions that is Cauchy in measure, then there is some subsequence
which is almost uniformly Cauchy. For every positive integer
we find some integer
so that if
We define , and
to be the larger of
or
, to make sure that
is a strictly increasing sequence of natural numbers. We also define
If then for every x not in
we have
That is, the subsequence is uniformly Cauchy on the set
. But we also know that
and so is almost uniformly Cauchy, as asserted.
Finally, we can take this subsequence that is almost uniformly Cauchy, and see that it must be a.e. Cauchy. We write at all
where this sequence converges. And then for every
,
The measure of the first set on the right is small for sufficiently large and
by the assumption that
is Cauchy in measure. The measure of the second approaches zero because almost uniform convergence implies convergence in measure.
And thus we conclude that if is Cauchy in measure, then there is some measurable function
to which
converges in measure. The topology of convergence in measure may not come from a norm, but it is still complete.
Convergence in Measure I
Suppose that and all the
(for positive integers
) are real-valued measurable functions on a set
of finite measure. For every
we define
That is, is the set where the value of
is at least
away from the value of
. I say that
converges a.e. to
if and only if
for every .
Given a point , the sequence
fails to converge to
if and only if there is some positive number
so that
for infinitely many values of
. That is, if
is the set of points where
doesn’t converge to
, then
Of course, if is to converge to
a.e., we need
. A necessary and sufficient condition is that
for all
. Then we can calculate
Our necessary and sufficient condition is thus equivalent to the one we stated at the outset.
We’ve shown that over a set of finite measure, a.e. convergence is equivalent to this other condition. Extracting it a bit, we get a new notion of convergence which will (as we just showed) be equivalent to a.e. convergence over sets of finite measure, but may not be in general. We say that a sequence of a.e. finite-valued measurable functions “converges in measure” to a measurable function
if for every
we have
Now, it turns out that there is no metric which gives this sense of convergence, but we still refer to a sequence as being “Cauchy in measure” if for every we have
Almost Uniform Convergence
From the conclusion of Egoroff’s Theorem we draw a new kind of convergence. We say that a sequence of a.e. finite-valued measurable functions converges to the measurable function
“almost uniformly” if for every
there is a measurable set
with
so that the sequence
converges uniformly to
on
.
We have to caution ourselves that this is not almost everywhere uniform convergence, which would be a sequence that converges uniformly once we cut out a subset of measure zero. Our example sequence yesterday converges almost uniformly to the constant zero function, but it doesn’t converge almost everywhere uniformly. Maybe “nearly uniform” would be better, but “almost uniformly” has become standard.
Now we can restate Egoroff’s Theorem to say that on a set of finite measure, a.e. convergence implies almost uniform convergence. Conversely, if is a sequence of functions (on any measure space) that converges to
almost uniformly, then it converges pointwise almost everywhere.
We start by picking a set for every positive integer
so that
, and so that
converges uniformly to
on
. We set
and conclude that , since
for all
. If
, then there must be some
for which
. Since
converges to
uniformly on
, we conclude that
converges to
. Thus
converges pointwise to
except on the set
of measure zero.
