Integrals are Additive Over Regions
Before I state today’s proposition, I need to define what I mean by saying that two sets in are “nonoverlapping”. Intuitively, we might think that this means they have no intersection, but that’s not quite it. We’ll allow some intersection, but only at boundary points. Since the regions we’re interested in for integrals are Jordan measurable, and their boundaries have zero Jordan content, so we know changing things along these boundaries in an integral will make no difference.
Let be a collection of bounded regions in
, so that any two of these regions are nonoverlapping. We define their union
and let be a bounded function defined on this union. Then
is integrable on
if and only if it’s integrable on each
, and we find
Indeed, if is some
-dimensional interval containing
, then it will also contain each of the smaller regions
, and we can define
and
We can use Lebesgue’s condition to establish our assertion about integrability. On the one hand, the discontinuities of in
must be contained within those of
, so if the latter set has measure zero then so must the former. On the other hand, the discontinuities of
consist of those within each of the
, and maybe some along the boundaries. Since the boundaries have measure zero, and we assume that the discontinuities in each
are of measure zero, their (countable) union will also have measure zero. And then so must the set of discontinuities of
in
have measure zero as well.
The inclusion-exclusion principle tells us that we can rewrite the characteristic function of :
We can put this into our integral and use the fact that integrals are additive with respect to finite sums in the integrand
But we assumed that all the are nonoverlapping, so any intersection of two or more of them must lie only along their boundaries. And since these boundaries all have Jordan content zero the integrals over them must come out to zero. We’re left with only the sum over each subregion, as we wanted.
The Mean Value Theorem for Multiple Integrals
As in the single variable case, multiple integrals satisfy a mean value property.
First of all, we should note that, like one-dimensional Riemann-Stieltjes integrals with increasing integrators, integration preserves order. That is, if and
are both integrable over a Jordan-measurable set
, and if
at each point
, then we have
This is a simple consequence of the definition of a multiple integral as the limit of Riemann sums, since every Riemann sum for will be smaller than the corresponding sum for
.
Now if and
are integrable on
and
for every
, then we set
and
— the infimum and supremum of the values attained by
on
. I assert that there is some
in the interval
so that
In particular, we can set and find
giving bounds on the integral in terms of the Jordan content of . Incidentally,
here is serving a similar role to the integrator
in the integral mean value theorem for Riemann-Stieltjes integrals.
Okay, so since we have
for every
. Since integration preserves order, this yields
If the integral of is zero, then our result automatically holds for any value of
. Otherwise we can divide through by this integral and set
which will be between and
.
One particularly useful case is when has Jordan content zero. In this case, we find that any integral over
is itself automatically zero.
Integrals Over More General Sets
To this point we’ve only discussed multiple integrals over -dimensional intervals. But often we’re interested in more general regions, like circles or ellipsoids or even more rectangular solids that are just tilted with respect to the coordinate axes. How can we handle integrating over these more general sets?
One attempt might be to fill up the region from inside. We can chop up regions into smaller pieces, each of which is an -dimensional interval. But overall this requires an incredibly involved limiting process, and we’ll never get anything calculated that way.
Instead, we come at it from outside. Given a bounded region , we can put it into an
-dimensional interval
. Then if
is a function, we can try to define some sort of integral of
over
in terms of its integral over
.
The obvious problem with is that it includes all the points in
that aren’t in
, and we don’t want to include the integral of
over that region. Worse, what if
has a big cluster of discontinuities within
but outside of
? Clearly that shouldn’t make
fail to be integrable over
. What we need is to mask off
, like a stencil or masking tape does when painting.
The mask we’ll use is the characteristic function of , which I’ve mentioned before. I’ll actually go more deeply into them in a bit, but for now we’ll recall that the characteristic function of a set
is written
, and it’s defined as
Now look what happens when we multiply our function by this mask:
Now our function has been redefined outside by setting it equal to zero there. Then we proceed to define
The integral over should be the “integral” over
plus the “integral” over the region
outside
. I put these in quotes because we haven’t really defined what these integrals mean (that’s what we’re trying to do!), but we can make reasonable assertions about properties that they should have, whatever they are.
Now is zero on the outer region, so the second integral is zero. And
is just equal to
inside
, so when “integrating” over
itself we may as well just drop the
factor. This justifies the definition of integrating over
.
It should be clear that this definition doesn’t depend on the interval at all. Indeed, if we have two different intervals
and
which both contain
, then any region the one contains which the other does not must fall outside of
, and
will be zero there anyway, and so will the difference between their integrals.
Now, this definition isn’t without its problems. The clearest of which is that we’ve almost certainly introduced new discontinuities. All around the boundary of , if
wasn’t already zero it suddenly and discontinuously becomes zero when we add our mask. This could cause trouble when trying to integrate the masked function over
. We must ask that
be Jordan measurable, because this will happen if and only if the boundary
has zero Jordan content, and thus zero outer Lebesgue measure. Since the collection of new discontinuities must be contained in this boundary, it will also have measure zero.
This leads us to an integrability criterion over a Jordan measurable set :
will be integrable over
if and only if the discontinuities of
in
form a set of measure zero. Indeed, Lebesgue’s condition tells us that the discontinuities of
must have measure zero. These discontinuities either come from those of
inside
, or from the characteristic function
at the boundary
. By assuming that
is Jordan measurable, we force the second kind to have measure zero, and so the total collection of discontinuities will have measure zero and satisfy Lebesgue’s condition if and only if the discontinuities of
inside
do.
Iterated Integrals IV
So we’ve established that as long as a double integral exists, we can use an iterated integral to evaluate it. What happens when the dimension of our space is even bigger?
In this case, we’re considering integrating an integrable function over some
-dimensional interval
. We want something like iterated integrals to allow us to evaluate this multiple integral. We’ll do this by peeling off a single integral from the outside and leaving an integral over an
-dimensional integral inside.
Specifically, we can project the interval onto the coordinate hyperplane defined by
just by leaving the coordinates
of each point
the same if
and setting
. We’ll call the resulting interval
where the wide hat means that we just leave out that one factor in the product. We’ll also write to mean the remaining coordinates on
.
Essentially, we want to integrate first over , and then let
run from
to
. We have a collection of assertions that parallel those from the two-dimensional case
- If
exists, then we have
with a copy of these three for each index between
and
. The proofs of these are pretty much identical to the proofs in the two-dimensional case, and so I’ll just skip them.
Anyhow, once we’ve picked one of the variables and split it off as the outermost integral, we’re left with an
-dimensional integral on the inside. We can pick any one of these variables and split it off, leaving an
-dimensional integral on the inside, and so on. For each of the
orderings of the original
variables, we get a way of writing the
-dimensional integral over
as a sequence of
integrals, each over a one-dimensional interval. Now, we may find some of these iterated integrals easier to evaluate than others, but in principle, if each of the
-dimensional integrals in the sequence exists it doesn’t matter which of the orderings we use.
So, for example, if we’re considering a bounded function defined on a three-dimensional interval
, we can write (up to) six different iterated integrals, assuming that all the integrals in sight exist.
Iterated Integrals III
I recently heard a characterization (if someone remembers the source, please let me know) of the situation in analysis as being that there are no theorems — only conjectures that don’t yet have counterexamples. Today’s counterexample is adapted from one in Apostol’s book, but it’s far simpler than his seems to be.
We might guess that we can always evaluate double integrals by iterated integrals as we’ve been discussing. After all, that’s exactly what we do in multivariable calculus courses as soon as we introduce iterated integrals, never looking back to all those messy double and triple Riemann sums again. Unfortunately, the existence of the iterated integrals — even if both of them exist and their values agree — is not enough to guarantee that the double integral exists. Today, we will see a counterexample.
Let be the set of points
in the unit square
so that
and
, where
and
are two fractions with the same denominator, each of which are in lowest terms. That is, it contains the point
, but not the point
, since when we write these latter two fractions in lowest terms they are no longer over a common denominator. We will consider the characteristic function
, which is
on points in
and
elsewhere in the unit square.
First, I assert that both iterated integrals exist and have the same value. That is
Indeed, the set is symmetric between the two coordinates, so we only need to evaluate one of these iterated integrals and the other one will automatically have the same value.
If is an irrational number in
, then there is no
at all so that
. Thus we can easily calculate the inner integral
On the other hand, if is a rational number, we can write it in lowest terms as
. Then there are only a finite number of points
having the same denominator at all. Thus we can break the interval
into a finite number of pieces, on each of which the characteristic function has the constant value zero. Thus we can calculate the inner integral
And so we see that for any the inner integral evaluates to
. Then it’s easy to calculate the outer integral
and, as we said before, the other iterated integral also has the value .
On the other hand, the double integral does not exist. Yes, is countable, and so it has measure zero. However, it’s also dense, which means
is discontinuous everywhere in the unit square.
Saying that is dense in the square means that every neighborhood of every point of the square contains some point of
. Indeed, consider a point
in the square and some radius
. Since the real numbers are Archimedean, we can pick some
, and as many people on Nick‘s Twitter experiment (remember to follow @DrMathochist!) reminded us, there are infinitely many prime numbers. Thus we can pick a prime
. Then we can round
up to the next larger fraction of the form
, which will be in lowest terms unless
, in which case we round
down to
. Similarly, we can round
up (or down) to a fraction
in lowest terms. This gives us a new point
, and we can calculate the distance
So there is a point in within any radius
of
.
But now when we try to set up the upper and lower integrals to check Riemann’s condition we find that every subinterval of any partition must contain some points in
and some points not in
. The points within
tell us that the upper sum gets a sample value of
for each subinterval, giving a total upper sum of
. Meanwhile the points outside of
tell us that the lower sum gets a sample value of
for each subinterval, giving a total lower sum of
. Clearly Riemann’s condition fails to hold, and thus the double integral
fails to exist, despite the iterated integrals existing and agreeing.
Iterated Integrals II
Let’s get to proving the assertions we made last time, starting with
where is a bounded function defined on the rectangle
.
We can start by defining
And we easily see that , where
is the supremum of
on the rectangle
, so this is a bounded function as well. Thus the upper integral
and the lower integral
are both well-defined.
Now if is a partition of
, and
is a partition of
, then
is a partition of
into
subrectangles
. We will define
Clearly, we have
and so we find
That is
and, similarly
We also define and
to be the infimum and supremum of
over the rectangle
, which gives us the inequalities
and from here we find
Summing on both and
, and sing the above inequalities, we get
and since this holds for all partitions , the assertion that we’re trying to prove follows.
The second assertion from last time can be proven similarly, just replacing by the lower integral over
. And then the third and fourth assertions are just the same, but interchanging the roles of
and
. Finally, the last assertion is a consequence of the first four. Indeed, if the integral over
exists, then the upper and lower integrals are equal, which collapses all of the inequalities into equalities.
Iterated Integrals I
We may remember from a multivariable calculus class that we can evaluate multiple integrals by using iterated integrals. For example, if is a continuous, nonnegative function on a two-dimensional rectangle
then the integral
measures the volume contained between the graph of the function and the
-
plane within the rectangle. If we fix some constant
between
and
we can calculate the single integral
which describes the area that the plane cuts out of this volume. It exists because because the integrand is continuous as a function of
. In such classes, we make the reasonable assumption that as we vary
this area varies continuously. This gives us a continuous function on
, which will then be integrable:
This is an “iterated integral”, since we perform more than one integral in sequence. We usually leave out the big parens and trust in the notation to tell us when the inner integral is closed. Our handwaving argument then justifies the belief that this iterated integral is the same as the double integral above. And this is true:
but we haven’t really proven it.
Besides, we’re interested in more general situations. What if, say, is discontinuous along the whole line
for some fixed
? This line can be contained in an arbitrarily thin rectangle, so it has outer Lebesgue measure zero in the rectangle
. If these are the only discontinuities, then
is integrable on
, but we can’t follow the above prescription anymore, even if it were actually rigorous. We need some method of handling this sort of thing.
To this end, we have five assertions relating the upper and lower single and double integrals involving a function which is defined and bounded on the rectangle
above. Unfortunately, our notation for upper and lower integrals gets a little cumbersome here, and the
support on WordPress isn’t the most elegant. Still, we soldier on and write
and similarly for upper integrals, and for lower and upper double integrals. Now, our assertions:
- If
exists, then we have
Okay, as ugly as all those are, they’re what we’ll prove next time.
Lebesgue’s Condition
At last we come to Lebesgue’s condition for Riemann-integrability in terms of Lebesgue measure. It asserts, simply enough, that a bounded function defined on an
-dimensional interval
is Riemann integrable on that interval if and only if the set
of discontinuities of
has measure zero. Our proof will go proceed by way of our condition in terms of Jordan content.
As in our proof of this latter condition, we define
and by our earlier condition we know that for all
. In particular, it holds for
for all natural numbers
.
If is a point where
is discontinuous, then the oscillation
must be nonzero, and so
for some
. That is
Since , we also have
, and therefore have
as well.
Conversely, let’s assume that . Given an
, we know that
is a closed set contained in
. From this, we conclude that
. Since this is true for all
, the Jordan content condition holds, and
is Riemann integrable.
Jordan Content Integrability Condition
We wanted a better necessary and sufficient condition for integrability than Riemann’s condition, and now we can give a result halfway to our goal. We let be a bounded function defined on the
-dimensional interval
.
The condition hinges on defining a certain collection of sets. For every we define the set
of points where the oscillation exceeds the threshold value . The first thing to note about
is that it’s a closed set. That is, it should contain all its accumulation points. So let
be such an accumulation point and assume that is isn’t in
, so
. So there must exist a neighborhood
of
so that
and this means that
for any point
, and so none of these points can be in
. But if this is the case, then
can’t be an accumulation point of
, and so we must have
.
And now for our condition. The function is integrable if and only if the Jordan content
for every
.
We start by assuming that for some
, and we’ll show that Riemann’s condition can’t hold. Given a partition
of
we calculate the difference between the upper and lower sums
where is the part of the sum involving subintervals which contain points of
and
is the rest. The intervals in
have total length
, and in these intervals we must have
because if the difference were less then the subinterval would be a neighborhood with oscillation less than and thus couldn’t contain any points in
. Thus we conclude that
, and the difference between the upper and lower sums is at least as big that. This happens no matter what partition we pick, and so the upper and lower integrals must also differ by at least this much, violating Riemann’s condition. Thus if the function is integrable, we must have
.
Conversely, take an arbitrary and assume that
. For this to hold, there must exist a partition
so that
. In each of the subintervals not containing points of
, we have
for all
in the subinterval. Then we know there exists a
so that we can subdivide the subinterval into smaller subintervals each with a diameter less than
, and the oscillation on each of these subintervals will be less than
. We will call this refined partition
.
Now, if is finer than
, we can again write
where again contains the terms from subintervals containing points in
and
is the remainder. In all of these latter subintervals we know the difference between the maximum and minimum values of
is less than
, and so
For , on the other hand, we let
and
be the supremum and infimum of
on all of
, and we find
Thus we conclude that
Since this inequality is valid for any , we see that Riemann’s condition must hold.
From Local Oscillation to Neighborhoods
When we defined oscillation, we took a limit to find the oscillation “at a point”. This is how much the function changes due to its behavior within every neighborhood of a point, no matter how small. If the function has a jump discontinuity at
, for instance, it shows up as an oscillation in
. We now want to investigate to what extent such localized oscillations contribute to the oscillation of
over a spread-out neighborhood of a point.
To this end, let be some bounded function on a compact region
. Given an
, assume that
for every point
. Then there exists a
so that for every closed neighborhood
we have
whenever the diameter of
is less than
. The diameter, incidentally, is defined by
in a metric space with distance function . That is, it’s the supremum of the distance between any two points in
.
Anyhow, for each we have some metric ball
so that
because by picking a small enough neighborhood of we can bring the oscillation over the neighborhood within any positive distance of
. This is where we use the assumption that
.
The collection of all the half-size balls forms an open cover of
. Thus, since
is compact, we have a finite subcover. That is, the half-size balls at some finite collection of points
still covers
. We let
be the smallest of these radii
.
Now if is some closed neighborhood with diameter less than
, it will be partially covered by at least one of these half-size balls, say
. The corresponding full-size ball
then fully covers
. Further, we chose this ball so that the
, and so we have
and we’re done.
