
Roman numerals


The following problem appeared in Volume 131, Issue 9 (2024) of The American Mathematical Monthly.
Let
and
be independent normally distributed random variables, each with its own mean and variance. Show that the variance of
conditioned on the event
is smaller than the variance of
alone.
In previous posts, we reduced the problem to showing that if , then
is always positive, where
is the cumulative distribution function of the standard normal distribution. If we can prove this, then the original problem will be true.
Motivated by the graph of , I thought of a two-step method for showing
must be positive: show that
is an increasing function, and show that
. If I could prove both of these claims, then that would prove that
must always be positive.
I was able to show the second step by demonstrating that, if ,
.
As discussed in the last post, the limit follows from this equality. However, I just couldn’t figure out the first step.
So I kept trying.
And trying.
And trying.
Until it finally hit me: I’m working too hard! The goal is to show that is positive. Clearly, clearly, the right-hand side of the last equation is positive! So that’s the entire proof for
… there was no need to prove that
is increasing!
For , it’s even easier. If
is non-negative, then
.
So, in either case, must be positive. Following the logical thread in the previous posts, this demonstrates that
, so that
, thus concluding the solution.
And I was really annoyed at myself that I stumbled over the last step for so long, when the solution was literally right in front of me.
The following problem appeared in Volume 131, Issue 9 (2024) of The American Mathematical Monthly.
Let
and
be independent normally distributed random variables, each with its own mean and variance. Show that the variance of
conditioned on the event
is smaller than the variance of
alone.
In previous posts, we reduced the problem to showing that if , then
is always positive, where
is the cumulative distribution function of the standard normal distribution. If we can prove this, then the original problem will be true.
When I was solving this problem for the first time, my progress through the first few steps was hindered by algebra mistakes and the like, but I didn’t doubt that I was progressing toward the answer. At this point in the solution, however, I was genuinely stuck: nothing immediately popped to mind for showing that must be greater than
.
So I turned to Mathematica, just to make sure I was on the right track. Based on the graph, the function of certainly looks positive.
What’s more, the graph suggests attempting to prove a couple of things: is an increasing function, and
or, equivalently,
. If I could prove both of these claims, then that would prove that
must always be positive.
I started by trying to show
.
I vaguely remembered something about the asymptotic expansion of the above integral from a course decades ago, and so I consulted that course’s textbook, by Bender and Orszag, to refresh my memory. To derive the behavior of as
, we integrate by parts. (This is permissible: the integrands below are well-behaved if
, so that
is not in the range of integration.)
.
This is agonizingly close: the leading term is as expected. However, I was stuck for the longest time trying to show that the second term goes to zero as
.
So, once again, I consulted Bender and Orszag, which outlined how to show this. We note that
.
Therefore,
,
so that
.
Therefore,
,
or
.
So (I thought) I was halfway home with the solution, and all that remained was to show that was an increasing function.
And I was completely stuck at this point for a long time.
Until I realized — much to my utter embarrassment — that showing was increasing was completely unnecessary, as discussed in the next post.
The following problem appeared in Volume 131, Issue 9 (2024) of The American Mathematical Monthly.
Let
and
be independent normally distributed random variables, each with its own mean and variance. Show that the variance of
conditioned on the event
is smaller than the variance of
alone.
We suppose that ,
,
, and
. With these definitions, we may write
and
, where
and
are independent standard normal random variables.
The goal is to show that . In previous posts, we showed that it will be sufficient to show that
, where
and
. We also showed that
, where
and
is the cumulative distribution function of the standard normal distribution.
To compute
,
we showed in the two previous posts that
and
.
Therefore,
.
To show that , it suffices to show that the second term must be positive. Furthermore, since the denominator of the second term is positive, it suffices to show that
must also be positive.
And, to be honest, I was stuck here for the longest time.
At some point, I decided to plot this function in Mathematica to see if I get some ideas flowing:

The function certainly looks like it’s always positive. What’s more, the graph suggests attempting to prove a couple of things: is an increasing function, and
. If I could prove both of these claims, then that would prove that
must always be positive.
Spoiler alert: this was almost a dead-end approach to the problem. I managed to prove one of them, but not the other. (I don’t doubt it’s true, but I didn’t find a proof.) I’ll discuss in the next post.
The following problem appeared in Volume 131, Issue 9 (2024) of The American Mathematical Monthly.
Let
and
be independent normally distributed random variables, each with its own mean and variance. Show that the variance of
conditioned on the event
is smaller than the variance of
alone.
We suppose that ,
,
, and
. With these definitions, we may write
and
, where
and
are independent standard normal random variables.
The goal is to show that . In previous posts, we showed that it will be sufficient to show that
, where
and
. We also showed that
, where
and
is the cumulative distribution function of the standard normal distribution.
To compute
,
we showed in the previous post that
.
We now turn to the second conditional expectation:
.
The expected value in the numerator is a double integral:
,
where is the joint probability density function of
and
. Since
and
are independent,
is the product of the individual probability density functions:
.
Therefore, we must compute
,
where I wrote for the event
.
I’m not above admitting that I first stuck this into Mathematica to make sure that this was doable. To begin, we compute the inner integral:
we begin by using integration by parts on the inner integral:
Therefore,
.
The second term is equal to since the double integral is
. For the first integral, we complete the square as before:
.
I now rewrite the integrand so that has the form of the probability density function of a normal distribution, writing and multiplying and dividing by
in the denominator:
.
This is an example of making a problem easier by apparently making it harder. The integrand has the probability density function of a normally distributed random variable with
and
. Therefore, the integral is equal to
, so that
,
.
Therefore,
.
We note that this reduces to what we found in the second special case: if , then
and
, so that
, matching what we found earlier.
In the next post, we consider the calculation of .
The following problem appeared in Volume 131, Issue 9 (2024) of The American Mathematical Monthly.
Let
and
be independent normally distributed random variables, each with its own mean and variance. Show that the variance of
conditioned on the event
is smaller than the variance of
alone.
We suppose that ,
,
, and
. With these definitions, we may write
and
, where
and
are independent standard normal random variables.
The goal is to show that . In the previous two posts, we showed that it will be sufficient to show that
, where
and
. We also showed that
, where
and
is the cumulative distribution function of the standard normal distribution.
To compute
,
we begin with
.
The expected value in the numerator is a double integral:
,
where is the joint probability density function of
and
. Since
and
are independent,
is the product of the individual probability density functions:
.
Therefore, we must compute
,
where I wrote for the event
.
I’m not above admitting that I first stuck this into Mathematica to make sure that this was doable. To begin, we compute the inner integral:
.
At this point, I used a standard technique/trick of completing the square to rewrite the integrand as a common pdf.
.
I now rewrite the integrand so that has the form of the probability density function of a normal distribution, writing and multiplying and dividing by
in the denominator:
.
This is an example of making a problem easier by apparently making it harder. The integrand is equal to , where
is a normally distributed random variable with
and
. Since
, we have
,
and so
.
We note that this reduces to what we found in the second special case: if ,
, and
, then
,
, and
. Since
, we have
,
matching what we found earlier.
In the next post, we consider the calculation of .
The following problem appeared in Volume 131, Issue 9 (2024) of The American Mathematical Monthly.
Let
and
be independent normally distributed random variables, each with its own mean and variance. Show that the variance of
conditioned on the event
is smaller than the variance of
alone.
We suppose that ,
,
, and
. With these definitions, we may write
and
, where
and
are independent standard normal random variables.
Based on the experience of the special cases, it seems likely that I’ll eventually need to integrate over the joint probability density function of and
. However, it’s a bit easier to work with standard normal random variables than general ones, and so I’d like to rewrite in terms of
and
to whatever extent is possible.
As it turns out, the usual scaling and shifting properties of variance apply to a conditional variance on any event . The event that we have in mind, of course, is
. As discussed in the previous post, this can be rewritten as
, where
and
.
We are now ready to derive the scaling and shift properties for . We begin by using the definition
.
Let’s examine the unconditional expectations and
. First,
,
and so
.
Next,
,
and so
.
Therefore,
.
So, not surprisingly, .
Also, the ultimate goal is to show that is less than
, where
is the event
or, equivalently,
. We see that it will be sufficient to show that
.
We start the calculations of this conditional variance in the next post.
The following problem appeared in Volume 131, Issue 9 (2024) of The American Mathematical Monthly.
Let
and
be independent normally distributed random variables, each with its own mean and variance. Show that the variance of
conditioned on the event
is smaller than the variance of
alone.
Not quite knowing how to start, I decided to begin by simplifying the problem and assume that both and
follow a standard normal distribution, so that
and
. After solving this special case, I then made a small generalization by allowing
to be arbitrary. Solving these two special cases boosted my confidence that I would eventually be able to tackle the general case, which I start to consider with this post.
We suppose that ,
,
, and
. The goal is to show that
.
Based on the experience of the special cases, it seems likely that I’ll eventually need to integrate over the joint probability density function of and
. However, it’s a bit easier to work with standard normal random variables than general ones, and so I wrote
and
, where
and
are independent standard normal random variables.
We recall that
,
and so we’ll have to compute . We switch to
and
:
,
where we define and
for the sake of simplicity. Since
and
are independent, we know that
will also be a normal random variable with
and
.
Therefore, converting to standard units,
,
where and
is the cumulative distribution function of the standard normal distribution.
We already see that the general case is more complicated than the two special cases we previously considered, for which was simply equal to
.
In future posts, we take up the computation of and
.
The following problem appeared in Volume 131, Issue 9 (2024) of The American Mathematical Monthly.
Let
and
be independent normally distributed random variables, each with its own mean and variance. Show that the variance of
conditioned on the event
is smaller than the variance of
alone.
Not quite knowing how to start, I decided to begin by simplifying the problem and assume that both and
follow a standard normal distribution, so that
and
. This doesn’t solve the original problem, of course, but I hoped that solving this simpler case might give me some guidance about tackling the general case. I solved this special case in the previous post.
Next, to work on a special case that was somewhere between the general case and the first special case, I kept as a standard normal distribution but changed
to have a nonzero mean. As it turned out, this significantly complicated the problem (as we’ll see in the next post), and I got stuck.
So I changed course: for a second attempt, I kept as a standard normal distribution but changed
so that
and
, where
could be something other than 1. The goal is to show that
.
We begin by computing . The denominator is straightforward: since
and
are independent normal random variables, we also know that
is normally distributed with
. (Also,
, but that’s really not needed for this problem.) Therefore,
since the distribution of
is symmetric about its mean of
.
Next, I wrote , where
has a standard normal distribution. Then
,
where we have used the joint probability density function for the independent random variables and
. The region of integration is
, matching the requirement
. The inner integral can be directly evaluated:
.
At this point, I rewrote the integrand to be the probability density function of a random variable:
.
The integrand is the probability density function of a normal random variable with mean 0 and variance , and so the integral must be equal to 1. We conclude
.
Next, we compute the other conditional expectation:
.
The inner integral can be computed using integration by parts:
.
Therefore,
.
We could calculate the first integral, but we can immediately see that it’s going to be equal to 0 since the integrand is an odd function. The double integral is equal to
, which we’ve already shown is equal to
. Therefore,
.
We conclude that
,
which is indeed less than 1. If , we recover the conditional variance found in the first special case.
After tackling these two special cases, we start the general case with the next post.
The following problem appeared in Volume 131, Issue 9 (2024) of The American Mathematical Monthly.
Let
and
be independent normally distributed random variables, each with its own mean and variance. Show that the variance of
conditioned on the event
is smaller than the variance of
alone.
I admit I did a double-take when I first read this problem. If and
are independent, then the event
contains almost no information. How then, so I thought, could the conditional distribution of
given
be narrower than the unconditional distribution of
?
Then I thought: I can believe that is greater than
: if we’re given that
, then we know that
must be larger than something. So maybe it’s possible for
to be less than
.
Still, not quite knowing how to start, I decided to begin by simplifying the problem and assume that both and
follow a standard normal distribution, so that
and
. This doesn’t solve the original problem, of course, but I hoped that solving this simpler case might give me some guidance about tackling the general case. I also hoped that solving this special case might give me some psychological confidence that I would eventually be able to solve the general case.
For the special case, the goal is to show that
.
We begin by computing . The denominator is straightforward: since
and
are independent normal random variables, we also know that
is normally distributed with
. (Also,
, but that’s really not needed for this problem.) Therefore,
since the distribution of
is symmetric about its mean of
.
Next,
,
where we have used the joint probability density function for and
. The region of integration is
, taking care of the requirement
. The inner integral can be directly evaluated:
.
At this point, I used a standard technique/trick of integration by rewriting the integrand to be the probability density function of a random variable. In this case, the random variable is normally distributed with mean 0 and variance :
.
The integral must be equal to 1, and so we conclude
.
We parenthetically note that , matching my initial intuition.
Next, we compute the other conditional expectation:
.
The inner integral can be computed using integration by parts:
.
Therefore,
.
We could calculate the first integral, but we can immediately see that it’s going to be equal to 0 since the integrand is an odd function. The double integral is equal to
, which we’ve already shown is equal to
. Therefore,
.
We conclude that
,
which is indeed less than 1.
This solves the problem for the special case of two independent standard normal random variables. This of course does not yet solve the general case, but my hope was that solving this problem might give me some intuition about the general case, which I’ll develop as this series progresses.