We know from the definition of expectation that it exists
finitely if and only if the sum defining the expectation converges absolutely. It
sometimes helps to know some sufficient conditions for this that
are easier to check.
Proof:
No infinite series involved here at all. Just a finite sum!
[QED]
We shall use a concept from real analysis
called the comparison test
repeatedly below. Click on the link to learn about it.
Proof:
Let $X$ take values $x_1,x_2,...$ with
probabilities $p_1,p_2,...$ We are assumin $\exists
B\in{\mathbb R}~~\forall i~~|x_i|\leq B.$
So
$$
\sum_n |x_n|p_n\leq \sum_n B p_n = B\sum p_n.
$$
We know the $\sum p_n$ converges (to $1$).
So, by comparison test, $\sum |x_n|p_n$ must converge.
[QED]
Proof:
Use comparison test between $\sum |f(x_n)|p_n$ and $\sum |g(x_n)|p_n.$
[QED]
Proof:
Apply comparison test between $\sum \max\{f(x_n)p_n,g(x_n)p_n\}$ and $\sum (f(x_n)+g(x_n))p_n.$
[QED]
Proof:
Use the fact that $|X^m| \leq \max\{1,|X^n|\}.$ Now use the
last theorem.
[QED]
Of course, a random variable is random, and so may differ from
its expectation. By how much? A lot or a little? We can use
expectation to find that out.
Proof:
We shall not do the proof here. But here is the main idea:
$$
e^{tX} = 1 + \frac{tX}{1!} + \frac{t^2X^2}{2!} + \frac{t^3X^3}{3!} + \cdots.
$$
From this we want to write
$$
E(e^{tX}) = 1 + \frac{tE(X)}{1!} + \frac{t^2E(X^2)}{2!} + \frac{t^3E(X^3)}{3!} + \cdots.
$$
This is not a precise statement, because we do not know if all
raw moments of $X$ exist finitely. Also, even if they do, is
it valid to "distribute" expectation over an infinite sum?
Answers to these questions require deeper real analysis results
than we know at this point.
However, assuming that this is valid, we may try to differentiate
both sides to get
$$
\frac{d}{dt} E(e^{tX}) = E(X) + \frac{2tE(X^2)}{2!} + \frac{3t^2E(X^3)}{3!} + \cdots.
$$
Again this step needs justification. Can we "distribute"
differentiation over an infinite sum?
Assuming that we can, puting $t=0$ indeed gives us $E(X).$
SImilarly, differentiating once again, and putting $t=0$
gives us $E(X^2),$ and so on.
[QED]
We shall not spend much time with MGFs, because there is a better
alternative called the characteristic function (CF).
Don't be nervous to see expectation of a complex random
variable. It is simply
$$
E(\cos tX) + i E(\sin tX).
$$
CFs are better than MGFs because of two reasons, that we give as
theorems below.
Proof:
This is obvious, since $\sin tX$ and $\cos tX$ are both
bounded random variables, and hence have finite expectations.
[QED]
Proof:
Not in this course.
[QED]
Indeed, this property has earned characteristic functions their name.
MGFs do not have this proprty. It is possible to get (rather
ugly) counter-examples of random variables $X$ and $Y$
that both have the same MGF (in particluar both have the same
domain $D\subseteq{\mathbb R}$), but still $X$ and $Y$ have
different distributions. However, if the domain includes a
neighbourhood of $0,$ then $X,Y$ must have the same
distribution. This is stated in the following theorem.
Proof:
Too difficult for this course.
[QED]
We shall not spend proving any result on MGF here. We shall
learn the proofs for CFs in the next semester.
A box has 6 red balls an 4 black balls. An SRSWR of
size $n$ is selected. If $X$ is the number of red
balls selectrd, then find PMF and $E(X).$ Also solve the
problem in the case of SRSWOR.
Let $N$ be a positive integer. Let
$$
f(x) = \left\{\begin{array}{ll}c 2^x &\text{if }x=1,2,...,N\\0&\text{otherwise.}\end{array}\right.
$$
be a PMF. Find $c.$ Find $E(X)$ and $V(X)$ if $X$ has this PMF.
An SRSWR of size 2 is drawn from $\{1,2,...,12\}.$
Let $X$ be the maximum of the two numbers
selected. Find $E(X).$
An SRSWR of size $n$ is selected
from $\{1,2,...,12\}.$ Let $a_n $ be the expected
value of the maximum of the sample. Show that $a_n \leq
a_{n+1}$ without explicily finding $a_n$ in terms of $n.$