这篇 blog 的中文版本

Foreword

This blog serves as a collection of interesting results and exercises about conditional expectation and martingale. It does not aim to be a comprehensive tutorial on the topic. The content is collected from various textbooks and lecture notes, and the difficulty level should be not higher than PhD level measure theory or stochastic process courses. Currently, this blog is still under construction. If you have any suggestions or want to contribute, please feel free to contact me!

Conditional Expectation

Regular Conditional Probability

Based on the definition of conditional expectation, we can naturally define conditional probability. For an arbitrary measure space $\left(\Omega ,\mathscr{F},\mu \right)$, a sub-$\sigma$-algebra $\mathscr{G}$ of $\mathscr{F}$, and a set $A$ in $\mathscr{F}$, the conditional probability of $A$ given $\mathscr{G}$ is defined as:

\[   \mathbb{P}\left(A\left\lvert\right. \mathscr{G}\right):=\mathbb{E}\left[\mathbf{1}_{A}\left\lvert\right. \mathscr{G}\right]. \]

According to the monotonicity and linearity of conditional expectation, we have:
(1). For any $A,B \in \mathscr{F}$ with $A \subset B$:

\[   0=\mathbb{P}\left(\varnothing \left\lvert\right. \mathscr{G}\right)\le \mathbb{P}\left(A \left\lvert\right. \mathscr{G}\right)\le \mathbb{P}\left(B \left\lvert\right. \mathscr{G}\right)\le \mathbb{P}\left(\Omega \left\lvert\right. \mathscr{G}\right)=1, a.s. \]
(2). For any countable measurable partition $\left\{A _{n}\right\} _{n=1}^{\infty}$ of $\left(\Omega ,\mathscr{F}\right)$, we have: \[   \mathbb{P}\left(\bigcup\limits _{n=1}^{\infty }A _{n}\left\lvert\right. \mathscr{G}\right)=\sum\limits _{n=1}^{\infty }\mathbb{P}\left(A _{n}\left\lvert\right. \mathscr{G}\right), a.s. \]

We observe that although conditional probability is a random variable (since conditional expectation is a random variable), apart from the restriction of “a.s.” equivalence classes, its properties seem very similar to those of a general probability measure. This leads us to wonder: for any sub-$\sigma$-algebra $\mathscr{G}$ of a general measure space $\left(\Omega ,\mathscr{F}\right)$, can we always define a bivariate function $P _{\mathscr{G}}(\cdot ,\cdot ):\left(\Omega \times \mathscr{F}\right)\rightarrow \mathbb{R}$ such that:
(1). For any set $A$ in $\mathscr{F}$, $P _{\mathscr{G}}(\cdot ,A)$ is the conditional probability of $A$ given $\mathscr{G}$.
(2). For any $\omega$ in $\Omega$, $P _{\mathscr{G}}(\omega ,\cdot )$ is a probability measure on $\left(\Omega ,\mathscr{F}\right)$.

If such a bivariate function $P _{\mathscr{G}}(\cdot ,\cdot )$ can be defined, then we could treat conditional probability as a random measure. Furthermore, if we want to compute the conditional expectation $\mathbb{E}\left[g(f)\left\lvert\right. \mathscr{G}\right]$ of $g(f)$, we could directly integrate over the regular conditional probability of $f$ given $\mathscr{G}$, just like computing a standard expectation.

Naturally, we might think: would it be sufficient to directly define $P _{\mathscr{G}}(\omega ,A):=\mathbb{P}\left(A\left\lvert\right. \mathscr{G}\right)(\omega )$? For condition (1), this definition naturally satisfies the requirement. For condition (2), if we fix a countable measurable partition $\left\{A _{n}\right\} _{n=1}^{\infty}$, according to the properties of conditional probability mentioned above, except for at most a null set $N$, we have:

\[ \mathbb{P}\left(\bigcup\limits _{n=1}^{\infty }A _{n}\left\lvert\right. \mathscr{G}\right)(\omega )=\sum\limits _{n=1}^{\infty }\mathbb{P}\left(A _{n}\left\lvert\right. \mathscr{G}\right)(\omega ). \]

If $\omega $ happens to fall within $N$, we can simply choose an arbitrary probability measure $\mathbb{P} _{0}$. Wouldn’t this exactly conform to the properties of a measure? However, the problem arises with this countable measurable partition $\left\{A _{n}\right\} _{n=1}^{\infty}$. For a general measure, we require countable additivity to hold for any countable measurable partition, but the null set $N=N _{\left\{A _{n}\right\} _{n=1}^{\infty}}$ chosen above depends on the specific countable measurable partition selected. If we take the union of all such null sets $N _{\left\{A _{n}\right\} _{n=1}^{\infty}}$:

\[   \tilde{N}:=\bigcup\limits _{\left\{A _{n}\right\} _{n=1}^{\infty }}^{}N _{\left\{A _{n}\right\} _{n=1}^{\infty }}, \]

then $\tilde{N}$ could be the union of uncountably many null sets, and thus might not be a null set itself! Therefore, the $P _{\mathscr{G}}(\cdot ,\cdot )$ defined in this way may not satisfy the properties of a measure. The existence of a regular conditional probability depends on the properties of the space where the image of $f$ lies. In fact, there exist measurable mappings $f$ for which the regular conditional probability given some sub-$\sigma$-algebra $\mathscr{G}$ does not exist. Fortunately, because the properties of the $\sigma$-algebra $(\mathbb{R},\mathcal{B} _{\mathbb{R}})$ are nice enough (it is separable, meaning it has a countable dense subset), the existence of a regular conditional probability is guaranteed when the image of a random variable is the set of real numbers.

Let $f$ be a real-valued random variable on the probability space $(\Omega ,\mathscr{F},\mu )$, and let $\mathscr{G}$ be a sub-$\sigma$-algebra. A bivariate function $F _{f|\mathscr{G}}(\cdot ,\cdot ):(\Omega ,\mathbb{R})\rightarrow \mathbb{R}$ is called the regular conditional distribution function of $f$ given $\mathscr{G}$, if it satisfies:
(1). For each $\omega \in \Omega$, $F _{\mathscr{G}}(\omega ,\cdot )$ is a distribution function.
(2). For each $A \in \mathcal{B} _{\mathbb{R}}$, $F _{\mathscr{G}}(\cdot ,A)$ is the conditional probability of the event $\left\{f \in A\right\}$ given $\mathscr{G}$.

Since $\sigma (f)=f ^{-1}(\mathcal{B} _{\mathbb{R}})\subset \mathscr{F}$, the definition of the regular conditional distribution function actually weakens the definition of regular conditional probability. The following theorem guarantees the existence of a regular conditional distribution function for a real-valued random variable $f$:

  (Existence of the regular conditional distribution function) Let $f$ be a real-valued random variable on the probability space $(\Omega ,\mathscr{F},\mu )$, and let $\mathscr{G}$ be a sub-$\sigma$-algebra. Then the regular conditional distribution function $F _{\mathscr{G}}(\cdot ,\cdot )$ of $f$ given $\mathscr{G}$ exists.
 

Let $\mathbb{Q}$ be the set of rational numbers. For any rational number $r \in \mathbb{Q}$, let \begin{equation*} G(\cdot ,r):=\mathbb{P}(\left\{f \le r\right\}\left\lvert\right. \mathscr{G})(\cdot ) a.s. \end{equation*} Furthermore, let \begin{equation*} \begin{aligned} & N _{1,r}:=\bigcup\limits _{r _{1},r _{2}\in \mathbb{Q}, r _{1}\le r _{2}}^{}\left\{\omega \in \Omega :G(\omega ,r _{1})> G(\omega ,r _{2})\right\},\\ & N _{2,r}:=\bigcup\limits _{r \in \mathbb{Q}}^{}\left\{\omega \in \Omega :\lim\limits _{n\rightarrow \infty } G(\omega ,r+\frac{1}{n})\neq G(\omega ,r)\right\},\\ & N _{3,r}:=\bigcup\limits _{r \in \mathbb{Q}}^{}\left\{\omega \in \Omega :\lim\limits _{n\rightarrow \infty }G(\omega ,-n)\neq 0\right\}\cup \left\{\omega \in \Omega :\lim\limits _{n\rightarrow \infty }G(\omega ,n)\neq 1\right\},\\ & \tilde{N} _{r}:=\bigcup\limits _{i=1}^{3}N _{i,r},\\ & \tilde{N}:=\bigcup\limits _{r \in \mathbb{Q}}^{}\tilde{N} _{r}. \end{aligned} \end{equation*} Since $\mathbb{Q}$ is countable, we have $\mathbb{P}(\tilde{N})=0$. Choose an arbitrary distribution function $H(\cdot )$. For $a \in \mathbb{R}$, let \begin{equation*} F _{\mathscr{G}}(\omega ,\left\{f \le a\right\})=\left\{ \begin{aligned} &\inf\limits _{}\left\{G(\omega ,r):r \in \mathbb{Q}, r > a\right\}, \omega \in \Omega \setminus \tilde{N},\\ &H(a), \omega \in \tilde{N}. \end{aligned} \right. \end{equation*} We now prove that $F _{\mathscr{G}}(\cdot ,\cdot )$ is a regular conditional distribution function of $f$ given $\mathscr{G}$.

It is easy to see that for any $\omega \in \Omega $, $F _{\mathscr{G}}(\omega ,\cdot )$ is a distribution function. Next, we verify that for any $A \in \mathcal{B} _{\mathbb{R}}$, $F _{\mathscr{G}}(\cdot ,A)$ is the conditional probability of the event $\left\{f \in A\right\}$ given $\mathscr{G}$.

First, for any $A=\left(-\infty ,a\right]$, by definition we have that $F _{\mathscr{G}}(\cdot ,A)$ is $\mathscr{G}$-measurable. Let $m _{\mathscr{G}}:=\left\{A:A \in \mathcal{B}_{\mathbb{R}},F _{\mathscr{G}}(\cdot ,A) \text{ is } \mathscr{G}\text{-measurable}\right\}$, then it can be verified that $m _{\mathscr{G}}$ is a $\lambda$-system containing the $\pi$-system $\left\{(-\infty ,a]:a \in \mathbb{R}\right\}$. By the $\pi$-$\lambda$ theorem, we know $\mathcal{B} _{\mathbb{R}} \subset \mathscr{G}$. That is, for any $A \in \mathcal{B}_{\mathbb{R}}$, $F _{\mathscr{G}}(A,\cdot )$ is $\mathscr{G}$-measurable.

Secondly, for any fixed $G \in \mathscr{G}$, denote \begin{equation*} c _{G}:=\left\{A:A \in \mathcal{B}_{\mathbb{R}},\displaystyle\int _{G}^{}F(A,\omega )d \mathbb{P}(\omega )=\mathbb{E}\left[\mathbf{1}_{\left\{f \in A\right\}}\cdot \mathbf{1}_{\left\{G\right\}}\right]\right\}. \end{equation*} From the definition of $F _{\mathscr{G}}(\cdot ,\cdot )$ and the monotone convergence theorem for conditional expectations, we know that $c _{G}$ is a $\lambda$-system containing the $\pi$-system $\left\{(-\infty ,a]:a \in \mathbb{R}\right\}$. By the $\pi$-$\lambda$ theorem, we know $\mathcal{B} _{\mathbb{R}} \subset c _{G}$. Due to the arbitrariness of $G$, we conclude that $F _{\mathscr{G}}(\cdot , A)$ is the conditional probability of $\left\{f \in A\right\}$ given $\mathscr{G}$. This completes the proof.

Note: The proof of the above theorem relies heavily on the properties of real numbers—the countable set of rational numbers is its dense subset. Otherwise, we would not be able to well-define $F _{\mathscr{G}}(\omega ,\left\{f \le a\right\})$ from $G(\omega, r)$.

Martingale

Suppose $(X _{n},\mathcal{F}_{n})$ and $(Y _{n},\mathcal{F}_{n})$ are square-integrable martingales.

(a). Show that the corresponding mrtingale differences $D _{n}$ are uncorrelated and zero-mean.

(b). Show that for any $l \ge n \ge 0$, we have \begin{equation*} \begin{aligned} \mathbb{E}\left[X _{l}Y _{l}\left\lvert\right. \mathcal{F}_{n}\right]-X _{n}Y _{n}&=\mathbb{E}\left[(X _{l}-X _{n})(Y _{l}-Y _{n})\left\lvert\right. \mathcal{F}_{n}\right]\\ &=\sum\limits _{k=n+1}^{l}\mathbb{E}\left[(X _{k}-X _{k-1})(Y _{k}-Y _{k-1})\left\lvert\right. \mathcal{F}_{n}\right]. \end{aligned} \end{equation*}

(c). Using the results in (a) and (b), show that if there exists a constant $C$ such that $\sup\limits _{k}\left|X _{k}\right|\le C$, then for any $l \ge 1$, we have: \begin{equation*} \mathbb{E}\left[\left(\sum\limits _{i=1}^{l}D _{k}^{2}\right)^{2}\right]\le 6C ^{4}. \end{equation*}

Note: The result in (c) is non-trivial. The right-hand side of the inequality to be proved is independent of $l$.
Let $\left(X _{n},\mathscr{F}_{n}\right)$ be a martingale with $\sup\limits _{n}\mathbb{E}\left[\left|X _{n}\right|\right]< \infty $. Show that there exists a representation $X _{n}=Y _{n}-Z _{n}$ with $\left(Y _{n},\mathscr{F}_{n}\right)$ and $\left(Z _{n},\mathscr{F}_{n}\right)$ being non-negative martingales such that $\sup\limits _{n}\mathbb{E}\left[\left|Y _{n}\right|\right]<\infty $ and $\sup\limits _{n}\mathbb{E}\left[\left|Z _{n}\right|\right]<\infty $.

Note: this decomposition is analogous to that any measurable function can be decomposed into its positive and negative parts.

Suppose $a _{n}\in (0,1]$ are non-random and $Z _{n}$ are independent random variables such that $\mathbb{E}Z _{n}=0, \mathbb{E}\left[\left|Z _{n}\right|\right]=1$ for all $n \ge 1$, while $\mathbb{P}\left(A _{n}\right)\rightarrow 0$ for $A _{n}:=\left\{\omega :Z _{n}(\omega )\neq 0\right\}$. Let \begin{equation*} Y _{n}=\left|Z _{n}\right|Y _{n-1}+a _{n}Z _{n}\mathbf{1}_{\left\{Y _{n-1}=0\right\}}, \forall n \ge 1, Y _{0}=0. \end{equation*}

(a). Show that $\left\{Y _{n}\right\}$ is a martingale and that for any $x>0$ and $n \ge 1$, \begin{equation*} \mathbb{P}\left(\max\limits _{k=1}^{n} Y _{k}\ge x\right)\le \frac{1}{2x}\left[a _{1}+\sum\limits _{k=2}^{n}a _{k}\left(1-\mathbb{P}(A _{k-1})\right)\right]. \end{equation*}

Note: Note that the denominator on the right-hand side of the inequality is $2x$ instead of $x$.

(b). Show that $Y _{n}\overset{p}{\rightarrow }0$ as $n\to\infty$, but $\left\{Y _{n}\right\}$ does not converge in $L _{1}$ to $0$.

(c). Show that $Y _{n}\overset{a.s.}{\rightarrow }0$ if and only if $\sum\limits _{k=1}^{\infty }\mathbb{P}(A _{k})<\infty $.

(d). Show that $\left\{Y _{n}\right\}$ is $L _{1}$-bounded if and only if $\sum\limits _{k=1}^{\infty }a _{k}<\infty $.

(e). Suppose that in addition $\left|Z _{k}\right|\ge \frac{1}{a _{k}}$ on $A _{k}$ for all $k \ge 1$. Using the Vitali convergence theorem, what can you deduce?

Categories:

Updated: