Maximal function theory
Below is an outline of the notes I wrote up on the basic theory Hardy—Littlewood maximal function and its variants for a seminar. The notes assume familiarity with measure theory.
Download the notes here: MaximalFunctionTheory.pdf (29 pages)
Let $f:[a,b] \to \mathbb{R}$ be continuous. The fundamental theorem of calculus states that
$$F(x) = \int_a^x f(y) \ dy$$
is differentiable in $(a,b)$ and $F’(x) = f(x)$ for all $x \in (a,b)$.
Could we generalize this result to a larger class of functions? Note that the above result is equivalent to
for all continuous functions $f$. In this form, the fundamental theorem of calculus is a statement about the behavior of the integral mean value
$$(2) \hspace{2em} (\mathcal{A_{\mathnormal{r}}} f)(x) = \frac{1}{2r} \int_{x-r}^{x+r} f(y) \, dy$$
as the length of the interval $(x-r,x+r)$ centered at $x$ decreases to 0. Since $(\mathcal{A_{\mathnormal{r}}}f)(x)$ is well-defined for all $f \in L^1([x-r,x+r])$, it makes sense to try and prove (1.1) for all $f \in L^1_{\text{loc}}(\mathbb{R})$, the space of measurable functions on $\mathbb{R}$ whose integral is finite on each compact subset of $\mathbb{R}$.
We also consider $d$-dimensional generalizations of (1). To this end, we must determine what we wish to take as an $n$-dimensional generalization of intervals. We abstract three properties of intervals: compactness, convexity, and symmetry.
Definition. A nonempty set $B \subseteq \mathbb{R}^d$ is convex if, for each pair of points $x_1$ and $x_2$ in $B$, the convex combination $(1-\lambda) x_1 + \lambda x_2$ is in $B$ for all $0 \leq \lambda \leq 1$.
Definition. A nonempty set $B \subseteq \mathbb{R}^d$ is centrally symmetric with respect to $p \in B$ if $B$ is invariant under the affine transform $x \mapsto 2p - x$. This is equivalent to saying that $p + h \in B$ if and only if $p - h \in B$.
We write centrally symmetric convex body to refer to a subset of $\mathbb{R}^d$ that is compact, convex, and centrally symmetric with its center $p$ at the origin.
Since we can rewrite (2) as
$$(\mathcal{A_{\mathnormal{r}}} f)(x) = \frac{1}{2r} \int_{-r}^r f(x+y) \, dy,$$
it suffices to consider centrally symmetric convex bodies with respect to the origin in asking the following question:
Question. Given a centrally symmetric convex body $B \subseteq \mathbb{R}^d$, does the integral mean value
$$(\mathcal{A_{\mathnormal{rB}}}f)(x) = \frac{1}{m(rB)} \int_{rB} f(x+y) \, dy$$
of $f \in L^1_{\text{loc}}(\mathbb{R}^d)$ converge pointwise to $f(x)$ as $r \to 0$? Here we have used $rB$ to denote the scaled set $rB = \lbrace ry : y \in B\rbrace$.
Whenever $f$ is continuous, the argument for the one-dimensional fundamental theorem of calculus can be applied with minor modification to answer Question 1.5 in the affirmative. If $f$ is merely $L^1(\mathbb{R})$, then, for each $\varepsilon > 0$, we can find a $g \in \mathscr{C_{\mathnormal{c}}}(\mathbb{R}^d)$ such that $\lVert f-g \rVert_1 < \varepsilon$. We can then rewrite $(\mathcal{A_{\mathnormal{rB}}}f)(x) - f(x)$ as
$$\mathcal{A{\mathnormal{rB}}}(f-g)(x) + (\mathcal{A_{\mathnormal{rB}}}g)(x) - g(x) + g(x) - f(x).$$
By continuity, we have $(\mathcal{A_{\mathnormal{rB}}}g)(x) \to g(x)$ as $r \to 0$, whence we have the estimate
$$\begin{align*} &\limsup_{r \to 0} \vert (\mathcal{A_{\mathnormal{rB}}}f)(x) - f(x) \vert \\ \leq& \limsup_{r \to 0} \vert \mathcal{A_{\mathnormal{rB}}}(f-g)(x) \vert \\ +& \limsup_{r \to 0} \vert(\mathcal{A_{\mathnormal{rB}}}g)(x) - g(x) \vert \\ +& \vert f(x) - g(x) \vert \\ =& \limsup_{r \to 0} \mathcal{A_{\mathnormal{rB}}}(\vert f - g \vert)(x) + \vert f(x) + g(x) \vert. \end{align*}$$
Therefore, the study of the integral mean values of $f$ at $x \in \mathbb{R}^d$ depends crucially on the quantity
$$\limsup_{r \to 0} \mathcal{A{\mathnormal{rB}}}(\vert f - g \vert)(x),$$
which is bounded from above by the maximal function
$$\sup_{r > 0} \mathcal{A{\mathnormal{rB}}}(\vert f - g \vert)(x).$$
This motivates us to introduce our main object of study:
Definition. The Hardy—Littlewood maximal function of $h \in L^1_{\textrm{loc}}(\mathbb{R}^d)$ over a centrally symmetric convex body $B \subseteq \mathbb{R}^d$ is
$$\begin{align*}(\mathcal{M_{\mathnormal{B}}}f)(x) &= \sup_{r > 0}(\mathcal{A_{\mathnormal{rB}}}\vert f \vert)(x) \\ &= \sup_{r > 0} \frac{1}{(m(rB)} \int_{rB} \vert f(x+y) \vert \, dy.\end{align*}$$
What we have discussed so far assures us that if $f \in L^1(\mathbb{R}^d)$, then, for each $\varepsilon > 0$, there exists a $g \in L^1(\mathbb{R}^d)$ that yields the estimate
$$(3) \hspace{3em} \limsup_{r \to 0} \vert (\mathcal{A}_{rB}f)(x) - f(x) \vert \leq \mathcal{M}_B(f-g)(x) + \vert f(x) - g(x)\vert.$$
Ideally, we would have liked to show that
$$\mathcal{M}_B(f-g)(x) + \vert f(x) - g(x) \vert C \varepsilon$$
for some constant $C$ independent of $\varepsilon$, thus proving that $\mathcal{A_{\mathnormal{rB}}} f \to f$ pointwise everywhere. But this is too much to hope for, as
$$(\mathcal{A_{\mathnormal{(-r,r)}}}\chi_{(0,1)}(0) = \frac{1}{2r} \int_0^r \, dy = \frac{1}{2}$$
for all $0 < r < 1$, which does not converge to $\chi_{(0,1)}(0) = 0$ as $r \to 0$.
So then, if we hope to obtain an affirmative answer to the above Question, then we must settle for an almost-everywhere statement. This equivalent to the statement that the set
$$\left\lbrace x \in \mathbb{R}^d : \limsup_{r \to 0} \vert (\mathcal{A_{\mathnormal{rB}}}f)(x) - f(x) \vert > 0 \right\rbrace$$
is of Lebesgue measure zero. Since the above set is the intersection of the sets
$$(4) \hspace{3em} E_k = \left\lbrace x \in \mathbb{R}^d : \limsup_{r \to 0} \vert (\mathcal{A_{\mathnormal{rB}}}f)(x) - f(x)\vert > \frac{1}{k}\right\rbrace,$$
it suffices to show that $m(E_k) = 0$ for all $k \in \mathbb{N}$.
Now, Estimate (3) implies that
$$(5) \hspace{3em} \begin{align*} m(E_k) &\leq m \left( \left\lbrace x : \mathcal{M_{\mathnormal{B}}}(f-g)(x) > \frac{1}{2k}\right\rbrace\right) \\ &+ m \left( \left\lbrace x : \vert f(x) - g(x) \vert > \frac{1}{2k} \right\rbrace\right) \end{align*}$$
Observe that
$$\begin{align*} m\left(\left\lbrace x : \vert f(x) - g(x) \vert > \frac{1}{2k} \right\rbrace\right) &= \int_{\lbrace x : \vert f(x) - g(x) \vert > \frac{1}{2k}\rbrace} 1 \, dy \\ &\leq \int_{\lbrace x : \vert f(x) - g(x) \vert > \frac{1}{2k}\rbrace} \frac{\vert f(y) - g(y)\vert}{1/2k} \, dy \\ &\leq \int_{\mathbb{R}^d} \frac{\vert f(y) - g(y)\vert}{1/2k} \, dy \\ &= 2k \lVert f-g\rVert_1. \end{align*}$$
It is therefore natural to hope for a bound of the form
$$(6) \hspace{3em} m \left(\lbrace x : \mathcal{M_{\mathnormal{B}}}(f-g)(x) > \frac{1}{2k}\rbrace\right) \leq 2kA \lVert f-g\rVert_1$$
for some constant $A$ independent of $(f-g)$, so that (5) can be written as
$$m(E_k) \leq 2k(A+1) \lVert f-g \rVert_1 < 2k(A+1)\varepsilon.$$
Since $\varepsilon$ was arbitrary, we can then conclude that $m(E_k) = 0$.
(6) is established by the following foundational result in maximal function theory:
Theorem (Weak-type $(1,1)$-bound on the maximal function). If $B \subseteq \mathbb{R}^d$ is a centrally symmetric convex body, then there exists a constant $A_{d,1,B}$ such that
$$m(\lbrace x \in \mathbb{R}^d : \mathcal{M_{\mathnormal{B}}}f(x) > \alpha\rbrace) \leq \frac{A_{d,1,B}}{\alpha}\lVert f\rVert_1$$
for each $\alpha > 0$ and every $f \in L^1(\mathbb{R}^d)$. The constant $A_{d,1,B}$ depends only on the dimension $d$ and the body $B$.
The weak-type bound now implies the below pointwise (almost everywhere) convergence result for the $L^1$ case. The general case of the theorem follows easily from the $L^1$ case by considering the compact cutoff function $f\chi_{B(0;k)}$ with respect to closed balls $B(0;k)$ of radius $k \in \mathbb{N}$ centered at the origin.
Theorem (Lebesgue differentiation theorem). If $f \in L^1_{\textrm{loc}}(\mathbb{R}^d)$, and if $B \subseteq \mathbb{R}^d$ is a centrally symmetric convex body, then
$$\lim_{r \to 0}(\mathcal{A_{\mathnormal{rB}}}f)(x) = f(x)$$
for almost every $x \in \mathbb{R}^d$.
The method of establishing a weak-type bound to prove a pointwise convergence result turns out to be extremely powerful. In fact, we cannot do better:
Theorem (Stein’s $L^1$ maximal principle). Let $G$ be a compact, Hausdorff, abelian topological group equipped with the Haar measure $\mu$. If $(\varphi_n)_{n=1}^\infty$ is a sequence of operators in $L^\infty(G)$ such that, for each $f \in L^1(G)$, we have the “pointwise convergence criterion”
$$\limsup_{n \to \infty} \vert (f \ast \varphi_n)(x) \vert < \infty$$
on a set $E_f$ of positive measure, then the maximal operator
$$Mf(x) = \sup_{n \in \mathbb{N}} \vert (f \ast \varphi_n)(x) \vert$$
satisfies the weak-type $(1,1)$-bound.
The weak-type bound of a general maximal operator is typically established through a judicious use of the Hardy–Littlewood maximal operator. It is thus of interest to tighten the bound
$$m(\lbrace x \in \mathbb{R}^d : \mathcal{M_{\mathnormal{B}}}f(x) > \alpha\rbrace) \leq \frac{A_{d,1,B}}{\alpha}\lVert f\rVert_1$$
by reducing the size of the constant $A_{d, 1, B}$ as much as possible.
The classical proof yields $A_{d,1,B} = O(5^d)$ when $B$ is the Euclidean ball. This is a consequence of the following lemma:
Lemma (Infinitary Vitali covering lemma). If ${B(x_\beta,r_\beta)}_{\beta}$ is a collection of Euclidean balls in $\mathbb{R}^d$ whose radii are uniformly bounded, then there exists a pairwise-disjoint countable subcollection ${B(x_n,r_n)}_n$ such that
$$\bigcup_{\beta} B(x_\beta,r_\beta) \subseteq \bigcup_n B(x_n,5r_n).$$
An 1988 result of Stein and Strömberg reduces the constant to $O(d)$ for the Euclidean ball and $O(d \log d)$ for an arbitrary centrally symmetric convex body. Naor and Tao showed in 2011 that $d \log d$ is, in fact, optimal for a broad class of metric measure spaces.
The question of finding a tight bound for the Euclidean-space case remains open. In 2003, Melas showed that $\frac{11+\sqrt{61}}{21}$ is the optimal constant for the weak-type bound of the Hardy–Littlewood maximal function on the one-dimensional Euclidean ball. This remains the only known tight bound.
Stein and Strömberg conjectured that the bound may, in fact, be $O(1)$. While the conjecture has not been settled for the Euclidean ball case, Aldaz showed in 2011 that the constant grows without bound on the $l^\infty$ ball as the dimension increase.
Much work has been done on establishing dimension-independent bound for $L^p$ when $p > 1$. A classical proof using real interpolation methods gives us the bound
$$\lVert \mathcal{M_{\mathnormal{B}}}f \rVert_p \leq A_{d, p, B} \lVert f \rVert_p$$
for all $p > 1$ with constant
$$A_{d, p, B} = 2^{\frac{p-1}{p}} A^{\frac{1}{p}}_{d, 1, B} \left( \frac{p}{p-1} \right)^{1/p}.$$
Stein showed in 1982 that, given a fixed centrally symmetric convex body $B$, the constant can be made $O(1)$ with respect to the dimension $d$. Moreover, Bourgain’s far-reaching generalization in 1986 shows that, for each $p > 3/2$,
$$\sup_{d, p, B} A_{d, p, B} < \infty$$
where the supremum is taken over all $d \geq 1$ and centrally symmetric convex bodies $B \subseteq \mathbb{R}^d$. In 2014, Bourgain brought $p$ down to $p > 1$ for the $l^\infty$ ball. The question of establishing the above uniform bound for a large class of centrally symmetric convex bodies in the range $p > 1$ remains open.