Inequalities

Jensen’s Inequality. Let XX be a convex set in a real vector space, φ:XR\varphi : X \to \R a convex function, {xi}i=1nX\{x_i\}_{i=1}^n \subseteq X and {λi}i=1n[0,)\{\lambda_i\}_{i=1}^n \subseteq [0, \infty) such that i=1nλi=1\sum_{i=1}^n \lambda_i = 1. Then

φ(i=1nλixi)i=1nλiφ(xi)\varphi\left(\sum_{i=1}^n \lambda_i x_i\right) \le \sum_{i=1}^n \lambda_i \varphi(x_i)

If φ\varphi is otherwise concave, then

φ(i=1nλixi)i=1nλiφ(xi)\varphi\left(\sum_{i=1}^n \lambda_i x_i\right) \ge \sum_{i=1}^n \lambda_i \varphi(x_i)

Proof. It suffices to only prove the result for convex functions, as if it is concave, then φ-\varphi is convex and

φ(i=1nλixi)i=1nλiφ(xi)-\varphi\left(\sum_{i=1}^n \lambda_i x_i\right) \le -\sum_{i=1}^n \lambda_i \varphi(x_i)

so now just multiply both sides by -1. The proof follows by induction. n=1n=1 is trivial since φ(λ1x1)=φ(x1)=λ1φ(x1)\varphi(\lambda_1 x_1) = \varphi(x_1) = \lambda_1 \varphi(x_1), and n=2n=2 follows directly from the definition of convexity.

φ(λ1x1+λ2x2)λ1φ(x1)+λ2φ(x2)\varphi(\lambda_1 x_1 + \lambda_2 x_2) \le \lambda_1 \varphi(x_1) + \lambda_2 \varphi(x_2)

If true for nn, let {xi}i=1n+1\{x_i\}_{i=1}^{n+1} and {λi}i=1n+1\{\lambda_i\}_{i=1}^{n+1}. If needed, reorder these terms so that λn+1<1\lambda_{n+1} < 1. Then

y=i=1nλixi1λn+1    i=1n+1λixi=(1λn+1)y+λn+1xn+1y = \sum_{i=1}^{n} \frac{\lambda_i x_i}{1 - \lambda_{n+1}} \implies \sum_{i=1}^{n+1} \lambda_i x_i = (1 - \lambda_{n+1})y + \lambda_{n+1}x_{n+1}

By convexity, yXy \in X and by the inductive step,

φ(i=1n+1λixi)(1λn+1)φ(y)+λn+1φ(xn+1)i=1nλiφ(xi)+λn+1φ(xn+1)=i=1n+1λixi\varphi\left(\sum_{i=1}^{n+1} \lambda_i x_i\right) \le (1 - \lambda_{n+1}) \varphi(y) + \lambda_{n+1} \varphi(x_{n+1}) \le \sum_{i=1}^n \lambda_i\varphi(x_i) + \lambda_{n+1} \varphi(x_{n+1}) = \sum_{i=1}^{n+1} \lambda_i x_i \quad \square

AM-GM inequality. For any positive real numbers x1,x2,,xnx_1,x_2,\dots,x_n,

x1+x2++xnnx1x2xnn\frac{x_1 + x_2 + \cdots + x_n}{n} \ge \sqrt[n]{x_1 x_2 \cdots x_n}

Proof. Note that log(x)\log(x) is increasing and d2dx2log(x)=1x2<0\frac{d^2}{dx^2} \log(x) = \frac{-1}{x^2} < 0 implies it is strictly concave. Therefore, if we apply the Jensen’s inequality to λi=1/n\lambda_i = 1/n, and then apply exe^x on both sides (which doesn’t change the direction of the inequality)

log(i=1nxin)i=1nlog(xi)n    xin=(i=1nxi)1/n\log\left(\sum_{i=1}^n \frac{x_i}{n}\right) \ge \sum_{i=1}^n \frac{\log(x_i)}{n} \implies \frac{x_i}{n} = \left(\prod_{i=1}^n x_i\right)^{1/n} \quad \square

Young’s inequality. If a,b0a,b \ge 0 and p,q>1p,q > 1 such that p1+q1=1p^{-1} + q^{-1} = 1, then

abapp+bqqab \le \frac{a^p}{p} + \frac{b^q}{q}

Proof. Using the log\log trick as above, Jensen’s inequality states

log(app+bqq)log(ap)p+log(bq)q=log(ab)\log\left(\frac{a^p}{p} + \frac{b^q}{q}\right) \ge \frac{\log(a^p)}{p} + \frac{\log(b^q)}{q} = \log(ab) \quad \square

Hölder’s inequality. Let (X,Σ,μ)(X, \Sigma, \mu) be any measure space and p,q>1p,q > 1 with p1+q1=1p^{-1} + q^{-1} = 1. For any measurable functions f,g:XCf,g : X \to \C,

f(x)g(x)dx(f(x)pdx)1/p(g(x)qdx)1/q\int \abs{f(x)g(x)} \: dx \le \left(\int \abs{f(x)}^p \: dx\right)^{1/p}\left(\int \abs{g(x)}^q \: dx\right)^{1/q}

Proof. Denote fp=(f(x)pdx)1/p\norm{f}_p = (\int \abs{f(x)}^p \: dx)^{1/p}. We wish to prove fg1fpgq\norm{fg}_1 \le \norm{f}_p \norm{g}_q. By Young’s inequality,

f(x)fpg(x)gqf(x)ppfpp+g(x)qqgqq\frac{\abs{f(x)}}{\norm{f}_p} \frac{\abs{g(x)}}{\norm{g}_q} \le \frac{\abs{f(x)}^p}{p\norm{f}_p^p} + \frac{\abs{g(x)}^q}{q\norm{g}_q^q}

Since these are both positive for all xx, we can integrate both sides and get

fg1fpgqfpppfpp+gqqqgqq=1p+1q=1    fg1fpgq\frac{\norm{fg}_1}{\norm{f}_p\norm{g}_q} \le \frac{\norm{f}_p^p}{p\norm{f}_p^p} + \frac{\norm{g}_q^q}{q\norm{g}_q^q} = \frac{1}{p} + \frac{1}{q} = 1 \implies \norm{fg}_1 \le \norm{f}_p\norm{g}_q \quad \square

Minkowski inequality. Let (X,Σ,μ)(X, \Sigma, \mu) be any measure space and p1p \ge 1. For any measurable functions f,g:XCf,g : X \to \C,

(f(x)+g(x)pdx)1/p(f(x)pdx)1/p+(g(x)pdx)1/p\left(\int \abs{f(x) + g(x)}^p \: dx\right)^{1/p} \le \left(\int \abs{f(x)}^p \: dx\right)^{1/p} + \left(\int \abs{g(x)}^p \: dx\right)^{1/p}

Proof. For p=1p=1, this is just the triangle inequality for C\C applies at every xx and then integrating both sides. Once again, if we use the fp\norm{f}_p notation as above, we wish to prove f+gpfp+gp\norm{f + g}_p \le \norm{f}_p + \norm{g}_p. Assuming p>1p > 1, compute

f+gpp=f(x)+g(x)f(x)+g(x)p1dxf(x)f(x)+g(x)p1dx+g(x)f(x)+g(x)p1dx\norm{f + g}_p^p = \int \abs{f(x) + g(x)}\abs{f(x) + g(x)}^{p-1} \: dx \le \int \abs{f(x)}\abs{f(x) + g(x)}^{p-1} \: dx + \int \abs{g(x)}\abs{f(x) + g(x)}^{p-1} \: dx

We can apply Hölder’s inequality to the functions f(x)f(x) and (f(x)g(x))p1(f(x)g(x))^{p-1}, with q=p/(p1)q = p/(p-1), to get

f(x)f(x)+g(x)p1dx(f(x)pdx)1/p(f(x)+g(x)p)11p=fpf+gppf+gp\int \abs{f(x)}\abs{f(x) + g(x)}^{p-1} \: dx \le \left(\int \abs{f(x)}^p \: dx\right)^{1/p}\left(\int \abs{f(x) + g(x)}^p\right)^{1 - \frac{1}{p}} = \norm{f}_p \frac{\norm{f + g}_p^p}{\norm{f + g}_p}

By doing the same thing for the second term with g(x)g(x), we get

f+gppfpf+gppf+gp+gpf+gppf+gp    f+gpfp+gp\norm{f + g}_p^p \le \norm{f}_p \frac{\norm{f + g}_p^p}{\norm{f + g}_p} + \norm{g}_p \frac{\norm{f + g}_p^p}{\norm{f + g}_p} \implies \norm{f + g}_p \le \norm{f}_p + \norm{g}_p \quad \square

Cauchy-Schwarz inequality for LpL^p spaces. For any f,gLpf,g \in L^p,

f,g2f,fg,g\norm{\inner{f, g}}^2 \le \inner{f, f} \inner{g, g}

Proof. The inner product in LpL^p space is defined as

f,g(f(x)g(x)dx)1/2\inner{f, g} \coloneqq \left(\int f(x)\cl{g(x)} \: dx\right)^{1/2}

The Hölder inequality immediatelly tells us that for p=q=2p = q = 2,

f(x)g(x)dxf(x)g(x)dx(f(x)2dx)1/2(g(x)2dx)1/2=f,fg,g\int f(x)\cl{g(x)} \: dx \le \int \abs{f(x)g(x)} \: dx \le \left(\int \abs{f(x)}^2 \: dx\right)^{1/2}\left(\int \abs{g(x)}^2 \: dx\right)^{1/2} = \inner{f, f} \inner{g, g} \quad \square