Brascamp–Lieb inequality

In mathematics, the Brascamp–Lieb inequality is either of two inequalities. The first is a result in geometry concerning integrable functions on n-dimensional Euclidean space $\mathbb {R} ^{n}$ . It generalizes the Loomis–Whitney inequality and Hölder's inequality. The second is a result of probability theory which gives a concentration inequality for log-concave probability distributions. Both are named after Herm Jan Brascamp and Elliott H. Lieb.

The geometric inequality

Fix natural numbers m and n. For 1 ≤ i ≤ m, let n_i ∈ N and let c_i > 0 so that

\sum _{i=1}^{m}c_{i}n_{i}=n.

Choose non-negative, integrable functions

f_{i}\in L^{1}\left(\mathbb {R} ^{n_{i}};[0,+\infty ]\right)

and surjective linear maps

B_{i}:\mathbb {R} ^{n}\to \mathbb {R} ^{n_{i}}.

Then the following inequality holds:

\int _{\mathbb {R} ^{n}}\prod _{i=1}^{m}f_{i}\left(B_{i}x\right)^{c_{i}}\,\mathrm {d} x\leq D^{-1/2}\prod _{i=1}^{m}\left(\int _{\mathbb {R} ^{n_{i}}}f_{i}(y)\,\mathrm {d} y\right)^{c_{i}},

where D is given by

D=\inf \left\{\left.{\frac {\det \left(\sum _{i=1}^{m}c_{i}B_{i}^{*}A_{i}B_{i}\right)}{\prod _{i=1}^{m}(\det A_{i})^{c_{i}}}}\right|A_{i}{\text{ is a positive-definite }}n_{i}\times n_{i}{\text{ matrix}}\right\}.

Another way to state this is that the constant D is what one would obtain by restricting attention to the case in which each $f_{i}$ is a centered Gaussian function, namely $f_{i}(y)=\exp\{-(y,\,A_{i}\,y)\}$ .^[1]

Alternative forms

Consider a probability density function $p(x)=\exp(-\phi (x))$ . This probability density function $p(x)$ is said to be a log-concave measure if the $\phi (x)$ function is convex. Such probability density functions have tails which decay exponentially fast, so most of the probability mass resides in a small region around the mode of $p(x)$ . The Brascamp–Lieb inequality gives another characterization of the compactness of $p(x)$ by bounding the mean of any statistic $S(x)$ .

Formally, let $S(x)$ be any derivable function. The Brascamp–Lieb inequality reads:

\operatorname {var} _{p}(S(x))\leq E_{p}(\nabla ^{T}S(x)[H\phi (x)]^{-1}\nabla S(x))

where H is the Hessian and $\nabla$ is the Nabla symbol.^[2]

BCCT inequality

The inequality is generalized in 2008^[3] to account for both continuous and discrete cases, and for all linear maps, with precise estimates on the constant.

Definition: the Brascamp-Lieb datum (BL datum)

$d,n\geq 1$ .

$d_{1},...,d_{n}\in \{1,2,...,d\}$ .

$p_{1},...,p_{n}\in [0,\infty )$ .

$B_{i}:\mathbb {R} ^{d}\to \mathbb {R} ^{d_{i}}$ are linear surjections, with zero common kernel: $\cap _{i}ker(B_{i})=\{0\}$ .
Call $(B,p)=(B_{1},...,B_{n},p_{1},...,p_{n})$ a Brascamp-Lieb datum (BL datum).

For any $f_{i}\in L^{1}(R^{d_{i}})$ with $f_{i}\geq 0$ , define

BL(B,p,f):={\frac {\int _{H}\prod _{j=1}^{m}\left(f_{j}\circ B_{j}\right)^{p_{j}}}{\prod _{j=1}^{m}\left(\int _{H_{j}}f_{j}\right)^{p_{j}}}}

Now define the Brascamp-Lieb constant for the BL datum:

BL(B,p)=\max _{f}BL(B,p,f)

Theorem — (BCCT, 2007)

$BL(B,p)$ is finite iff $d=\sum _{i}p_{i}d_{i}$ , and for all subspace $V$ of $\mathbb {R} ^{d}$ ,

dim(V)\leq \sum _{i}p_{i}dim(B_{i}(V))

$BL(B,p)$ is reached by gaussians:

If $BL(B,p)$ is finite, then there exists some linear operators $A_{i}:\mathbb {R} ^{d_{i}}\to \mathbb {R} ^{d_{i}}$ such that $f_{i}=e^{-\langle A_{i}x,x\rangle }$ achieves the upper bound.

If $BL(B,p)$ is infinite, then there exists a sequence of gaussians for which

{\frac {\int _{H}\prod _{j=1}^{m}\left(f_{j}\circ B_{j}\right)^{p_{j}}}{\prod _{j=1}^{m}\left(\int _{H_{j}}f_{j}\right)^{p_{j}}}}\to \infty

Discrete case

Setup:

Finitely generated abelian groups $G,G_{1},...,G_{n}$ .

Group homomorphisms $\phi _{j}:G\to G_{j}$ .

BL datum defined as $(G,G_{1},...,G_{n},\phi _{1},...\phi _{n})$

$T(G)$ is the torsion subgroup, that is, the subgroup of finite-order elements.

With this setup, we have (Theorem 2.4,^[4] Theorem 3.12 ^[5])

Theorem — If there exists some $s_{1},...,s_{n}\in [0,1]$ such that

rank(H)\leq \sum _{j}s_{j}rank(\phi _{j}(H))\quad \forall H\leq G

Then for all $0\geq f_{j}\in \ell ^{1/s_{j}}(G_{j})$ ,

\left\|\prod _{j}f_{j}\circ \phi _{j}\right\|_{1}\leq |T(G)|\prod _{j}\|f_{j}\|_{1/s_{j}}

and in particular,

|E|\leq |T(G)|\prod _{j}|\phi _{j}(E)|^{s_{j}}\quad \forall E\subset G

Note that the constant $|T(G)|$ is not always tight.

BL polytope

Given BL datum $(B,p)$ , the conditions for $BL(B,p)<\infty$ are

$d=\sum _{i}p_{i}d_{i}$ , and
for all subspace $V$ of $\mathbb {R} ^{d}$ ,
$dim(V)\leq \sum _{i}p_{i}dim(B_{i}(V))$

Thus, the subset of $p\in [0,\infty )^{n}$ that satisfies the above two conditions is a closed convex polytope defined by linear inequalities. This is the BL polytope.

Note that while there are infinitely many possible choices of subspace $V$ of $\mathbb {R} ^{d}$ , there are only finitely many possible equations of $dim(V)\leq \sum _{i}p_{i}dim(B_{i}(V))$ , so the subset is a closed convex polytope.

Similarly we can define the BL polytope for the discrete case.

Relationships to other inequalities

The geometric Brascamp–Lieb inequality

The geometric Brascamp–Lieb inequality, first derived in 1976,^[6] is a special case of the general inequality. It was used by Keith Ball, in 1989, to provide upper bounds for volumes of central sections of cubes.^[7]

For i = 1, ..., m, let c_i > 0 and let u_i ∈ Sⁿ⁻¹ be a unit vector; suppose that c_i and u_i satisfy

x=\sum _{i=1}^{m}c_{i}(x\cdot u_{i})u_{i}

for all x in Rⁿ. Let f_i ∈ L¹(R; [0, +∞]) for each i = 1, ..., m. Then

\int _{\mathbb {R} ^{n}}\prod _{i=1}^{m}f_{i}(x\cdot u_{i})^{c_{i}}\,\mathrm {d} x\leq \prod _{i=1}^{m}\left(\int _{\mathbb {R} }f_{i}(y)\,\mathrm {d} y\right)^{c_{i}}.

The geometric Brascamp–Lieb inequality follows from the Brascamp–Lieb inequality as stated above by taking n_i = 1 and B_i(x) = x · u_i. Then, for z_i ∈ R,

B_{i}^{*}(z_{i})=z_{i}u_{i}.

It follows that D = 1 in this case.

Hölder's inequality

Take n_i = n, B_i = id, the identity map on $\mathbb {R} ^{n}$ , replacing f_i by f^1/c_i
_i, and let c_i = 1 / p_i for 1 ≤ i ≤ m. Then

\sum _{i=1}^{m}{\frac {1}{p_{i}}}=1

and the log-concavity of the determinant of a positive definite matrix implies that D = 1. This yields Hölder's inequality in $\mathbb {R} ^{n}$ :

\int _{\mathbb {R} ^{n}}\prod _{i=1}^{m}f_{i}(x)\,\mathrm {d} x\leq \prod _{i=1}^{m}\|f_{i}\|_{p_{i}}.

Poincaré inequality

The Brascamp–Lieb inequality is an extension of the Poincaré inequality which only concerns Gaussian probability distributions.^[8]

Cramér–Rao bound

The Brascamp–Lieb inequality is also related to the Cramér–Rao bound.^[8] While Brascamp–Lieb is an upper-bound, the Cramér–Rao bound lower-bounds the variance of $\operatorname {var} _{p}(S(x))$ . The Cramér–Rao bound states

\operatorname {var} _{p}(S(x))\geq E_{p}(\nabla ^{T}S(x))[E_{p}(H\phi (x))]^{-1}E_{p}(\nabla S(x))\!

which is very similar to the Brascamp–Lieb inequality in the alternative form shown above.

References

^ This inequality is in Lieb, Elliott H. (1990). "Gaussian Kernels have only Gaussian Maximizers". Inventiones Mathematicae. 102: 179–208. Bibcode:1990InMat.102..179L. doi:10.1007/bf01233426.
^ This theorem was originally derived in Brascamp, Herm J.; Lieb, Elliott H. (1976). "On Extensions of the Brunn–Minkowski and Prékopa–Leindler theorems, including inequalities for log concave functions, and with an application to the diffusion equation". Journal of Functional Analysis. 22 (4): 366–389. doi:10.1016/0022-1236(76)90004-5. Extensions of the inequality can be found in Hargé, Gilles (2008). "Reinforcement of an Inequality due to Brascamp and Lieb". Journal of Functional Analysis. 254 (2): 267–300. doi:10.1016/j.jfa.2007.07.019. and Carlen, Eric A.; Cordero-Erausquin, Dario; Lieb, Elliott H. (2013). "Asymmetric Covariance Estimates of Brascamp-Lieb Type and Related Inequalities for Log-concave Measures". Annales de l'Institut Henri Poincaré B. 49 (1): 1–12. arXiv:1106.0709. Bibcode:2013AIHPB..49....1C. doi:10.1214/11-aihp462.
^ Bennett, Jonathan; Carbery, Anthony; Christ, Michael; Tao, Terence (2008-01-01). "The Brascamp–Lieb Inequalities: Finiteness, Structure and Extremals". Geometric and Functional Analysis. 17 (5): 1343–1415. doi:10.1007/s00039-007-0619-6. hdl:20.500.11820/b13abfca-453c-4aea-adf6-d7d421cec7a4. ISSN 1420-8970. S2CID 10193995.
^ Bennett, Jonathan; Carbery, Anthony; Christ, Michael; Tao, Terence (2005-05-31). "Finite bounds for Holder-Brascamp-Lieb multilinear inequalities". arXiv:math/0505691.
^ Christ, Michael; Demmel, James; Knight, Nicholas; Scanlon, Thomas; Yelick, Katherine (2013-07-31). "Communication lower bounds and optimal algorithms for programs that reference arrays -- Part 1". arXiv:1308.0068 [math.CA].
^ This was derived first in Brascamp, H. J.; Lieb, E. H. (1976). "Best Constants in Young's Inequality, Its Converse and Its Generalization to More Than Three Functions". Advances in Mathematics. 20 (2): 151–172. doi:10.1016/0001-8708(76)90184-5.
^ Ball, Keith M. (1989). "Volumes of Sections of Cubes and Related Problems". In Lindenstrauss, Joram; Milman, Vitali D. (eds.). Geometric Aspects of Functional Analysis. Lecture Notes in Mathematics. Vol. 1376. Berlin: Springer. pp. 251–260. doi:10.1007/BFb0090058. ISBN 978-3-540-51303-2.
^ ^a ^b Saumard, Adrien; Wellner, Jon A. (2014). "Log-concavity and strong log-concavity: a review". Statistics Surveys. 8: 45–114. doi:10.1214/14-SS107. PMC 4847755. PMID 27134693.

Gardner, Richard J. (2002). "The Brunn–Minkowski inequality" (PDF). Bulletin of the American Mathematical Society. New Series. 39 (3): 355–405. doi:10.1090/S0273-0979-02-00941-2.