Karamata's inequality

Algebra theorem about convex functions

In mathematics, Karamata's inequality,[1] named after Jovan Karamata,[2] also known as the majorization inequality, is a theorem in elementary algebra for convex and concave real-valued functions, defined on an interval of the real line. It generalizes the discrete form of Jensen's inequality, and generalizes in turn to the concept of Schur-convex functions.

Statement of the inequality

Let I be an interval of the real line and let f denote a real-valued, convex function defined on I. If x1, …, xn and y1, …, yn are numbers in I such that (x1, …, xn) majorizes (y1, …, yn), then

f ( x 1 ) + + f ( x n ) f ( y 1 ) + + f ( y n ) . {\displaystyle f(x_{1})+\cdots +f(x_{n})\geq f(y_{1})+\cdots +f(y_{n}).}
(1)

Here majorization means that x1, …, xn and y1, …, yn satisfies

x 1 x 2 x n {\displaystyle x_{1}\geq x_{2}\geq \cdots \geq x_{n}}     and     y 1 y 2 y n , {\displaystyle y_{1}\geq y_{2}\geq \cdots \geq y_{n},}
(2)

and we have the inequalities

x 1 + + x i y 1 + + y i {\displaystyle x_{1}+\cdots +x_{i}\geq y_{1}+\cdots +y_{i}}      for all i ∈ {1, …, n − 1}.
(3)

and the equality

x 1 + + x n = y 1 + + y n {\displaystyle x_{1}+\cdots +x_{n}=y_{1}+\cdots +y_{n}}
(4)

If f  is a strictly convex function, then the inequality (1) holds with equality if and only if we have xi = yi for all i ∈ {1, …, n}.

Remarks

  • If the convex function f  is non-decreasing, then the proof of (1) below and the discussion of equality in case of strict convexity shows that the equality (4) can be relaxed to
    x 1 + + x n y 1 + + y n . {\displaystyle x_{1}+\cdots +x_{n}\geq y_{1}+\cdots +y_{n}.}
    (5)
  • The inequality (1) is reversed if f  is concave, since in this case the function f  is convex.

Example

The finite form of Jensen's inequality is a special case of this result. Consider the real numbers x1, …, xnI and let

a := x 1 + x 2 + + x n n {\displaystyle a:={\frac {x_{1}+x_{2}+\cdots +x_{n}}{n}}}

denote their arithmetic mean. Then (x1, …, xn) majorizes the n-tuple (a, a, …, a), since the arithmetic mean of the i largest numbers of (x1, …, xn) is at least as large as the arithmetic mean a of all the n numbers, for every i ∈ {1, …, n − 1}. By Karamata's inequality (1) for the convex function f,

f ( x 1 ) + f ( x 2 ) + + f ( x n ) f ( a ) + f ( a ) + + f ( a ) = n f ( a ) . {\displaystyle f(x_{1})+f(x_{2})+\cdots +f(x_{n})\geq f(a)+f(a)+\cdots +f(a)=nf(a).}

Dividing by n gives Jensen's inequality. The sign is reversed if f  is concave.

Proof of the inequality

We may assume that the numbers are in decreasing order as specified in (2).

If xi = yi for all i ∈ {1, …, n}, then the inequality (1) holds with equality, hence we may assume in the following that xiyi for at least one i.

If xi = yi for an i ∈ {1, …, n}, then the inequality (1) and the majorization properties (3) and (4) are not affected if we remove xi and yi. Hence we may assume that xiyi for all i ∈ {1, …, n}.

It is a property of convex functions that for two numbers xy in the interval I the slope

f ( x ) f ( y ) x y {\displaystyle {\frac {f(x)-f(y)}{x-y}}}

of the secant line through the points (x, f (x)) and (y, f (y)) of the graph of f  is a monotonically non-decreasing function in x for y fixed (and vice versa). This implies that

c i + 1 := f ( x i + 1 ) f ( y i + 1 ) x i + 1 y i + 1 f ( x i ) f ( y i ) x i y i =: c i {\displaystyle c_{i+1}:={\frac {f(x_{i+1})-f(y_{i+1})}{x_{i+1}-y_{i+1}}}\leq {\frac {f(x_{i})-f(y_{i})}{x_{i}-y_{i}}}=:c_{i}}
(6)

for all i ∈ {1, …, n − 1}. Define A0 = B0 = 0 and

A i = x 1 + + x i , B i = y 1 + + y i {\displaystyle A_{i}=x_{1}+\cdots +x_{i},\qquad B_{i}=y_{1}+\cdots +y_{i}}

for all i ∈ {1, …, n}. By the majorization property (3), AiBi for all i ∈ {1, …, n − 1} and by (4), An = Bn. Hence,

i = 1 n ( f ( x i ) f ( y i ) ) = i = 1 n c i ( x i y i ) = i = 1 n c i ( A i A i 1 = x i ( B i B i 1 = y i ) ) = i = 1 n c i ( A i B i ) i = 1 n c i ( A i 1 B i 1 ) = c n ( A n B n = 0 ) + i = 1 n 1 ( c i c i + 1 0 ) ( A i B i 0 ) c 1 ( A 0 B 0 = 0 ) 0 , {\displaystyle {\begin{aligned}\sum _{i=1}^{n}{\bigl (}f(x_{i})-f(y_{i}){\bigr )}&=\sum _{i=1}^{n}c_{i}(x_{i}-y_{i})\\&=\sum _{i=1}^{n}c_{i}{\bigl (}\underbrace {A_{i}-A_{i-1}} _{=\,x_{i}}{}-(\underbrace {B_{i}-B_{i-1}} _{=\,y_{i}}){\bigr )}\\&=\sum _{i=1}^{n}c_{i}(A_{i}-B_{i})-\sum _{i=1}^{n}c_{i}(A_{i-1}-B_{i-1})\\&=c_{n}(\underbrace {A_{n}-B_{n}} _{=\,0})+\sum _{i=1}^{n-1}(\underbrace {c_{i}-c_{i+1}} _{\geq \,0})(\underbrace {A_{i}-B_{i}} _{\geq \,0})-c_{1}(\underbrace {A_{0}-B_{0}} _{=\,0})\\&\geq 0,\end{aligned}}}
(7)

which proves Karamata's inequality (1).

To discuss the case of equality in (1), note that x1 > y1 by (3) and our assumption xiyi for all i ∈ {1, …, n − 1}. Let i be the smallest index such that (xi, yi) ≠ (xi+1, yi+1), which exists due to (4). Then Ai > Bi. If f  is strictly convex, then there is strict inequality in (6), meaning that ci+1 < ci. Hence there is a strictly positive term in the sum on the right hand side of (7) and equality in (1) cannot hold.

If the convex function f  is non-decreasing, then cn ≥ 0. The relaxed condition (5) means that AnBn, which is enough to conclude that cn(AnBn) ≥ 0 in the last step of (7).

If the function f  is strictly convex and non-decreasing, then cn > 0. It only remains to discuss the case An > Bn. However, then there is a strictly positive term on the right hand side of (7) and equality in (1) cannot hold.

References

  1. ^ Kadelburg, Zoran; Đukić, Dušan; Lukić, Milivoje; Matić, Ivan (2005), "Inequalities of Karamata, Schur and Muirhead, and some applications" (PDF), The Teaching of Mathematics, 8 (1): 31–45, ISSN 1451-4966
  2. ^ Karamata, Jovan (1932), "Sur une inégalité relative aux fonctions convexes" (PDF), Publ. Math. Univ. Belgrade (in French), 1: 145–148, Zbl 0005.20101

External links

An explanation of Karamata's inequality and majorization theory can be found here.