Karamata's inequality
In mathematics, Karamata's inequality,[1] named after Jovan Karamata,[2] also known as the majorization inequality, is a theorem in elementary algebra for convex and concave real-valued functions, defined on an interval of the real line. It generalizes the discrete form of Jensen's inequality, and generalizes in turn to the concept of Schur-convex functions.
Statement of the inequality
Let I be an interval of the real line and let f denote a real-valued, convex function defined on I. If x1, …, xn and y1, …, yn are numbers in I such that (x1, …, xn) majorizes (y1, …, yn), then
-
[math]\displaystyle{ f(x_1)+\cdots+f(x_n) \ge f(y_1)+\cdots+f(y_n). }[/math]
(
)
Here majorization means that x1, …, xn and y1, …, yn satisfies
-
[math]\displaystyle{ x_1\ge x_2\ge\cdots\ge x_n }[/math] and [math]\displaystyle{ y_1\ge y_2\ge\cdots\ge y_n, }[/math]
(
)
and we have the inequalities
-
[math]\displaystyle{ x_1+\cdots+x_i\ge y_1+\cdots+y_i }[/math] for all i ∈ {1, …, n − 1}.
(
)
and the equality
-
[math]\displaystyle{ x_1+\cdots+x_n=y_1+\cdots+y_n }[/math]
(
)
If f is a strictly convex function, then the inequality (1) holds with equality if and only if we have xi = yi for all i ∈ {1, …, n}.
Remarks
- If the convex function f is non-decreasing, then the proof of (1) below and the discussion of equality in case of strict convexity shows that the equality (4) can be relaxed to
-
[math]\displaystyle{ x_1+\cdots+x_n\ge y_1+\cdots+y_n. }[/math]
(
)
-
- The inequality (1) is reversed if f is concave, since in this case the function −f is convex.
Example
The finite form of Jensen's inequality is a special case of this result. Consider the real numbers x1, …, xn ∈ I and let
- [math]\displaystyle{ a := \frac{x_1+x_2+\cdots+x_n}{n} }[/math]
denote their arithmetic mean. Then (x1, …, xn) majorizes the n-tuple (a, a, …, a), since the arithmetic mean of the i largest numbers of (x1, …, xn) is at least as large as the arithmetic mean a of all the n numbers, for every i ∈ {1, …, n − 1}. By Karamata's inequality (1) for the convex function f,
- [math]\displaystyle{ f(x_1)+f(x_2)+ \cdots +f(x_n) \ge f(a)+f(a)+\cdots+f(a) = nf(a). }[/math]
Dividing by n gives Jensen's inequality. The sign is reversed if f is concave.
Proof of the inequality
We may assume that the numbers are in decreasing order as specified in (2).
If xi = yi for all i ∈ {1, …, n}, then the inequality (1) holds with equality, hence we may assume in the following that xi ≠ yi for at least one i.
If xi = yi for an i ∈ {1, …, n}, then the inequality (1) and the majorization properties (3) and (4) are not affected if we remove xi and yi. Hence we may assume that xi ≠ yi for all i ∈ {1, …, n}.
It is a property of convex functions that for two numbers x ≠ y in the interval I the slope
- [math]\displaystyle{ \frac{f(x)-f(y)}{x-y} }[/math]
of the secant line through the points (x, f (x)) and (y, f (y)) of the graph of f is a monotonically non-decreasing function in x for y fixed (and vice versa). This implies that
-
[math]\displaystyle{ c_{i+1}:=\frac{f(x_{i+1})-f(y_{i+1})}{x_{i+1}-y_{i+1}}\le\frac{f(x_i)-f(y_i)}{x_i-y_i}=:c_i }[/math]
(
)
for all i ∈ {1, …, n − 1}. Define A0 = B0 = 0 and
- [math]\displaystyle{ A_i=x_1+\cdots+x_i,\qquad B_i=y_1+\cdots+y_i }[/math]
for all i ∈ {1, …, n}. By the majorization property (3), Ai ≥ Bi for all i ∈ {1, …, n − 1} and by (4), An = Bn. Hence,
-
[math]\displaystyle{ \begin{align} \sum_{i=1}^n \bigl(f(x_i) - f(y_i)\bigr) &=\sum_{i=1}^n c_i (x_i - y_i)\\ &=\sum_{i=1}^n c_i \bigl(\underbrace{A_i - A_{i-1}}_{=\,x_i}{} - (\underbrace{B_i - B_{i-1}}_{=\,y_i})\bigr)\\ &=\sum_{i=1}^n c_i (A_i - B_i) - \sum_{i=1}^n c_i (A_{i-1} - B_{i-1})\\ &=c_n (\underbrace{A_n-B_n}_{=\,0}) + \sum_{i=1}^{n-1}(\underbrace{c_i - c_{i + 1}}_{\ge\,0})(\underbrace{A_i - B_i}_{\ge\,0}) - c_1(\underbrace{A_0-B_0}_{=\,0})\\ &\ge0, \end{align} }[/math]
(
)
which proves Karamata's inequality (1).
To discuss the case of equality in (1), note that x1 > y1 by (3) and our assumption xi ≠ yi for all i ∈ {1, …, n − 1}. Let i be the smallest index such that (xi, yi) ≠ (xi+1, yi+1), which exists due to (4). Then Ai > Bi. If f is strictly convex, then there is strict inequality in (6), meaning that ci+1 < ci. Hence there is a strictly positive term in the sum on the right hand side of (7) and equality in (1) cannot hold.
If the convex function f is non-decreasing, then cn ≥ 0. The relaxed condition (5) means that An ≥ Bn, which is enough to conclude that cn(An−Bn) ≥ 0 in the last step of (7).
If the function f is strictly convex and non-decreasing, then cn > 0. It only remains to discuss the case An > Bn. However, then there is a strictly positive term on the right hand side of (7) and equality in (1) cannot hold.
References
- ↑ Kadelburg, Zoran; Đukić, Dušan; Lukić, Milivoje; Matić, Ivan (2005), "Inequalities of Karamata, Schur and Muirhead, and some applications", The Teaching of Mathematics 8 (1): 31–45, ISSN 1451-4966, http://elib.mi.sanu.ac.rs/files/journals/tm/14/tm813.pdf
- ↑ Karamata, Jovan (1932), "Sur une inégalité relative aux fonctions convexes" (in French), Publ. Math. Univ. Belgrade 1: 145–148, http://elib.mi.sanu.ac.rs/files/journals/publ/1/11.pdf
External links
An explanation of Karamata's inequality and majorization theory can be found here.
Original source: https://en.wikipedia.org/wiki/Karamata's inequality.
Read more |