Lebesgue constant (interpolation)

From HandWiki

In mathematics, the Lebesgue constants (depending on a set of nodes and of its size) give an idea of how good the interpolant of a function (at the given nodes) is in comparison with the best polynomial approximation of the function (the degree of the polynomials are obviously fixed). The Lebesgue constant for polynomials of degree at most n and for the set of n + 1 nodes T is generally denoted by Λn(T ). These constants are named after Henri Lebesgue.

Definition

We fix the interpolation nodes [math]\displaystyle{ x_0, ..., x_n }[/math]and an interval [math]\displaystyle{ [a,\,b] }[/math]containing all the interpolation nodes. The process of interpolation maps the function [math]\displaystyle{ f }[/math] to a polynomial [math]\displaystyle{ p }[/math]. This defines a mapping [math]\displaystyle{ X }[/math] from the space C([a, b]) of all continuous functions on [a, b] to itself. The map X is linear and it is a projection on the subspace Πn of polynomials of degree n or less.

The Lebesgue constant [math]\displaystyle{ \Lambda_n(T) }[/math] is defined as the operator norm of X. This definition requires us to specify a norm on C([a, b]). The uniform norm is usually the most convenient.

Properties

The Lebesgue constant bounds the interpolation error: let p denote the best approximation of f among the polynomials of degree n or less. In other words, p minimizes || p −  f || among all p in Πn. Then

[math]\displaystyle{ \|f-X(f)\| \le (\Lambda_n(T)+1) \left \|f-p^* \right \|. }[/math]

We will here prove this statement with the maximum norm.

[math]\displaystyle{ \| f-X(f) \| \le \| f-p^* \| + \| p^* - X(f) \| }[/math]

by the triangle inequality. But X is a projection on Πn, so

pX( f ) = X(p) − X( f ) = X(pf ).

This finishes the proof since [math]\displaystyle{ \|X(p^*-f)\| \le \|X\| \|p^*-f\|=\|X\| \|f-p^*\| }[/math]. Note that this relation comes also as a special case of Lebesgue's lemma.

In other words, the interpolation polynomial is at most a factor Λn(T ) + 1 worse than the best possible approximation. This suggests that we look for a set of interpolation nodes with a small Lebesgue constant.

The Lebesgue constant can be expressed in terms of the Lagrange basis polynomials:

[math]\displaystyle{ l_j(x) := \prod_{\begin{smallmatrix}i=0\\ j\neq i\end{smallmatrix}}^{n} \frac{x-x_i}{x_j-x_i}. }[/math]

In fact, we have the Lebesgue function

[math]\displaystyle{ \lambda_n(x) = \sum_{j=0}^n |l_j(x)|. }[/math]

and the Lebesgue constant (or Lebesgue number) for the grid is its maximum value

[math]\displaystyle{ \Lambda_n(T)=\max_{x\in[a,b]} \lambda_n(x) }[/math]

Nevertheless, it is not easy to find an explicit expression for Λn(T ).

Minimal Lebesgue constants

In the case of equidistant nodes, the Lebesgue constant grows exponentially. More precisely, we have the following asymptotic estimate

[math]\displaystyle{ \Lambda_n(T) \sim \frac{2^{n+1}}{e \, n \log n} \qquad \text{ as } n \to \infty. }[/math]

On the other hand, the Lebesgue constant grows only logarithmically if Chebyshev nodes are used, since we have

[math]\displaystyle{ \tfrac{2}{\pi} \log(n+1)+a \lt \Lambda_n(T) \lt \tfrac{2}{\pi} \log(n+1) + 1, \qquad a = 0.9625\cdots }[/math]

We conclude again that Chebyshev nodes are a very good choice for polynomial interpolation. However, there is an easy (linear) transformation of Chebyshev nodes that gives a better Lebesgue constant. Let ti denote the i-th Chebyshev node. Then, define

[math]\displaystyle{ s_i = \frac{t_i}{\cos \left ( \frac{\pi}{2(n+1)} \right)}. }[/math]

For such nodes:

[math]\displaystyle{ \Lambda_n(S)\lt \tfrac{2}{\pi} \log(n+1)+b, \qquad b = 0.7219\cdots }[/math]

Those nodes are, however, not optimal (i.e. they do not minimize the Lebesgue constants) and the search for an optimal set of nodes (which has already been proved to be unique under some assumptions) is still an intriguing topic in mathematics today. However, this set of nodes is optimal for interpolation over [math]\displaystyle{ C_M^n[-1,1] }[/math] the set of n times differentiable functions whose n-th derivatives are bounded in absolute values by a constant M as shown by N. S. Hoang. Using a computer, one can approximate the values of the minimal Lebesgue constants, here for the canonical interval [−1, 1]:

n 1 2 3 4 5 6 7 8 9
Λn(T) 1.0000 1.2500 1.4229 1.5595 1.6722 1.7681 1.8516 1.9255 1.9917

There are uncountable infinitely many sets of nodes in [-1,1] that minimize, for fixed n > 1, the Lebesgue constant. Though if we assume that we always take −1 and 1 as nodes for interpolation (which is called a canonical node configuration), then such a set is unique and zero-symmetric. To illustrate this property, we shall see what happens when n = 2 (i.e. we consider 3 interpolation nodes in which case the property is not trivial). One can check that each set of (zero-symmetric) nodes of type (−a, 0, a) is optimal when 8/3a ≤ 1 (we consider only nodes in [−1, 1]). If we force the set of nodes to be of the type (−1, b, 1), then b must equal 0 (look at the Lebesgue function, whose maximum is the Lebesgue constant). All arbitrary (i.e. zero-symmetric or zero-asymmetric) optimal sets of nodes in [-1,1] when n = 2 have been determined by F. Schurer, and in an alternative fashion by H.-J. Rack and R. Vajda (2014).

If we assume that we take −1 and 1 as nodes for interpolation, then as shown by H.-J. Rack (1984 and 2013), for the case n = 3, the explicit values of the optimal (unique and zero-symmetric) 4 interpolation nodes and the explicit value of the minimal Lebesgue constant are known. All arbitrary optimal sets of 4 interpolation nodes in [-1,1] when n = 3 have been explicitly determined, in two different but equivalent fashions, by H.-J. Rack and R. Vajda (2015).

The Padua points provide another set of nodes with slow growth (although not as slow as the Chebyshev nodes) and with the additional property of being a unisolvent point set.

Sensitivity of the values of a polynomial

The Lebesgue constants also arise in another problem. Let p(x) be a polynomial of degree n expressed in the Lagrangian form associated with the points in the vector t (i.e. the vector u of its coefficients is the vector containing the values [math]\displaystyle{ p(t_i) }[/math]). Let [math]\displaystyle{ \hat{p}(x) }[/math] be a polynomial obtained by slightly changing the coefficients u of the original polynomial p(x) to [math]\displaystyle{ \hat{u} }[/math]. Consider the inequality:

[math]\displaystyle{ \frac{\|p-\hat{p}\|}{\|p\|}\leq \Lambda_n(T)\frac{\|u-\hat{u}\|}{\|u\|} }[/math]

This means that the (relative) error in the values of [math]\displaystyle{ \hat{p}(x) }[/math] will not be higher than the appropriate Lebesgue constant times the relative error in the coefficients. In this sense, the Lebesgue constant can be viewed as the relative condition number of the operator mapping each coefficient vector u to the set of the values of the polynomial with coefficients u in the Lagrange form. We can actually define such an operator for each polynomial basis but its condition number is greater than the optimal Lebesgue constant for most convenient bases.

References