Multilinear polynomial

From HandWiki

In algebra, a multilinear polynomial[1] is a multivariate polynomial that is linear (meaning affine) in each of its variables separately, but not necessarily simultaneously. It is a polynomial in which no variable occurs to a power of 2 or higher; that is, each monomial is a constant times a product of distinct variables. For example f(x,y,z) = 3xy + 2.5 y - 7z is a multilinear polynomial of degree 2 (because of the monomial 3xy) whereas f(x,y,z) = x² +4y is not. The degree of a multilinear polynomial is the maximum number of distinct variables occurring in any monomial.

Definition

Multilinear polynomials can be understood as a multilinear map (specifically, a multilinear form) applied to the vectors [1 x], [1 y], etc. The general form can be written as a tensor contraction:[math]\displaystyle{ f(x) = \sum_{i_1=0}^1\sum_{i_2=0}^1\cdots\sum_{i_n=0}^1 a_{i_1i_2\cdots i_n}x_1^{i_1}x_2^{i_2}\cdots x_n^{i_n} }[/math]

For example, in two variables:[math]\displaystyle{ f(x,y) = \sum_{i=0}^1\sum_{j=0}^1 a_{ij}x^{i}y^{j} = a_{00} + a_{10}x + a_{01}y + a_{11}xy = \begin{pmatrix}1&x\end{pmatrix}\begin{pmatrix}a_{00}&a_{01}\\a_{10}&a_{11}\end{pmatrix}\begin{pmatrix}1\\y\end{pmatrix} }[/math]

Properties

A multilinear polynomial [math]\displaystyle{ f }[/math] is linear (affine) when varying only one variable, [math]\displaystyle{ x_k }[/math]:[math]\displaystyle{ f(x_1, x_2, ..., x_k, ..., x_n) = a x_k + b }[/math]where [math]\displaystyle{ a }[/math] and [math]\displaystyle{ b }[/math] do not depend on [math]\displaystyle{ x_k }[/math]. Note that [math]\displaystyle{ b }[/math] is generally not zero, so [math]\displaystyle{ f }[/math] is linear in the "shaped like a line" sense, but not in the "directly proportional" sense of a multilinear map.

All repeated second partial derivatives are zero:[math]\displaystyle{ \frac{\partial^2 f}{\partial x_k^2} = 0 }[/math]In other words, its Hessian matrix is a symmetric hollow matrix.

In particular, the Laplacian [math]\displaystyle{ \nabla^2f=0 }[/math], so [math]\displaystyle{ f }[/math] is a harmonic function. This implies [math]\displaystyle{ f }[/math] has maxima and minima only on the boundary of the domain.

More generally, every restriction of [math]\displaystyle{ f }[/math] to a subset of its coordinates is also multilinear, so [math]\displaystyle{ \nabla^2f=0 }[/math] still holds when one or more variables are fixed. In other words, [math]\displaystyle{ f }[/math] is harmonic on every "slice" of the domain along coordinate axes.

On a rectangular domain

When the domain is rectangular in the coordinate axes (e.g. a hypercube), [math]\displaystyle{ f }[/math] will have maxima and minima only on the vertices of the domain, i.e. the finite set of [math]\displaystyle{ 2^n }[/math] points with minimal and maximal coordinate values. The value of the function on these points completely determines the function, since the value on the edges of the boundary can be found by linear interpolation, and the value on the rest of the boundary and the interior is fixed by Laplace's equation, [math]\displaystyle{ \nabla^2f=0 }[/math].[1]

The value of the polynomial at an arbitrary point can be found by repeated linear interpolation along each coordinate axis. Equivalently, it is a weighted mean of the vertex values, where the weights are the Lagrange interpolation polynomials. These weights also constitute a set of generalized barycentric coordinates for the hyperrectangle. Geometrically, the point divides the domain into [math]\displaystyle{ 2^n }[/math] smaller hyperrectangles, and the weight of each vertex is the (fractional) volume of the hyperrectangle opposite it.

Algebraically, the multilinear interpolant on the hyperrectangle [math]\displaystyle{ [a_i, b_i]_{i=1}^n }[/math] is:[math]\displaystyle{ f(x) = \sum_v f(v) \prod_{i|v_i=b_i} \frac{x_i-a_i}{b_i-a_i} \prod_{i|v_i=a_i} \frac{b_i-x_i}{b_i-a_i} }[/math]where the sum is taken over the vertices [math]\displaystyle{ v }[/math]. Equivalently,[math]\displaystyle{ f(x) = \frac{1}{V} \sum_v f(v) \prod_{i|v_i=b_i} (x_i-a_i) \prod_{i|v_i=a_i} (b_i-x_i) }[/math]where V is the volume of the hyperrectangle.

The value at the center is the arithmetic mean of the value at the vertices, which is also the mean over the domain boundary, and the mean over the interior. The components of the gradient at the center are proportional to the balance of the vertex values along each coordinate axis.

The vertex values and the coefficients of the polynomial are related by a linear transformation (specifically, a Möbius transform if the domain is the unit hypercube [math]\displaystyle{ \{0, 1\}^n }[/math], and a Walsh-Hadamard-Fourier transform if the domain is the symmetric hypercube [math]\displaystyle{ \{-1, 1\}^n }[/math]).

Applications

Multilinear polynomials are the interpolants of multilinear or n-linear interpolation on a rectangular grid, a generalization of linear interpolation, bilinear interpolation and trilinear interpolation to an arbitrary number of variables. This is a specific form of multivariate interpolation, not to be confused with piecewise linear interpolation. The resulting polynomial is not a linear function of the coordinates (its degree can be higher than 1), but it is a linear function of the fitted data values.

The determinant, permanent and other immanants of a matrix are homogeneous multilinear polynomials in the elements of the matrix (and also multilinear forms in the rows or columns).

The multilinear polynomials in [math]\displaystyle{ n }[/math] variables form a [math]\displaystyle{ 2^n }[/math]-dimensional vector space, which is also the basis used in the Fourier analysis of (pseudo-)Boolean functions. Every (pseudo-)Boolean function can be uniquely expressed as a multilinear polynomial (up to a choice of domain and codomain).

Multilinear polynomials are important in the study of polynomial identity testing.[2]

See also

References

  1. 1.0 1.1 Laneve, Cosimo; Lascu, Tudor A.; Sordoni, Vania (2010-10-01). "The Interval Analysis of Multilinear Expressions" (in en). Electronic Notes in Theoretical Computer Science 267 (2): 43–53. doi:10.1016/j.entcs.2010.09.017. ISSN 1571-0661. 
  2. A. Giambruno, Mikhail Zaicev. Polynomial Identities and Asymptotic Methods. AMS Bookstore, 2005 ISBN:978-0-8218-3829-7. Section 1.3.