Schur polynomial

From HandWiki
Revision as of 14:18, 6 February 2024 by John Stpola (talk | contribs) (fix)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

In mathematics, Schur polynomials, named after Issai Schur, are certain symmetric polynomials in n variables, indexed by partitions, that generalize the elementary symmetric polynomials and the complete homogeneous symmetric polynomials. In representation theory they are the characters of polynomial irreducible representations of the general linear groups. The Schur polynomials form a linear basis for the space of all symmetric polynomials. Any product of Schur polynomials can be written as a linear combination of Schur polynomials with non-negative integral coefficients; the values of these coefficients is given combinatorially by the Littlewood–Richardson rule. More generally, skew Schur polynomials are associated with pairs of partitions and have similar properties to Schur polynomials.

Definition (Jacobi's bialternant formula)

Schur polynomials are indexed by integer partitions. Given a partition λ = (λ1, λ2, ...,λn), where λ1λ2 ≥ ... ≥ λn, and each λj is a non-negative integer, the functions

[math]\displaystyle{ a_{(\lambda_1+n-1, \lambda_2+n-2, \dots , \lambda_n)} (x_1, x_2, \dots , x_n) = \det \left[ \begin{matrix} x_1^{\lambda_1+n-1} & x_2^{\lambda_1+n-1} & \dots & x_n^{\lambda_1+n-1} \\ x_1^{\lambda_2+n-2} & x_2^{\lambda_2+n-2} & \dots & x_n^{\lambda_2+n-2} \\ \vdots & \vdots & \ddots & \vdots \\ x_1^{\lambda_n} & x_2^{\lambda_n} & \dots & x_n^{\lambda_n} \end{matrix} \right] }[/math]

are alternating polynomials by properties of the determinant. A polynomial is alternating if it changes sign under any transposition of the variables.

Since they are alternating, they are all divisible by the Vandermonde determinant [math]\displaystyle{ a_{(n-1, n-2, \dots , 0)} (x_1, x_2, \dots , x_n) = \det \left[ \begin{matrix} x_1^{n-1} & x_2^{n-1} & \dots & x_n^{n-1} \\ x_1^{n-2} & x_2^{n-2} & \dots & x_n^{n-2} \\ \vdots & \vdots & \ddots & \vdots \\ 1 & 1 & \dots & 1 \end{matrix} \right] = \prod_{1 \leq j \lt k \leq n} (x_j-x_k). }[/math] The Schur polynomials are defined as the ratio

[math]\displaystyle{ s_{\lambda} (x_1, x_2, \dots , x_n) = \frac{ a_{(\lambda_1+n-1, \lambda_2+n-2, \dots , \lambda_n+0)} (x_1, x_2, \dots , x_n)} {a_{(n-1, n-2, \dots , 0)} (x_1, x_2, \dots , x_n) }. }[/math]

This is known as the bialternant formula of Jacobi. It is a special case of the Weyl character formula.

This is a symmetric function because the numerator and denominator are both alternating, and a polynomial since all alternating polynomials are divisible by the Vandermonde determinant.

Properties

The degree d Schur polynomials in n variables are a linear basis for the space of homogeneous degree d symmetric polynomials in n variables. For a partition λ = (λ1, λ2, ..., λn), the Schur polynomial is a sum of monomials,

[math]\displaystyle{ s_\lambda(x_1,x_2,\ldots,x_n)=\sum_T x^T = \sum_T x_1^{t_1}\cdots x_n^{t_n} }[/math]

where the summation is over all semistandard Young tableaux T of shape λ. The exponents t1, ..., tn give the weight of T, in other words each ti counts the occurrences of the number i in T. This can be shown to be equivalent to the definition from the first Giambelli formula using the Lindström–Gessel–Viennot lemma (as outlined on that page).

Schur polynomials can be expressed as linear combinations of monomial symmetric functions mμ with non-negative integer coefficients Kλμ called Kostka numbers,

[math]\displaystyle{ s_\lambda= \sum_\mu K_{\lambda\mu}m_\mu.\ }[/math]

The Kostka numbers Kλμ are given by the number of semi-standard Young tableaux of shape λ and weight μ.

Jacobi−Trudi identities

The first Jacobi−Trudi formula expresses the Schur polynomial as a determinant in terms of the complete homogeneous symmetric polynomials,

[math]\displaystyle{ s_{\lambda} = \det(h_{\lambda_{i} + j - i})_{i,j = 1}^{l(\lambda)} = \det\left[ \begin{matrix} h_{\lambda_1} & h_{\lambda_1 + 1} & \dots & h_{\lambda_1 + n - 1} \\ h_{\lambda_2-1} & h_{\lambda_2} & \dots & h_{\lambda_2+n-2} \\ \vdots & \vdots & \ddots & \vdots \\ h_{\lambda_n-n+1} & h_{\lambda_n-n+2} & \dots & h_{\lambda_n} \end{matrix} \right], }[/math]

where hi := s(i).[1]

The second Jacobi-Trudi formula expresses the Schur polynomial as a determinant in terms of the elementary symmetric polynomials,

[math]\displaystyle{ s_{\lambda} = \det(e_{\lambda'_{i} + j - i})_{i,j = 1}^{l(\lambda')} = \det\left[ \begin{matrix} e_{\lambda'_1} & e_{\lambda'_1 + 1} & \dots & e_{\lambda'_1 + l - 1} \\ e_{\lambda'_2-1} & e_{\lambda'_2} & \dots & e_{\lambda'_2+ l-2} \\ \vdots & \vdots & \ddots & \vdots \\ e_{\lambda'_l-l+1} & e_{\lambda'_l-l+2} & \dots & e_{\lambda'_l} \end{matrix} \right], }[/math]

where ei := s(1i) and λ' is the conjugate partition to λ.[2]

In both identities, functions with negative subscripts are defined to be zero.

The Giambelli identity

Another determinantal identity is Giambelli's formula, which expresses the Schur function for an arbitrary partition in terms of those for the hook partitions contained within the Young diagram. In Frobenius' notation, the partition is denoted

[math]\displaystyle{ (a_1, \ldots, a_r\mid b_1, \ldots, b_r) }[/math]

where, for each diagonal element in position ii, ai denotes the number of boxes to the right in the same row and bi denotes the number of boxes beneath it in the same column (the arm and leg lengths, respectively).

The Giambelli identity expresses the Schur function corresponding to this partition as the determinant

[math]\displaystyle{ s_{ (a_1, \ldots, a_r\mid b_1, \ldots, b_r)} = \det ( s_{(a_i \mid b_j)}) }[/math]

of those for hook partitions.

The Cauchy identity

The Cauchy identity for Schur functions (now in infinitely many variables), and its dual state that

[math]\displaystyle{ \sum_\lambda s_\lambda(x) s_{\lambda}(y) = \sum_\lambda m_\lambda(x) h_{\lambda}(y)= \prod_{i,j} (1-x_i y_j)^{-1}, }[/math]

and

[math]\displaystyle{ \sum_\lambda s_\lambda(x) s_{\lambda'}(y) = \sum_\lambda m_\lambda(x) e_{\lambda}(y) = \prod_{i,j} (1+x_i y_j), }[/math]

where the sum is taken over all partitions λ, and [math]\displaystyle{ h_{\lambda}(x) }[/math], [math]\displaystyle{ e_{\lambda}(x) }[/math] denote the complete symmetric functions and elementary symmetric functions, respectively. If the sum is taken over products of Schur polynomials in [math]\displaystyle{ n }[/math] variables [math]\displaystyle{ (x_1, \dots, x_n) }[/math], the sum includes only partitions of length [math]\displaystyle{ \ell(\lambda) \le n }[/math] since otherwise the Schur polynomials vanish.

There are many generalizations of these identities to other families of symmetric functions. For example, Macdonald polynomials, Schubert polynomials and Grothendieck polynomials admit Cauchy-like identities.

Further identities

The Schur polynomial can also be computed via a specialization of a formula for Hall–Littlewood polynomials,

[math]\displaystyle{ s_{\lambda}(x_1,\dotsc,x_n) = \sum_{w \in S_n / S^{\lambda}_n} w\left( x^\lambda \prod_{\lambda_i \gt \lambda_j} \frac{x_i}{x_i-x_j} \right) }[/math]

where [math]\displaystyle{ S^{\lambda}_n }[/math] is the subgroup of permutations such that [math]\displaystyle{ \lambda_{w(i)}=\lambda_i }[/math] for all i, and w acts on variables by permuting indices.

The Murnaghan−Nakayama rule

The Murnaghan–Nakayama rule expresses a product of a power-sum symmetric function with a Schur polynomial, in terms of Schur polynomials:

[math]\displaystyle{ p_r \cdot s_\lambda = \sum_{\mu} (-1)^{ht(\mu/\lambda)+1}s_\mu }[/math]

where the sum is over all partitions μ such that μ/λ is a rim-hook of size r and ht(μ/λ) is the number of rows in the diagram μ/λ.

The Littlewood–Richardson rule and Pieri's formula

The Littlewood–Richardson coefficients depend on three partitions, say [math]\displaystyle{ \lambda,\mu,\nu }[/math], of which [math]\displaystyle{ \lambda }[/math] and [math]\displaystyle{ \mu }[/math] describe the Schur functions being multiplied, and [math]\displaystyle{ \nu }[/math] gives the Schur function of which this is the coefficient in the linear combination; in other words they are the coefficients [math]\displaystyle{ c_{\lambda,\mu}^\nu }[/math] such that

[math]\displaystyle{ s_\lambda s_\mu=\sum_\nu c_{\lambda,\mu}^\nu s_\nu. }[/math]

The Littlewood–Richardson rule states that [math]\displaystyle{ c_{\lambda,\mu}^\nu }[/math] is equal to the number of Littlewood–Richardson tableaux of skew shape [math]\displaystyle{ \nu/\lambda }[/math] and of weight [math]\displaystyle{ \mu }[/math].

Pieri's formula is a special case of the Littlewood-Richardson rule, which expresses the product [math]\displaystyle{ h_r s_{\lambda} }[/math] in terms of Schur polynomials. The dual version expresses [math]\displaystyle{ e_r s_{\lambda} }[/math] in terms of Schur polynomials.

Specializations

Evaluating the Schur polynomial sλ in (1, 1, ..., 1) gives the number of semi-standard Young tableaux of shape λ with entries in 1, 2, ..., n. One can show, by using the Weyl character formula for example, that [math]\displaystyle{ s_\lambda(1,1,\dots,1) = \prod_{1\leq i \lt j \leq n} \frac{\lambda_i - \lambda_j + j-i}{j-i}. }[/math] In this formula, λ, the tuple indicating the width of each row of the Young diagram, is implicitly extended with zeros until it has length n. The sum of the elements λi is d. See also the Hook length formula which computes the same quantity for fixed λ.

Example

The following extended example should help clarify these ideas. Consider the case n = 3, d = 4. Using Ferrers diagrams or some other method, we find that there are just four partitions of 4 into at most three parts. We have

[math]\displaystyle{ s_{(2,1,1)} (x_1, x_2, x_3) = \frac{1}{\Delta} \; \det \left[ \begin{matrix} x_1^4 & x_2^4 & x_3^4 \\ x_1^2 & x_2^2 & x_3^2 \\ x_1 & x_2 & x_3 \end{matrix} \right] = x_1 \, x_2 \, x_3 \, (x_1 + x_2 + x_3) }[/math]
[math]\displaystyle{ s_{(2,2,0)} (x_1, x_2, x_3) = \frac{1}{\Delta} \; \det \left[ \begin{matrix} x_1^4 & x_2^4 & x_3^4 \\ x_1^3 & x_2^3 & x_3^3 \\ 1 & 1 & 1 \end{matrix} \right]= x_1^2 \, x_2^2 + x_1^2 \, x_3^2 + x_2^2 \, x_3^2 + x_1^2 \, x_2 \, x_3 + x_1 \, x_2^2 \, x_3 + x_1 \, x_2 \, x_3^2 }[/math]

and so on, where [math]\displaystyle{ \Delta }[/math] is the Vandermonde determinant [math]\displaystyle{ a_{(2,1,0)}(x_1,x_2,x_3) }[/math]. Summarizing:

  1. [math]\displaystyle{ s_{(2,1,1)} = e_1 \, e_3 }[/math]
  2. [math]\displaystyle{ s_{(2,2,0)} = e_2^2 - e_1 \, e_3 }[/math]
  3. [math]\displaystyle{ s_{(3,1,0)} = e_1^2 \, e_2 - e_2^2 - e_1 \, e_3 }[/math]
  4. [math]\displaystyle{ s_{(4,0,0)} = e_1^4 - 3 \, e_1^2 \, e_2 + 2 \, e_1 \, e_3 + e_2^2. }[/math]

Every homogeneous degree-four symmetric polynomial in three variables can be expressed as a unique linear combination of these four Schur polynomials, and this combination can again be found using a Gröbner basis for an appropriate elimination order. For example,

[math]\displaystyle{ \phi(x_1, x_2, x_3) = x_1^4 + x_2^4 + x_3^4 }[/math]

is obviously a symmetric polynomial which is homogeneous of degree four, and we have

[math]\displaystyle{ \phi = s_{(2,1,1)} - s_{(3,1,0)} + s_{(4,0,0)}.\,\! }[/math]

Relation to representation theory

The Schur polynomials occur in the representation theory of the symmetric groups, general linear groups, and unitary groups. The Weyl character formula implies that the Schur polynomials are the characters of finite-dimensional irreducible representations of the general linear groups, and helps to generalize Schur's work to other compact and semisimple Lie groups.

Several expressions arise for this relation, one of the most important being the expansion of the Schur functions sλ in terms of the symmetric power functions [math]\displaystyle{ p_k=\sum_i x_i^k }[/math]. If we write χλρ for the character of the representation of the symmetric group indexed by the partition λ evaluated at elements of cycle type indexed by the partition ρ, then

[math]\displaystyle{ s_\lambda = \sum_{\nu} \frac{\chi^\lambda_\nu}{z_\nu} p_\nu = \sum_{\rho=(1^{r_1},2^{r_2},3^{r_3},\dots)}\chi^\lambda_\rho \prod_k \frac{p^{r_k}_k}{r_k! k^{r_k} }, }[/math]

where ρ = (1r1, 2r2, 3r3, ...) means that the partition ρ has rk parts of length k.

A proof of this can be found in R. Stanley's Enumerative Combinatorics Volume 2, Corollary 7.17.5.

The integers χλρ can be computed using the Murnaghan–Nakayama rule.

Schur positivity

Due to the connection with representation theory, a symmetric function which expands positively in Schur functions are of particular interest. For example, the skew Schur functions expand positively in the ordinary Schur functions, and the coefficients are Littlewood–Richardson coefficients.

A special case of this is the expansion of the complete homogeneous symmetric functions hλ in Schur functions. This decomposition reflects how a permutation module is decomposed into irreducible representations.

Methods for proving Schur positivity

There are several approaches to prove Schur positivity of a given symmetric function F. If F is described in a combinatorial manner, a direct approach is to produce a bijection with semi-standard Young tableaux. The Edelman–Greene correspondence and the Robinson–Schensted–Knuth correspondence are examples of such bijections.

A bijection with more structure is a proof using so called crystals. This method can be described as defining a certain graph structure described with local rules on the underlying combinatorial objects.

A similar idea is the notion of dual equivalence. This approach also uses a graph structure, but on the objects representing the expansion in the fundamental quasisymmetric basis. It is closely related to the RSK-correspondence.

Generalizations

Skew Schur functions

Skew Schur functions sλ/μ depend on two partitions λ and μ, and can be defined by the property

[math]\displaystyle{ \langle s_{\lambda/\mu},s_\nu\rangle = \langle s_{\lambda},s_\mu s_\nu\rangle. }[/math]

Here, the inner product is the Hall inner product, for which the Schur polynomials form an orthonormal basis.

Similar to the ordinary Schur polynomials, there are numerous ways to compute these. The corresponding Jacobi-Trudi identities are

[math]\displaystyle{ s_{\lambda/\mu} = \det(h_{\lambda_i - \mu_j -i + j})_{i,j = 1}^{l(\lambda)} }[/math]
[math]\displaystyle{ s_{\lambda'/\mu'} = \det(e_{\lambda_i - \mu_j -i + j})_{i,j = 1}^{l(\lambda)} }[/math]

There is also a combinatorial interpretation of the skew Schur polynomials, namely it is a sum over all semi-standard Young tableaux (or column-strict tableaux) of the skew shape [math]\displaystyle{ \lambda/\mu }[/math].

The skew Schur polynomials expands positively in Schur polynomials. A rule for the coefficients is given by the Littlewood-Richardson rule.

Double Schur polynomials

The double Schur polynomials[3] can be seen as a generalization of the shifted Schur polynomials. These polynomials are also closely related to the factorial Schur polynomials. Given a partition λ, and a sequence a1, a2,... one can define the double Schur polynomial sλ(x || a) as [math]\displaystyle{ s_\lambda(x||a) = \sum_T \prod_{\alpha \in \lambda}(x_{T(\alpha)} - a_{T(\alpha)-c(\alpha)}) }[/math] where the sum is taken over all reverse semi-standard Young tableaux T of shape λ, and integer entries in 1, ..., n. Here T(α) denotes the value in the box α in T and c(α) is the content of the box.

A combinatorial rule for the Littlewood-Richardson coefficients (depending on the sequence a) was given by A.I Molev.[3] In particular, this implies that the shifted Schur polynomials have non-negative Littlewood-Richardson coefficients.

The shifted Schur polynomials s*λ(y) can be obtained from the double Schur polynomials by specializing ai = −i and yi = xi + i.

The double Schur polynomials are special cases of the double Schubert polynomials.

Factorial Schur polynomials

The factorial Schur polynomials may be defined as follows. Given a partition λ, and a doubly infinite sequence ...,a−1, a0, a1, ... one can define the factorial Schur polynomial sλ(x|a) as [math]\displaystyle{ s_\lambda(x|a) = \sum_T \prod_{\alpha \in \lambda}(x_{T(\alpha)} - a_{T(\alpha)+c(\alpha)}) }[/math] where the sum is taken over all semi-standard Young tableaux T of shape λ, and integer entries in 1, ..., n. Here T(α) denotes the value in the box α in T and c(α) is the content of the box.

There is also a determinant formula, [math]\displaystyle{ s_\lambda(x|a) = \frac{\det[(x_j|a)^{\lambda_i+n-i}]_{i,j = 1}^{l(\lambda)}}{\prod_{i\lt j}(x_i-x_j)} }[/math] where (y|a)k = (ya1) ... (yak). It is clear that if we let ai = 0 for all i, we recover the usual Schur polynomial sλ.

The double Schur polynomials and the factorial Schur polynomials in n variables are related via the identity sλ(x||a) = sλ(x|u) where ani+1 = ui.

Other generalizations

There are numerous generalizations of Schur polynomials:

  • Hall–Littlewood polynomials
  • Shifted Schur polynomials
  • Flagged Schur polynomials
  • Schubert polynomials
  • Stanley symmetric functions (also known as stable Schubert polynomials)
  • Key polynomials (also known as Demazure characters)
  • Quasi-symmetric Schur polynomials
  • Row-strict Schur polynomials
  • Jack polynomials
  • Modular Schur polynomials
  • Loop Schur functions
  • Macdonald polynomials
  • Schur polynomials for the symplectic and orthogonal group.
  • k-Schur functions
  • Grothendieck polynomials (K-theoretical analogue of Schur polynomials)
  • LLT polynomials

See also

References

  1. Fulton & Harris 1991, Formula A.5
  2. Fulton & Harris 1991, Formula A.6
  3. 3.0 3.1 Molev, A.I. (June 2009). "Littlewood–Richardson polynomials". Journal of Algebra 321 (11): 3450–68. doi:10.1016/j.jalgebra.2008.02.034.