Berlekamp's algorithm

From HandWiki

In mathematics, particularly computational algebra, Berlekamp's algorithm is a well-known method for factoring polynomials over finite fields (also known as Galois fields). The algorithm consists mainly of matrix reduction and polynomial GCD computations. It was invented by Elwyn Berlekamp in 1967. It was the dominant algorithm for solving the problem until the Cantor–Zassenhaus algorithm of 1981. It is currently implemented in many well-known computer algebra systems.

Overview

Berlekamp's algorithm takes as input a square-free polynomial [math]\displaystyle{ f(x) }[/math] (i.e. one with no repeated factors) of degree [math]\displaystyle{ n }[/math] with coefficients in a finite field [math]\displaystyle{ \mathbb{F}_q }[/math] and gives as output a polynomial [math]\displaystyle{ g(x) }[/math] with coefficients in the same field such that [math]\displaystyle{ g(x) }[/math] divides [math]\displaystyle{ f(x) }[/math]. The algorithm may then be applied recursively to these and subsequent divisors, until we find the decomposition of [math]\displaystyle{ f(x) }[/math] into powers of irreducible polynomials (recalling that the ring of polynomials over a finite field is a unique factorization domain).

All possible factors of [math]\displaystyle{ f(x) }[/math] are contained within the factor ring

[math]\displaystyle{ R = \frac{\mathbb{F}_q[x]}{\langle f(x) \rangle}. }[/math]

The algorithm focuses on polynomials [math]\displaystyle{ g(x) \in R }[/math] which satisfy the congruence:

[math]\displaystyle{ g(x)^q \equiv g(x) \pmod{f(x)}.\, }[/math]

These polynomials form a subalgebra of R (which can be considered as an [math]\displaystyle{ n }[/math]-dimensional vector space over [math]\displaystyle{ \mathbb{F}_q }[/math]), called the Berlekamp subalgebra. The Berlekamp subalgebra is of interest because the polynomials [math]\displaystyle{ g(x) }[/math] it contains satisfy

[math]\displaystyle{ f(x) = \prod_{s \in \mathbb{F}_q} \gcd(f(x),g(x)-s). }[/math]

In general, not every GCD in the above product will be a non-trivial factor of [math]\displaystyle{ f(x) }[/math], but some are, providing the factors we seek.

Berlekamp's algorithm finds polynomials [math]\displaystyle{ g(x) }[/math] suitable for use with the above result by computing a basis for the Berlekamp subalgebra. This is achieved via the observation that Berlekamp subalgebra is in fact the kernel of a certain [math]\displaystyle{ n \times n }[/math] matrix over [math]\displaystyle{ \mathbb{F}_q }[/math], which is derived from the so-called Berlekamp matrix of the polynomial, denoted [math]\displaystyle{ \mathcal{Q} }[/math]. If [math]\displaystyle{ \mathcal{Q}=[q_{i,j}] }[/math] then [math]\displaystyle{ q_{i,j} }[/math] is the coefficient of the [math]\displaystyle{ j }[/math]-th power term in the reduction of [math]\displaystyle{ x^{iq} }[/math] modulo [math]\displaystyle{ f(x) }[/math], i.e.:

[math]\displaystyle{ x^{iq} \equiv q_{i,n-1}x^{n-1} + q_{i,n-2}x^{n-2} + \ldots + q_{i,0} \pmod{f(x)}.\, }[/math]

With a certain polynomial [math]\displaystyle{ g(x) \in R }[/math], say:

[math]\displaystyle{ g(x) = g_{n-1}x^{n-1}+g_{n-2}x^{n-2} + \ldots + g_0,\, }[/math]

we may associate the row vector:

[math]\displaystyle{ g = (g_0, g_1, \ldots, g_{n-1}).\, }[/math]

It is relatively straightforward to see that the row vector [math]\displaystyle{ g\mathcal{Q} }[/math] corresponds, in the same way, to the reduction of [math]\displaystyle{ g(x)^q }[/math] modulo [math]\displaystyle{ f(x) }[/math]. Consequently, a polynomial [math]\displaystyle{ g(x) \in R }[/math] is in the Berlekamp subalgebra if and only if [math]\displaystyle{ g(\mathcal{Q}-I)=0 }[/math] (where [math]\displaystyle{ I }[/math] is the [math]\displaystyle{ n \times n }[/math] identity matrix), i.e. if and only if it is in the null space of [math]\displaystyle{ \mathcal{Q}-I }[/math].

By computing the matrix [math]\displaystyle{ \mathcal{Q}-I }[/math] and reducing it to reduced row echelon form and then easily reading off a basis for the null space, we may find a basis for the Berlekamp subalgebra and hence construct polynomials [math]\displaystyle{ g(x) }[/math] in it. We then need to successively compute GCDs of the form above until we find a non-trivial factor. Since the ring of polynomials over a field is a Euclidean domain, we may compute these GCDs using the Euclidean algorithm.

Conceptual algebraic explanation

With some abstract algebra, the idea behind Berlekamp's algorithm becomes conceptually clear. We represent a finite field [math]\displaystyle{ \mathbb{F}_q }[/math], where [math]\displaystyle{ q = p^m }[/math] for some prime [math]\displaystyle{ p }[/math], as [math]\displaystyle{ \mathbb{F}_p[y]/(g(y)) }[/math]. We can assume that [math]\displaystyle{ f(x) \in \mathbb{F}_q[x] }[/math] is square free, by taking all possible pth roots and then computing the gcd with its derivative.

Now, suppose that [math]\displaystyle{ f(x) = f_1(x) \ldots f_n(x) }[/math] is the factorization into irreducibles. Then we have a ring isomorphism, [math]\displaystyle{ \sigma: \mathbb{F}_q[x]/(f(x)) \to \prod_i \mathbb{F}_q[x]/(f_i(x)) }[/math], given by the Chinese remainder theorem. The crucial observation is that the Frobenius automorphism [math]\displaystyle{ x \to x^p }[/math] commutes with [math]\displaystyle{ \sigma }[/math], so that if we denote [math]\displaystyle{ \text{Fix}_p(R) = \{ f \in R : f^p = f \} }[/math], then [math]\displaystyle{ \sigma }[/math] restricts to an isomorphism [math]\displaystyle{ \text{Fix}_p( \mathbb{F}_q[x]/(f(x)) ) \to \prod_{i = 1}^n \text{Fix}_p( \mathbb{F}_q[x]/(f_i(x)) ) }[/math]. By finite field theory, [math]\displaystyle{ \text{Fix}_p( \mathbb{F}_q[x]/(f_i(x)) ) }[/math] is always the prime subfield of that field extension. Thus, [math]\displaystyle{ \text{Fix}_p( \mathbb{F}_q[x]/(f(x)) ) }[/math] has [math]\displaystyle{ p }[/math] elements if and only if [math]\displaystyle{ f(x) }[/math] is irreducible.

Moreover, we can use the fact that the Frobenius automorphism is [math]\displaystyle{ \mathbb{F}_p }[/math]-linear to calculate the fixed set. That is, we note that [math]\displaystyle{ \text{Fix}_p( \mathbb{F}_q[x]/(f(x)) ) }[/math] is a [math]\displaystyle{ \mathbb{F}_p }[/math]-subspace, and an explicit basis for it can be calculated in the polynomial ring [math]\displaystyle{ \mathbb{F}_p[x,y]/(f,g) }[/math] by computing [math]\displaystyle{ (x^i y^j)^p }[/math] and establishing the linear equations on the coefficients of [math]\displaystyle{ x,y }[/math] polynomials that are satisfied iff it is fixed by Frobenius. We note that at this point we have an efficiently computable irreducibility criterion, and the remaining analysis shows how to use this to find factors.

The algorithm now breaks down into two cases:

  • In the case of small [math]\displaystyle{ p }[/math] we can construct any [math]\displaystyle{ g \in \text{Fix}_p( \mathbb{F}_q[x]/(f(x)) ) \setminus \mathbb{F}_p }[/math], and then observe that for some [math]\displaystyle{ a \in \mathbb{F}_p }[/math] there are [math]\displaystyle{ i,j }[/math] so that [math]\displaystyle{ g - a = 0 \mod f_i }[/math] and [math]\displaystyle{ g - a \not = 0 \mod f_j }[/math]. Such a [math]\displaystyle{ g - a }[/math] has a nontrivial factor in common with [math]\displaystyle{ f(x) }[/math], which can be computed via the gcd. As [math]\displaystyle{ p }[/math] is small, we can cycle through all possible [math]\displaystyle{ a }[/math].
  • For the case of large primes, which are necessarily odd, one can exploit the fact that a random nonzero element of [math]\displaystyle{ \mathbb{F}_p^* }[/math] is a square with probability [math]\displaystyle{ 1/2 }[/math], and that the map [math]\displaystyle{ x \to x^{ \frac{ p -1}{2}} }[/math] maps the set of non-zero squares to [math]\displaystyle{ 1 }[/math], and the set of non-squares to [math]\displaystyle{ -1 }[/math]. Thus, if we take a random element [math]\displaystyle{ g \in \text{Fix}_p( \mathbb{F}_q[x]/f(x)) }[/math], then with good probability [math]\displaystyle{ g^{ \frac{ p - 1}{2}} - 1 }[/math] will have a non-trivial factor in common with [math]\displaystyle{ f(x) }[/math].

For further details one can consult.[1]

Applications

One important application of Berlekamp's algorithm is in computing discrete logarithms over finite fields [math]\displaystyle{ \mathbb{F}_{p^n} }[/math], where [math]\displaystyle{ p }[/math] is prime and [math]\displaystyle{ n\geq 2 }[/math]. Computing discrete logarithms is an important problem in public key cryptography and error-control coding. For a finite field, the fastest known method is the index calculus method, which involves the factorisation of field elements. If we represent the field [math]\displaystyle{ \mathbb{F}_{p^n} }[/math] in the usual way - that is, as polynomials over the base field [math]\displaystyle{ \mathbb{F}_{p} }[/math], reduced modulo an irreducible polynomial of degree [math]\displaystyle{ n }[/math] - then this is simply polynomial factorisation, as provided by Berlekamp's algorithm.

Implementation in computer algebra systems

Berlekamp's algorithm may be accessed in the PARI/GP package using the factormod command, and the WolframAlpha [1] website.

See also

References