Parameterized complexity

From HandWiki
(Redirected from W hierarchy)
Short description: Branch of computational complexity theory

In computer science, parameterized complexity is a branch of computational complexity theory that focuses on classifying computational problems according to their inherent difficulty with respect to multiple parameters of the input or output. The complexity of a problem is then measured as a function of those parameters. This allows the classification of NP-hard problems on a finer scale than in the classical setting, where the complexity of a problem is only measured as a function of the number of bits in the input. This appears to have been first demonstrated in (Gurevich Stockmeyer). The first systematic work on parameterized complexity was done by (Downey Fellows).

Under the assumption that P ≠ NP, there exist many natural problems that require superpolynomial running time when complexity is measured in terms of the input size only but that are computable in a time that is polynomial in the input size and exponential or worse in a parameter k. Hence, if k is fixed at a small value and the growth of the function over k is relatively small then such problems can still be considered "tractable" despite their traditional classification as "intractable".

The existence of efficient, exact, and deterministic solving algorithms for NP-complete, or otherwise NP-hard, problems is considered unlikely, if input parameters are not fixed; all known solving algorithms for these problems require time that is exponential (so in particular superpolynomial) in the total size of the input. However, some problems can be solved by algorithms that are exponential only in the size of a fixed parameter while polynomial in the size of the input. Such an algorithm is called a fixed-parameter tractable (FPT) algorithm, because the problem can be solved efficiently (i.e., in polynomial time) for constant values of the fixed parameter.

Problems in which some parameter k is fixed are called parameterized problems. A parameterized problem that allows for such an FPT algorithm is said to be a fixed-parameter tractable problem and belongs to the class FPT, and the early name of the theory of parameterized complexity was fixed-parameter tractability.

Many problems have the following form: given an object x and a nonnegative integer k, does x have some property that depends on k? For instance, for the vertex cover problem, the parameter can be the number of vertices in the cover. In many applications, for example when modelling error correction, one can assume the parameter to be "small" compared to the total input size. Then it is challenging to find an algorithm that is exponential only in k, and not in the input size.

In this way, parameterized complexity can be seen as two-dimensional complexity theory. This concept is formalized as follows:

A parameterized problem is a language [math]\displaystyle{ L \subseteq \Sigma^* \times \N }[/math], where [math]\displaystyle{ \Sigma }[/math] is a finite alphabet. The second component is called the parameter of the problem.
A parameterized problem L is fixed-parameter tractable if the question "[math]\displaystyle{ (x, k) \in L }[/math]?" can be decided in running time [math]\displaystyle{ f(k) \cdot |x|^{O(1)} }[/math], where f is an arbitrary function depending only on k. The corresponding complexity class is called FPT.

For example, there is an algorithm that solves the vertex cover problem in [math]\displaystyle{ O(kn + 1.274^k) }[/math] time,[1] where n is the number of vertices and k is the size of the vertex cover. This means that vertex cover is fixed-parameter tractable with the size of the solution as the parameter.

Complexity classes

FPT

FPT contains the fixed parameter tractable problems, which are those that can be solved in time [math]\displaystyle{ f(k) \cdot {|x|}^{O(1)} }[/math] for some computable function f. Typically, this function is thought of as single exponential, such as [math]\displaystyle{ 2^{O(k)} }[/math], but the definition admits functions that grow even faster. This is essential for a large part of the early history of this class. The crucial part of the definition is to exclude functions of the form [math]\displaystyle{ f(n,k) }[/math], such as [math]\displaystyle{ k^n }[/math].

The class FPL (fixed parameter linear) is the class of problems solvable in time [math]\displaystyle{ f(k) \cdot |x| }[/math] for some computable function f.[2] FPL is thus a subclass of FPT. An example is the Boolean satisfiability problem, parameterised by the number of variables. A given formula of size m with k variables can be checked by brute force in time [math]\displaystyle{ O(2^km) }[/math]. A vertex cover of size k in a graph of order n can be found in time [math]\displaystyle{ O(2^kn) }[/math], so the vertex cover problem is also in FPL.

An example of a problem that is thought not to be in FPT is graph coloring parameterised by the number of colors. It is known that 3-coloring is NP-hard, and an algorithm for graph k-coloring in time [math]\displaystyle{ f(k)n^{O(1)} }[/math] for [math]\displaystyle{ k=3 }[/math] would run in polynomial time in the size of the input. Thus, if graph coloring parameterised by the number of colors were in FPT, then P = NP.

There are a number of alternative definitions of FPT. For example, the running-time requirement can be replaced by [math]\displaystyle{ f(k) + |x|^{O(1)} }[/math]. Also, a parameterised problem is in FPT if it has a so-called kernel. Kernelization is a preprocessing technique that reduces the original instance to its "hard kernel", a possibly much smaller instance that is equivalent to the original instance but has a size that is bounded by a function in the parameter.

FPT is closed under a parameterised notion of reductions called fpt-reductions. Such reductions transform an instance [math]\displaystyle{ (x,k) }[/math] of some problem into an equivalent instance [math]\displaystyle{ (x',k') }[/math] of another problem (with [math]\displaystyle{ k' \leq g(k) }[/math]) and can be computed in time [math]\displaystyle{ f(k)\cdot p(|x|) }[/math] where [math]\displaystyle{ p }[/math] is a polynomial.

Obviously, FPT contains all polynomial-time computable problems. Moreover, it contains all optimisation problems in NP that allow an efficient polynomial-time approximation scheme (EPTAS).

W hierarchy

The W hierarchy is a collection of computational complexity classes. A parameterized problem is in the class W[i], if every instance [math]\displaystyle{ (x, k) }[/math] can be transformed (in fpt-time) to a combinatorial circuit that has weft at most i, such that [math]\displaystyle{ (x, k)\in L }[/math] if and only if there is a satisfying assignment to the inputs that assigns 1 to exactly k inputs. The weft is the largest number of logical units with fan-in greater than two on any path from an input to the output. The total number of logical units on the paths (known as depth) must be limited by a constant that holds for all instances of the problem.

Note that [math]\displaystyle{ \mathsf{FPT} = W[0] }[/math] and [math]\displaystyle{ W[i] \subseteq W[j] }[/math] for all [math]\displaystyle{ i\le j }[/math]. The classes in the W hierarchy are also closed under fpt-reduction.

A complete problem for W[i] is Weighted i-Normalized Satisfiability:[3] given a Boolean formula written as an AND of ORs of ANDs of ... of possibly negated variables, with [math]\displaystyle{ i+1 }[/math] layers of ANDs or ORs (and i alternations between AND and OR), can it be satisfied by setting exactly k variables to 1?

Many natural computational problems occupy the lower levels, W[1] and W[2].

W[1]

Examples of W[1]-complete problems include

  • deciding if a given graph contains a clique of size k
  • deciding if a given graph contains an independent set of size k
  • deciding if a given nondeterministic single-tape Turing machine accepts within k steps ("short Turing machine acceptance" problem). This also applies to nondeterministic Turing machines with f(k) tapes and even f(k) of f(k)-dimensional tapes, but even with this extension, the restriction to f(k) tape alphabet size is fixed-parameter tractable. Crucially, the branching of the Turing machine at each step is allowed to depend on n, the size of the input. In this way, the Turing machine may explore nO(k) computation paths.

W[2]

Examples of W[2]-complete problems include

  • deciding if a given graph contains a dominating set of size k
  • deciding if a given nondeterministic multi-tape Turing machine accepts within k steps ("short multi-tape Turing machine acceptance" problem). Crucially, the branching is allowed to depend on n (like the W[1] variant), as is the number of tapes. An alternate W[2]-complete formulation allows only single-tape Turing machines, but the alphabet size may depend on n.

W[t]

[math]\displaystyle{ W[t] }[/math] can be defined using the family of Weighted Weft-t-Depth-d SAT problems for [math]\displaystyle{ d\geq t }[/math]: [math]\displaystyle{ W[t,d] }[/math] is the class of parameterized problems that fpt-reduce to this problem, and [math]\displaystyle{ W[t] = \bigcup_{d\geq t} W[t,d] }[/math].

Here, Weighted Weft-t-Depth-d SAT is the following problem:

  • Input: A Boolean formula of depth at most d and weft at most t, and a number k. The depth is the maximal number of gates on any path from the root to a leaf, and the weft is the maximal number of gates of fan-in at least three on any path from the root to a leaf.
  • Question: Does the formula have a satisfying assignment of Hamming weight exactly k?

It can be shown that for [math]\displaystyle{ t\geq2 }[/math] the problem Weighted t-Normalize SAT is complete for [math]\displaystyle{ W[t] }[/math] under fpt-reductions.[4] Here, Weighted t-Normalize SAT is the following problem:

  • Input: A Boolean formula of depth at most t with an AND-gate on top, and a number k.
  • Question: Does the formula have a satisfying assignment of Hamming weight exactly k?

W[P]

W[P] is the class of problems that can be decided by a nondeterministic [math]\displaystyle{ h(k) \cdot {|x|}^{O(1)} }[/math]-time Turing machine that makes at most [math]\displaystyle{ O(f(k)\cdot \log n) }[/math] nondeterministic choices in the computation on [math]\displaystyle{ (x,k) }[/math] (a k-restricted Turing machine). (Flum Grohe)

It is known that FPT is contained in W[P], and the inclusion is believed to be strict. However, resolving this issue would imply a solution to the P versus NP problem.

Other connections to unparameterised computational complexity are that FPT equals W[P] if and only if circuit satisfiability can be decided in time [math]\displaystyle{ \exp(o(n))m^{O(1)} }[/math], or if and only if there is a computable, nondecreasing, unbounded function f such that all languages recognised by a nondeterministic polynomial-time Turing machine using [math]\displaystyle{ f(n)\log n }[/math] nondeterministic choices are in P.

W[P] can be loosely thought of as the class of problems where we have a set S of n items, and we want to find a subset [math]\displaystyle{ T \subset S }[/math] of size k such that a certain property holds. We can encode a choice as a list of k integers, stored in binary. Since the highest any of these numbers can be is n, [math]\displaystyle{ \lceil\log_2 n\rceil }[/math] bits are needed for each number. Therefore [math]\displaystyle{ k \cdot \lceil\log_2 n\rceil }[/math] total bits are needed to encode a choice. Therefore we can select a subset [math]\displaystyle{ T\subset S }[/math] with [math]\displaystyle{ O(k\cdot \log n) }[/math] nondeterministic choices.

XP

XP is the class of parameterized problems that can be solved in time [math]\displaystyle{ n^{f(k)} }[/math] for some computable function f. These problems are called slicewise polynomial, in the sense that each "slice" of fixed k has a polynomial algorithm, although possibly with a different exponent for each k. Compare this with FPT, which merely allows a different constant prefactor for each value of k. XP contains FPT, and it is known that this containment is strict by diagonalization.


para-NP

para-NP is the class of parameterized problems that can be solved by a nondeterministic algorithm in time [math]\displaystyle{ f(k) \cdot |x|^{O(1)} }[/math] for some computable function f. It is known that [math]\displaystyle{ \textsf{FPT}=\textsf{para-NP} }[/math] if and only if [math]\displaystyle{ \textsf{P}=\textsf{NP} }[/math].[5]

A problem is para-NP-hard if it is [math]\displaystyle{ \textsf{NP} }[/math]-hard already for a constant value of the parameter. That is, there is a "slice" of fixed k that is [math]\displaystyle{ \textsf{NP} }[/math]-hard. A parameterized problem that is [math]\displaystyle{ \textsf{para-NP} }[/math]-hard cannot belong to the class [math]\displaystyle{ \textsf{XP} }[/math], unless [math]\displaystyle{ \textsf{P}=\textsf{NP} }[/math]. A classic example of a [math]\displaystyle{ \textsf{para-NP} }[/math]-hard parameterized problem is graph coloring, parameterized by the number k of colors, which is already [math]\displaystyle{ \textsf{NP} }[/math]-hard for [math]\displaystyle{ k=3 }[/math] (see Graph coloring).

A hierarchy

The A hierarchy is a collection of computational complexity classes similar to the W hierarchy. However, while the W hierarchy is a hierarchy contained in NP, the A hierarchy more closely mimics the polynomial-time hierarchy from classical complexity. It is known that A[1] = W[1] holds.

See also

Notes

  1. Chen, Kanj & Xia 2006
  2. (Grohe 1999)
  3. Downey, Rod G.; Fellows, Michael R. (August 1995). "Fixed-Parameter Tractability and Completeness I: Basic Results" (in en). SIAM Journal on Computing 24 (4): 873–921. doi:10.1137/S0097539792228228. ISSN 0097-5397. https://doi.org/10.1137/S0097539792228228. 
  4. Buss, Jonathan F; Islam, Tarique (2006). "Simplifying the weft hierarchy". Theoretical Computer Science 351 (3): 303–313. doi:10.1016/j.tcs.2005.10.002. 
  5. Flum & Grohe (2006), p. 39.

References

External links