Isolation lemma

From HandWiki
Short description: Technique for reducing number of solutions

In theoretical computer science, the term isolation lemma (or isolating lemma) refers to randomized algorithms that reduce the number of solutions to a problem to one, should a solution exist. This is achieved by constructing random constraints such that, with non-negligible probability, exactly one solution satisfies these additional constraints if the solution space is not empty. Isolation lemmas have important applications in computer science, such as the Valiant–Vazirani theorem and Toda's theorem in computational complexity theory.

The first isolation lemma was introduced by (Valiant Vazirani), albeit not under that name. Their isolation lemma chooses a random number of random hyperplanes, and has the property that, with non-negligible probability, the intersection of any fixed non-empty solution space with the chosen hyperplanes contains exactly one element. This suffices to show the Valiant–Vazirani theorem: there exists a randomized polynomial-time reduction from the satisfiability problem for Boolean formulas to the problem of detecting whether a Boolean formula has a unique solution. (Mulmuley Vazirani) introduced an isolation lemma of a slightly different kind: Here every coordinate of the solution space gets assigned a random weight in a certain range of integers, and the property is that, with non-negligible probability, there is exactly one element in the solution space that has minimum weight. This can be used to obtain a randomized parallel algorithm for the maximum matching problem.

Stronger isolation lemmas have been introduced in the literature to fit different needs in various settings. For example, the isolation lemma of (Chari Rohatgi) has similar guarantees as that of Mulmuley et al., but it uses fewer random bits. In the context of the exponential time hypothesis, (Calabro Impagliazzo) prove an isolation lemma for k-CNF formulas. Noam Ta-Shma[1] gives an isolation lemma with slightly stronger parameters, and gives non-trivial results even when the size of the weight domain is smaller than the number of variables.

The isolation lemma of Mulmuley, Vazirani, and Vazirani

Any linear program with a randomly chosen linear cost function has a unique optimum with high probability. The isolation lemma of Mulmuley, Vazirani, and Vazirani extends this fact to arbitrary sets and a random cost function that is sampled using few random bits.
Lemma. Let [math]\displaystyle{ n }[/math] and [math]\displaystyle{ N }[/math] be positive integers, and let [math]\displaystyle{ \mathcal F }[/math] be an arbitrary nonempty family of subsets of the universe [math]\displaystyle{ \{1,\dots,n\} }[/math]. Suppose each element [math]\displaystyle{ x\in\{1,\dots,n\} }[/math] in the universe receives an integer weight [math]\displaystyle{ w(x) }[/math], each of which is chosen independently and uniformly at random from [math]\displaystyle{ \{1,\dots,N\} }[/math]. The weight of a set S in [math]\displaystyle{ \mathcal F }[/math] is defined as
[math]\displaystyle{ w(S) = \sum_{x \in S} w(x)\,. }[/math]
Then, with probability at least [math]\displaystyle{ 1-n/N }[/math], there is a unique set in [math]\displaystyle{ \mathcal F }[/math] that has the minimum weight among all sets of [math]\displaystyle{ \mathcal F }[/math].

It is remarkable that the lemma assumes nothing about the nature of the family [math]\displaystyle{ \mathcal F }[/math]: for instance [math]\displaystyle{ \mathcal F }[/math] may include all [math]\displaystyle{ 2^n-1 }[/math] nonempty subsets. Since the weight of each set in [math]\displaystyle{ \mathcal F }[/math] is between [math]\displaystyle{ 1 }[/math] and [math]\displaystyle{ nN }[/math] on average there will be [math]\displaystyle{ (2^n-1) / (nN) }[/math] sets of each possible weight. Still, with high probability, there is a unique set that has minimum weight.

Mulmuley, Vazirani, and Vazirani's proof

Suppose we have fixed the weights of all elements except an element x. Then x has a threshold weight α, such that if the weight w(x) of x is greater than α, then it is not contained in any minimum-weight subset, and if [math]\displaystyle{ w(x) \le \alpha }[/math], then it is contained in some sets of minimum weight. Further, observe that if [math]\displaystyle{ w(x) \lt \alpha }[/math], then every minimum-weight subset must contain x (since, when we decrease w(x) from α, sets that do not contain x do not decrease in weight, while those that contain x do). Thus, ambiguity about whether a minimum-weight subset contains x or not can happen only when the weight of x is exactly equal to its threshold; in this case we will call x "singular". Now, as the threshold of x was defined only in terms of the weights of the other elements, it is independent of w(x), and therefore, as w(x) is chosen uniformly from {1, …, N},

[math]\displaystyle{ \Pr[x\text{ is singular}] = \Pr[w(x) = \alpha] \le 1/N }[/math]

and the probability that some x is singular is at most n/N. As there is a unique minimum-weight subset iff no element is singular, the lemma follows.

Remark: The lemma holds with [math]\displaystyle{ \le }[/math] (rather than =) since it is possible that some x has no threshold value (i.e., x will not be in any minimum-weight subset even if w(x) gets the minimum possible value, 1).

Joel Spencer's proof

This is a restatement version of the above proof, due to Joel Spencer (1995).[2]

For any element x in the set, define

[math]\displaystyle{ \alpha(x) = \min_{S \in \mathcal F, x \not\in S}w(S) - \min_{S\in\mathcal F, x\in S}w(S\setminus\{x\}). }[/math]

Observe that [math]\displaystyle{ \alpha(x) }[/math] depends only on the weights of elements other than x, and not on w(x) itself. So whatever the value of [math]\displaystyle{ \alpha(x) }[/math], as w(x) is chosen uniformly from {1, …, N}, the probability that it is equal to [math]\displaystyle{ \alpha(x) }[/math] is at most 1/N. Thus the probability that [math]\displaystyle{ w(x) = \alpha(x) }[/math] for some x is at most n/N.

Now if there are two sets A and B in [math]\displaystyle{ \mathcal F }[/math] with minimum weight, then, taking any x in A\B, we have

[math]\displaystyle{ \begin{align} \alpha(x) &= \min_{S \in \mathcal F, x \not\in S}w(S) - \min_{S\in\mathcal F, x\in S}w(S\setminus\{x\}) \\ &= w(B) - (w(A)-w(x)) \\ &= w(x), \end{align} }[/math]

and as we have seen, this event happens with probability at most n/N.

Examples/applications

  • The original application was to minimum-weight (or maximum-weight) perfect matchings in a graph. Each edge is assigned a random weight in {1, …, 2m}, and [math]\displaystyle{ \mathcal F }[/math] is the set of perfect matchings, so that with probability at least 1/2, there exists a unique perfect matching. When each indeterminate [math]\displaystyle{ x_{ij} }[/math] in the Tutte matrix of the graph is replaced with [math]\displaystyle{ 2^{w_{ij}} }[/math] where [math]\displaystyle{ w_{ij} }[/math] is the random weight of the edge, we can show that the determinant of the matrix is nonzero, and further use this to find the matching.
  • More generally, the paper also observed that any search problem of the form "Given a set system [math]\displaystyle{ (S,\mathcal F) }[/math], find a set in [math]\displaystyle{ \mathcal F }[/math]" could be reduced to a decision problem of the form "Is there a set in [math]\displaystyle{ \mathcal F }[/math] with total weight at most k?". For instance, it showed how to solve the following problem posed by Papadimitriou and Yannakakis, for which (as of the time the paper was written) no deterministic polynomial-time algorithm is known: given a graph and a subset of the edges marked as "red", find a perfect matching with exactly k red edges.
  • The Valiant–Vazirani theorem, concerning unique solutions to NP-complete problems, has a simpler proof using the isolation lemma. This is proved by giving a randomized reduction from CLIQUE to UNIQUE-CLIQUE.[3]
  • (Ben-David Chor) use the proof of Valiant-Vazirani in their search-to-decision reduction for average-case complexity.
  • Avi Wigderson used the isolation lemma in 1994 to give a randomized reduction from NL to UL, and thereby prove that NL/poly ⊆ ⊕L/poly.[4] Reinhardt and Allender later used the isolation lemma again to prove that NL/poly = UL/poly.[5]
  • The book by Hemaspaandra and Ogihara has a chapter on the isolation technique, including generalizations.[6]
  • The isolation lemma has been proposed as the basis of a scheme for digital watermarking.[7]
  • There is ongoing work on derandomizing the isolation lemma in specific cases[8] and on using it for identity testing.[9]

Notes

  1. Noam Ta-Shma (2015); A simple proof of the Isolation Lemma, in eccc
  2. (Jukna 2001)
  3. (Mulmuley Vazirani)
  4. (Wigderson 1994)
  5. (Reinhardt Allender)
  6. (Hemaspaandra Ogihara)
  7. (Majumdar Wong)
  8. (Arvind Mukhopadhyay)
  9. (Arvind Mukhopadhyay)

References

External links