# Philosophy:RecycleUnits

In mathematical logic, proof compression by RecycleUnits[1] is a method for compressing propositional logic resolution proofs. Its main idea is to make use of intermediate (e.g. non input) proof results being unit clauses, i.e. clauses containing only one literal. Certain proof nodes can be replaced with the nodes representing these unit clauses. After this operation the obtained graph is transformed into a valid proof. The output proof is shorter than the original while being equivalent or stronger.

## Algorithms

The algorithms treat resolution proofs as directed acyclic graphs, where each node is labeled by a clause and each node has either one or two predecessors called parents. If a node has two parents it is also labeled with a propositional variable called the pivot, which was used to compute the nodes clause using resolution.
The following algorithm describes the replacement of nodes.
It is assumed that in the resolution proof for all non leaf nodes with two parent nodes, the left parent node contains the positive and the right parent node the negative pivot variable. The algorithm first iterates over all non leaf unit clauses and then over all non ancestor nodes of the proof. If the node's pivot element is the variable of the present unit clause's literal, one of the parent nodes can be replaced by the node corresponding to the unit clause. Because of the above assumption, if the literal is equal to the pivot, the left parent contains the literal and can be replaced by the unit clause node. If the literal is equal to the negation of the pivot the right parent is replaced.

1  function RecycleUnits(Proof $\displaystyle{ P }$):
2      Let $\displaystyle{ U }$ be the set of non leaf nodes representing unit clauses
3      for each $\displaystyle{ u \in U }$ do
4          Mark the ancestors of u
5          for each unmarked $\displaystyle{ n \in P }$ do
6              let $\displaystyle{ p }$ be the pivot variable of $\displaystyle{ n }$
7              let $\displaystyle{ l }$ be the literal contained in the clause of $\displaystyle{ u }$
8              if $\displaystyle{ p == l }$ then
9                  replace the left parent of $\displaystyle{ n }$ with $\displaystyle{ u }$
10             else if $\displaystyle{ \neg p == l }$ then
11                 replace the right parent of $\displaystyle{ n }$ with $\displaystyle{ u }$


In general after execution of this function the proof won't be a legal proof anymore. The following algorithm takes the root node of a proof and constructs a legal proof out of it. The computation begins with recursively calls to the children nodes. In order to minimize the algorithm calls, it is beingt kept track of which nodes were already visited. Note that a resolution proof can be seen as a general directed acyclic graph as opposed to a tree. After the recursive call the clause of the present node is updated. While doing so four different cases can occur. The present pivot variable can occur in both, the left, the right or in none of the parent nodes. If it occurs in both parent nodes the clause is calculated as resolvent of the parent clauses. If it is not present in one of the parent nodes the clause of this parent can be copied. If it misses in both parents one has to choose heuristically.

1  function ReconstructProof(Node $\displaystyle{ n }$):
3      if $\displaystyle{ n }$ is visited return
4      mark $\displaystyle{ n }$ as visited
5      if $\displaystyle{ n }$ has no parents return
6      else if $\displaystyle{ n }$ has only one parent $\displaystyle{ x }$ then
7          ReconstructProof($\displaystyle{ x }$)
8          $\displaystyle{ n }$.Clause = $\displaystyle{ x }$.Clause
9      else
10         let $\displaystyle{ l }$ be the left and $\displaystyle{ r }$ the right parent node
11         let $\displaystyle{ p }$ be the pivot variable used to compute $\displaystyle{ n }$
12         ReconstructProof($\displaystyle{ l }$)
13         ReconstructProof($\displaystyle{ r }$)
14         if $\displaystyle{ p \in l.Clause }$ and $\displaystyle{ p \in r.Clause }$
15             $\displaystyle{ n }$.Clause = Resolve($\displaystyle{ l }$,$\displaystyle{ r }$,$\displaystyle{ p }$)
16         else if $\displaystyle{ p \in l.Clause }$ and $\displaystyle{ p \notin r.Clause }$
17             $\displaystyle{ n }$.Clause = $\displaystyle{ r }$.Clause
18             delete reference to $\displaystyle{ l }$
19         else if $\displaystyle{ p \in r.Clause }$ and $\displaystyle{ p \notin l.Clause }$
20             $\displaystyle{ n }$.Clause = $\displaystyle{ l }$.Clause
21             delete reference to $\displaystyle{ r }$
22         else
23             let $\displaystyle{ x \in \{l,r\} }$ and $\displaystyle{ y \in \{l,r\} \setminus \{x\} }$ //choose x heuristically
24             $\displaystyle{ n }$.Clause = $\displaystyle{ x }$.Clause
25             delete reference to $\displaystyle{ y }$


## Example

Consider the following resolution proof.
One intermediate result is $\displaystyle{ C_8 }$ which is representing the unit clause (-1).

$\displaystyle{ (1)\cfrac{ (2)\cfrac{ (1)\cfrac{C_1 (1,3)\qquad C_2 (-1,2,5)}{C_3 (2,3,5)} \qquad C_4 (1,-2) } {C_7 (1,3,5)} \qquad (4)\cfrac{C_5 (-1,4) \qquad C_6 (-1,-4)}{\color{red}C_8 (-1)} } { C_9 (3,5) } }$

There is one non-ancestor node using the variable 1 as a pivot element: $\displaystyle{ C_3 }$.

$\displaystyle{ (1)\cfrac{ (2)\cfrac{ {\color{red}(1)}\cfrac{C_1 (1,3)\qquad C_2 (-1,2,5)}{C_3 (2,3,5)} \qquad C_4 (1,-2) } {C_7 (1,3,5)} \qquad (4)\cfrac{C_5 (-1,4) \qquad C_6 (-1,-4)}{C_8 (-1)} } { C_9 (3,5) } }$

The literal -1 is contained in the right parent of this node and therefore this parent is replaced by $\displaystyle{ C_8 }$. The string $\displaystyle{ {C_8}^* }$ denotes a reference to the clause $\displaystyle{ C_8 }$ (the structure is now a directed acyclic graph rather than a tree).

$\displaystyle{ (1)\cfrac{ (2)\cfrac{ (1)\cfrac{C_1 (1,3)\qquad {\color{red} {C_8}^*}}{C_3 (2,3,5)} \qquad C_4 (1,-2) } {C_7 (1,3,5)} \qquad (4)\cfrac{C_5 (-1,4) \qquad C_6 (-1,-4)}{C_8 (-1)} } { C_9 (3,5) } }$

This structure is not a legal proof anymore, because $\displaystyle{ C_3 }$ is not the resolvent of $\displaystyle{ C_1 }$ and $\displaystyle{ C_8 }$. Therefore it has to be transformed into one again.
The first step is to update $\displaystyle{ C_3 }$. As the pivot variable 1 appears in both parent nodes, $\displaystyle{ C_3 }$ is computed as the resolvent of them.

$\displaystyle{ (1)\cfrac{ (2)\cfrac{ (1)\cfrac{C_1 (1,3)\qquad {C_8}^*}{C_3 {\color{red}(3)}} \qquad C_4 (1,-2) } {C_7 (1,3,5)} \qquad (4)\cfrac{C_5 (-1,4) \qquad C_6 (-1,-4)}{C_8 (-1)} } { C_9 (3,5) } }$

The left parent node of $\displaystyle{ C_7 }$ does not contain the pivot variable and therefore the clause of this parent is copied into the clause of $\displaystyle{ C_7 }$. The link between $\displaystyle{ C_7 }$ and $\displaystyle{ C_4 }$ is removed and since there are no other links to $\displaystyle{ C_4 }$ this node can be deleted.

$\displaystyle{ (1)\cfrac{ \cfrac{ (1)\cfrac{C_1 (1,3)\qquad {C_8}^*}{C_3 (3)} } {C_7 {\color{red}(3)}} \qquad (4)\cfrac{C_5 (-1,4) \qquad C_6 (-1,-4)}{C_8 (-1)} } { C_9 (3,5) } }$

Again the left parent of $\displaystyle{ C_9 }$ does not contain the pivot variable and the same operation is performed as before.

$\displaystyle{ \cfrac{ \cfrac{ (1)\cfrac{C_1 (1,3)\qquad (4)\cfrac{C_5 (-1,4) \qquad C_6 (-1,-4)}{C_8 (-1)}}{C_3 (3)} } {C_7 (3)} } { C_9 {\color{red}(3)} } }$

Note: the reference $\displaystyle{ {C_8}^* }$ was replaced by the actual proof node $\displaystyle{ C_8 }$.
The result of this proof is the unit clause (3) which is a stronger result than the clause (3,5) of the original proof.

## Notes

1. Bar-Ilan, O.; Fuhrmann, O.; Hoory, S.; Shacham, O.; Strichman, O. Linear-time Reductions of Resolution Proofs. Hardware and Software: Verification and Testing, p. 114–128, Springer, 2011.