Packing in a hypergraph

From HandWiki

In mathematics, a packing in a hypergraph is a partition of the set of the hypergraph's edges into a number of disjoint subsets such that no pair of edges in each subset share any vertex. There are two famous algorithms to achieve asymptotically optimal packing in k-uniform hypergraphs. One of them is a random greedy algorithm which was proposed by Joel Spencer. He used a branching process to formally prove the optimal achievable bound under some side conditions. The other algorithm is called the Rödl nibble and was proposed by Vojtěch Rödl et al. They showed that the achievable packing by the Rödl nibble is in some sense close to that of the random greedy algorithm.

History

The problem of finding the number of such subsets in a k-uniform hypergraph was originally motivated through a conjecture by Paul Erdős and Haim Hanani in 1963. Vojtěch Rödl proved their conjecture asymptotically under certain conditions in 1985. Pippenger and Joel Spencer generalized Rödl's results using a random greedy algorithm in 1989.

Definition and terminology

In the following definitions, the hypergraph is denoted by H=(V,E). H is called a k-uniform hypergraph if every edge in E consists of exactly k vertices.

[math]\displaystyle{ P }[/math] is a hypergraph packing if it is a subset of edges in H such that there is no pair of distinct edges with a common vertex.

[math]\displaystyle{ H }[/math] is a ([math]\displaystyle{ D_0 }[/math],[math]\displaystyle{ \epsilon }[/math])-good hypergraph if there exists a [math]\displaystyle{ D_0 }[/math] such that for all [math]\displaystyle{ x,y \in V }[/math] and [math]\displaystyle{ D\geq D_0 }[/math] and both of the following conditions hold.

[math]\displaystyle{ D(1-\epsilon)\leq \text{deg}(x)\leq D(1+\epsilon) }[/math]
[math]\displaystyle{ \text{codeg}(x,y)\leq \epsilon D }[/math]

where the degree [math]\displaystyle{ \text{deg} (x) }[/math] of a vertex [math]\displaystyle{ x }[/math] is the number of edges that contain [math]\displaystyle{ x }[/math] and the codegree [math]\displaystyle{ \text{codeg} (x,y) }[/math] of two distinct vertices [math]\displaystyle{ x }[/math] and [math]\displaystyle{ y }[/math] is the number of edges that contain both vertices.

Theorem

There exists an asymptotic packing P of size at least [math]\displaystyle{ \frac{n}{K+1}(1-o(1)) }[/math] for a [math]\displaystyle{ (K+1) }[/math]-uniform hypergraph under the following two conditions,

  1. All vertices have the degree of [math]\displaystyle{ D(1+o(1)) }[/math] in which [math]\displaystyle{ D }[/math] tends to infinity.
  2. For every pair of vertices shares only [math]\displaystyle{ o(D) }[/math] common edges.

where [math]\displaystyle{ n }[/math] is the total number of vertices. This result was shown by Pippenger and was later proved by Joel Spencer. To address the asymptotic hypergraph packing problem, Joel Spencer proposed a random greedy algorithm. In this algorithm, a branching process is used as the basis and it was shown that it almost always achieves an asymptotically optimal packing under the above side conditions.

Asymptotic packing algorithms

There are two famous algorithms for asymptotic packing of k-uniform hypergraphs: the random greedy algorithm via branching process, and the Rödl nibble.

Random greedy algorithm via branching process

Every edge [math]\displaystyle{ E \in H }[/math] is independently and uniformly assigned a distinct real "birthtime" [math]\displaystyle{ t_E \in [0,D ] }[/math]. The edges are taken one by one in the order of their birthtimes. The edge [math]\displaystyle{ E }[/math] is accepted and included in [math]\displaystyle{ P }[/math] if it does not overlap any previously accepted edges. Obviously, the subset [math]\displaystyle{ P }[/math] is a packing and it can be shown that its size is [math]\displaystyle{ |P|=\frac{n}{K+1} }[/math] almost surely. To show that, let stop the process of adding new edges at time [math]\displaystyle{ c }[/math]. For an arbitrary [math]\displaystyle{ \gamma \gt 0 }[/math], pick [math]\displaystyle{ c, D_0, \epsilon }[/math] such that for any [math]\displaystyle{ (D_0,\epsilon) }[/math]-good hypergraph [math]\displaystyle{ f_{x,H}(c)\lt \gamma^2 }[/math] where [math]\displaystyle{ f_{x,H}(c) }[/math] denotes the probability of vertex [math]\displaystyle{ x }[/math] survival (a vertex survives if it is not in any edges in [math]\displaystyle{ P }[/math]) until time [math]\displaystyle{ c }[/math]. Obviously, in such a situation the expected number of [math]\displaystyle{ x }[/math] surviving at time [math]\displaystyle{ c }[/math] is less than [math]\displaystyle{ \gamma^2 n }[/math]. As a result, the probability of [math]\displaystyle{ x }[/math] surviving being less than [math]\displaystyle{ \gamma n }[/math] is higher than [math]\displaystyle{ 1-\gamma }[/math]. In other words, [math]\displaystyle{ P_c }[/math] must include at least [math]\displaystyle{ (1-\gamma)n }[/math] vertices which means that [math]\displaystyle{ |P|\geq (1-\gamma)\frac{n}{K+1} }[/math].

To complete the proof, it must be shown that [math]\displaystyle{ \lim_{c\rightarrow \infty} \lim_{x,H} f_{x,H}(c)=0 }[/math]. For that, the asymptotic behavior of [math]\displaystyle{ x }[/math] surviving is modeled by a continuous branching process. Fix [math]\displaystyle{ c\gt 0 }[/math] and begin with Eve with the birthdate of [math]\displaystyle{ c }[/math]. Assume time goes backward so Eve gives birth in the interval of [math]\displaystyle{ [0,c) }[/math] with a unit density Poisson distribution. The probability of Eve having [math]\displaystyle{ k }[/math] birth is [math]\displaystyle{ \frac{e^{-c}c^k}{k!} }[/math]. By conditioning on [math]\displaystyle{ k }[/math] the birthtimes [math]\displaystyle{ x_1,...,x_k }[/math] are independently and uniformly distributed on [math]\displaystyle{ [0,c) }[/math]. Every birth given by Eve consists of [math]\displaystyle{ Q }[/math] offspring all with the same birth time say [math]\displaystyle{ a }[/math]. The process is iterated for each offspring. It can be shown that for all [math]\displaystyle{ \epsilon \gt 0 }[/math] there exists a [math]\displaystyle{ K }[/math] so that with a probability higher than [math]\displaystyle{ (1-\epsilon) }[/math], Eve has at most [math]\displaystyle{ K }[/math] descendants.

A rooted tree with the notions of parent, child, root, birthorder and wombmate shall be called a broodtree. Given a finite broodtree [math]\displaystyle{ T }[/math] we say for each vertex that it survives or dies. A childless vertex survives. A vertex dies if and only if it has at least one brood all of whom survive. Let [math]\displaystyle{ f(c) }[/math] denote the probability that Eve survives in the broodtree [math]\displaystyle{ T }[/math] given by the above process. The objective is to show [math]\displaystyle{ \lim_{c\rightarrow \infty} f(c)=0 }[/math] and then for any fixed [math]\displaystyle{ c }[/math], it can be shown that [math]\displaystyle{ \lim^* f_{x,H}(c)=f(c) }[/math]. These two relations complete our argument.

To show [math]\displaystyle{ f(c)=0 }[/math], let [math]\displaystyle{ c\geq 0, \Delta c\gt 0 }[/math]. For [math]\displaystyle{ \Delta c }[/math] small, [math]\displaystyle{ f(c+\Delta c)-f(c) \approx -(\Delta c)f(c)^{Q+1} }[/math] as, roughly, an Eve starting at time [math]\displaystyle{ c+\Delta c }[/math] might have a birth in time interval [math]\displaystyle{ [c, c+\Delta c) }[/math] all of whose children survive while Eve has no births in [math]\displaystyle{ [0, c) }[/math] all of whose children survive. Letting [math]\displaystyle{ \Delta c\rightarrow 0 }[/math] yields the differential equation [math]\displaystyle{ f'(c)=-f(c)^{Q+1} }[/math]. The initial value [math]\displaystyle{ f(0)=1 }[/math] gives a unique solution [math]\displaystyle{ f(c)=(1+Qc)^{-1/Q} }[/math]. Note that indeed [math]\displaystyle{ \lim_{c\rightarrow \infty} f(c)=0 }[/math].

To prove [math]\displaystyle{ \lim^* f_{x,H}(c)=f(c) }[/math], consider a procedure we call History which either aborts or produces a broodtree. History contains a set [math]\displaystyle{ T }[/math] of vertices, initially [math]\displaystyle{ T=\{x\} }[/math]. [math]\displaystyle{ T }[/math] will have a broodtree structure with [math]\displaystyle{ x }[/math] the root. The [math]\displaystyle{ y\in T }[/math] are either processed or unprocessed, [math]\displaystyle{ x }[/math] is initially unprocessed. To each [math]\displaystyle{ y\in T }[/math] is assigned a birthtime [math]\displaystyle{ t_y }[/math], we initialize [math]\displaystyle{ t_x=c }[/math]. History is to take an unprocessed [math]\displaystyle{ y\in T }[/math] and process it as follows. For the value of all [math]\displaystyle{ t_E }[/math] with [math]\displaystyle{ y\in E }[/math] but with no [math]\displaystyle{ x\in E }[/math] that has already been processed, if either some [math]\displaystyle{ E }[/math] has [math]\displaystyle{ t_E\lt t_y }[/math] and [math]\displaystyle{ y,z\in E }[/math] with [math]\displaystyle{ z\in T }[/math] or some [math]\displaystyle{ E, E' }[/math] have [math]\displaystyle{ t_E,t_{E'}\lt t_y }[/math] with [math]\displaystyle{ y\in E,E' }[/math] and [math]\displaystyle{ |E\cup E'|\gt 1 }[/math], then History is aborted. Otherwise for each [math]\displaystyle{ E }[/math] with [math]\displaystyle{ t_E\lt t_y }[/math] add all [math]\displaystyle{ z\in E-\{y\} }[/math] to [math]\displaystyle{ T }[/math] as wombmates with parent [math]\displaystyle{ y }[/math] and common birthdate [math]\displaystyle{ t_E }[/math]. Now [math]\displaystyle{ y }[/math] is considered processed. History halts, if not aborted, when all [math]\displaystyle{ y\in T }[/math] are processed. If History does not abort then root [math]\displaystyle{ x }[/math] survives broodtree [math]\displaystyle{ T }[/math] if and only if [math]\displaystyle{ x }[/math] survives at time [math]\displaystyle{ c }[/math]. For a fixed broodtree, let [math]\displaystyle{ f(T,c) }[/math] denote the probability that the branching process yields broodtree [math]\displaystyle{ T }[/math]. Then the probability that History does not abort is [math]\displaystyle{ f(T,c) }[/math]. By the finiteness of the branching process, [math]\displaystyle{ \sum f(T,c)=1 }[/math], the summation over all broodtrees [math]\displaystyle{ T }[/math] and History does not abort. The [math]\displaystyle{ lim^* }[/math] distribution of its broodtree approaches the branching process distribution. Thus [math]\displaystyle{ \lim^* f_{x,H}(c)=f(c) }[/math].

The Rödl nibble

In 1985, Rödl proved Paul Erdős’s conjecture by a method called the Rödl nibble. Rödl's result can be formulated in form of either packing or covering problem. For [math]\displaystyle{ 2\leq l\lt k\lt n }[/math] the covering number denoted by [math]\displaystyle{ M(n,k,l) }[/math] shows the minimal size of a family [math]\displaystyle{ \kappa }[/math] of [math]\displaystyle{ k }[/math]-element subsets of [math]\displaystyle{ \{1,...,n\} }[/math] which have the property that every [math]\displaystyle{ l }[/math]-element set is contained in at least one [math]\displaystyle{ A \in \kappa }[/math]. Paul Erdős et al. conjecture was

[math]\displaystyle{ \lim_{n\rightarrow \infty} \frac{M(n,k,l)}{{n \choose l}/{k \choose l}}=1 }[/math].

where [math]\displaystyle{ 2\leq l\lt k }[/math]. This conjecture roughly means that a tactical configuration is asymptotically achievable. One may similarly define the packing number [math]\displaystyle{ m(n,k,l) }[/math] as the maximal size of a family [math]\displaystyle{ \kappa }[/math] of [math]\displaystyle{ k }[/math]-element subsets of [math]\displaystyle{ \{1,...,n\} }[/math] having the property that every [math]\displaystyle{ l }[/math]-element set is contained in at most one [math]\displaystyle{ A \in \kappa }[/math].

Packing under the stronger condition

In 1997, Noga Alon, Jeong Han Kim, and Joel Spencer also supply a good bound for [math]\displaystyle{ \gamma }[/math] under the stronger codegree condition that every distinct pair [math]\displaystyle{ v, v'\in V }[/math] has at most one edge in common.

For a k-uniform, D-regular hypergraph on n vertices, if k > 3, there exists a packing P covering all vertices but at most [math]\displaystyle{ O(nD^{-1/(k-1)}) }[/math]. If k = 3 there exists a packing P covering all vertices but at most [math]\displaystyle{ O(nD^{-1/2}\ln^{3/2}D) }[/math].

This bound is desirable in various applications, such as Steiner triple system. A Steiner Triple System is a 3-uniform, simple hypergraph in which every pair of vertices is contained in precisely one edge. Since a Steiner Triple System is clearly d=(n-1)/2-regular, the above bound supplies the following asymptotic improvement.

Any Steiner Triple System on n vertices contains a packing covering all vertices but at most [math]\displaystyle{ O(n^{1/2}\ln^{3/2}n) }[/math].

This has subsequently improved to [math]\displaystyle{ n/3-O(\frac{\log n}{\log \log n}) }[/math][1] and [math]\displaystyle{ \frac{n-4}{3} }[/math][2]

See also

References

  1. Keevash, Peter; Pokrovskiy, Alexey; Sudakov, Benny; Yepremyan, Liana (2022-04-15). "New bounds for Ryser's conjecture and related problems" (in en). Transactions of the American Mathematical Society, Series B 9 (8): 288–321. doi:10.1090/btran/92. ISSN 2330-0000. 
  2. Montgomery, Richard (2023). "A proof of the Ryser-Brualdi-Stein conjecture for large even n". arXiv:2310.19779 [math.CO].