Graph removal lemma

From HandWiki

In graph theory, the graph removal lemma states that when a graph contains few copies of a given subgraph, then all of the copies can be eliminated by removing a small number of edges.[1] The special case in which the subgraph is a triangle is known as the triangle removal lemma.[2]

The graph removal lemma can be used to prove Roth's theorem on 3-term arithmetic progressions,[3] and a generalization of it, the hypergraph removal lemma, can be used to prove Szemerédi's theorem.[4] It also has applications to property testing.[5]

Formulation

Let H be a graph with h vertices. The graph removal lemma states that for any ϵ>0, there exists a constant δ=δ(ϵ,H)>0 such that for any n-vertex graph G with fewer than δnh subgraphs isomorphic to H, it is possible to eliminate all copies of H by removing at most ϵn2 edges from G.[1]

An alternative way to state this is to say that for any n-vertex graph G with o(nh) subgraphs isomorphic to H, it is possible to eliminate all copies of H by removing o(n2) edges from G. Here, the o indicates the use of little o notation.

In the case when H is a triangle, resulting lemma is called triangle removal lemma.

History

The original motivation for the study of triangle removal lemma was Ruzsa–Szemerédi problem. Initial formulation due to Imre Z. Ruzsa and Szemerédi from 1978 was slightly weaker than the triangle removal lemma used nowadays and can be roughly stated as follows: every locally linear graph on n vertices contains o(n2) edges.[6] This statement can be quickly deduced from a modern triangle removal lemma. Ruzsa and Szemerédi provided also an alternative proof of Roth's theorem on arithmetic progressions as a simple corollary.[6]

In 1986 during their work on generalizations of Ruzsa–Szemerédi problem to arbitrary r-uniform graphs, Erdős, Frankl, and Rödl provided statement for general graphs very close to the modern graph removal lemma: if graph H2 is a homomorphic image of H2, then any H1-free graph G on n vertices can be made H2-free by removing o(n2) edges.[7]

The modern formulation of graph removal lemma was first stated by Füredi in 1994.[8] The proof generalized earlier approaches by Ruzsa and Szemerédi and Erdős, Frankl, and Rödl, also utilizing Szemerédi regularity lemma.

Graph counting lemma

A key component of the proof of graph removal lemma is the graph counting lemma about counting subgraphs in systems of regular pairs. Graph counting lemma is also very useful on its own. According to Füredi, it is used "in most applications of regularity lemma".[8]

Heuristic argument

Let H be a graph on h vertices, whose vertex set is V={1,2,,h} and edge set is E. Let X1,X2,,Xh be sets of vertices of some graph G such that for all ijE pair (Xi,Xj) is ϵ-regular (in the sense of regularity lemma). Let also dij be the density between sets Xi and Xj. Intuitively, regular pair (X,Y) with density d should behave like a random Erdős–Rényi-like graph, where every pair of vertices (x,y)(X×Y) is selected to be an edge independently with probability d. This suggests that the number of copies of H on vertices x1,x2,,xh such that xiXi should be close to the expected number from Erdős–Rényi model: ijE(H)dijiV(H)|Xi| where E(H) and V(H) are the edge set and the vertex set of H.

Precise statement

The straightforward formalization of above heuristic claim is as follows. Let H be a graph on h vertices, whose vertex set is V={1,2,,h} and edge set is E. Let δ>0 be arbitrary. Then there exists ϵ>0 such that for any X1,X2,,Xh as above, satisfying dij>δ for all ijE, the number of graph homomorphisms from H to G such that vertex iV(H) is mapped to Xi is not smaller than (1δ)ijE(dijδ)iV|Xi|

Blow-up Lemma

One can even find bounded degree subgraphs of blow-ups of H in a similar setting. The following claim appears in the literature under name of the blow-up lemma and was first proven by Komlós, Sárközy and Szemerédi.[9] Precise statement here is a slightly simplified version due to Komlós, who referred to it also as the key lemma, as it is used in numerous regularity-based proofs.[10]

Let H1 be an arbitrary graph and t+. Construct H(t) by replacing each vertex i of H by independent set Vi of size t and replacing every edge ij of H by complete bipartite graph on (Vi,Vj). Let ϵ,δ>0 be arbitrary reals, N be a positive integer and let H2 be a subgraph of H(t) with h vertices and with maximum degree Δ. Define ϵ0=δΔ/(2+Δ). Finally, let G be a graph and X1,X2,,Xh be disjoint sets of vertices of G such that whenever ijE(H2) then (Xi,Xj) is a ϵ-regular pair with density at least ϵ+δ. Then if ϵϵ0 and 1tNϵ0, the number of injective graph homomorphisms from H2 to G is at least (ϵ0N)h.

In fact, one can only restrict to counting homomorphisms such that any vertex k[h] of H2 such that kVi is mapped to a vertex in Xi.

Proof

We will provide proof of the counting lemma in the case when H is a triangle (triangle counting lemma). The proof of the general case, as well as the proof of the blow-up lemma, are very similar and do not require different techniques.

Take ϵ=δ/2. Let X1X1 be the set of those vertices in X1 which have at least (d12ϵ)|X2| neighbors in X2 and at least (d13ϵ)|X3| neighbors in X3. Note that if there were more than ϵ|X1| vertices in X1 with less than (d12ϵ)|X2| neighbors in X2, then these vertices together with whole X2 would witness ϵ-irregularity of the pair (X1,X2). Repeating this argument for X3 shows that we must have |X1|>(12ϵ)|X1|. Now take arbitrary xX1 and define X2 and X3 as neighbors of x in X2 and X3 respectively. By definition |X2|(d12ϵ)|X2|ϵ|X2| and |X3|ϵ|X3| so by regularity of (X2,X3) we obtain existence of at least (d23ϵ)|X2||X3|(d12ϵ)(d23ϵ)(d13ϵ)|X2||X3| triangles containing x. Since x was chosen arbitrarily from the set X1 of size at least (12ϵ)|X1|, we obtain a total of at least (12ϵ)(d23ϵ)|X2||X3|(d12ϵ)(d23ϵ)(d13ϵ)|X1||X2||X3| which finishes the proof as ϵ=δ/2.

Proof

Proof of the triangle removal lemma

To prove the triangle removal lemma, consider an ϵ/4-regular partition V1VM of the vertex set of G. This exists by the Szemerédi regularity lemma. The idea is to remove all edges between irregular pairs, low-density pairs, and small parts, and prove that if at least one triangle still remains, then many triangles remain. Specifically, remove all edges between parts Vi and Vj if

  • (Vi,Vj) is not ϵ/4-regular
  • the density d(Vi,Vj) is less than ϵ/2, or
  • either Vi or Vj has at most (ϵ/4M)n vertices.

This procedure removes at most ϵn2 edges. If there exists a triangle with vertices in Vi,Vj,Vk after these edges are removed, then the triangle counting lemma tells us there are at least (1ϵ2)(ϵ4)3(ϵ4M)3n3 triples in Vi×Vj×Vk which form a triangle. Thus, we may take δ<16(1ϵ2)(ϵ4)3(ϵ4M)3.

Proof of the graph removal lemma

The proof of the case of general H is analogous to the triangle case, and uses graph counting lemma instead of triangle counting lemma.

Induced Graph Removal Lemma

A natural generalization of the Graph Removal Lemma is to consider induced subgraphs. In property testing it is often useful to consider how far a graph is from being induced H-free.[11] A graph G is considered to contain an induced subgraph H if there is an injective map f:V(H)V(G) such that (f(u),f(v)) is an edge of G if and only if (u,v) is an edge of H. Notice that non-edges are considered as well. G is induced H-free if there is no induced subgraph G. We define G as ϵ-far from being induced H-free if we cannot add or delete ϵn2 edges to make G induced H-free.

Formulation

A version of the Graph Removal for induced subgraphs was proved by Alon, Fischer, Krivelevich, and Szegedy in 2000.[12] It states that for any graph H with h vertices and ϵ>0, there exists a constant δ>0 such that if an n-vertex graph G has fewer than δnh induced subgraphs isomorphic to H, then it is possible to eliminate all induced copies of H by adding or removing fewer than ϵn2 edges.

The problem can be reformulated as follows: Given a red-blue coloring H of the complete graph Kh (Analogous to the graph H on the same h vertices where non-edges are blue, edges are red), and a constant ϵ>0, then there exists a constant δ>0 such that for any red-blue colorings of Kn has fewer than δnh subgraphs isomorphic to H, then it is possible to eliminate all copies of H by changing the colors of fewer than ϵn2 edges. Notice that our previous "cleaning" process, where we remove all edges between irregular pairs, low-density pairs, and small parts, only involves removing edges. Removing edges only corresponds to changing edge colors from red to blue. However, there are situations in the induced case where the optimal edit distance involves changing edge colors from blue to red as well. Thus, the Regularity Lemma is insufficient to prove Induced Graph Removal Lemma. The proof of the Induced Graph Removal Lemma must take advantage of the strong regularity lemma.[12]

Proof

Strong Regularity Lemma

The strong regularity lemma[12] is a strengthened version of Szemerédi's Regularity Lemma. For any infinite sequence of constants ϵ0ϵ1...>0, there exists an integer M such that for any graph G, we can obtain two (equitable) partitions 𝒫 and 𝒬 such that the following properties are satisfied:

  • 𝒬 refines 𝒫, that is every part of 𝒫 is the union of some collection of parts in 𝒬.
  • 𝒫 is ϵ0-regular and 𝒬 is ϵ|𝒫|-regular.
  • q(𝒬)<q(𝒫)+ϵ0
  • |𝒬|M

The function q is defined to be the energy function defined in Szemerédi regularity lemma. Essentially, we can find a pair of partitions 𝒫,𝒬 where 𝒬 is extremely regular compared to 𝒫, and at the same time 𝒫,𝒬 are close to each other. (This property is captured in the third condition)

Corollary of the Strong Regularity Lemma

The following corollary of the strong regularity lemma is used in the proof of the Induced Graph Removal Lemma.[12] For any infinite sequence of constants ϵ0ϵ1...>0, there exists δ>0 such that there exists a partition 𝒫=V1,...,Vk and subsets WiVi for each i where the following properties are satisfied:

  • |Wi|>δn
  • (Wi,Wj) is ϵ|𝒫|-regular for each pair i,j
  • |d(Wi,Wj)d(Vi,Vj)|ϵ0 for all but ϵ0|𝒫|2 pairs i,j

The main idea of the proof of this corollary is to start with two partitions 𝒫 and 𝒬 that satisfy the Strong Regularity Lemma where q(𝒬)<q(𝒫)+ϵ03/8. Then for each part Vi𝒫, we uniformly at random choose some part WiVi that is a part in 𝒬. The expected number of irregular pairs (Wi,Wj) is less than 1. Thus, there exists some collection of Wi such that every pair is ϵ|𝒫|-regular!

The important aspect of this corollary is that every pair of Wi,Wj are ϵ|𝒫|-regular! This allows us to consider edges and non-edges when we perform our cleaning argument.

Proof of Sketch of the Induced Graph Removal Lemma

With these results, we are able to prove the Induced Graph Removal Lemma. Take any graph G with n vertices that has less than δnv(H) copies of H. The idea is to start with a collection of vertex sets Wi which satisfy the conditions of the Corollary of the Strong Regularity Lemma.[12] We then can perform a "cleaning" process where we remove all edges between pairs of parts (Wi,Wj) with low density, and we can add all edges between pairs of parts (Wi,Wj) with high density. We choose the density requirements such that we added/deleted at most ϵn2 edges.

If the new graph has no copies of H, then we are done. Suppose the new graph has a copy of H. Suppose the vertex viv(H) is embedded in Wf(i). Then if there is an edge connecting vi,vj in H, then Wi,Wj does not have low density. (Edges between Wi,Wj were not removed in the cleaning process) Similarly, if there is not an edge connecting vi,vj in H, then Wi,Wj does not have high density. (Edges between Wi,Wj were not added in the cleaning process)

Thus, by a similar counting argument to the proof of the triangle counting lemma, that is the graph counting lemma, we can show that G has more than δnv(H) copies of H.

Generalizations

The graph removal lemma was later extended to directed graphs[5] and to hypergraphs.[4]

Quantitative bounds

Usage of regularity lemma in the proof of graph removal lemma forces δ to be extremely small, bounded by tower function of height polynomial in ϵ1 that is δ=1/tower(ϵO(1)) (here tower(k) is the tower of twos of height k). Tower function of height ϵO(1) is necessary in all regularity proofs as is implied by results of Gowers on lower bounds in regularity lemma.[13] However, in 2011 Fox provided a new proof of graph removal lemma which does not use regularity lemma, improving the bound to δ=1/tower(5h2logϵ1) (here h is number of vertices of removed graph H).[1] His proof, however, uses regularity-related ideas such as energy increment, but with different notion of energy, related to entropy. This proof can be also rephrased using Frieze-Kannan weak regularity lemma as noted by Conlon and Fox.[11] In the special case of bipartite H it was shown that δ=ϵO(1) is sufficient.

There is a large gap between upper and lower bounds for δ in the general case. The current best result true for all graphs H is due to Alon and states that for each nonbipartite H there exists constant c>0 such that δ<(ϵ/c)clog(c/ϵ) is necessary for the graph removal lemma to hold while for bipartite H the optimal δ has polynomial dependence on ϵ, which matches the lower bound. Construction for nonbipartite case is a consequence of Behrend construction of large Salem-Spencer set. Indeed, as triangle removal lemma implies Roth's theorem, existence of large Salem-Spencer set may be translated to an upper bound for δ in the triangle removal lemma. This method can be leveraged for arbitrary nonbipartite H to give aforementioned bound.

Applications

Additive combinatorics

Graph theory

  • Graph counting/removal lemma can be used to provide quick and transparent proof of Erdős–Stone theorem starting from Turán's theorem and extend the result to Simonovits stability: for any graph H and any ϵ>0 there exists δ such that any H-free graph on n vertices with at least (11χ(H)1)(n2)δn2 edges can be transformed into complete (χ(H)1)-partite Turán graph Tn,χ(H)1 by adding and/or deleting at most ϵn2 edges (here χ(H) is the chromatic number of H).[15] Although both results had been proven earlier using more elementary techniques (Erdős–Stone theorem was proved in 1966[16] by Erdős and Stone while Simonovits stability was shown in the same year by various authors[16][17][18][19]), the regularity proof provides different viewpoint and elucidates connection with other modern proofs.
  • Graph removal lemma together with Erdős–Stone theorem may be used to show that the number of non-isomorphic H-free graphs on n vertices is equal to

    2(π(H)+o(1))(n2)

    where π(H)=11χ(H)1 is the Turán density of H.[7]

Property testing

  • The graph removal lemma has applications for property testing, because it implies that for every graph, either the graph is near an H-free graph, or random sampling will easily find a copy of H in the graph.[5] One result is that for any fixed ϵ>0, there is a constant time algorithm that returns with high probability whether or not a given n-vertex graph G is ϵ-far from being H-free.[20] Here, ϵ-far from being H-free means that at least ϵn2 edges must be removed to eliminate all copies of H in G.
  • The induced graph removal lemma was formulated by Alon, Fischer, Krivelevich, and Szegedy to characterize testable graph properties.[12]

See also

References

  1. 1.0 1.1 1.2 "A new proof of the graph removal lemma", Annals of Mathematics, Second Series 174 (1): 561–579, 2011, doi:10.4007/annals.2011.174.1.17 
  2. Trevisan, Luca (May 13, 2009), "The Triangle Removal Lemma", in theory, https://lucatrevisan.wordpress.com/2009/05/13/the-triangle-removal-lemma/ 
  3. 3.0 3.1 "On certain sets of integers", Journal of the London Mathematical Society 28 (1): 104–109, 1953, doi:10.1112/jlms/s1-28.1.104 
  4. 4.0 4.1 4.2 "A variant of the hypergraph removal lemma", Journal of Combinatorial Theory, Series A 113 (7): 1257–1280, 2006, doi:10.1016/j.jcta.2005.11.006 
  5. 5.0 5.1 5.2 "Testing subgraphs in directed graphs", Journal of Computer and System Sciences 69 (3): 353–382, 2004, doi:10.1016/j.jcss.2004.04.008 
  6. 6.0 6.1 "Triple systems with no six points carrying three triangles", Combinatorics (Proc. Fifth Hungarian Colloq., Keszthely, 1976), Vol. II, Colloq. Math. Soc. János Bolyai, 18, Amsterdam and New York: North-Holland, 1978, pp. 939–945 
  7. 7.0 7.1 "The asymptotic number of graphs not containing a fixed subgraph and a problem for hypergraphs having no exponent", Graphs and Combinatorics 2 (2): 113–121, 1986, doi:10.1007/BF01788085 
  8. 8.0 8.1 Füredi, Zoltán (1995). "Extremal Hypergraphs and Combinatorial Geometry". in Chatterji, S. D. (in en). Proceedings of the International Congress of Mathematicians. Basel: Birkhäuser. pp. 1343–1352. doi:10.1007/978-3-0348-9078-6_129. ISBN 978-3-0348-9078-6. https://link.springer.com/chapter/10.1007/978-3-0348-9078-6_129. 
  9. Komlós, János; Sárközy, Gábor N.; Szemerédi, Endre (1997-03-01). "Blow-up Lemma" (in en). Combinatorica 17 (1): 109–123. doi:10.1007/BF01196135. ISSN 1439-6912. https://doi.org/10.1007/BF01196135. 
  10. Komlós, János (1999). "The Blow-up Lemma" (in en). Combinatorics, Probability and Computing 8 (1–2): 161–176. doi:10.1017/S0963548398003502. ISSN 1469-2163. https://www.cambridge.org/core/journals/combinatorics-probability-and-computing/article/abs/blowup-lemma/1F094FCD0A272B5FBE5B1E5D0583E8D3. 
  11. 11.0 11.1 Blackburn, Simon R.; Gerke, Stefanie; Wildon, Mark, eds. (2013), "Graph removal lemmas", Surveys in Combinatorics 2013, London Mathematical Society Lecture Note Series, 409, Cambridge, UK: Cambridge University Press, pp. 1–49, doi:10.1017/CBO9781139506748.002 
  12. 12.0 12.1 12.2 12.3 12.4 12.5 "Efficient testing of large graphs", Combinatorica 20 (4): 451–476, 2000, doi:10.1007/s004930070001 
  13. Gowers, W. T. (1997). "Lower bounds of tower type for Szemerédi's uniformity lemma". Geometric and Functional Analysis 7 (2): 322–337. doi:10.1007/PL00001621. 
  14. "Note on a generalization of Roth's theorem", Discrete and Computational Geometry, Algorithms and Combinatorics 25: 825–827, 2003, doi:10.1007/978-3-642-55566-4_39, ISBN 978-3-642-62442-1 
  15. Alon, N. (2001-10-14). "Testing subgraphs in large graphs". Proceedings 42nd IEEE Symposium on Foundations of Computer Science. FOCS '01. USA: IEEE Computer Society. pp. 434–441. doi:10.1109/SFCS.2001.959918. ISBN 978-0-7695-1390-4. https://dl.acm.org/doi/10.5555/874063.875540. 
  16. 16.0 16.1 "A limit theorem in graph theory". Studia Sci. Math. Hung. 1: 51–57. 1966. 
  17. "Some recent results on extremal problems in graph theory". Theory of Graphs, International Symp. Rome: 118–123. 1966. 
  18. "On some new inequalities concerning extremal properties of graphs". Theory of Graphs, Proc. Coll. Tihany, Hungary: 77–81. 1966. 
  19. "A method for solving extremal problems in graph theory". Theory of Graphs, Proc. Coll. Tihany: 279–319. 1966. 
  20. "A characterization of the (natural) graph properties testable with one-sided error", SIAM Journal on Computing 37 (6): 1703–1727, 2008, doi:10.1137/06064888X