Dense subgraph

From HandWiki
Revision as of 15:59, 6 February 2024 by AstroAI (talk | contribs) (url)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Short description: Highly connected subgraph
An example of a graph G with density dG = 1.375 and its densest subgraph induced by the vertices b, c, d, e and h in red with density 1.4 .

In graph theory and computer science, a dense subgraph is a subgraph with many edges per vertex. This is formalized as follows: let G = (V, E) be an undirected graph and let S = (VS, ES) be a subgraph of G. Then the density of S is defined to be:

[math]\displaystyle{ d(S) = {|E_S|\over|V_S|} }[/math]

The densest subgraph problem is that of finding a subgraph of maximum density. The density of the maximally dense subgraph of a graph is sometimes referred to as its subgraph density. In 1984, Andrew V. Goldberg developed a polynomial time algorithm to find the maximum density subgraph using a max flow technique. This has been improved by Gallo, Grigoriadis and Tarjan in 1989[1] to run in O(nm log(n2/m)) time. A simple LP for finding the optimal solution was given by Charikar in 2000.[2]

Subgraph density is asymptotic to the related notion of arboricity and to graph degeneracy.

Densest k subgraph

There are many variations on the densest subgraph problem. One of them is the densest k subgraph problem, where the objective is to find the maximum density subgraph on exactly k vertices. This problem generalizes the clique problem and is thus NP-hard in general graphs. There exists a polynomial algorithm approximating the densest k subgraph within a ratio of [math]\displaystyle{ n^{1/4 \,+\, \epsilon} }[/math] for every [math]\displaystyle{ \epsilon \gt 0 }[/math],[3] while it does not admit an [math]\displaystyle{ n^{1/\!\operatorname{polyloglog} n} }[/math]-approximation in polynomial time unless the exponential time hypothesis is false.[4] Under a weaker assumption that [math]\displaystyle{ \mathsf{NP} \nsubseteq \bigcap_{\epsilon \gt 0} \mathsf{BPTIME}(2^{n^\epsilon}) }[/math], no PTAS exists for the problem.[5]

The problem remains NP-hard in bipartite graphs and chordal graphs but is polynomial for trees and split graphs.[6] It is open whether the problem is NP-hard or polynomial in (proper) interval graphs and planar graphs; however, a variation of the problem in which the subgraph is required to be connected is NP-hard in planar graphs.[7]

Densest at most k subgraph

The objective of the densest at most [math]\displaystyle{ k }[/math] problem is to find the maximum density subgraph on at most [math]\displaystyle{ k }[/math] vertices. Andersen and Chellapilla showed that if there exists an [math]\displaystyle{ \alpha }[/math]-approximation for this problem then that will lead to an [math]\displaystyle{ \Theta(\alpha^2) }[/math]-approximation for the densest [math]\displaystyle{ k }[/math] subgraph problem.[8] Later, this was improved by Khuller and Saha who showed that an [math]\displaystyle{ \alpha }[/math]-approximation for densest at most [math]\displaystyle{ k }[/math] subgraph implies a [math]\displaystyle{ 4 \alpha }[/math]-approximation for the densest [math]\displaystyle{ k }[/math] subgraph problem.[9]

Densest at least k subgraph

The densest at least [math]\displaystyle{ k }[/math] problem is defined similarly to the densest at most [math]\displaystyle{ k }[/math] subgraph problem. The problem is NP-complete,[9] but admits 2-approximation in polynomial time.[10] Moreover, there is some evidence that this approximation algorithm is essentially the best possible: assuming the small set expansion hypothesis (a computational complexity assumption closely related to the unique games conjecture), then it is NP-hard to approximate the problem to within [math]\displaystyle{ (2 - \epsilon) }[/math] factor for every constant [math]\displaystyle{ \epsilon \gt 0 }[/math].[11]

K-clique densest subgraph

Charalampos Tsourakakis introduced the [math]\displaystyle{ k }[/math]-clique densest subgraph problem. This variation of the densest subgraph problem aims to maximize the average number of induced [math]\displaystyle{ k }[/math] cliques [math]\displaystyle{ d_{k}(S) = {|C_k(S)|\over|V_S|} }[/math], where [math]\displaystyle{ C_k(S) }[/math] is the set of [math]\displaystyle{ k }[/math]-cliques induced by [math]\displaystyle{ S }[/math]. Notice that the densest subgraph problem is obtained as a special case for [math]\displaystyle{ k=2 }[/math]. This generalization provides an empirically successful poly-time approach for extracting large near-cliques from large-scale real-world networks.

Locally top-k densest subgraph

Qin et al. introduced the problem of top-k locally densest subgraphs discovery in a graph, each of which achieves the highest density in its local region in the graph: it is neither contained in any supergraph with the same or larger density, nor it contains subgraphs with density being loosely connected with the rest of the local densest subgraph. Note that the densest subgraph problem is obtained as a special case for [math]\displaystyle{ k=1 }[/math]. The set of locally densest subgraphs in a graph can be computed in polynomial time.

References

  1. Gallo, Giorgio; Grigoriadis, Michael D.; Tarjan, Robert E. (1989), "A fast parametric maximum flow algorithm and applications", SIAM Journal on Computing 18 (1): 30–55, doi:10.1137/0218003 
  2. Charikar, Moses (2000), "Greedy approximation algorithms for finding dense components in a graph", in Jansen, Klaus; Khuller, Samir, Approximation Algorithms for Combinatorial Optimization, Third International Workshop, APPROX 2000, Saarbrücken, Germany, September 5-8, 2000, Proceedings, Lecture Notes in Computer Science, 1913, Springer, pp. 84–95, doi:10.1007/3-540-44436-X_10, ISBN 978-3-540-67996-7 
  3. Bhaskara, Aditya (2010), "Detecting high log-densities—an O(n1/4) approximation for densest k-subgraph", STOC'10—Proceedings of the 2010 ACM International Symposium on Theory of Computing, ACM, New York, pp. 201–210, doi:10.1145/1806689.1806719, ISBN 9781450300506 .
  4. Manurangsi, Pasin (2017), "Almost-polynomial ratio ETH-hardness of approximating densest k-subgraph", STOC'17—Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing, ACM, pp. 954–961, doi:10.1145/3055399.3055412, ISBN 9781450345286 .
  5. "Ruling out PTAS for graph min-bisection, dense k-subgraph, and bipartite clique", SIAM Journal on Computing 36 (4): 1025–1071, 2006, doi:10.1137/S0097539705447037 .
  6. "Clustering and domination in perfect graphs", Discrete Applied Mathematics 9 (1): 27–39, 1984, doi:10.1016/0166-218X(84)90088-X .
  7. Keil, J. Mark; Brecht, Timothy B. (1991), "The complexity of clustering in planar graphs", Journal of Combinatorial Mathematics and Combinatorial Computing 9: 155–159, https://cs.uwaterloo.ca/~brecht/papers/jcmcc-1991.pdf .
  8. Andersen, Reid; Chellapilla, Kumar (2009). Avrachenkov, Konstantin; Donato, Debora; Litvak, Nelly. eds. "Finding Dense Subgraphs with Size Bounds" (in en). Algorithms and Models for the Web-Graph. Lecture Notes in Computer Science (Berlin, Heidelberg: Springer): 25–37. doi:10.1007/978-3-540-95995-3_3. ISBN 978-3-540-95995-3. https://link.springer.com/chapter/10.1007/978-3-540-95995-3_3. 
  9. 9.0 9.1 "On finding dense subgraphs", Automata, Languages and Programming: 36th International Colloquium, ICALP 2009, Rhodes, Greece, July 5-12, 2009, Proceedings, Part I, Lecture Notes in Computer Science, 5555, Berlin: Springer-Verlag, 2009, pp. 597–608, doi:10.1007/978-3-642-02927-1_50, ISBN 978-3-642-02926-4, http://www.cs.umd.edu/~samir/grant/ICALP09.pdf 
  10. Andersen, Reid (2007), Finding large and small dense subgraphs, Bibcode2007cs........2032A 
  11. Manurangsi, Pasin (2018), "Inapproximability of maximum biclique problems, minimum k-cut and densest at-least-k-subgraph from the small set expansion hypothesis", Algorithms 11 (1): 10, doi:10.3390/a11010010 

Further reading

  • Andersen, R.; Chellapilla, K. (2009), "Finding dense subgraphs with size bounds", WAW: 25–36 .
  • Feige, U.; Kortsarz, G.; Peleg, D. (1997), "The dense k-subgraph problem", Algorithmica 29 (3): 410–421, doi:10.1007/s004530010050 .
  • Goldberg, A. V. (1984), "Finding a maximum density subgraph", Technical Report .
  • Tsourakakis, C. (2015), "The K-clique Densest Subgraph Problem", Proceedings of the 24th International Conference on World Wide Web, pp. 1122–1132, doi:10.1145/2736277.2741098, ISBN 9781450334693 .
  • Qin, Lu; Li, Rong{-}Hua; Chang, Lijun; Zhang, Chengqi (2015), "Locally densest subgraph discovery", in Cao, Longbing; Zhang, Chengqi; Joachims, Thorsten et al., Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Sydney, NSW, Australia, August 10-13, 2015, {ACM}, pp. 965–974, doi:10.1145/2783258.2783299, ISBN 9781450336642 .