Hyperbolic geometric graph

From HandWiki
Revision as of 16:57, 6 February 2024 by Jport (talk | contribs) (url)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

A hyperbolic geometric graph (HGG) or hyperbolic geometric network (HGN) is a special type of spatial network where (1) latent coordinates of nodes are sprinkled according to a probability density function into a hyperbolic space of constant negative curvature and (2) an edge between two nodes is present if they are close according to a function of the metric[1][2] (typically either a Heaviside step function resulting in deterministic connections between vertices closer than a certain threshold distance, or a decaying function of hyperbolic distance yielding the connection probability). A HGG generalizes a random geometric graph (RGG) whose embedding space is Euclidean.

Mathematical formulation

Mathematically, a HGG is a graph [math]\displaystyle{ G(V,E) }[/math] with a vertex set V (cardinality [math]\displaystyle{ N=|V| }[/math]) and an edge set E constructed by considering the nodes as points placed onto a 2-dimensional hyperbolic space [math]\displaystyle{ \mathbb{H}^2_\zeta }[/math] of constant negative Gaussian curvature, [math]\displaystyle{ -\zeta^2 }[/math] and cut-off radius [math]\displaystyle{ R }[/math], i.e. the radius of the Poincaré disk which can be visualized using a hyperboloid model. Each point [math]\displaystyle{ i }[/math] has hyperbolic polar coordinates [math]\displaystyle{ (r_i,\theta_i) }[/math] with [math]\displaystyle{ 0\leq r_i\leq R }[/math] and [math]\displaystyle{ 0\leq \theta_i \lt 2\pi }[/math].

The hyperbolic law of cosines allows to measure the distance [math]\displaystyle{ d_{ij} }[/math] between two points [math]\displaystyle{ i }[/math] and [math]\displaystyle{ j }[/math],[2]

[math]\displaystyle{ \cosh(\zeta d_{ij})=\cosh(\zeta r_i)\cosh(\zeta r_j) }[/math][math]\displaystyle{ -\sinh(\zeta r_i)\sinh(\zeta r_j)\cos\bigg(\underbrace{\pi\!-\!\bigg|\pi-|\theta_i\!-\!\theta_j|\bigg|}_{\Delta}\bigg). }[/math]

The angle [math]\displaystyle{ \Delta }[/math] is the (smallest) angle between the two position vectors.

In the simplest case, an edge [math]\displaystyle{ (i,j) }[/math] is established iff (if and only if) two nodes are within a certain neighborhood radius [math]\displaystyle{ r }[/math], [math]\displaystyle{ d_{ij}\leq r }[/math], this corresponds to an influence threshold.

Connectivity decay function

In general, a link will be established with a probability depending on the distance [math]\displaystyle{ d_{ij} }[/math]. A connectivity decay function [math]\displaystyle{ \gamma(s): \mathbb{R}^+\to[0,1] }[/math] represents the probability of assigning an edge to a pair of nodes at distance [math]\displaystyle{ s }[/math]. In this framework, the simple case of hard-code neighborhood like in random geometric graphs is referred to as truncation decay function.[3]

Generating hyperbolic geometric graphs

Krioukov et al.[2] describe how to generate hyperbolic geometric graphs with uniformly random node distribution (as well as generalized versions) on a disk of radius [math]\displaystyle{ R }[/math] in [math]\displaystyle{ \mathbb{H}_\zeta^2 }[/math]. These graphs yield a power-law distribution for the node degrees. The angular coordinate [math]\displaystyle{ \theta }[/math] of each point/node is chosen uniformly random from [math]\displaystyle{ [0, 2\pi] }[/math], while the density function for the radial coordinate r is chosen according to the probability distribution [math]\displaystyle{ \rho }[/math]:

[math]\displaystyle{ \rho(r) = \alpha \frac{\sinh(\alpha r)}{\cosh(\alpha R) - 1} }[/math]

The growth parameter [math]\displaystyle{ \alpha \gt 0 }[/math] controls the distribution: For [math]\displaystyle{ \alpha = \zeta }[/math], the distribution is uniform in [math]\displaystyle{ \mathbb{H}_\zeta^2 }[/math], for smaller values the nodes are distributed more towards the center of the disk and for bigger values more towards the border. In this model, edges between nodes [math]\displaystyle{ u }[/math] and [math]\displaystyle{ v }[/math] exist iff [math]\displaystyle{ d_{uv} \lt R }[/math] or with probability [math]\displaystyle{ \gamma(d_{uv}) }[/math] if a more general connectivity decay function is used. The average degree is controlled by the radius [math]\displaystyle{ R }[/math] of the hyperbolic disk. It can be shown, that for [math]\displaystyle{ \alpha / \zeta \gt 1/2 }[/math] the node degrees follow a power law distribution with exponent [math]\displaystyle{ \gamma = 1 + 2\alpha / \zeta }[/math].

The image depicts randomly generated graphs for different values of [math]\displaystyle{ \alpha }[/math] and [math]\displaystyle{ R }[/math] in [math]\displaystyle{ \mathbb{H}_1^2 }[/math]. It can be seen how [math]\displaystyle{ \alpha }[/math] has an effect on the distribution of the nodes and [math]\displaystyle{ R }[/math] on the connectivity of the graph. The native representation where the distance variables have their true hyperbolic values is used for the visualization of the graph, therefore edges are straight lines.

Random hyperbolic geometric graphs with N=100 nodes each for different values of alpha and R

Quadratic complexity generator[4]

The naive algorithm for the generation of hyperbolic geometric graphs distributes the nodes on the hyperbolic disk by choosing the angular and radial coordinates of each point are sampled randomly. For every pair of nodes an edge is then inserted with the probability of the value of the connectivity decay function of their respective distance. The pseudocode looks as follows:

[math]\displaystyle{  V = \{ \}, E = \{ \}  }[/math]
for [math]\displaystyle{  i \gets 0  }[/math] to [math]\displaystyle{  N - 1  }[/math] do
    [math]\displaystyle{  \theta \gets U[0, 2\pi]  }[/math]
    [math]\displaystyle{  r \gets \frac{1}{\alpha} \text{acosh} (1 + (\cosh \alpha R - 1) U[0, 1])  }[/math]
    [math]\displaystyle{  V = V \cup \{ (r, \theta)\}  }[/math]
for every pair  [math]\displaystyle{  (u, v) \in V \times V, u \neq v }[/math] do
    if [math]\displaystyle{  U[0, 1] \leq \gamma(d_{uv})  }[/math] then
        [math]\displaystyle{  E = E \cup \{ (u,v) \}  }[/math]
return [math]\displaystyle{  V, E  }[/math]

[math]\displaystyle{ N }[/math] is the number of nodes to generate, the distribution of the radial coordinate by the probability density function [math]\displaystyle{ \rho }[/math] is achieved by using inverse transform sampling. [math]\displaystyle{ U }[/math]denotes the uniform sampling of a value in the given interval. Because the algorithm checks for edges for all pairs of nodes, the runtime is quadratic. For applications where [math]\displaystyle{ N }[/math] is big, this is not viable any more and algorithms with subquadratic runtime are needed.

Sub-quadratic generation

To avoid checking for edges between every pair of nodes, modern generators use additional data structures that partition the graph into bands.[5][6] A visualization of this shows a hyperbolic graph with the boundary of the bands drawn in orange. In this case, the partitioning is done along the radial axis. Points are stored sorted by their angular coordinate in their respective band. For each point [math]\displaystyle{ u }[/math], the limits of its hyperbolic circle of radius [math]\displaystyle{ R }[/math] can be (over-)estimated and used to only perform the edge-check for points that lie in a band that intersects the circle. Additionally, the sorting within each band can be used to further reduce the number of points to look at by only considering points whose angular coordinate lie in a certain range around the one of [math]\displaystyle{ u }[/math] (this range is also computed by over-estimating the hyperbolic circle around [math]\displaystyle{ u }[/math]).

Using this and other extensions of the algorithm, time complexities of [math]\displaystyle{ \mathcal{O}(n \log \log n + m) }[/math](where [math]\displaystyle{ n }[/math] is the number of nodes and [math]\displaystyle{ m }[/math] the number of edges) are possible with high probability.[7]

The hyperbolic graph is partitioned into bands such that each of them holds approximately the same number of points.

Findings

For [math]\displaystyle{ \zeta=1 }[/math] (Gaussian curvature [math]\displaystyle{ K=-1 }[/math]), HGGs form an ensemble of networks for which is possible to express the degree distribution analytically as closed form for the limiting case of large number of nodes.[2] This is worth mentioning since this is not true for many ensembles of graphs.

Applications

HGGs have been suggested as promising model for social networks where the hyperbolicity appears through a competition between similarity and popularity of an individual.[8]

References

  1. Barthélemy, Marc (2011). "Spatial networks". Physics Reports 499 (1–3): 1–101. doi:10.1016/j.physrep.2010.11.002. Bibcode2011PhR...499....1B. 
  2. 2.0 2.1 2.2 2.3 Krioukov, Dmitri; Papadopoulos, Fragkiskos; Kitsak, Maksim; Vahdat, Amin; Boguñá, Marián (2010). "Hyperbolic geometry of complex networks". Physical Review E 82 (3): 036106. doi:10.1103/PhysRevE.82.036106. PMID 21230138. Bibcode2010PhRvE..82c6106K. 
  3. Barnett, L.; Di Paolo, E.; Bullock, S. (2007). "Spatially embedded random networks". Physical Review E 76 (5): 056115. doi:10.1103/PhysRevE.76.056115. PMID 18233726. Bibcode2007PhRvE..76e6115B. https://eprints.soton.ac.uk/266764/1/sern_physreve.pdf. Retrieved 2023-02-04. 
  4. Krioukov, Dmitri; Orsini, Chiara; Aldecoa, Rodrigo (2015-03-17). "Hyperbolic Graph Generator" (in en). Computer Physics Communications 196: 492–496. doi:10.1016/j.cpc.2015.05.028. Bibcode2015CoPhC.196..492A. 
  5. von Looz, Moritz; Meyerhenke, Henning; Prutkin, Roman (2015). "Generating Random Hyperbolic Graphs in Subquadratic Time". in Elbassioni, Khaled; Makino, Kazuhisa (in en). Algorithms and Computation. Lecture Notes in Computer Science. 9472. Springer Berlin Heidelberg. pp. 467–478. doi:10.1007/978-3-662-48971-0_40. ISBN 9783662489710. 
  6. Meyerhenke, Henning; Laue, Sören; Özdayi, Mustafa; von Looz, Moritz (2016-06-30) (in en). Generating massive complex networks with hyperbolic geometry faster in practice. Bibcode2016arXiv160609481V. https://archive.org/details/arxiv-1606.09481. 
  7. Penschuck, Manuel (2017). "Generating Practical Random Hyperbolic Graphs in Near-Linear Time and with Sub-Linear Memory" (in en). Schloss Dagstuhl - Leibniz-Zentrum für Informatik GMBH, Wadern/Saarbruecken, Germany. Leibniz International Proceedings in Informatics (LIPIcs) 75: 26:1–26:21. doi:10.4230/lipics.sea.2017.26. ISBN 9783959770361. http://drops.dagstuhl.de/opus/volltexte/2017/7621/. Retrieved 2023-02-04. 
  8. Papadopoulos, Fragkiskos; Kitsak, Maksim; Serrano, M. Ángeles; Boguñá, Marián; Krioukov, Dmitri (12 September 2012). "Popularity versus similarity in growing networks". Nature 489 (7417): 537–540. doi:10.1038/nature11459. PMID 22972194. Bibcode2012Natur.489..537P.