Betweenness centrality

Short description: Measure of a graph's centrality, based on shortest paths

In graph theory, betweenness centrality is a measure of centrality in a graph based on shortest paths. Betweenness centrality measures how frequently a node appears on the shortest path between other nodes in the graph. For every pair of vertices in a connected graph, there exists at least one shortest path between the vertices, that is, there exists at least one path such that either the number of edges that the path passes through (for unweighted graphs) or the sum of the weights of the edges (for weighted graphs) is minimized.

Betweenness centrality was devised as a general measure of centrality:^[1]. It applies to a wide range of problems in network theory, including problems related to social networks, biology, transport, and scientific cooperation. Although earlier authors had intuitively described centrality in terms of betweenness, Freeman (1977) gave the first formal definition of betweenness centrality.

Betweenness centrality has wide application in network theory; it measures the extent to which nodes lie between one another. For example, in a telecommunications network, a node with higher betweenness centrality would have more control over the network because more information flows through it.

Definition

The betweenness centrality of a node $v$ is given by the expression:

g (v) = \sum_{s \neq v \neq t} \frac{σ_{s t} (v)}{σ_{s t}}

where $σ_{s t}$ is the total number of shortest paths from node $s$ to node $t$ and $σ_{s t} (v)$ is the number of those paths that pass through $v$ (not where $v$ is an end point).^[2]

The betweenness centrality of a node scales with the number of pairs of nodes as suggested by the summation indices. Therefore, the calculation may be rescaled by dividing through by the number of pairs of nodes not including $v$ , so that $g \in [0, 1]$ . The division is done by $(N - 1) (N - 2)$ for directed graphs and $(N - 1) (N - 2) / 2$ for undirected graphs, where $N$ is the number of nodes in the giant component. Note that this scales for the highest possible value, where one node is crossed by every single shortest path. This is often not the case, and a normalization can be performed without a loss of precision

normal (g (v)) = \frac{g (v) - \min (g)}{\max (g) - \min (g)}

which results in:

\max (normal) = 1

\min (normal) = 0

Note that this will always be a scaling from a smaller range into a larger range, so no precision is lost.

Weighted networks

In a weighted network the links connecting the nodes are no longer treated as binary interactions, but are weighted in proportion to their capacity, influence, frequency, etc., which adds another dimension of heterogeneity within the network beyond the topological effects. A node's strength in a weighted network is given by the sum of the weights of its adjacent edges.

s_{i} = \sum_{j = 1}^{N} a_{i j} w_{i j}

With $a_{i j}$ and $w_{i j}$ being adjacency and weight matrices between nodes $i$ and $j$ , respectively. Analogous to the power law distribution of degree found in scale free networks, the strength of a given node follows a power law distribution as well.

s (k) \approx k^{β}

A study of the average value $s (b)$ of the strength for vertices with betweenness $b$ shows that the functional behavior can be approximated by a scaling form:^[3]

$s (b) \approx b^{α}$

Percolation centrality

Percolation centrality is a version of weighted betweenness centrality, but it considers the 'state' of the source and target nodes of each shortest path in calculating this weight. Percolation of a 'contagion' occurs in complex networks in a number of scenarios. For example, viral or bacterial infection can spread over social networks of people, known as contact networks. The spread of disease can also be considered at a higher level of abstraction, by contemplating a network of towns or population centres, connected by road, rail or air links. Computer viruses can spread over computer networks. Rumours or news about business offers and deals can also spread via social networks of people. In all of these scenarios, a 'contagion' spreads over the links of a complex network, altering the 'states' of the nodes as it spreads, either recoverably or otherwise. For example, in an epidemiological scenario, individuals go from 'susceptible' to 'infected' state as the infection spreads. The states the individual nodes can take in the above examples could be binary (such as received/not received a piece of news), discrete (susceptible/infected/recovered), or even continuous (such as the proportion of infected people in a town), as the contagion spreads. The common feature in all these scenarios is that the spread of contagion results in the change of node states in networks. Percolation centrality (PC) was proposed with this in mind, which specifically measures the importance of nodes in terms of aiding the percolation through the network. This measure was proposed by Piraveenan, Prokopenko & Hossain (2013).^[4]

Percolation centrality is defined for a given node, at a given time, as the proportion of 'percolated paths' that go through that node. A 'percolated path' is a shortest path between a pair of nodes, where the source node is percolated (e.g., infected). The target node can be percolated or non-percolated, or in a partially percolated state.

P C^{t} (v) = \frac{1}{N - 2} \sum_{s \neq v \neq r} \frac{σ_{s r} (v)}{σ_{s r}} \frac{{x^{t}}_{s}}{\sum [{x^{t}}_{i}] - {x^{t}}_{v}}

where $σ_{s r}$ is total number of shortest paths from node $s$ to node $r$ and $σ_{s r} (v)$ is the number of those paths that pass through $v$ . The percolation state of the node $i$ at time $t$ is denoted by ${x^{t}}_{i}$ and two special cases are when ${x^{t}}_{i} = 0$ which indicates a non-percolated state at time $t$ whereas when ${x^{t}}_{i} = 1$ which indicates a fully percolated state at time $t$ . The values in between indicate partially percolated states ( e.g., in a network of townships, this would be the percentage of people infected in that town).

The attached weights to the percolation paths depend on the percolation levels assigned to the source nodes, based on the premise that the higher the percolation level of a source node is, the more important are the paths that originate from that node. Nodes which lie on shortest paths originating from highly percolated nodes are therefore potentially more important to the percolation. The definition of PC may also be extended to include target node weights as well. Percolation centrality calculations run in $O (| V | | E |)$ time with an efficient implementation adapted from Brandes' algorithm. If the calculation needs to consider target node weights, the worst case time is $O (| V |^{3})$ .

Algorithms

Calculating the betweenness and closeness centralities of all the vertices in a graph involves calculating the shortest paths between all pairs of vertices on a graph, which takes $Θ (| V |^{3})$ time with the Floyd–Warshall algorithm, modified to not only find one but count all shortest paths between two nodes. On a sparse graph, Johnson's algorithm or Brandes' algorithm may be more efficient, both taking $O (| V |^{2} \log | V | + | V | | E |)$ time. On unweighted graphs, calculating betweenness centrality takes $O (| V | | E |)$ time using Brandes' algorithm.^[5]

In calculating betweenness and closeness centralities of all vertices in a graph, it is assumed that graphs are undirected and connected with the allowance of loops and multiple edges. When specifically dealing with network graphs, often graphs are without loops or multiple edges to maintain simple relationships (where edges represent connections between two people or vertices). In this case, using Brandes' algorithm will divide final centrality scores by 2 to account for each shortest path being counted twice.^[6]

Another algorithm generalizes the Freeman's betweenness computed on geodesics and Newman's betweenness computed on all paths, by introducing a hyper-parameter controlling the trade-off between exploration and exploitation. The time complexity is the number of edges times the number of nodes in the graph.^[7]

Approximations

Because exact computation of betweenness centrality can be expensive on large graphs, a number of approximation algorithms have been proposed. Many methods estimate betweenness by sampling shortest paths between randomly selected pairs of vertices instead of enumerating all shortest paths.^[8]^[9]

Riondato and Kornaropoulos proposed a shortest-path sampling algorithm with probabilistic error guarantees based on Vapnik–Chervonenkis theory.^[10] Later methods such as ABRA and SILVAN used progressive sampling strategies and Rademacher averages to adaptively determine the number of sampled shortest paths needed to achieve a target accuracy.^[11]^[12]

KADABRA is an adaptive approximation algorithm that combines shortest-path sampling with bidirectional breadth-first search and confidence interval estimation.^[13]

Local heuristics have also been proposed as computationally inexpensive alternatives. Ego betweenness computes the betweenness of a vertex within its ego network, consisting only of the vertex, its neighbors, and the edges among those neighbors.^[14] Other lightweight proxy measures, such as the local clustering coefficient-dependent degree centrality (LCCDC), use only local structural properties such as degree and the clustering coefficient to estimate the relative importance of vertices.^[15]

Applications

Social networks

In social network analysis, betweenness centrality can have different implications. From a macroscopic perspective, bridging positions or "structural holes" (indicated by high betweenness centrality) reflect power, because they allow the person on the bridging position to exercise control (e.g., decide whether to share information or not) over the persons it connects between.^[16] From the microscopic perspective of ego networks (i.e., only considering first-degree connections), in online social networks a high betweenness centrality coincides with nominations of closest friends (i.e., strong interpersonal ties), because it reflects social capital investments into the relationship when distant social circles (e.g., family and university) are bridged (often resulting from an introduction by ego).^[17]

River networks

Betweenness centrality has been used to analyze the topological complexity of river networks as well as their use in maritime trade.^[18]^[19]

Related concepts

Betweenness centrality is related to a network's connectivity, in so much as high betweenness vertices have the potential to disconnect graphs if removed (see cut set).

References

↑ Freeman (1977), p. 39.
↑ "Calculating the Betweenness Centrality in Gephi". https://www.youtube.com/watch?v=PuWNYB0u_gM.
↑ Barrat et al. (2004).
↑ Piraveenan, Prokopenko & Hossain (2013).
↑ Brandes (2001), p. 1.
↑ Brandes (2001), p. 9.
↑ Mantrach et al. (2010).
↑ Bader, David A.; Kintali, Shiva; Madduri, Kamesh; Mihail, Milena (2007). "Approximating betweenness centrality". 4863. Springer. pp. 124–137. doi:10.1007/978-3-540-77004-6_10.
↑ Brandes, Ulrik; Pich, Christian (2007). "Centrality estimation in large networks". International Journal of Bifurcation and Chaos 17 (7): 2303–2318. doi:10.1142/S0218127407018403.
↑ Riondato, Matteo; Kornaropoulos, Evgenios M. (2016). "Fast approximation of betweenness centrality through sampling". Data Mining and Knowledge Discovery 30: 438–475. doi:10.1007/s10618-015-0423-0.
↑ Riondato, Matteo; Upfal, Eli (2018). "ABRA: Approximating betweenness centrality in static and dynamic graphs with Rademacher averages". ACM Transactions on Knowledge Discovery from Data 12 (5): 61:1–61:38. doi:10.1145/3230636.
↑ Pellegrina, Leonardo; Vandin, Fabio (2023). "SILVAN: Estimating betweenness centralities with progressive sampling and non-uniform Rademacher bounds". ACM Transactions on Knowledge Discovery from Data 18 (3). doi:10.1145/3628601.
↑ Borassi, Michele; Natale, Emanuele (2019). "KADABRA is an ADaptive Algorithm for Betweenness via Random Approximation". ACM Journal of Experimental Algorithmics 24: 1.2:1–1.2:35. doi:10.1145/3284359.
↑ Everett, Martin; Borgatti, Stephen P. (2005). "Ego network betweenness". Social Networks 27 (1): 31–38. doi:10.1016/j.socnet.2004.11.007.
↑ Meghanathan, Natarajan (2017). "A computationally lightweight and localized centrality metric in lieu of betweenness centrality for complex network analysis". Vietnam Journal of Computer Science 4: 23–38. doi:10.1007/s40595-016-0073-1.
↑ Burt (2009).
↑ Stolz & Schlereth (2021).
↑ Sarker et al. (2019).
↑ Eiland, Murray (2020). Interview with Johannes Preiser-Kapeller. "Networks of Rome, Byzantium, and China". Antiqvvs 4 (1): 41–45. https://www.academia.edu/88994886.

Bibliography

Barrat, A. et al. (2004). "The architecture of complex weighted networks" (in en). Proceedings of the National Academy of Sciences of the United States of America 101 (11): 3747–3752. doi:10.1073/pnas.0400087101. ISSN 0027-8424. PMID 15007165. Bibcode: 2004PNAS..101.3747B.
Borassi, Michele; Natale, Emanuele (2019). "KADABRA is an ADaptive Algorithm for Betweenness via Random Approximation" (in en). ACM Journal of Experimental Algorithmics 24: 1.2:1–1.2:35. doi:10.1145/3284359. ISSN 1084-6654. https://drops.dagstuhl.de/opus/volltexte/2016/6371/.
Brandes, Ulrik (2001). "A faster algorithm for betweenness centrality" (in en). Journal of Mathematical Sociology 25 (2): 163–177. doi:10.1080/0022250x.2001.9990249. http://www.uvm.edu/pdodds/research/papers/others/2001/brandes2001a.pdf. Retrieved March 29, 2021.
Burt, Ronald (2009) (in English). Structural holes: The social structure of competition. Cambridge: Harvard University Press. ISBN 978-0-674-02909-5. OCLC 1041149426. https://books.google.com/books?id=FAhiz9FWDzMC. Retrieved March 29, 2021.
Freeman, Linton (1977). "A set of measures of centrality based on betweenness". Sociometry 40 (1): 35–41. doi:10.2307/3033543.
Goh, K.-I.; Kahng, B.; Kim, D. (2001). "Universal Behavior of Load Distribution in Scale-Free Networks" (in en). Physical Review Letters 87 (27). doi:10.1103/PhysRevLett.87.278701. ISSN 0031-9007. PMID 11800921. Bibcode: 2001PhRvL..87A8701G.
Mantrach, Amin et al. (2010). "The Sum-over-Paths Covariance Kernel: A Novel Covariance Measure between Nodes of a Directed Graph". IEEE Transactions on Pattern Analysis and Machine Intelligence 32 (6): 1112–1126. doi:10.1109/tpami.2009.78. PMID 20431135.
Moxley, Robert L.; Moxley, Nancy F. (1974). "Determining Point-Centrality in Uncontrived Social Networks". Sociometry 37 (1): 122–130. doi:10.2307/2786472.
Newman, Mark E. J. (2010) (in en). Networks: An Introduction. Oxford: Oxford University Press. ISBN 978-0-19-920665-0. OCLC 964511577.
Piraveenan, Mahendra; Prokopenko, Mikhail; Hossain, Liaquat (2013). Holme, Petter. ed. "Percolation Centrality: Quantifying Graph-Theoretic Impact of Nodes during Percolation in Networks" (in en). PLOS ONE 8 (1). doi:10.1371/journal.pone.0053095. ISSN 1932-6203. PMID 23349699. Bibcode: 2013PLoSO...853095P.
Sarker, Shiblu et al. (2019). "Critical Nodes in River Networks" (in en). Scientific Reports 9 (1): 11178. doi:10.1038/s41598-019-47292-4. ISSN 2045-2322. PMID 31371735. Bibcode: 2019NatSR...911178S.
Stolz, Simon; Schlereth, Christian (2021). "Predicting tie strength with ego network structures". Journal of Interactive Marketing 54 (May): 40–52. doi:10.1016/j.intmar.2020.10.001.

0.00

(0 votes)

Original source: https://en.wikipedia.org/wiki/Betweenness centrality. Read more

[FOOTNOTEFreeman197739-1] Freeman (1977), p. 39.

[2] "Calculating the Betweenness Centrality in Gephi". https://www.youtube.com/watch?v=PuWNYB0u_gM.

[FOOTNOTEBarratBarthelemyPastor-SatorrasVespignani2004-3] Barrat et al. (2004).

[FOOTNOTEPiraveenanProkopenkoHossain2013-4] Piraveenan, Prokopenko & Hossain (2013).

[FOOTNOTEBrandes20011-5] Brandes (2001), p. 1.

[FOOTNOTEBrandes20019-6] Brandes (2001), p. 9.

[FOOTNOTEMantrachYenCallutFrancoisse2010-7] Mantrach et al. (2010).

[Bader2007-8] Bader, David A.; Kintali, Shiva; Madduri, Kamesh; Mihail, Milena (2007). "Approximating betweenness centrality". 4863. Springer. pp. 124–137. doi:10.1007/978-3-540-77004-6_10.

[BrandesPich2007-9] Brandes, Ulrik; Pich, Christian (2007). "Centrality estimation in large networks". International Journal of Bifurcation and Chaos 17 (7): 2303–2318. doi:10.1142/S0218127407018403.

[RiondatoKornaropoulos2016-10] Riondato, Matteo; Kornaropoulos, Evgenios M. (2016). "Fast approximation of betweenness centrality through sampling". Data Mining and Knowledge Discovery 30: 438–475. doi:10.1007/s10618-015-0423-0.

[RiondatoUpfal2018-11] Riondato, Matteo; Upfal, Eli (2018). "ABRA: Approximating betweenness centrality in static and dynamic graphs with Rademacher averages". ACM Transactions on Knowledge Discovery from Data 12 (5): 61:1–61:38. doi:10.1145/3230636.

[PellegrinaVandin2023-12] Pellegrina, Leonardo; Vandin, Fabio (2023). "SILVAN: Estimating betweenness centralities with progressive sampling and non-uniform Rademacher bounds". ACM Transactions on Knowledge Discovery from Data 18 (3). doi:10.1145/3628601.

[BorassiNatale2019-13] Borassi, Michele; Natale, Emanuele (2019). "KADABRA is an ADaptive Algorithm for Betweenness via Random Approximation". ACM Journal of Experimental Algorithmics 24: 1.2:1–1.2:35. doi:10.1145/3284359.

[EverettBorgatti2005-14] Everett, Martin; Borgatti, Stephen P. (2005). "Ego network betweenness". Social Networks 27 (1): 31–38. doi:10.1016/j.socnet.2004.11.007.

[Meghanathan2017-15] Meghanathan, Natarajan (2017). "A computationally lightweight and localized centrality metric in lieu of betweenness centrality for complex network analysis". Vietnam Journal of Computer Science 4: 23–38. doi:10.1007/s40595-016-0073-1.

[FOOTNOTEBurt2009-16] Burt (2009).

[FOOTNOTEStolzSchlereth2021-17] Stolz & Schlereth (2021).

[FOOTNOTESarkerVeremyevBoginskiSingh2019-18] Sarker et al. (2019).

[19] Eiland, Murray (2020). Interview with Johannes Preiser-Kapeller. "Networks of Rome, Byzantium, and China". Antiqvvs 4 (1): 41–45. https://www.academia.edu/88994886.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

Anonymous

Search

Betweenness centrality

Namespaces

More

Page actions

Contents

Definition

Weighted networks

Percolation centrality

Algorithms

Approximations

Applications

Social networks

River networks

Related concepts

See also

References

Bibliography

Navigation

Navigation

Resources

Help

googletranslator

Navigation

Wiki tools

Wiki tools

Anonymous

Search

Betweenness centrality

Definition

Weighted networks

Percolation centrality

Algorithms

Approximations

Applications

Social networks

River networks

Related concepts

See also

References

Bibliography

Navigation

Wiki tools

Page tools

Other projects

Categories