Hierarchical closeness

From HandWiki

Hierarchical closeness (HC) is a structural centrality measure used in network theory or graph theory. It is extended from closeness centrality to rank how centrally located a node is in a directed network. While the original closeness centrality of a directed network considers the most important node to be that with the least total distance from all other nodes, hierarchical closeness evaluates the most important node as the one which reaches the most nodes by the shortest paths. The hierarchical closeness explicitly includes information about the range of other nodes that can be affected by the given node. In a directed network [math]\displaystyle{ G(V, A) }[/math] where [math]\displaystyle{ V }[/math] is the set of nodes and [math]\displaystyle{ A }[/math] is the set of interactions, hierarchical closeness of a node [math]\displaystyle{ i }[/math][math]\displaystyle{ V }[/math] called [math]\displaystyle{ C_{hc}(i) }[/math] was proposed by Tran and Kwon[1] as follows:

[math]\displaystyle{ C_{hc}(i) = N_R(i) + C_{(clo-i)}(i) }[/math]

where:

  • [math]\displaystyle{ N_R(i) \in [0, |V| - 1] }[/math] is the reachability of a node [math]\displaystyle{ i }[/math] defined by [math]\displaystyle{ N_R(i) = |\{j \in V: \exists }[/math] a path from [math]\displaystyle{ i }[/math] to [math]\displaystyle{ j\}| }[/math], and
  • [math]\displaystyle{ C_{clo}(i) }[/math] is the normalized form of original closeness (Sabidussi, 1966).[2] It can use a variant definition of closeness[3] as follows: [math]\displaystyle{ C_{clo-i}(i)=\frac{1}{|V|-1} \sum_{j \in V\setminus\{i\}} \frac{1}{d(i,j)} }[/math] where [math]\displaystyle{ d(i, j) }[/math] is the distance of the shortest path, if any, from [math]\displaystyle{ i }[/math] to [math]\displaystyle{ j }[/math]; otherwise, [math]\displaystyle{ d(i, j) }[/math] is specified as an infinite value.

In the formula, [math]\displaystyle{ N_R(i) }[/math] represents the number of nodes in [math]\displaystyle{ V }[/math] that can be reachable from [math]\displaystyle{ i }[/math]. It can also represent the hierarchical position of a node in a directed network. It notes that if [math]\displaystyle{ N_R(i) = 0 }[/math], then [math]\displaystyle{ C_{hc}(i) = 0 }[/math] because [math]\displaystyle{ C_{(clo-i)}(i) }[/math] is [math]\displaystyle{ 0 }[/math]. In cases where [math]\displaystyle{ N_R(i) \gt 0 }[/math], the reachability is a dominant factor because [math]\displaystyle{ N_R(i) \geq 1 }[/math] but [math]\displaystyle{ C_{(clo-i)}(i) \lt 1 }[/math]. In other words, the first term indicates the level of the global hierarchy and the second term presents the level of the local centrality.

Application

Hierarchical closeness can be used in biological networks to rank the risk of genes to carry diseases.[1]

References

  1. Tran, T.-D. and Kwon, Y.-K. Hierarchical closeness efficiently predicts disease genes in a directed signaling network, Computational biology and chemistry.
  2. Sabidussi, G. (1966) The centrality index of a graph, Psychometrika, 31, 581-603 %G English
  3. Opsahl, T., Agneessens, F. and Skvoretz, J. (2010) Node centrality in weighted networks: Generalizing degree and shortest paths, Social networks, 32, 245-251.