Wiener index

From HandWiki
Short description: Topological index of a molecule

In chemical graph theory, the Wiener index (also Wiener number) introduced by Harry Wiener, is a topological index of a molecule, defined as the sum of the lengths of the shortest paths between all pairs of vertices in the chemical graph representing the non-hydrogen atoms in the molecule.[1]

Wiener index can be used for the representation of computer networks and enhancing lattice hardware security.

History

The Wiener index is named after Harry Wiener, who introduced it in 1947; at the time, Wiener called it the "path number".[2] It is the oldest topological index related to molecular branching.[3] Based on its success, many other topological indexes of chemical graphs, based on information in the distance matrix of the graph, have been developed subsequently to Wiener's work.[4]

The same quantity has also been studied in pure mathematics, under various names including the gross status,[5] the distance of a graph,[6] and the transmission.[7] The Wiener index is also closely related to the closeness centrality of a vertex in a graph, a quantity inversely proportional to the sum of all distances between the given vertex and all other vertices that has been frequently used in sociometry and the theory of social networks.[6]

Example

Butane (C4H10) has two different structural isomers: n-butane, with a linear structure of four carbon atoms, and isobutane, with a branched structure. The chemical graph for n-butane is a four-vertex path graph, and the chemical graph for isobutane is a tree with one central vertex connected to three leaves.

The n-butane molecule has three pairs of vertices at distance one from each other, two pairs at distance two, and one pair at distance three, so its Wiener index is

[math]\displaystyle{ 3\times 1 + 2\times 2 + 1\times 3 = 10. }[/math]

The isobutane molecule has three pairs of vertices at distances one from each other (the three leaf-center pairs), and three pairs at distance two (the leaf-leaf pairs). Therefore, its Wiener index is

[math]\displaystyle{ 3\times 1 + 3\times 2 = 9. }[/math]

These numbers are instances of formulas for special cases of the Wiener index: it is [math]\displaystyle{ (n^3-n)/6 }[/math] for any [math]\displaystyle{ n }[/math]-vertex path graph such as the graph of n-butane,[8] and [math]\displaystyle{ (n-1)^2 }[/math] for any [math]\displaystyle{ n }[/math]-vertex star such as the graph of isobutane.[9]

Thus, even though these two molecules have the same chemical formula, and the same numbers of carbon-carbon and carbon-hydrogen bonds, their different structures give rise to different Wiener indices.

Relation to chemical properties

Wiener showed that the Wiener index number is closely correlated with the boiling points of alkane molecules.[2] Later work on quantitative structure–activity relationships showed that it is also correlated with other quantities including the parameters of its critical point,[10] the density, surface tension, and viscosity of its liquid phase,[11] and the van der Waals surface area of the molecule.[12]

Calculation in arbitrary graphs

The Wiener index may be calculated directly using an algorithm for computing all pairwise distances in the graph. When the graph is unweighted (so the length of a path is just its number of edges), these distances may be calculated by repeating a breadth-first search algorithm, once for each starting vertex.[13] The total time for this approach is O(nm), where n is the number of vertices in the graph and m is its number of edges.

For weighted graphs, one may instead use the Floyd–Warshall algorithm[13][14][15] or Johnson's algorithm,[16] with running time O(n3) or O(nm + n2 log n) respectively. Alternative but less efficient algorithms based on repeated matrix multiplication have also been developed within the chemical informatics literature.[17][18]

Calculation in special types of graph

When the underlying graph is a tree (as is true for instance for the alkanes originally studied by Wiener), the Wiener index may be calculated more efficiently. If the graph is partitioned into two subtrees by removing a single edge e, then its Wiener index is the sum of the Wiener indices of the two subtrees, together with a third term representing the paths that pass through e. This third term may be calculated in linear time by computing the sum of distances of all vertices from e within each subtree and multiplying the two sums.[19] This divide and conquer algorithm can be generalized from trees to graphs of bounded treewidth, and leads to near-linear-time algorithms for such graphs.[20]

An alternative method for calculating the Wiener index of a tree, by Bojan Mohar and Tomaž Pisanski, works by generalizing the problem to graphs with weighted vertices, where the weight of a path is the product of its length with the weights of its two endpoints. If v is a leaf vertex of the tree then the Wiener index of the tree may be calculated by merging v with its parent (adding their weights together), computing the index of the resulting smaller tree, and adding a simple correction term for the paths that pass through the edge from v to its parent. By repeatedly removing leaves in this way, the Wiener index may be calculated in linear time.[13]

For graphs that are constructed as products of simpler graphs, the Wiener index of the product graph can often be computed by a simple formula that combines the indices of its factors.[21] Benzenoids (graphs formed by gluing regular hexagons edge-to-edge) can be embedded isometrically into the Cartesian product of three trees, allowing their Wiener indices to be computed in linear time by using the product formula together with the linear time tree algorithm.[22]

Inverse problem

(Gutman Yeh) considered the problem of determining which numbers can be represented as the Wiener index of a graph.[23] They showed that all but two positive integers have such a representation; the two exceptions are the numbers 2 and 5, which are not the Wiener index of any graph. For graphs that must be bipartite, they found that again almost all integers can be represented, with a larger set of exceptions: none of the numbers in the set

{2, 3, 5, 6, 7, 11, 12, 13, 15, 17, 19, 33, 37, 39}

can be represented as the Wiener index of a bipartite graph.

Gutman and Yeh conjectured, but were unable to prove, a similar description of the numbers that can be represented as Wiener indices of trees, with a set of 49 exceptional values:

2, 3, 5, 6, 7, 8, 11, 12, 13, 14, 15, 17, 19, 21, 22, 23, 24, 26, 27, 30, 33, 34, 37, 38, 39, 41, 43, 45, 47, 51, 53, 55, 60, 61, 69, 73, 77, 78, 83, 85, 87, 89, 91, 99, 101, 106, 113, 147, 159 (sequence A122686 in the OEIS)

The conjecture was later proven by Wagner, Wang, and Yu.[24][25]

References

  1. Rouvray, Dennis H. (2002), "The rich legacy of half a century of the Wiener index", in Rouvray, Dennis H.; King, Robert Bruce, Topology in Chemistry: Discrete Mathematics of Molecules, Horwood Publishing, pp. 16–37, ISBN 9781898563761, https://books.google.com/books?id=a-ZkUeVLdO8C&pg=PA16 .
  2. 2.0 2.1 Wiener, H. (1947), "Structural determination of paraffin boiling points", Journal of the American Chemical Society 1 (69): 17–20, doi:10.1021/ja01193a005, PMID 20291038 .
  3. Todeschini, Roberto; Consonni, Viviana (2000), Handbook of Molecular Descriptors, Wiley, ISBN 3-527-29913-0 .
  4. (Rouvray 2002). See in particular Table 2 on p. 32.
  5. "Status and contrastatus", Sociometry 22 (1): 23–43, 1959, doi:10.2307/2785610 
  6. 6.0 6.1 Entringer, R. C.; Jackson, D. E.; Snyder, D. A. (1976), "Distance in graphs", Czechoslovak Mathematical Journal 26 (101): 283–296, doi:10.21136/CMJ.1976.101401 .
  7. Šoltés, Ľubomír (1991), "Transmission in graphs: a bound and vertex removing", Mathematica Slovaca 41 (1): 11–16 .
  8. As (Rouvray 2002) describes, this formula was already given by (Wiener 1947).
  9. Bonchev, D.; Trinajstić, N. (1977), "Information theory, distance matrix, and molecular branching", Journal of Chemical Physics 67 (10): 4517–4533, doi:10.1063/1.434593, Bibcode1977JChPh..67.4517B .
  10. Stiel, Leonard I.; Thodos, George (1962), "The normal boiling points and critical constants of saturated aliphatic hydrocarbons", AIChE Journal 8 (4): 527–529, doi:10.1002/aic.690080421 .
  11. Rouvray, D. H.; Crafford, B. C. (1976), "The dependence of physical-chemical properties on topological factors", South African Journal of Science 72: 47 .
  12. Gutman, Ivan; Körtvélyesi, T. (1995), "Wiener indices and molecular surfaces", Zeitschrift für Naturforschung 50a (7): 669–671, doi:10.1515/zna-1995-0707, Bibcode1995ZNatA..50..669G .
  13. 13.0 13.1 13.2 "How to compute the Wiener index of a graph", Journal of Mathematical Chemistry 2 (3): 267–277, 1988, doi:10.1007/BF01167206 .
  14. "Algorithm 97: Shortest Path", Communications of the ACM 5 (6): 345, June 1962, doi:10.1145/367766.368168 .
  15. Warshall, Stephen (January 1962), "A theorem on Boolean matrices", Journal of the ACM 9 (1): 11–12, doi:10.1145/321105.321107 .
  16. "Efficient algorithms for shortest paths in sparse networks", Journal of the ACM 24 (1): 1–13, 1977, doi:10.1145/321992.321993 .
  17. Bersohn, Malcom (1983), "A fast algorithm for calculation of the distance matrix of a molecule", Journal of Computational Chemistry 4 (1): 110–113, doi:10.1002/jcc.540040115 .
  18. Müller, W. R.; Szymanski, K.; Knop, J. V.; Trinajstić, N. (1987), "An algorithm for construction of the molecular distance matrix", Journal of Computational Chemistry 8 (2): 170–173, doi:10.1002/jcc.540080209 .
  19. Canfield, E. R.; Robinson, R. W.; Rouvray, D. H. (1985), "Determination of the Wiener molecular branching index for the general tree", Journal of Computational Chemistry 6 (6): 598–609, doi:10.1002/jcc.540060613 .
  20. Cabello, Sergio; Knauer, Christian (2009), "Algorithms for graphs of bounded treewidth via orthogonal range searching", Computational Geometry 42 (9): 815–824, doi:10.1016/j.comgeo.2009.02.001 .
  21. Yeh, Yeong Nan; Gutman, Ivan (1994), "On the sum of all distances in composite graphs", Discrete Mathematics 135 (1–3): 359–365, doi:10.1016/0012-365X(93)E0092-I .
  22. Chepoi, Victor; Klavžar, Sandi (1997), "The Wiener index and the Szeged index of benzenoid systems in linear time", Journal of Chemical Information and Computer Sciences 37 (4): 752–755, doi:10.1021/ci9700079 . For earlier algorithms for benzenoids based on their partial cube structure, see Klavžar, Sandi; Gutman, Ivan (1995), "Labeling of benzenoid systems which reflects the vertex-distance relations", Journal of Chemical Information and Computer Sciences 35 (3): 590–593, doi:10.1021/ci00025a030, http://www.fmf.uni-lj.si/~klavzar/preprints/labeling-benzi.pdf .
  23. Gutman, Ivan; Yeh, Yeong-Nan (1995), "The sum of all distances in bipartite graphs", Mathematica Slovaca 45 (4): 327–334 .
  24. Wagner, Stephan G. (2006), "A class of trees and its Wiener index", Acta Applicandae Mathematicae 91 (2): 119–132, doi:10.1007/s10440-006-9026-5 .
  25. Wang, Hua; Yu, Guang (2006), "All but 49 numbers are Wiener indices of trees", Acta Applicandae Mathematicae 92 (1): 15–20, doi:10.1007/s10440-006-9037-2 .

External links