Kirkwood approximation

From HandWiki
Revision as of 04:29, 9 May 2022 by imported>Len Stevenson (over-write)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

The Kirkwood superposition approximation was introduced in 1935 by John G. Kirkwood as a means of representing a discrete probability distribution.[1] The Kirkwood approximation for a discrete probability density function [math]\displaystyle{ P(x_{1},x_{2},\ldots ,x_{n}) }[/math] is given by

[math]\displaystyle{ P^{\prime }(x_1,x_2,\ldots ,x_n) = \prod_{i = 1}^{n -1}\left[\prod_{\mathcal{T}_i\subseteq \mathcal{V}}p(\mathcal{T}_i)\right]^{(-1)^{n-1-i}} = \frac{\prod_{\mathcal{T} _{n-1}\subseteq \mathcal{V}}p(\mathcal{T}_{n-1})}{\frac{\prod_{\mathcal{T} _{n-2}\subseteq \mathcal{V}}p(\mathcal{T}_{n-2})}{\frac{\vdots }{\prod_{\mathcal{ T}_1\subseteq \mathcal{V}}p(\mathcal{T}_1)}}} }[/math]

where

[math]\displaystyle{ \prod_{\mathcal{T}_i\subseteq \mathcal{V}}p(\mathcal{T}_i) }[/math]

is the product of probabilities over all subsets of variables of size i in variable set [math]\displaystyle{ \scriptstyle\mathcal{V} }[/math]. This kind of formula has been considered by Watanabe (1960) and, according to Watanabe, also by Robert Fano. For the three-variable case, it reduces to simply

[math]\displaystyle{ P^\prime(x_1,x_2,x_3)=\frac{p(x_1,x_2)p(x_2,x_3)p(x_1,x_3)}{p(x_1)p(x_{2})p(x_3)} }[/math]

The Kirkwood approximation does not generally produce a valid probability distribution (the normalization condition is violated). Watanabe claims that for this reason informational expressions of this type are not meaningful, and indeed there has been very little written about the properties of this measure. The Kirkwood approximation is the probabilistic counterpart of the interaction information.

Judea Pearl (1988 §3.2.4) indicates that an expression of this type can be exact in the case of a decomposable model, that is, a probability distribution that admits a graph structure whose cliques form a tree. In such cases, the numerator contains the product of the intra-clique joint distributions and the denominator contains the product of the clique intersection distributions.

References

  1. Kirkwood, John G. (1935). "Statistical Mechanics of Fluid Mixtures". The Journal of Chemical Physics (AIP Publishing) 3 (5): 300–313. doi:10.1063/1.1749657. ISSN 0021-9606. Bibcode1935JChPh...3..300K. 
  • Jakulin, A. & Bratko, I. (2004), Quantifying and visualizing attribute interactions: An approach based on entropy, Journal of Machine Learning Research, (submitted) pp. 38–43.
  • Matsuda, Hiroyuki (2000-09-01). "Physical nature of higher-order mutual information: Intrinsic correlations and frustration". Physical Review E (American Physical Society (APS)) 62 (3): 3096–3102. doi:10.1103/physreve.62.3096. ISSN 1063-651X. PMID 11088803. Bibcode2000PhRvE..62.3096M. 
  • Pearl, J. (1988). Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. San Mateo, CA: Morgan Kaufmann/Elsevier. doi:10.1016/c2009-0-27609-4. ISBN 978-0-08-051489-5. 
  • Watanabe, Satosi (1960). "Information Theoretical Analysis of Multivariate Correlation". IBM Journal of Research and Development (IBM) 4 (1): 66–82. doi:10.1147/rd.41.0066. ISSN 0018-8646.