Exponential dispersion model

From HandWiki
Short description: Set of probability distributions

In probability and statistics, the class of exponential dispersion models (EDM), also called exponential dispersion family (EDF), is a set of probability distributions that represents a generalisation of the natural exponential family.[1][2][3] Exponential dispersion models play an important role in statistical theory, in particular in generalized linear models because they have a special structure which enables deductions to be made about appropriate statistical inference.

Definition

Univariate case

There are two versions to formulate an exponential dispersion model.

Additive exponential dispersion model

In the univariate case, a real-valued random variable [math]\displaystyle{ X }[/math] belongs to the additive exponential dispersion model with canonical parameter [math]\displaystyle{ \theta }[/math] and index parameter [math]\displaystyle{ \lambda }[/math], [math]\displaystyle{ X \sim \mathrm{ED}^*(\theta, \lambda) }[/math], if its probability density function can be written as

[math]\displaystyle{ f_X(x\mid\theta, \lambda) = h^*(\lambda,x) \exp\left(\theta x - \lambda A(\theta)\right) \,\! . }[/math]

Reproductive exponential dispersion model

The distribution of the transformed random variable [math]\displaystyle{ Y=\frac{X}{\lambda} }[/math] is called reproductive exponential dispersion model, [math]\displaystyle{ Y \sim \mathrm{ED}(\mu, \sigma^2) }[/math], and is given by

[math]\displaystyle{ f_Y(y\mid\mu, \sigma^2) = h(\sigma^2,y) \exp\left(\frac{\theta y - A(\theta)}{\sigma^2}\right) \,\! , }[/math]

with [math]\displaystyle{ \sigma^2 = \frac{1}{\lambda} }[/math] and [math]\displaystyle{ \mu = A'(\theta) }[/math], implying [math]\displaystyle{ \theta = (A')^{-1}(\mu) }[/math]. The terminology dispersion model stems from interpreting [math]\displaystyle{ \sigma^2 }[/math] as dispersion parameter. For fixed parameter [math]\displaystyle{ \sigma^2 }[/math], the [math]\displaystyle{ \mathrm{ED}(\mu, \sigma^2) }[/math] is a natural exponential family.

Multivariate case

In the multivariate case, the n-dimensional random variable [math]\displaystyle{ \mathbf{X} }[/math] has a probability density function of the following form[1]

[math]\displaystyle{ f_{\mathbf{X}}(\mathbf{x}|\boldsymbol{\theta}, \lambda) = h(\lambda,\mathbf{x}) \exp\left(\lambda(\boldsymbol\theta^\top \mathbf{x} - A(\boldsymbol\theta))\right) \,\!, }[/math]

where the parameter [math]\displaystyle{ \boldsymbol\theta }[/math] has the same dimension as [math]\displaystyle{ \mathbf{X} }[/math].

Properties

Cumulant-generating function

The cumulant-generating function of [math]\displaystyle{ Y\sim\mathrm{ED}(\mu,\sigma^2) }[/math] is given by

[math]\displaystyle{ K(t;\mu,\sigma^2) = \log\operatorname{E}[e^{tY}] = \frac{A(\theta+\sigma^2 t)-A(\theta)}{\sigma^2}\,\! , }[/math]

with [math]\displaystyle{ \theta = (A')^{-1}(\mu) }[/math]

Mean and variance

Mean and variance of [math]\displaystyle{ Y\sim\mathrm{ED}(\mu,\sigma^2) }[/math] are given by

[math]\displaystyle{ \operatorname{E}[Y]= \mu = A'(\theta) \,, \quad \operatorname{Var}[Y] = \sigma^2 A''(\theta) = \sigma^2 V(\mu)\,\! , }[/math]

with unit variance function [math]\displaystyle{ V(\mu) = A''((A')^{-1}(\mu)) }[/math].

Reproductive

If [math]\displaystyle{ Y_1,\ldots, Y_n }[/math] are i.i.d. with [math]\displaystyle{ Y_i\sim\mathrm{ED}\left(\mu,\frac{\sigma^2}{w_i}\right) }[/math], i.e. same mean [math]\displaystyle{ \mu }[/math] and different weights [math]\displaystyle{ w_i }[/math], the weighted mean is again an [math]\displaystyle{ \mathrm{ED} }[/math] with

[math]\displaystyle{ \sum_{i=1}^n \frac{w_i Y_i}{w_{\bullet}} \sim \mathrm{ED}\left(\mu, \frac{\sigma^2}{w_\bullet}\right) \,\! , }[/math]

with [math]\displaystyle{ w_\bullet = \sum_{i=1}^n w_i }[/math]. Therefore [math]\displaystyle{ Y_i }[/math] are called reproductive.

Unit deviance

The probability density function of an [math]\displaystyle{ \mathrm{ED}(\mu, \sigma^2) }[/math] can also be expressed in terms of the unit deviance [math]\displaystyle{ d(y,\mu) }[/math] as

[math]\displaystyle{ f_Y(y\mid\mu, \sigma^2) = \tilde{h}(\sigma^2,y) \exp\left(-\frac{d(y,\mu)}{2\sigma^2}\right) \,\! , }[/math]

where the unit deviance takes the special form [math]\displaystyle{ d(y,\mu) = y f(\mu) + g(\mu) + h(y) }[/math] or in terms of the unit variance function as [math]\displaystyle{ d(y,\mu) = 2 \int_\mu^y\! \frac{y-t}{V(t)} \,dt }[/math].

Examples

Many very common probability distributions belong to the class of EDMs, among them are: normal distribution, binomial distribution, Poisson distribution, negative binomial distribution, gamma distribution, inverse Gaussian distribution, and Tweedie distribution.

References

  1. 1.0 1.1 Jørgensen, B. (1987). Exponential dispersion models (with discussion). Journal of the Royal Statistical Society, Series B, 49 (2), 127–162.
  2. Jørgensen, B. (1992). The theory of exponential dispersion models and analysis of deviance. Monografias de matemática, no. 51.
  3. Marriott, P. (2005) "Local Mixtures and Exponential Dispersion Models" pdf