Skew normal distribution

From HandWiki
Short description: Probability distribution
Skew Normal
Probability density function
Probability density plots of skew normal distributions
Cumulative distribution function
Cumulative distribution function plots of skew normal distributions
Parameters [math]\displaystyle{ \xi \, }[/math] location (real)
[math]\displaystyle{ \omega \, }[/math] scale (positive, real)
[math]\displaystyle{ \alpha \, }[/math] shape (real)
Support [math]\displaystyle{ x \in (-\infty; +\infty)\! }[/math]
PDF [math]\displaystyle{ \frac{2}{\omega \sqrt{2 \pi}} e^{-\frac{(x-\xi)^2}{2\omega^2}} \int_{-\infty}^{\alpha\left(\frac{x-\xi}{\omega}\right)} \frac{1}{\sqrt{2 \pi}} e^{-\frac{t^2}{2}}\ dt }[/math]
CDF [math]\displaystyle{ \Phi\left(\frac{x-\xi}{\omega}\right)-2T\left(\frac{x-\xi}{\omega},\alpha\right) }[/math]
[math]\displaystyle{ T(h,a) }[/math] is Owen's T function
Mean [math]\displaystyle{ \xi + \omega\delta\sqrt{\frac{2}{\pi}} }[/math] where [math]\displaystyle{ \delta = \frac{\alpha}{\sqrt{1+\alpha^2}} }[/math]
Mode [math]\displaystyle{ \xi + \omega m_o(\alpha) }[/math]
Variance [math]\displaystyle{ \omega^2\left(1 - \frac{2\delta^2}{\pi}\right) }[/math]
Skewness [math]\displaystyle{ \gamma_1 = \frac{4-\pi}{2} \frac{\left(\delta\sqrt{2/\pi}\right)^3}{ \left(1-2\delta^2/\pi\right)^{3/2}} }[/math]
Kurtosis [math]\displaystyle{ 2(\pi - 3)\frac{\left(\delta\sqrt{2/\pi}\right)^4}{\left(1-2\delta^2/\pi\right)^2} }[/math]
MGF [math]\displaystyle{ M_X\left(t\right)=2\exp\left(\xi t+\frac{\omega^2t^2}{2}\right)\Phi\left(\omega\delta t\right) }[/math]
CF [math]\displaystyle{ e^{i t \xi -t^2\omega^2/2}\left(1+i\, \textrm{Erfi}\left(\frac{\delta\omega t}{\sqrt{2}}\right)\right) }[/math]

In probability theory and statistics, the skew normal distribution is a continuous probability distribution that generalises the normal distribution to allow for non-zero skewness.

Definition

Let [math]\displaystyle{ \phi(x) }[/math] denote the standard normal probability density function

[math]\displaystyle{ \phi(x)=\frac{1}{\sqrt{2\pi}}e^{-\frac{x^2}{2}} }[/math]

with the cumulative distribution function given by

[math]\displaystyle{ \Phi(x) = \int_{-\infty}^{x} \phi(t)\ dt = \frac{1}{2} \left[ 1 + \operatorname{erf} \left(\frac{x}{\sqrt{2}}\right)\right] }[/math],

where "erf" is the error function. Then the probability density function (pdf) of the skew-normal distribution with parameter [math]\displaystyle{ \alpha }[/math] is given by

[math]\displaystyle{ f(x) = 2\phi(x)\Phi(\alpha x). \, }[/math]

This distribution was first introduced by O'Hagan and Leonard (1976).[1] Alternative forms to this distribution, with the corresponding quantile function, have been given by Ashour and Abdel-Hamid[2] and by Mudholkar and Hutson.[3]

A stochastic process that underpins the distribution was described by Andel, Netuka and Zvara (1984).[4] Both the distribution and its stochastic process underpinnings were consequences of the symmetry argument developed in Chan and Tong (1986),[5] which applies to multivariate cases beyond normality, e.g. skew multivariate t distribution and others. The distribution is a particular case of a general class of distributions with probability density functions of the form [math]\displaystyle{ f(x) = 2 \phi(x) \Phi(x) }[/math] where [math]\displaystyle{ \phi(\cdot) }[/math] is any PDF symmetric about zero and [math]\displaystyle{ \Phi(\cdot) }[/math] is any CDF whose PDF is symmetric about zero.[6]

To add location and scale parameters to this, one makes the usual transform [math]\displaystyle{ x\rightarrow\frac{x-\xi}{\omega} }[/math]. One can verify that the normal distribution is recovered when [math]\displaystyle{ \alpha = 0 }[/math], and that the absolute value of the skewness increases as the absolute value of [math]\displaystyle{ \alpha }[/math] increases. The distribution is right skewed if [math]\displaystyle{ \alpha\gt 0 }[/math] and is left skewed if [math]\displaystyle{ \alpha\lt 0 }[/math]. The probability density function with location [math]\displaystyle{ \xi }[/math], scale [math]\displaystyle{ \omega }[/math], and parameter [math]\displaystyle{ \alpha }[/math] becomes

[math]\displaystyle{ f(x) = \frac{2}{\omega}\phi\left(\frac{x-\xi}{\omega}\right)\Phi\left(\alpha \left(\frac{x-\xi}{\omega}\right)\right). \, }[/math]

The skewness ([math]\displaystyle{ \gamma_1 }[/math]) of the distribution is limited to slightly less than the interval [math]\displaystyle{ (-1,1) }[/math] (see Estimation).

As has been shown,[7] the mode (maximum) [math]\displaystyle{ m_o }[/math] of the distribution is unique. For general [math]\displaystyle{ \alpha }[/math] there is no analytic expression for [math]\displaystyle{ m_o }[/math], but a quite accurate (numerical) approximation is:

[math]\displaystyle{ \begin{align} \delta &= \frac{\alpha}{\sqrt{1+\alpha^2}} \\ m_o (\alpha) &\approx \sqrt{\frac{2}{\pi}}\delta - \left(1-\frac{\pi}{4}\right) \frac{\left(\sqrt{\frac{2}{\pi}}\delta\right)^3}{1-\frac{2}{\pi}\delta^2} - \frac{\mathrm{sgn}(\alpha)}{2} e^{\left(-\frac{2 \pi}{|\alpha |}\right)} \\ \end{align} }[/math]

Estimation

Maximum likelihood estimates for [math]\displaystyle{ \xi }[/math], [math]\displaystyle{ \omega }[/math], and [math]\displaystyle{ \alpha }[/math] can be computed numerically, but no closed-form expression for the estimates is available unless [math]\displaystyle{ \alpha=0 }[/math]. In contrast, the method of moments has a closed-form expression since the skewness equation can be inverted with

[math]\displaystyle{ |\delta| = \sqrt{\frac{\pi}{2} \frac{ |\gamma_1|^{\frac{2}{3}} }{|\gamma_1|^{\frac{2}{3}}+((4-\pi)/2)^\frac{2}{3}}} }[/math]

where [math]\displaystyle{ \delta = \frac{\alpha}{\sqrt{1+\alpha^2}} }[/math] and the sign of [math]\displaystyle{ \delta }[/math] is the same as the sign of [math]\displaystyle{ \gamma_1 }[/math]. Consequently, [math]\displaystyle{ \alpha = \frac{\delta}{\sqrt{1-\delta^2}} }[/math], [math]\displaystyle{ \omega = \frac{\sigma}{\sqrt{1-2\delta^2/\pi}} }[/math], and [math]\displaystyle{ \xi=\mu-\omega\delta\sqrt{\frac{2}{\pi}} }[/math] where [math]\displaystyle{ \mu }[/math] and [math]\displaystyle{ \sigma }[/math] are the mean and standard deviation. As long as the sample skewness [math]\displaystyle{ \hat{\gamma}_1 }[/math] is not too large, these formulas provide method of moments estimates [math]\displaystyle{ \hat\alpha }[/math], [math]\displaystyle{ \hat\omega }[/math], and [math]\displaystyle{ \hat\xi }[/math] based on a sample's [math]\displaystyle{ \hat\mu }[/math], [math]\displaystyle{ \hat\sigma }[/math], and [math]\displaystyle{ \hat\gamma_1 }[/math].

The maximum (theoretical) skewness is obtained by setting [math]\displaystyle{ {\delta = 1} }[/math] in the skewness equation, giving [math]\displaystyle{ \gamma_1 \approx 0.9952717 }[/math]. However it is possible that the sample skewness is larger, and then [math]\displaystyle{ \alpha }[/math] cannot be determined from these equations. When using the method of moments in an automatic fashion, for example to give starting values for maximum likelihood iteration, one should therefore let (for example) [math]\displaystyle{ |\hat{\gamma}_1| = \min(0.99, |(1/n)\sum{((x_i-\hat\mu)/\hat\sigma)^3}|) }[/math].

Concern has been expressed about the impact of skew normal methods on the reliability of inferences based upon them.[8]

Related distributions

The exponentially modified normal distribution is another 3-parameter distribution that is a generalization of the normal distribution to skewed cases. The skew normal still has a normal-like tail in the direction of the skew, with a shorter tail in the other direction; that is, its density is asymptotically proportional to [math]\displaystyle{ e^{-kx^2} }[/math] for some positive [math]\displaystyle{ k }[/math]. Thus, in terms of the seven states of randomness, it shows "proper mild randomness". In contrast, the exponentially modified normal has an exponential tail in the direction of the skew; its density is asymptotically proportional to [math]\displaystyle{ e^{-k|x|} }[/math]. In the same terms, it shows "borderline mild randomness".

Thus, the skew normal is useful for modeling skewed distributions which nevertheless have no more outliers than the normal, while the exponentially modified normal is useful for cases with an increased incidence of outliers in (just) one direction.

See also

References

  1. O'Hagan, A.; Leonard, Tom (1976). "Bayes estimation subject to uncertainty about parameter constraints". Biometrika 63 (1): 201–203. doi:10.1093/biomet/63.1.201. ISSN 0006-3444. 
  2. Ashour, Samir K.; Abdel-hameed, Mahmood A. (October 2010). "Approximate skew normal distribution". Journal of Advanced Research 1 (4): 341–350. doi:10.1016/j.jare.2010.06.004. ISSN 2090-1232. 
  3. Mudholkar, Govind S.; Hutson, Alan D. (February 2000). "The epsilon–skew–normal distribution for analyzing near-normal data". Journal of Statistical Planning and Inference 83 (2): 291–309. doi:10.1016/s0378-3758(99)00096-8. ISSN 0378-3758. 
  4. Andel, J., Netuka, I. and Zvara, K. (1984) On threshold autoregressive processes. Kybernetika, 20, 89-106
  5. Chan, K. S.; Tong, H. (March 1986). "A note on certain integral equations associated with non-linear time series analysis". Probability Theory and Related Fields 73 (1): 153–158. doi:10.1007/bf01845999. ISSN 0178-8051. 
  6. Azzalini, A. (1985). "A class of distributions which includes the normal ones". Scandinavian Journal of Statistics 12: 171–178. 
  7. Azzalini, Adelchi; Capitanio, Antonella (2014). The skew-normal and related families. pp. 32–33. ISBN 978-1-107-02927-9. 
  8. Pewsey, Arthur. "Problems of inference for Azzalini's skewnormal distribution." Journal of Applied Statistics 27.7 (2000): 859-870

External links