Wiener–Khinchin theorem

From HandWiki
Short description: Spectral decomposition theorem of stationary processes' autocorrelations

In applied mathematics, the Wiener–Khinchin theorem or Wiener–Khintchine theorem, also known as the Wiener–Khinchin–Einstein theorem or the Khinchin–Kolmogorov theorem, states that the autocorrelation function of a wide-sense-stationary random process has a spectral decomposition given by the power spectral density of that process.[1][2][3][4][5][6][7]

History

Norbert Wiener proved this theorem for the case of a deterministic function in 1930;[8] Aleksandr Khinchin later formulated an analogous result for stationary stochastic processes and published that probabilistic analogue in 1934.[9][10] Albert Einstein explained, without proofs, the idea in a brief two-page memo in 1914.[11][12]

Continuous-time process

For continuous time, the Wiener–Khinchin theorem says that if [math]\displaystyle{ x }[/math] is a wide-sense-stationary random process whose autocorrelation function (sometimes called autocovariance) defined in terms of statistical expected value, [math]\displaystyle{ r_{xx}(\tau) = \mathbb{E}\big[x(t)^*\cdot x(t - \tau)\big] }[/math] exists and is finite at every lag [math]\displaystyle{ \tau }[/math], then there exists a monotone function [math]\displaystyle{ F(f) }[/math] in the frequency domain [math]\displaystyle{ -\infty \lt f \lt \infty }[/math], or equivalently a non negative Radon measure [math]\displaystyle{ \mu }[/math] on the frequency domain, such that

[math]\displaystyle{ r_{xx} (\tau) = \int_{-\infty}^\infty e^{2\pi i\tau f}\mu(df) = \int_{-\infty}^\infty e^{2\pi i\tau f} dF(f) , }[/math]

where the integral is a Riemann–Stieltjes integral.[1][13] The asterisk denotes complex conjugate, and can be omitted if the random process is real-valued. This is a kind of spectral decomposition of the auto-correlation function. F is called the power spectral distribution function and is a statistical distribution function. It is sometimes called the integrated spectrum.

The Fourier transform of [math]\displaystyle{ x(t) }[/math] does not exist in general, because stochastic random functions are not generally either square-integrable or absolutely integrable. Nor is [math]\displaystyle{ r_{xx} }[/math] assumed to be absolutely integrable, so it need not have a Fourier transform either.

However, if the measure [math]\displaystyle{ \mu(df) = dF(f) }[/math] is absolutely continuous, for example, if the process is purely indeterministic, then [math]\displaystyle{ F }[/math] is differentiable almost everywhere and we can write [math]\displaystyle{ \mu(df) = S(f) df }[/math]. In this case, one can determine [math]\displaystyle{ S(f) }[/math], the power spectral density of [math]\displaystyle{ x(t) }[/math], by taking the averaged derivative of [math]\displaystyle{ F }[/math]. Because the left and right derivatives of [math]\displaystyle{ F }[/math] exist everywhere, i.e. we can put [math]\displaystyle{ S(f) = \frac12 \left(\lim_{\varepsilon \downarrow 0} \frac1\varepsilon \big(F(f + \varepsilon) - F(f)\big) + \lim_{\varepsilon \uparrow 0} \frac1\varepsilon \big(F(f + \varepsilon) - F(f)\big)\right) }[/math] everywhere,[14] (obtaining that F is the integral of its averaged derivative[15]), and the theorem simplifies to

[math]\displaystyle{ r_{xx} (\tau) = \int_{-\infty}^\infty e^{2\pi i\tau f} \, S(f)df. }[/math]

If now one assumes that r and S satisfy the necessary conditions for Fourier inversion to be valid, the Wiener–Khinchin theorem takes the simple form of saying that r and S are a Fourier-transform pair, and

[math]\displaystyle{ S(f) = \int_{-\infty}^\infty r_{xx} (\tau) e^{-2\pi if\tau} \,d\tau. }[/math]

Discrete-time process

For the discrete-time case, the power spectral density of the function with discrete values [math]\displaystyle{ x_n }[/math] is

[math]\displaystyle{ S(\omega)=\frac{1}{2\pi} \sum_{k=-\infty}^\infty r_{xx}(k)e^{-i \omega k} }[/math]

where [math]\displaystyle{ \omega = 2 \pi f }[/math] is the angular frequency, [math]\displaystyle{ i }[/math] is used to denote the imaginary unit (in engineering, sometimes the letter [math]\displaystyle{ j }[/math] is used instead) and [math]\displaystyle{ r_{xx}(k) }[/math] is the discrete autocorrelation function of [math]\displaystyle{ x_n }[/math], defined in its deterministic or stochastic formulation.

Provided [math]\displaystyle{ r_{xx} }[/math] is absolutely summable, i.e.

[math]\displaystyle{ \sum_{k=-\infty}^\infty |r_{xx}(k)| \lt +\infty }[/math]

the result of the theorem then can be written as

[math]\displaystyle{ r_{xx}(\tau) = \int_{-\pi}^{\pi} e^{i \tau \omega} S(\omega) d\omega }[/math]

Being a discrete-time sequence, the spectral density is periodic in the frequency domain. For this reason, the domain of the function [math]\displaystyle{ S }[/math] is usually restricted to [math]\displaystyle{ ]-\pi, \pi] }[/math] (note the interval is open from one side).

Application

The theorem is useful for analyzing linear time-invariant systems (LTI systems) when the inputs and outputs are not square-integrable, so their Fourier transforms do not exist. A corollary is that the Fourier transform of the autocorrelation function of the output of an LTI system is equal to the product of the Fourier transform of the autocorrelation function of the input of the system times the squared magnitude of the Fourier transform of the system impulse response.[16] This works even when the Fourier transforms of the input and output signals do not exist because these signals are not square-integrable, so the system inputs and outputs cannot be directly related by the Fourier transform of the impulse response.

Since the Fourier transform of the autocorrelation function of a signal is the power spectrum of the signal, this corollary is equivalent to saying that the power spectrum of the output is equal to the power spectrum of the input times the energy transfer function.

This corollary is used in the parametric method for power spectrum estimation.

Discrepancies in terminology

In many textbooks and in much of the technical literature, it is tacitly assumed that Fourier inversion of the autocorrelation function and the power spectral density is valid, and the Wiener–Khinchin theorem is stated, very simply, as if it said that the Fourier transform of the autocorrelation function was equal to the power spectral density, ignoring all questions of convergence[17] (similar to Einstein's paper[11]). But the theorem (as stated here) was applied by Norbert Wiener and Aleksandr Khinchin to the sample functions (signals) of wide-sense-stationary random processes, signals whose Fourier transforms do not exist. Wiener's contribution was to make sense of the spectral decomposition of the autocorrelation function of a sample function of a wide-sense-stationary random process even when the integrals for the Fourier transform and Fourier inversion do not make sense.

Further complicating the issue is that the discrete Fourier transform always exists for digital, finite-length sequences, meaning that the theorem can be blindly applied to calculate autocorrelations of numerical sequences. As mentioned earlier, the relation of this discrete sampled data to a mathematical model is often misleading, and related errors can show up as a divergence when the sequence length is modified.

Some authors refer to [math]\displaystyle{ R }[/math] as the autocovariance function. They then proceed to normalize it by dividing by [math]\displaystyle{ R(0) }[/math], to obtain what they refer to as the autocorrelation function.

References

  1. 1.0 1.1 C. Chatfield (1989). The Analysis of Time Series—An Introduction (fourth ed.). Chapman and Hall, London. pp. 94–95. ISBN 0-412-31820-2. 
  2. Norbert Wiener (1964). Time Series. M.I.T. Press, Cambridge, Massachusetts. p. 42. 
  3. Hannan, E.J., "Stationary Time Series", in: John Eatwell, Murray Milgate, and Peter Newman, editors, The New Palgrave: A Dictionary of Economics. Time Series and Statistics, Macmillan, London, 1990, p. 271.
  4. Dennis Ward Ricker (2003). Echo Signal Processing. Springer. ISBN 1-4020-7395-X. https://books.google.com/books?id=NF2Tmty9nugC&pg=PA23. 
  5. Leon W. Couch II (2001). Digital and Analog Communications Systems (sixth ed.). Prentice Hall, New Jersey. pp. 406–409. ISBN 0-13-522583-3. 
  6. Krzysztof Iniewski (2007). Wireless Technologies: Circuits, Systems, and Devices. CRC Press. ISBN 978-0-8493-7996-3. https://books.google.com/books?id=JJXrpazX9FkC&pg=PA390. 
  7. Joseph W. Goodman (1985). Statistical Optics. Wiley-Interscience. ISBN 0-471-01502-4. 
  8. Wiener, Norbert (1930). "Generalized Harmonic Analysis". Acta Mathematica 55: 117–258. doi:10.1007/bf02546511. 
  9. D.C. Champeney (1987). "Power spectra and Wiener's theorems". A Handbook of Fourier Theorems. Cambridge University Press. p. 102. ISBN 9780521265034. https://archive.org/details/handbookoffourie00cham_0. "Wiener's basic theory of 'generalised harmonic analysis' is in no way probabilistic, and the theorems apply to single well defined functions rather than to ensembles of functions [...] A further development of these ideas occurs in the work of A. I. Khintchine (1894–1959) on stationary random processes (or stochastic processes) [...] in contexts in which it is not important to distinguish the two approaches the theory is often referred to as the Wiener—Khintchine theory." 
  10. Khintchine, Alexander (1934). "Korrelationstheorie der stationären stochastischen Prozesse". Mathematische Annalen 109 (1): 604–615. doi:10.1007/BF01449156. 
  11. 11.0 11.1 Einstein, Albert (1914). "Méthode pour la détermination de valeurs statistiques d'observations concernant des grandeurs soumises à des fluctuations irrégulières". Archives des Sciences 37: 254–256. Bibcode1914ArS....37..254E. https://gallica.bnf.fr/ark:/12148/bpt6k2991413. 
  12. Jerison, David; Singer, Isadore Manuel; Stroock, Daniel W. (1997). The Legacy of Norbert Wiener: A Centennial Symposium (Proceedings of Symposia in Pure Mathematics). American Mathematical Society. p. 95. ISBN 0-8218-0415-4. 
  13. Hannan, E. J. (1990). "Stationary Time Series". in Eatwell, John; Milgate, Murray; Newman, Peter. The New Palgrave: A Dictionary of Economics. Time Series and Statistics. London: Macmillan. p. 271. ISBN 9781349208654. https://books.google.com/books?id=CUevCwAAQBAJ&pg=PA271. 
  14. Chatfield, C. (1989). The Analysis of Time Series—An Introduction (Fourth ed.). London: Chapman and Hall. p. 96. ISBN 0-412-31820-2. https://books.google.com/books?id=qKzyAbdaDFAC. 
  15. Champeney, D. C. (1987). A Handbook of Fourier Theorems. Cambridge Univ. Press. pp. 20–22. ISBN 9780521366885. https://books.google.com/books?id=IkQoC1ava5sC&pg=PA20. 
  16. Shlomo Engelberg (2007). Random signals and noise: a mathematical introduction. CRC Press. p. 130. ISBN 978-0-8493-7554-5. https://books.google.com/books?id=Zl51JGnoww4C&pg=PA130. 
  17. C. Chatfield (1989). The Analysis of Time Series—An Introduction (fourth ed.). Chapman and Hall, London. p. 98. ISBN 0-412-31820-2. 

Further reading

  • Brockwell, Peter A.; Davis, Richard J. (2002). Introduction to Time Series and Forecasting (Second ed.). New York: Springer-Verlag. ISBN 038721657X. 
  • Chatfield, C. (1989). The Analysis of Time Series—An Introduction (Fourth ed.). London: Chapman and Hall. ISBN 0412318202. 
  • Fuller, Wayne (1996). Introduction to Statistical Time Series. Wiley Series in Probability and Statistics (Second ed.). New York: Wiley. ISBN 0471552399. 
  • Wiener, Norbert (1949). Extrapolation, Interpolation, and Smoothing of Stationary Time Series. Cambridge, Massachusetts: Technology Press and Johns Hopkins Univ. Press.  (a classified document written for the Dept. of War in 1943).
  • Yaglom, A. M. (1962). An Introduction to the Theory of Stationary Random Functions. Englewood Cliffs, New Jersey: Prentice–Hall.