Nyquist–Shannon sampling theorem
The Nyquist–Shannon sampling theorem is a theorem in the field of signal processing which serves as a fundamental bridge between continuoustime signals and discretetime signals. It establishes a sufficient condition for a sample rate that permits a discrete sequence of samples to capture all the information from a continuoustime signal of finite bandwidth.
Strictly speaking, the theorem only applies to a class of mathematical functions having a Fourier transform that is zero outside of a finite region of frequencies. Intuitively we expect that when one reduces a continuous function to a discrete sequence and interpolates back to a continuous function, the fidelity of the result depends on the density (or sample rate) of the original samples. The sampling theorem introduces the concept of a sample rate that is sufficient for perfect fidelity for the class of functions that are bandlimited to a given bandwidth, such that no actual information is lost in the sampling process. It expresses the sufficient sample rate in terms of the bandwidth for the class of functions. The theorem also leads to a formula for perfectly reconstructing the original continuoustime function from the samples.
Perfect reconstruction may still be possible when the samplerate criterion is not satisfied, provided other constraints on the signal are known (see § Sampling of nonbaseband signals below and compressed sensing). In some cases (when the samplerate criterion is not satisfied), utilizing additional constraints allows for approximate reconstructions. The fidelity of these reconstructions can be verified and quantified utilizing Bochner's theorem.^{[1]}
The name Nyquist–Shannon sampling theorem honours Harry Nyquist and Claude Shannon, but the theorem was also previously discovered by E. T. Whittaker (published in 1915) and Shannon cited Whittaker's paper in his work. The theorem is thus also known by the names Whittaker–Shannon sampling theorem, Whittaker–Shannon, and Whittaker–Nyquist–Shannon, and may also be referred to as the cardinal theorem of interpolation.
Introduction
Sampling is a process of converting a signal (for example, a function of continuous time or space) into a sequence of values (a function of discrete time or space). Shannon's version of the theorem states:^{[2]}
If a function [math]\displaystyle{ x(t) }[/math] contains no frequencies higher than B hertz, it is completely determined by giving its ordinates at a series of points spaced [math]\displaystyle{ 1/(2B) }[/math] seconds apart.
A sufficient samplerate is therefore anything larger than [math]\displaystyle{ 2B }[/math] samples per second. Equivalently, for a given sample rate [math]\displaystyle{ f_s }[/math], perfect reconstruction is guaranteed possible for a bandlimit [math]\displaystyle{ B \lt f_s/2 }[/math].
When the bandlimit is too high (or there is no bandlimit), the reconstruction exhibits imperfections known as aliasing. Modern statements of the theorem are sometimes careful to explicitly state that [math]\displaystyle{ x(t) }[/math] must contain no sinusoidal component at exactly frequency [math]\displaystyle{ B, }[/math] or that [math]\displaystyle{ B }[/math] must be strictly less than ½ the sample rate. The threshold [math]\displaystyle{ 2B }[/math] is called the Nyquist rate and is an attribute of the continuoustime input [math]\displaystyle{ x(t) }[/math] to be sampled. The sample rate must exceed the Nyquist rate for the samples to suffice to represent [math]\displaystyle{ x(t). }[/math] The threshold [math]\displaystyle{ f_s/2 }[/math] is called the Nyquist frequency and is an attribute of the sampling equipment. All meaningful frequency components of the properly sampled [math]\displaystyle{ x(t) }[/math] exist below the Nyquist frequency. The condition described by these inequalities is called the Nyquist criterion, or sometimes the Raabe condition. The theorem is also applicable to functions of other domains, such as space, in the case of a digitized image. The only change, in the case of other domains, is the units of measure attributed to [math]\displaystyle{ t, }[/math] [math]\displaystyle{ f_s, }[/math] and [math]\displaystyle{ B. }[/math]
The symbol [math]\displaystyle{ T \triangleq 1/f_s }[/math] is customarily used to represent the interval between samples and is called the sample period or sampling interval. The samples of function [math]\displaystyle{ x(t) }[/math] are commonly denoted by [math]\displaystyle{ x[n] \triangleq x(nT) }[/math] (alternatively [math]\displaystyle{ x_n }[/math] in older signal processing literature), for all integer values of [math]\displaystyle{ n. }[/math] Another convenient definition is [math]\displaystyle{ x[n] \triangleq T\cdot x(nT), }[/math] which preserves the energy of the signal as [math]\displaystyle{ T }[/math] varies.^{[3]}
A mathematically ideal way to interpolate the sequence involves the use of sinc functions. Each sample in the sequence is replaced by a sinc function, centered on the time axis at the original location of the sample [math]\displaystyle{ nT, }[/math] with the amplitude of the sinc function scaled to the sample value, [math]\displaystyle{ x[n]. }[/math] Subsequently, the sinc functions are summed into a continuous function. A mathematically equivalent method is to convolve one sinc function with a series of Dirac delta pulses, weighted by the sample values. Neither method is numerically practical. Instead, some type of approximation of the sinc functions, finite in length, is used. The imperfections attributable to the approximation are known as interpolation error.
Practical digitaltoanalog converters produce neither scaled and delayed sinc functions, nor ideal Dirac pulses. Instead they produce a piecewiseconstant sequence of scaled and delayed rectangular pulses (the zeroorder hold), usually followed by a lowpass filter (called an "antiimaging filter") to remove spurious highfrequency replicas (images) of the original baseband signal.
Aliasing
When [math]\displaystyle{ x(t) }[/math] is a function with a Fourier transform [math]\displaystyle{ X(f) }[/math]:
 [math]\displaystyle{ X(f)\ \triangleq\ \int_{\infty}^{\infty} x(t) \ e^{ i 2 \pi f t} \ {\rm d}t, }[/math]
the Poisson summation formula indicates that the samples, [math]\displaystyle{ x(nT) }[/math], of [math]\displaystyle{ x(t) }[/math] are sufficient to create a periodic summation of [math]\displaystyle{ X(f) }[/math]. The result is:

[math]\displaystyle{ X_s(f)\ \triangleq \sum_{k=\infty}^{\infty} X\left(f  k f_s\right) = \sum_{n=\infty}^{\infty} T\cdot x(nT)\ e^{i 2\pi n T f}, }[/math]
(
)
which is a periodic function and its equivalent representation as a Fourier series, whose coefficients are [math]\displaystyle{ T\cdot x(nT). }[/math] This function is also known as the discretetime Fourier transform (DTFT) of the sample sequence.
As depicted, copies of [math]\displaystyle{ X(f) }[/math] are shifted by multiples of the sampling rate [math]\displaystyle{ f_s }[/math] and combined by addition. For a bandlimited function [math]\displaystyle{ (X(f) = 0, \text{ for all } f \ge B) }[/math] and sufficiently large [math]\displaystyle{ f_s, }[/math] it is possible for the copies to remain distinct from each other. But if the Nyquist criterion is not satisfied, adjacent copies overlap, and it is not possible in general to discern an unambiguous [math]\displaystyle{ X(f). }[/math] Any frequency component above [math]\displaystyle{ f_s/2 }[/math] is indistinguishable from a lowerfrequency component, called an alias, associated with one of the copies. In such cases, the customary interpolation techniques produce the alias, rather than the original component. When the samplerate is predetermined by other considerations (such as an industry standard), [math]\displaystyle{ x(t) }[/math] is usually filtered to reduce its high frequencies to acceptable levels before it is sampled. The type of filter required is a lowpass filter, and in this application it is called an antialiasing filter.
Derivation as a special case of Poisson summation
When there is no overlap of the copies (also known as "images") of [math]\displaystyle{ X(f) }[/math], the [math]\displaystyle{ k=0 }[/math] term of Eq.1 can be recovered by the product:
 [math]\displaystyle{ X(f) = H(f) \cdot X_s(f), }[/math] where:
 [math]\displaystyle{ H(f)\ \triangleq\ \begin{cases}1 & f \lt B \\ 0 & f \gt f_s  B. \end{cases} }[/math]
The sampling theorem is proved since [math]\displaystyle{ X(f) }[/math] uniquely determines [math]\displaystyle{ x(t). }[/math]
All that remains is to derive the formula for reconstruction. [math]\displaystyle{ H(f) }[/math] need not be precisely defined in the region [math]\displaystyle{ [B,\ f_sB] }[/math] because [math]\displaystyle{ X_s(f) }[/math] is zero in that region. However, the worst case is when [math]\displaystyle{ B=f_s/2, }[/math] the Nyquist frequency. A function that is sufficient for that and all less severe cases is:
 [math]\displaystyle{ H(f) = \mathrm{rect} \left(\frac{f}{f_s} \right) = \begin{cases}1 & f \lt \frac{f_s}{2} \\ 0 & f \gt \frac{f_s}{2}, \end{cases} }[/math]
where rect(•) is the rectangular function. Therefore:
 [math]\displaystyle{ X(f) = \mathrm{rect} \left(\frac{f}{f_s} \right) \cdot X_s(f) }[/math]
 [math]\displaystyle{ = \mathrm{rect}(Tf)\cdot \sum_{n=\infty}^{\infty} T\cdot x(nT)\ e^{i 2\pi n T f} }[/math] (from Eq.1, above).
 [math]\displaystyle{ = \sum_{n=\infty}^{\infty} x(nT)\cdot \underbrace{T\cdot \mathrm{rect} (Tf) \cdot e^{i 2\pi n T f}}_{ \mathcal{F}\left \{ \mathrm{sinc} \left( \frac{t  nT}{T} \right) \right \}}. }[/math] ^{[upperalpha 1]}
The inverse transform of both sides produces the Whittaker–Shannon interpolation formula:
 [math]\displaystyle{ x(t) = \sum_{n=\infty}^{\infty} x(nT)\cdot \mathrm{sinc} \left( \frac{t  nT}{T}\right), }[/math]
which shows how the samples, [math]\displaystyle{ x(nT), }[/math] can be combined to reconstruct [math]\displaystyle{ x(t). }[/math]
 Largerthannecessary values of f_{s} (smaller values of T), called oversampling, have no effect on the outcome of the reconstruction and have the benefit of leaving room for a transition band in which H(f) is free to take intermediate values. Undersampling, which causes aliasing, is not in general a reversible operation.
 Theoretically, the interpolation formula can be implemented as a lowpass filter, whose impulse response is sinc(t/T) and whose input is [math]\displaystyle{ \textstyle\sum_{n=\infty}^{\infty} x(nT)\cdot \delta(t  nT), }[/math] which is a Dirac comb function modulated by the signal samples. Practical digitaltoanalog converters (DAC) implement an approximation like the zeroorder hold. In that case, oversampling can reduce the approximation error.
Shannon's original proof
Poisson shows that the Fourier series in Eq.1 produces the periodic summation of [math]\displaystyle{ X(f) }[/math], regardless of [math]\displaystyle{ f_s }[/math] and [math]\displaystyle{ B }[/math]. Shannon, however, only derives the series coefficients for the case [math]\displaystyle{ f_s=2B }[/math]. Virtually quoting Shannon's original paper:
 Let [math]\displaystyle{ X(\omega) }[/math] be the spectrum of [math]\displaystyle{ x(t). }[/math] Then
 [math]\displaystyle{ x(t) = {1 \over 2\pi} \int_{\infty}^{\infty} X(\omega) e^{i\omega t}\;{\rm d}\omega = {1 \over 2\pi} \int_{2\pi B}^{2\pi B} X(\omega) e^{i\omega t}\;{\rm d}\omega, }[/math]
 because [math]\displaystyle{ X(\omega) }[/math] is assumed to be zero outside the band [math]\displaystyle{ \left\tfrac{\omega}{2\pi}\right \lt B. }[/math] If we let [math]\displaystyle{ t = \tfrac{n}{2B}, }[/math] where [math]\displaystyle{ n }[/math] is any positive or negative integer, we obtain:

[math]\displaystyle{ x \left(\tfrac{n}{2B} \right) = {1 \over 2\pi} \int_{2\pi B}^{2\pi B} X(\omega) e^{i\omega {n \over {2B}}}\;{\rm d}\omega. }[/math]
(
)

 On the left are values of [math]\displaystyle{ x(t) }[/math] at the sampling points. The integral on the right will be recognized as essentially^{[loweralpha 1]} the n^{th} coefficient in a Fourierseries expansion of the function [math]\displaystyle{ X(\omega), }[/math] taking the interval [math]\displaystyle{ B }[/math] to [math]\displaystyle{ B }[/math] as a fundamental period. This means that the values of the samples [math]\displaystyle{ x(n/2B) }[/math] determine the Fourier coefficients in the series expansion of [math]\displaystyle{ X(\omega). }[/math] Thus they determine [math]\displaystyle{ X(\omega), }[/math] since [math]\displaystyle{ X(\omega) }[/math] is zero for frequencies greater than B, and for lower frequencies [math]\displaystyle{ X(\omega) }[/math] is determined if its Fourier coefficients are determined. But [math]\displaystyle{ X(\omega) }[/math] determines the original function [math]\displaystyle{ x(t) }[/math] completely, since a function is determined if its spectrum is known. Therefore the original samples determine the function [math]\displaystyle{ x(t) }[/math] completely.
Shannon's proof of the theorem is complete at that point, but he goes on to discuss reconstruction via sinc functions, what we now call the Whittaker–Shannon interpolation formula as discussed above. He does not derive or prove the properties of the sinc function, but these would have been^{[weasel words]} familiar to engineers reading his works at the time, since the Fourier pair relationship between rect (the rectangular function) and sinc was well known.
 Let [math]\displaystyle{ x_n }[/math] be the n^{th} sample. Then the function [math]\displaystyle{ x(t) }[/math] is represented by:
 [math]\displaystyle{ x(t) = \sum_{n=\infty}^{\infty}x_n{\sin(\pi(2Btn)) \over \pi(2Btn)}. }[/math]
As in the other proof, the existence of the Fourier transform of the original signal is assumed, so the proof does not say whether the sampling theorem extends to bandlimited stationary random processes.
Notes
Application to multivariable signals and images
The sampling theorem is usually formulated for functions of a single variable. Consequently, the theorem is directly applicable to timedependent signals and is normally formulated in that context. However, the sampling theorem can be extended in a straightforward way to functions of arbitrarily many variables. Grayscale images, for example, are often represented as twodimensional arrays (or matrices) of real numbers representing the relative intensities of pixels (picture elements) located at the intersections of row and column sample locations. As a result, images require two independent variables, or indices, to specify each pixel uniquely—one for the row, and one for the column.
Color images typically consist of a composite of three separate grayscale images, one to represent each of the three primary colors—red, green, and blue, or RGB for short. Other colorspaces using 3vectors for colors include HSV, CIELAB, XYZ, etc. Some colorspaces such as cyan, magenta, yellow, and black (CMYK) may represent color by four dimensions. All of these are treated as vectorvalued functions over a twodimensional sampled domain.
Similar to onedimensional discretetime signals, images can also suffer from aliasing if the sampling resolution, or pixel density, is inadequate. For example, a digital photograph of a striped shirt with high frequencies (in other words, the distance between the stripes is small), can cause aliasing of the shirt when it is sampled by the camera's image sensor. The aliasing appears as a moiré pattern. The "solution" to higher sampling in the spatial domain for this case would be to move closer to the shirt, use a higher resolution sensor, or to optically blur the image before acquiring it with the sensor using an optical lowpass filter.
Another example is shown to the right in the brick patterns. The top image shows the effects when the sampling theorem's condition is not satisfied. When software rescales an image (the same process that creates the thumbnail shown in the lower image) it, in effect, runs the image through a lowpass filter first and then downsamples the image to result in a smaller image that does not exhibit the moiré pattern. The top image is what happens when the image is downsampled without lowpass filtering: aliasing results.
The sampling theorem applies to camera systems, where the scene and lens constitute an analog spatial signal source, and the image sensor is a spatial sampling device. Each of these components is characterized by a modulation transfer function (MTF), representing the precise resolution (spatial bandwidth) available in that component. Effects of aliasing or blurring can occur when the lens MTF and sensor MTF are mismatched. When the optical image which is sampled by the sensor device contains higher spatial frequencies than the sensor, the under sampling acts as a lowpass filter to reduce or eliminate aliasing. When the area of the sampling spot (the size of the pixel sensor) is not large enough to provide sufficient spatial antialiasing, a separate antialiasing filter (optical lowpass filter) may be included in a camera system to reduce the MTF of the optical image. Instead of requiring an optical filter, the graphics processing unit of smartphone cameras performs digital signal processing to remove aliasing with a digital filter. Digital filters also apply sharpening to amplify the contrast from the lens at high spatial frequencies, which otherwise falls off rapidly at diffraction limits.
The sampling theorem also applies to postprocessing digital images, such as to up or down sampling. Effects of aliasing, blurring, and sharpening may be adjusted with digital filtering implemented in software, which necessarily follows the theoretical principles.
Critical frequency
To illustrate the necessity of [math]\displaystyle{ f_s\gt 2B }[/math], consider the family of sinusoids generated by different values of [math]\displaystyle{ \theta }[/math] in this formula:
 [math]\displaystyle{ x(t) = \frac{\cos(2 \pi B t + \theta )}{\cos(\theta )}\ = \ \cos(2 \pi B t)  \sin(2 \pi B t)\tan(\theta ), \quad \pi/2 \lt \theta \lt \pi/2. }[/math]
With [math]\displaystyle{ f_s=2B }[/math] or equivalently [math]\displaystyle{ T=1/2B }[/math], the samples are given by:
 [math]\displaystyle{ x(nT) = \cos(\pi n)  \underbrace{\sin(\pi n)}_{0}\tan(\theta ) = (1)^n }[/math]
regardless of the value of [math]\displaystyle{ \theta }[/math]. That sort of ambiguity is the reason for the strict inequality of the sampling theorem's condition.
Sampling of nonbaseband signals
As discussed by Shannon:^{[2]}
A similar result is true if the band does not start at zero frequency but at some higher value, and can be proved by a linear translation (corresponding physically to singlesideband modulation) of the zerofrequency case. In this case the elementary pulse is obtained from sin(x)/x by singlesideband modulation.
That is, a sufficient noloss condition for sampling signals that do not have baseband components exists that involves the width of the nonzero frequency interval as opposed to its highest frequency component. See Sampling (signal processing) for more details and examples.
For example, in order to sample the FM radio signals in the frequency range of 100–102 MHz, it is not necessary to sample at 204 MHz (twice the upper frequency), but rather it is sufficient to sample at 4 MHz (twice the width of the frequency interval).
A bandpass condition is that X(f) = 0, for all nonnegative f outside the open band of frequencies: [math]\displaystyle{ \left(\frac{N}2 f_\mathrm{s}, \frac{N+1}2 f_\mathrm{s}\right), }[/math] for some nonnegative integer N. This formulation includes the normal baseband condition as the case N=0.
The corresponding interpolation function is the impulse response of an ideal brickwall bandpass filter (as opposed to the ideal brickwall lowpass filter used above) with cutoffs at the upper and lower edges of the specified band, which is the difference between a pair of lowpass impulse responses: [math]\displaystyle{ (N+1)\,\operatorname{sinc} \left(\frac{(N+1)t}T\right)  N\,\operatorname{sinc}\left( \frac{Nt}T \right). }[/math]
Other generalizations, for example to signals occupying multiple noncontiguous bands, are possible as well. Even the most generalized form of the sampling theorem does not have a provably true converse. That is, one cannot conclude that information is necessarily lost just because the conditions of the sampling theorem are not satisfied; from an engineering perspective, however, it is generally safe to assume that if the sampling theorem is not satisfied then information will most likely be lost.
Nonuniform sampling
The sampling theory of Shannon can be generalized for the case of nonuniform sampling, that is, samples not taken equally spaced in time. The Shannon sampling theory for nonuniform sampling states that a bandlimited signal can be perfectly reconstructed from its samples if the average sampling rate satisfies the Nyquist condition.^{[4]} Therefore, although uniformly spaced samples may result in easier reconstruction algorithms, it is not a necessary condition for perfect reconstruction.
The general theory for nonbaseband and nonuniform samples was developed in 1967 by Henry Landau.^{[5]} He proved that the average sampling rate (uniform or otherwise) must be twice the occupied bandwidth of the signal, assuming it is a priori known what portion of the spectrum was occupied. In the late 1990s, this work was partially extended to cover signals whose amount of occupied bandwidth was known, but the actual occupied portion of the spectrum was unknown.^{[6]} In the 2000s, a complete theory was developed (see the section Sampling below the Nyquist rate under additional restrictions below) using compressed sensing. In particular, the theory, using signal processing language, is described in this 2009 paper.^{[7]} They show, among other things, that if the frequency locations are unknown, then it is necessary to sample at least at twice the Nyquist criteria; in other words, you must pay at least a factor of 2 for not knowing the location of the spectrum. Note that minimum sampling requirements do not necessarily guarantee stability.
Sampling below the Nyquist rate under additional restrictions
The Nyquist–Shannon sampling theorem provides a sufficient condition for the sampling and reconstruction of a bandlimited signal. When reconstruction is done via the Whittaker–Shannon interpolation formula, the Nyquist criterion is also a necessary condition to avoid aliasing, in the sense that if samples are taken at a slower rate than twice the band limit, then there are some signals that will not be correctly reconstructed. However, if further restrictions are imposed on the signal, then the Nyquist criterion may no longer be a necessary condition.
A nontrivial example of exploiting extra assumptions about the signal is given by the recent field of compressed sensing, which allows for full reconstruction with a subNyquist sampling rate. Specifically, this applies to signals that are sparse (or compressible) in some domain. As an example, compressed sensing deals with signals that may have a low overall bandwidth (say, the effective bandwidth EB), but the frequency locations are unknown, rather than all together in a single band, so that the passband technique does not apply. In other words, the frequency spectrum is sparse. Traditionally, the necessary sampling rate is thus 2B. Using compressed sensing techniques, the signal could be perfectly reconstructed if it is sampled at a rate slightly lower than 2EB. With this approach, reconstruction is no longer given by a formula, but instead by the solution to a linear optimization program.
Another example where subNyquist sampling is optimal arises under the additional constraint that the samples are quantized in an optimal manner, as in a combined system of sampling and optimal lossy compression.^{[8]} This setting is relevant in cases where the joint effect of sampling and quantization is to be considered, and can provide a lower bound for the minimal reconstruction error that can be attained in sampling and quantizing a random signal. For stationary Gaussian random signals, this lower bound is usually attained at a subNyquist sampling rate, indicating that subNyquist sampling is optimal for this signal model under optimal quantization.^{[9]}
Historical background
The sampling theorem was implied by the work of Harry Nyquist in 1928,^{[10]} in which he showed that up to 2B independent pulse samples could be sent through a system of bandwidth B; but he did not explicitly consider the problem of sampling and reconstruction of continuous signals. About the same time, Karl Küpfmüller showed a similar result^{[11]} and discussed the sincfunction impulse response of a bandlimiting filter, via its integral, the stepresponse sine integral; this bandlimiting and reconstruction filter that is so central to the sampling theorem is sometimes referred to as a Küpfmüller filter (but seldom so in English).
The sampling theorem, essentially a dual of Nyquist's result, was proved by Claude E. Shannon.^{[2]} V. A. Kotelnikov published similar results in 1933,^{[12]} as did the mathematician E. T. Whittaker in 1915,^{[13]} J. M. Whittaker in 1935,^{[14]} and Gabor in 1946 ("Theory of communication"). In 1999, the Eduard Rhein Foundation awarded Kotelnikov their Basic Research Award "for the first theoretically exact formulation of the sampling theorem".
In 1948 and 1949, Claude E. Shannon published – 16 years after Vladimir Kotelnikov – the two revolutionary articles in which he founded the information theory.^{[15]}^{[16]}^{[2]} In Shannon 1948 the sampling theorem is formulated as “Theorem 13”: Let f(t) contain no frequencies over W. Then
 [math]\displaystyle{ f(t) = \sum_{n=\infty}^\infty X_n \frac{\sin \pi(2Wt  n)}{\pi(2Wt  n)}, }[/math] where [math]\displaystyle{ X_n = f\left(\frac n {2W} \right) }[/math].
It was not until these articles were published that the theorem known as “Shannon’s sampling theorem” became common property among communication engineers, although Shannon himself writes that this is a fact which is common knowledge in the communication art.^{[upperalpha 2]} A few lines further on, however, he adds: "but in spite of its evident importance, [it] seems not to have appeared explicitly in the literature of communication theory".
Other discoverers
Others who have independently discovered or played roles in the development of the sampling theorem have been discussed in several historical articles, for example, by Jerri^{[17]} and by Lüke.^{[18]} For example, Lüke points out that H. Raabe, an assistant to Küpfmüller, proved the theorem in his 1939 Ph.D. dissertation; the term Raabe condition came to be associated with the criterion for unambiguous representation (sampling rate greater than twice the bandwidth). Meijering^{[19]} mentions several other discoverers and names in a paragraph and pair of footnotes:
As pointed out by Higgins [135], the sampling theorem should really be considered in two parts, as done above: the first stating the fact that a bandlimited function is completely determined by its samples, the second describing how to reconstruct the function using its samples. Both parts of the sampling theorem were given in a somewhat different form by J. M. Whittaker [350, 351, 353] and before him also by Ogura [241, 242]. They were probably not aware of the fact that the first part of the theorem had been stated as early as 1897 by Borel [25].^{27} As we have seen, Borel also used around that time what became known as the cardinal series. However, he appears not to have made the link [135]. In later years it became known that the sampling theorem had been presented before Shannon to the Russian communication community by Kotel'nikov [173]. In more implicit, verbal form, it had also been described in the German literature by Raabe [257]. Several authors [33, 205] have mentioned that Someya [296] introduced the theorem in the Japanese literature parallel to Shannon. In the English literature, Weston [347] introduced it independently of Shannon around the same time.^{28}
^{27} Several authors, following Black [16], have claimed that this first part of the sampling theorem was stated even earlier by Cauchy, in a paper [41] published in 1841. However, the paper of Cauchy does not contain such a statement, as has been pointed out by Higgins [135].
^{28} As a consequence of the discovery of the several independent introductions of the sampling theorem, people started to refer to the theorem by including the names of the aforementioned authors, resulting in such catchphrases as “the Whittaker–Kotel’nikov–Shannon (WKS) sampling theorem" [155] or even "the Whittaker–Kotel'nikov–Raabe–Shannon–Someya sampling theorem" [33]. To avoid confusion, perhaps the best thing to do is to refer to it as the sampling theorem, "rather than trying to find a title that does justice to all claimants" [136].
Why Nyquist?
Exactly how, when, or why Harry Nyquist had his name attached to the sampling theorem remains obscure. The term Nyquist Sampling Theorem (capitalized thus) appeared as early as 1959 in a book from his former employer, Bell Labs,^{[20]} and appeared again in 1963,^{[21]} and not capitalized in 1965.^{[22]} It had been called the Shannon Sampling Theorem as early as 1954,^{[23]} but also just the sampling theorem by several other books in the early 1950s.
In 1958, Blackman and Tukey cited Nyquist's 1928 article as a reference for the sampling theorem of information theory,^{[24]} even though that article does not treat sampling and reconstruction of continuous signals as others did. Their glossary of terms includes these entries:
 Sampling theorem (of information theory)
 Nyquist's result that equispaced data, with two or more points per cycle of highest frequency, allows reconstruction of bandlimited functions. (See Cardinal theorem.)
 Cardinal theorem (of interpolation theory)
 A precise statement of the conditions under which values given at a doubly infinite set of equally spaced points can be interpolated to yield a continuous bandlimited function with the aid of the function [math]\displaystyle{ \frac{\sin (x  x_i)}{x  x_i}. }[/math]
Exactly what "Nyquist's result" they are referring to remains mysterious.
When Shannon stated and proved the sampling theorem in his 1949 article, according to Meijering,^{[19]} "he referred to the critical sampling interval [math]\displaystyle{ T = \frac 1 {2W} }[/math] as the Nyquist interval corresponding to the band W, in recognition of Nyquist’s discovery of the fundamental importance of this interval in connection with telegraphy". This explains Nyquist's name on the critical interval, but not on the theorem.
Similarly, Nyquist's name was attached to Nyquist rate in 1953 by Harold S. Black:
"If the essential frequency range is limited to B cycles per second, 2B was given by Nyquist as the maximum number of code elements per second that could be unambiguously resolved, assuming the peak interference is less half a quantum step. This rate is generally referred to as signaling at the Nyquist rate and [math]\displaystyle{ \frac 1 {2B} }[/math] has been termed a Nyquist interval."^{[25]} (bold added for emphasis; italics as in the original)
According to the OED, this may be the origin of the term Nyquist rate. In Black's usage, it is not a sampling rate, but a signaling rate.
See also
 44,100 Hz, a customary rate used to sample audible frequencies is based on the limits of human hearing and the sampling theorem
 Balian–Low theorem, a similar theoretical lower bound on sampling rates, but which applies to time–frequency transforms
 Cheung–Marks theorem, which specifies conditions where restoration of a signal by the sampling theorem can become illposed
 Shannon–Hartley theorem
 Nyquist ISI criterion
 Reconstruction from zero crossings
 Zeroorder hold
Notes
References
 ↑ Nemirovsky, Jonathan; Shimron, Efrat (2015). "Utilizing Bochners Theorem for Constrained Evaluation of Missing Fourier Data". arXiv:1506.03300 [physics.medph].
 ↑ ^{2.0} ^{2.1} ^{2.2} ^{2.3} Shannon, Claude E. (January 1949). "Communication in the presence of noise". Proceedings of the Institute of Radio Engineers 37 (1): 10–21. doi:10.1109/jrproc.1949.232969. Reprint as classic paper in: Proc. IEEE, Vol. 86, No. 2, (Feb 1998)
 ↑ Ahmed, N.; Rao, K.R. (July 10, 1975) (in English). Orthogonal Transforms for Digital Signal Processing (1 ed.). Berlin Heidelberg New York: SpringerVerlag. doi:10.1007/9783642454509. ISBN 9783540065562. https://www.google.com/books/edition/Orthogonal_Transforms_for_Digital_Signal/FnvCAAAQBAJ?hl=en&gbpv=1&printsec=frontcover.
 ↑ Marvasti, F., ed (2000). Nonuniform Sampling, Theory and Practice. New York: Kluwer Academic/Plenum Publishers.
 ↑ Landau, H. J. (1967). "Necessary density conditions for sampling and interpolation of certain entire functions". Acta Math. 117 (1): 37–52. doi:10.1007/BF02395039.
 ↑ see, e.g., Feng, P. (1997). Universal minimumrate sampling and spectrumblind reconstruction for multiband signals. Ph.D. dissertation, University of Illinois at UrbanaChampaign.
 ↑ Mishali, Moshe; Eldar, Yonina C. (March 2009). "Blind Multiband Signal Reconstruction: Compressed Sensing for Analog Signals". IEEE Trans. Signal Process. 57 (3): 993–1009. doi:10.1109/TSP.2009.2012791. Bibcode: 2009ITSP...57..993M.
 ↑ Kipnis, Alon; Goldsmith, Andrea J.; Eldar, Yonina C.; Weissman, Tsachy (January 2016). "Distortion rate function of subNyquist sampled Gaussian sources". IEEE Transactions on Information Theory 62: 401–429. doi:10.1109/tit.2015.2485271.
 ↑ Kipnis, Alon; Eldar, Yonina; Goldsmith, Andrea (26 April 2018). "AnalogtoDigital Compression: A New Paradigm for Converting Signals to Bits". IEEE Signal Processing Magazine 35 (3): 16–39. doi:10.1109/MSP.2017.2774249. Bibcode: 2018ISPM...35...16K.
 ↑ Nyquist, Harry (April 1928). "Certain topics in telegraph transmission theory". Trans. AIEE 47 (2): 617–644. doi:10.1109/taiee.1928.5055024. Bibcode: 1928TAIEE..47..617N. Reprint as classic paper in: Proc. IEEE, Vol. 90, No. 2, Feb 2002
 ↑ Küpfmüller, Karl (1928). "Über die Dynamik der selbsttätigen Verstärkungsregler" (in de). Elektrische Nachrichtentechnik 5 (11): 459–467. (English translation 2005).
 ↑ Kotelnikov, V. A. (1933). "On the carrying capacity of the ether and wire in telecommunications" (in ru). Material for the First AllUnion Conference on Questions of Communication, Izd. Red. Upr. Svyazi RKKA. (English translation, PDF).
 ↑ Whittaker, E. T. (1915). "On the Functions Which are Represented by the Expansions of the Interpolation Theory". Proc. Royal Soc. Edinburgh 35: 181–194. doi:10.1017/s0370164600017806. https://zenodo.org/record/1428702. ("Theorie der Kardinalfunktionen").
 ↑ Whittaker, J. M. (1935). Interpolatory Function Theory. Cambridge, England: Cambridge Univ. Press. https://archive.org/details/in.ernet.dli.2015.219870..
 ↑ Shannon, Claude E. (July 1948). "A Mathematical Theory of Communication". Bell System Technical Journal 27 (3): 379–423. doi:10.1002/j.15387305.1948.tb01338.x..
 ↑ Shannon, Claude E. (October 1948). "A Mathematical Theory of Communication". Bell System Technical Journal 27 (4): 623–666. doi:10.1002/j.15387305.1948.tb00917.x.
 ↑ Jerri, Abdul (November 1977). "The Shannon Sampling Theorem—Its Various Extensions and Applications: A Tutorial Review". Proceedings of the IEEE 65 (11): 1565–1596. doi:10.1109/proc.1977.10771. Bibcode: 1977IEEEP..65.1565J. See also Jerri, Abdul (April 1979). "Correction to "The Shannon sampling theorem—Its various extensions and applications: A tutorial review"". Proceedings of the IEEE 67 (4): 695. doi:10.1109/proc.1979.11307.
 ↑ Lüke, Hans Dieter (April 1999). "The Origins of the Sampling Theorem". IEEE Communications Magazine 37 (4): 106–108. doi:10.1109/35.755459. http://www.hit.bme.hu/people/papay/edu/Conv/pdf/origins.pdf.
 ↑ ^{19.0} ^{19.1} Meijering, Erik (March 2002). "A Chronology of Interpolation From Ancient Astronomy to Modern Signal and Image Processing". Proc. IEEE 90 (3): 319–342. doi:10.1109/5.993400. http://bigwww.epfl.ch/publications/meijering0201.pdf.
 ↑ Members of the Technical Staff of Bell Telephone Lababoratories (1959). Transmission Systems for Communications. AT&T. pp. 26–4 (Vol.2).
 ↑ Guillemin, Ernst Adolph (1963). Theory of Linear Physical Systems. Wiley. ISBN 9780471330707. https://books.google.com/books?id=jtIAAAAIAAJ.
 ↑ Roberts, Richard A.; Barton, Ben F. (1965). Theory of Signal Detectability: Composite Deferred Decision Theory.
 ↑ Gray, Truman S. (1954). "Applied Electronics: A First Course in Electronics, Electron Tubes, and Associated Circuits". Physics Today 7 (11): 17. doi:10.1063/1.3061438. Bibcode: 1954PhT.....7k..17G.
 ↑ Blackman, R. B.; Tukey, J. W. (1958). The Measurement of Power Spectra : From the Point of View of Communications Engineering. New York: Dover. http://alcatellucent.com/bstj/vol371958/articles/bstj371185.pdf.
 ↑ Black, Harold S. (1953). Modulation Theory.
Further reading
 Higgins, J.R.: Five short stories about the cardinal series, Bulletin of the AMS 12(1985)
 Küpfmüller, Karl, "Utjämningsförlopp inom Telegraf och Telefontekniken", ("Transients in telegraph and telephone engineering"), Teknisk Tidskrift, no. 9 pp. 153–160 and 10 pp. 178–182, 1931. [1] [2]
 Marks, R.J.(II): Introduction to Shannon Sampling and Interpolation Theory, SpringerVerlag, 1991.
 Marks, R.J.(II), Editor: Advanced Topics in Shannon Sampling and Interpolation Theory, SpringerVerlag, 1993.
 Marks, R.J.(II), Handbook of Fourier Analysis and Its Applications, Oxford University Press, (2009), Chapters 5–8. Google books
 Press, WH; Teukolsky, SA; Vetterling, WT; Flannery, BP (2007), "Section 13.11. Numerical Use of the Sampling Theorem", Numerical Recipes: The Art of Scientific Computing (3rd ed.), New York: Cambridge University Press, ISBN 9780521880688, http://apps.nrbook.com/empanel/index.html#pg=717
 Unser, Michael: Sampling50 Years after Shannon, Proc. IEEE, vol. 88, no. 4, pp. 569–587, April 2000
External links
 Learning by Simulations Interactive simulation of the effects of inadequate sampling
 Interactive presentation of the sampling and reconstruction in a webdemo Institute of Telecommunications, University of Stuttgart
 Undersampling and an application of it
 Sampling Theory For Digital Audio
 Journal devoted to Sampling Theory
 Sampling Theorem with Constant Amplitude Variable Width Pulse
 Lüke, Hans Dieter (April 1999). "The Origins of the Sampling Theorem". IEEE Communications Magazine 37 (4): 106–108. doi:10.1109/35.755459. http://www.hit.bme.hu/people/papay/edu/Conv/pdf/origins.pdf.
Original source: https://en.wikipedia.org/wiki/Nyquist–Shannon sampling theorem.
Read more 