Gabor transform

From HandWiki

The Gabor transform, named after Dennis Gabor, is a special case of the short-time Fourier transform. It is used to determine the sinusoidal frequency and phase content of local sections of a signal as it changes over time. The function to be transformed is first multiplied by a Gaussian function, which can be regarded as a window function, and the resulting function is then transformed with a Fourier transform to derive the time-frequency analysis.[1] The window function means that the signal near the time being analyzed will have higher weight. The Gabor transform of a signal x(t) is defined by this formula:

[math]\displaystyle{ G_x(\tau,\omega) = \int_{-\infty}^\infty x(t)e^{-\pi(t-\tau)^2}e^{-j \omega t}\,dt }[/math]
Magnitude of Gaussian function.

The Gaussian function has infinite range and it is impractical for implementation. However, a level of significance can be chosen (for instance 0.00001) for the distribution of the Gaussian function.

[math]\displaystyle{ \begin{cases} e^{-{\pi}a^2} \ge 0.00001; & \left| a \right| \le 1.9143 \\ e^{-{\pi}a^2} \lt 0.00001; & \left| a \right| \gt 1.9143 \end{cases} }[/math]

Outside these limits of integration ([math]\displaystyle{ \left| a \right| \gt 1.9143 }[/math]) the Gaussian function is small enough to be ignored. Thus the Gabor transform can be satisfactorily approximated as

[math]\displaystyle{ G_x(\tau,\omega) = \int_{-1.9143+\tau}^{1.9143+\tau} x(t) e^{-\pi(t-\tau)^2} e^{-j \omega t} \, dt }[/math]

This simplification makes the Gabor transform practical and realizable.

The window function width can also be varied to optimize the time-frequency resolution tradeoff for a particular application by replacing the [math]\displaystyle{ {-{\pi}(t-\tau)^2} }[/math] with [math]\displaystyle{ {-{\pi}\alpha (t-\tau)^2} }[/math] for some chosen [math]\displaystyle{ \alpha }[/math].

Inverse Gabor transform

The Gabor transform is invertible. Because it is over-complete, the original signal can be recovered in a variety of ways. For example, the "unwindowing" approach can be used for any [math]\displaystyle{ \tau_0 \in (-\infty,\infty) }[/math]:

[math]\displaystyle{ x(t) = e^{\pi(t-\tau_0)^2}\frac{1}{2\pi}\int_{-\infty}^\infty G_x(\tau_0,\omega) e^{j \omega t }\,d\omega }[/math]

Alternatively, all of the time components can be combined:

[math]\displaystyle{ x(t) = \int_{-\infty}^\infty \frac{1}{2\pi} \int_{-\infty}^\infty G_x(\tau,\omega) e^{j \omega t }\,d\omega\,d\tau }[/math]

Properties of the Gabor transform

The Gabor transform has many properties like those of the Fourier transform. These properties are listed in the following tables.

Signal Gabor transform Remarks
[math]\displaystyle{ x(t)\, }[/math] [math]\displaystyle{ G_x(\tau,\omega)=\int_{-\infty}^\infty x(t)e^{-\pi(t-\tau)^2}e^{-j \omega t}\,dt }[/math]
1 [math]\displaystyle{ a\cdot x(t) + b\cdot y(t)\, }[/math] [math]\displaystyle{ a\cdot G_x(\tau,\omega) + b\cdot G_y(\tau,\omega)\, }[/math] Linearity property
2 [math]\displaystyle{ x(t-t_0)\, }[/math] [math]\displaystyle{ G_x(\tau-t_0,\omega)e^{-j \omega t_0}\, }[/math] Shifting property
3 [math]\displaystyle{ x(t)e^{j\omega_0 t}\, }[/math] [math]\displaystyle{ G_x(\tau,\omega-\omega_0)\, }[/math] Modulation property
Remarks
1 [math]\displaystyle{ \int_{-\infty}^\infty \left| G_x(\tau,\omega) \right|^2\,d\omega = \int_{-\infty}^\infty \left| x(t) \right|^2 e^{-2\pi (t-\tau)^2} dt \approx \int_{\tau - 1.9143}^{\tau + 1.9143}\left| x(t) \right|^2 e^{-2\pi (t - \tau)^2} dt }[/math] Power integration property
2 [math]\displaystyle{ \int_{-\infty}^\infty \int_{-\infty}^\infty G_x(\tau ,\omega)G_y^*(\tau ,\omega)\,d\omega\,d\tau = \int_{-\infty}^\infty x(t)y^*(t)\, d\tau }[/math] Energy sum property
3 [math]\displaystyle{ \begin{cases} \displaystyle \int_{-\infty}^\infty \left| G_x(\tau,\omega) \right|^2d\omega \lt e^{-2\pi(t-t_0)^2}\int_{-\infty}^\infty \left| G_x(\tau_0,\omega) \right|^2\,d\omega; & \text{if } x(t) =0 \text{ for }t\gt t_0 \\[12pt] \displaystyle \int_{-\infty}^\infty \left| G_x(\tau,\omega) \right|^2\,d\tau \lt e^{-(\omega-\omega_0)^2}\int_{-\infty}^\infty \left| G_x(\tau,\omega_0) \right|^2\,d\tau; & \text{if } X(\omega) =FT[x(t)] = 0 \text{ for }\omega\gt \omega_0 \end{cases} }[/math] Power decay property
4 [math]\displaystyle{ \int_{-\infty}^\infty G_x(\tau,\omega) e^{j\omega t}\,d\omega = 2\pi e^{-\pi\tau^2} x(0) }[/math] Recovery property

Application and example

Time/frequency distribution.

The main application of the Gabor transform is used in time–frequency analysis. Take the following function as an example. The input signal has 1 Hz frequency component when t ≤ 0 and has 2 Hz frequency component when t > 0

[math]\displaystyle{ x(t) = \begin{cases} \cos(2\pi t) & \text{for } t \le 0, \\ \cos(4\pi t) & \text{for } t\gt 0. \end{cases} }[/math]

But if the total bandwidth available is 5 Hz, other frequency bands except x(t) are wasted. Through time–frequency analysis by applying the Gabor transform, the available bandwidth can be known and those frequency bands can be used for other applications and bandwidth is saved. The right side picture shows the input signal x(t) and the output of the Gabor transform. As was our expectation, the frequency distribution can be separated into two parts. One is t ≤ 0 and the other is t > 0. The white part is the frequency band occupied by x(t) and the black part is not used. Note that for each point in time there is both a negative (upper white part) and a positive (lower white part) frequency component.

Discrete Gabor-transformation

A discrete version of Gabor representation

[math]\displaystyle{ y(t)= \sum_{m = - \infty}^ \infty \sum_{n=- \infty}^ \infty C_{nm} \cdot g_{nm} (t) }[/math]

with [math]\displaystyle{ g_{nm} (t) = s(t-m \tau_0 ) \cdot e^{j\Omega nt} }[/math]

can be derived easily by discretizing the Gabor-basis-function in these equations. Hereby the continuous parameter t is replaced by the discrete time k. Furthermore, the now finite summation limit in Gabor representation has to be considered. In this way, the sampled signal y(k) is split into M time frames of length N. According to [math]\displaystyle{ \Omega \le \tfrac{2\pi}{\tau_0} }[/math], the factor Ω for critical sampling is [math]\displaystyle{ \Omega = \tfrac{2\pi}{N} }[/math].

Similar to the DFT (discrete Fourier transformation) a frequency domain split into N discrete partitions is obtained. An inverse transformation of these N spectral partitions then leads to N values y(k) for the time window, which consists of N sample values. For overall M time windows with N sample values, each signal y(k) contains K = N [math]\displaystyle{ \cdot }[/math] M sample values: (the discrete Gabor representation)

[math]\displaystyle{ y(k) = \sum_{m=0}^ {M-1} \sum_{n=0}^{N-1} C_{nm} \cdot g_{nm} (k) }[/math]

with [math]\displaystyle{ g_{nm} (k) = s(k-mN) \cdot e^{j\Omega nk} }[/math]

According to the equation above, the N [math]\displaystyle{ \cdot }[/math] M coefficients [math]\displaystyle{ C_{nm} }[/math] correspond to the number of sample values K of the signal.

For over-sampling [math]\displaystyle{ \Omega }[/math] is set to [math]\displaystyle{ \Omega \le \tfrac{2\pi}{N} = \tfrac{2\pi}{N^\prime} }[/math] with N′ > N, which results in N′ > N summation coefficients in the second sum of the discrete Gabor representation. In this case, the number of obtained Gabor-coefficients would be M[math]\displaystyle{ \cdot }[/math]N′ > K. Hence, more coefficients than sample values are available and therefore a redundant representation would be achieved.

Scaled Gabor transform

As in short time Fourier transform, the resolution in time and frequency domain can be adjusted by choosing different window function width. In Gabor transform cases, by adding variance [math]\displaystyle{ \sigma }[/math], as following equation:

The scaled (normalized) Gaussian window denotes as:

[math]\displaystyle{ W_{\text{gaussian}}(t) = e^{-\sigma \pi t^2} }[/math]

So the Scaled Gabor transform can be written as:

[math]\displaystyle{ G_x(t,f) = \sqrt[4]{\sigma}\textstyle \int_{-\infty}^{\infty} \displaystyle e^{-\sigma \pi(\tau -t)^2} e^{-j2\pi f\tau}x(\tau)d\tau \qquad }[/math]

With a large [math]\displaystyle{ \sigma }[/math], the window function will be narrow, causing higher resolution in time domain but lower resolution in frequency domain. Similarly, a small [math]\displaystyle{ \sigma }[/math] will lead to a wide window, with higher resolution in frequency domain but lower resolution in time domain.

Scale gabor simulation.png

Time-causal analogue of the Gabor transform

When processing temporal signals, data from the future cannot be accessed, which leads to problems if attempting to use Gabor functions for processing real-time signals. A time-causal analogue of the Gabor filter has been developed in [2] based on replacing the Gaussian kernel in the Gabor function with a time-causal and time-recursive kernel referred to as the time-causal limit kernel. In this way, time-frequency analysis based on the resulting complex-valued extension of the time-causal limit kernel makes it possible to capture essentially similar transformations of a temporal signal as the Gabor function can, and corresponding to the Heisenberg group, see [2] for further details.

See also

References

  1. E. Sejdić, I. Djurović, J. Jiang, “Time-frequency feature representation using energy concentration: An overview of recent advances,” Digital Signal Processing, vol. 19, no. 1, pp. 153-183, January 2009.
  2. 2.0 2.1 Lindeberg, T. (23 January 2023). "A time-causal and time-recursive scale-covariant scale-space representation of temporal signals and past time". Biological Cybernetics: 1–39. doi:10.1007/s00422-022-00953-6. 
  • D. Gabor, Theory of Communication, Part 1, J. Inst. of Elect. Eng. Part III, Radio and Communication, vol 93, p. 429 1946 (http://genesis.eecg.toronto.edu/gabor1946.pdf)
  • Jian-Jiun Ding, Time frequency analysis and wavelet transform class note, the Department of Electrical Engineering, National Taiwan University, Taipei, Taiwan, 2007.