Rasta filtering

From HandWiki
Revision as of 21:31, 6 February 2024 by NBrushPhys (talk | contribs) (linkage)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

RASTA filtering and mean subtraction was introduced to support perceptual linear prediction (PLP) preprocessing. It uses bandpass filtering in the log spectral domain. Rasta filtering then removes slow channel variations. It has also been applied to cepstrum feature-based preprocessing with both log spectral and cepstral domain filtering.

In general a RASTA filter is defined by

[math]\displaystyle{ T(z) = ( k * \sum (n-(N-1) / 2) * z^{-n}) / (1-\rho/x) \,\! }[/math]

The numerator is a regression filter with N being the order (must be odd) and the denominator is an integrator with time decay. The pole controls the lower limit of frequency and is normally around 0.9. RASTA-filtering can be changed to use mean subtraction, implementing a moving average filter. Filtering is normally performed in the cepstral domain. The mean becomes the long term cepstrum and is typically computed on the speech part for each separate utterance. A silence is necessary to detect each utterance.

References