Block transform

From HandWiki
Revision as of 19:32, 6 March 2023 by OrgMain (talk | contribs) (simplify)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Wavelet packet bases are designed by dividing the frequency axis in intervals of varying sizes. These bases are particularly well adapted to decomposing signals that have different behavior in different frequency intervals. If [math]\displaystyle{ f }[/math] has properties that vary in time, it is then more appropriate to decompose [math]\displaystyle{ f }[/math] in a block basis that segments the time axis in intervals with sizes that are adapted to the signal structures.

Block Bases

Block orthonormal bases are obtained by dividing the time axis in consecutive intervals [math]\displaystyle{ [ a_p,a_{p+1}] }[/math] with

[math]\displaystyle{ \lim_{p \to -\infty}a_p=-\infty }[/math] and [math]\displaystyle{ \lim_{p \to \infty}a_p=\infty }[/math].

The size [math]\displaystyle{ l_p =a_{p+1}-a_p }[/math] of each interval is arbitrary. Let [math]\displaystyle{ g=1_{[0,1]} }[/math]. An interval is covered by the dilated rectangular window

[math]\displaystyle{ g_p(t)=1_{[a_p,a_{p+1}]}(t)=g({t-a_p \over l_p}). }[/math]

Theorem 1. constructs a block orthogonal basis of [math]\displaystyle{ L^2(\mathbb{R}) }[/math] from a single orthonormal basis of [math]\displaystyle{ L^2[0,1] }[/math].

Theorem 1.

if [math]\displaystyle{ \{e_k\}_{k\in \mathbb{Z}} }[/math] is an orthonormal basis of [math]\displaystyle{ L^2[0,1] }[/math], then

[math]\displaystyle{ \{g_{p,k}(t)=g_p(t)\frac{1}{\sqrt{l_p}}e_k(\frac{t-a_p}{l_p})\}_{(p,k)\in \mathbb{Z}} }[/math]

is a block orthonormal basis of [math]\displaystyle{ L^2(\mathbb{R}) }[/math]

Proof

One can verify that the dilated and translated family

[math]\displaystyle{ \{\frac{1}{\sqrt{l_p}}e_k(\frac{t-a_p}{l_p})\}_{(p,k)\in \mathbb{Z}} }[/math]

is an orthonormal basis of [math]\displaystyle{ L^2[ a_p,a_{p+1}] }[/math]. If [math]\displaystyle{ p\neq q }[/math], then [math]\displaystyle{ \langle g_{p,k},g_{q,k}\rangle =0 }[/math] since their supports do not overlap. Thus, the family [math]\displaystyle{ \{g_{p,k}(t)=g_p(t)\frac{1}{\sqrt{l_p}}e_k(\frac{t-a_p}{l_p})\}_{(p,k)\in \mathbb{Z}} }[/math] is orthonormal. To expand a signal [math]\displaystyle{ f }[/math] in this family, it is decomposed as a sum of separate blocks

[math]\displaystyle{ f(t)=\sum_{p=-\infty}^{+\infty}f(t)g_p(t), }[/math]

and each block [math]\displaystyle{ f(t)g_p(t) }[/math] is decomposed in the basis [math]\displaystyle{ \{\frac{1}{\sqrt{l_p}}e_k(\frac{t-a_p}{l_p})\}_{(p,k)\in \mathbb{Z}} }[/math]

Block Fourier Basis

A block basis is constructed with the Fourier basis of [math]\displaystyle{ L^2[0,1] }[/math]:

[math]\displaystyle{ \{e_k(t)=exp(i2k\pi t)\}_{k\in \mathbb{Z}} }[/math]

The time support of each block Fourier vector [math]\displaystyle{ g_{p,k} }[/math] is [math]\displaystyle{ [ a_p,a_{p+1}] }[/math] of size [math]\displaystyle{ l_p }[/math]. The Fourier transform of [math]\displaystyle{ g = 1_{[0,1]} }[/math] is

[math]\displaystyle{ \hat{g}(w)=\frac{\sin(w/2)}{w/2}exp(\frac{iw}{2}) }[/math]

and

[math]\displaystyle{ \hat{g}_{p,k}(w)=\sqrt{l_p}\hat{g}(l_pw-2k\pi)exp(\frac{-i2\pi ka_p}{l_P}). }[/math]

It is centered at [math]\displaystyle{ 2k\pi l_p^{-1} }[/math] and has a slow asymptotic decay proportional to [math]\displaystyle{ l_p^-1\left\vert w \right\vert^{-1}. }[/math] Because of this poor frequency localization, even though a signal [math]\displaystyle{ f }[/math] is smooth, its decomposition in a block Fourier basis may include large high-frequency coefficients. This can also be interpreted as an effect of periodization.

Discrete Block Bases

For all [math]\displaystyle{ p \in \mathbb{Z} }[/math], suppose that [math]\displaystyle{ a_p \in \mathbb{Z} }[/math]. Discrete block bases are built with discrete rectangular windows having supports on intervals [math]\displaystyle{ [ a_p,a_{p-1}] }[/math]:

[math]\displaystyle{ g_p[n]=1_{[a_p,a_{p+1}-1]}(n) }[/math].

Since dilations are not defined in a discrete framework, bases of intervals of varying sizes from a single basis cannot generally be derived. Thus, Theorem 2 supposes an orthonormal basis of [math]\displaystyle{ \mathbb{C}^l }[/math] for any [math]\displaystyle{ l\gt 0 }[/math] can be constructed. The proof is:

Theorem 2.

Suppose that [math]\displaystyle{ \{e_{k,l}\}_{0\leqslant k\lt l} }[/math] is an orthogonal basis of [math]\displaystyle{ \mathbb{C}^l }[/math] for any [math]\displaystyle{ l\gt 0 }[/math]. The family

[math]\displaystyle{ \{g_{p,k}[n]=g_p[n]e_{k,l_p}[n-a_p]\}_{0\leqslant k\lt l_p,p\in \mathbb{Z}} }[/math]

is a block orthonormal basis of [math]\displaystyle{ l^2(\mathbb{Z}) }[/math].

A discrete block basis is constructed with discrete Fourier bases

[math]\displaystyle{ \{e_{k,l[n]}=\frac{1}{\sqrt{l}}exp(\frac{i2\pi kn}{l})\}_{0\leqslant k\lt l} }[/math]

The resulting block Fourier vectors [math]\displaystyle{ g_{p,k} }[/math] have sharp transitions at the window border, and thus are not well localized in frequency. As in the continuous case, the decomposition of smooth signals [math]\displaystyle{ f }[/math] may produce large-amplitude, high-frequency coefficients because of border effects.

Block Bases of Images

General block bases of images are constructed by partitioning the plane [math]\displaystyle{ \mathbb{R}^2 }[/math] into rectangles [math]\displaystyle{ \{[a_p,b_p]\times [c_p,d_p]\}_{p\in \mathbb{Z}} }[/math] of arbitrary length [math]\displaystyle{ l_p = b_p-a_p }[/math] and width [math]\displaystyle{ w_p=d_p-c_p }[/math]. Let [math]\displaystyle{ \{e_k\}_{k\in\mathbb{Z}} }[/math] be an orthonormal basis of [math]\displaystyle{ L^2[0,1] }[/math] and [math]\displaystyle{ g = 1_{[0,1]} }[/math]. The following can be denoted:

[math]\displaystyle{ g_{p,k,j}(x,y)=g(\frac{x-a_p}{l_p})g(\frac{y-c_p}{w_p})\frac{1}{\sqrt{l_pw_p}}e_k(\frac{x-a_p}{l_p})e_j(\frac{y-c_p}{w_p}) }[/math].

The family [math]\displaystyle{ \{g_{p,k,j}\}_{(p,k,j)\in\mathbb{Z}^3} }[/math] is an orthonormal basis of [math]\displaystyle{ L^2(\mathbb{R}^2) }[/math].

For discrete images, discrete windows that cover each rectangle can be defined

[math]\displaystyle{ g_p=1_{[a_p,b_{p}-1]\times [c_p,d_p-1]} }[/math].

If [math]\displaystyle{ \{e_{k,l}\}_{0\leqslant k\lt l} }[/math] is an orthogonal basis of [math]\displaystyle{ \mathbb{C}^l }[/math] for any [math]\displaystyle{ l\gt 0 }[/math], then

[math]\displaystyle{ \{g_{p,k,j}[n_1,n_2]=g_p[n_1,n_2]e_{k,l_p}[n_1-a_p]e_{j,w_p}[n_2-c_p]\}_{(k,j,p)\in\mathbb(Z)^3} }[/math]

is a block basis of [math]\displaystyle{ l^2(\mathbb{R}^2) }[/math]

References

  1. St´ephane Mallat, A Wavelet Tour of Signal Processing, 3rd