Shrinkage Fields (image restoration)

From HandWiki

Shrinkage fields is a random field-based machine learning technique that aims to perform high quality image restoration (denoising and deblurring) using low computational overhead.

Method

The restored image x is predicted from a corrupted observation y after training on a set of sample images S.

A shrinkage (mapping) function fπi(v)=j=1Mπi,jexp(γ2(vμj)2) is directly modeled as a linear combination of radial basis function kernels, where γ is the shared precision parameter, μ denotes the (equidistant) kernel positions, and M is the number of Gaussian kernels.

Because the shrinkage function is directly modeled, the optimization procedure is reduced to a single quadratic minimization per iteration, denoted as the prediction of a shrinkage field gΘ(x)=1[(λKTy+i=1NFiTfπi(Fix))λKˇ*Kˇ+i=1NFˇi*Fˇi]=Ω1η where denotes the discrete Fourier transform and Fx is the 2D convolution fx with point spread function filter, F˘ is an optical transfer function defined as the discrete Fourier transform of f, and F˘* is the complex conjugate of F˘.

x^t is learned as x^t=gΘt(x^t1) for each iteration t with the initial case x^0=y, this forms a cascade of Gaussian conditional random fields (or cascade of shrinkage fields (CSF)). Loss-minimization is used to learn the model parameters Θt={λt,πti,fti}i=1N.

The learning objective function is defined as J(Θt)=s=1Sl(x^t(s);xgt(s)), where l is a differentiable loss function which is greedily minimized using training data {xgt(s),y(s),k(s)}s=1S and x^t(s).

Performance

Preliminary tests by the author suggest that RTF5[1] obtains slightly better denoising performance than CSF7×7{3,4,5}, followed by CSF5×55, CSF7×72, CSF5×5{3,4}, and BM3D.

BM3D denoising speed falls between that of CSF5×54 and CSF7×74, RTF being an order of magnitude slower.

Advantages

  • Results are comparable to those obtained by BM3D (reference in state of the art denoising since its inception in 2007)
  • Minimal runtime compared to other high-performance methods (potentially applicable within embedded devices)
  • Parallelizable (e.g.: possible GPU implementation)
  • Predictability: O(DlogD) runtime where D is the number of pixels
  • Fast training even with CPU

Implementations

  • A reference implementation has been written in MATLAB and released under the BSD 2-Clause license: shrinkage-fields

See also

References

  1. Jancsary, Jeremy; Nowozin, Sebastian; Sharp, Toby; Rother, Carsten (10 April 2012). "Regression Tree Fields – An Efficient, Non-parametric Approach to Image Labeling Problems". IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR). Providence, RI, USA: IEEE Computer Society. doi:10.1109/CVPR.2012.6247950.