Rank SIFT

From HandWiki

Rank SIFT algorithm is the revised SIFT (Scale-invariant feature transform) algorithm which uses ranking techniques to improve the performance of the SIFT algorithm. In fact, ranking techniques can be used in key point localization or descriptor generation of the original SIFT algorithm.

SIFT With Ranking Techniques

Ranking the Key Point

Ranking techniques can be used to keep certain number of key points which are detected by SIFT detector.[1]

Suppose [math]\displaystyle{ \left \{ I_m, m=0,1,...M \right \} }[/math] is a training image sequence and [math]\displaystyle{ p }[/math] is a key point obtained by SIFT detector. The following equation determines the rank of [math]\displaystyle{ p }[/math] in the key point set. Larger value of [math]\displaystyle{ R(p) }[/math] corresponds to the higher rank of [math]\displaystyle{ p }[/math].

[math]\displaystyle{ R( p \in I_0)=\sum_m I(\min_{ q \in I_m}{\lVert H_m(p)-q \rVert}_2 \lt \epsilon), }[/math]

where [math]\displaystyle{ I(.) }[/math] is the indicator function, [math]\displaystyle{ H_m }[/math] is the homography transformation from [math]\displaystyle{ I_0 }[/math] to [math]\displaystyle{ I_m }[/math], and [math]\displaystyle{ \epsilon }[/math] is the threshold.

Suppose [math]\displaystyle{ x_i }[/math] is the feature descriptor of key point [math]\displaystyle{ p_i }[/math] defined above. So [math]\displaystyle{ x_i }[/math] can be labeled with the rank of [math]\displaystyle{ p_i }[/math] in the feature vector space. Then the vector set [math]\displaystyle{ X_{feature space}=\left \{ \vec x_1, \vec x_2,...\right \} }[/math] containing labeled elements can be used as a training set for the Ranking SVM[2] problem.

The learning process can be represented as follows:

[math]\displaystyle{ \begin{array}{lcl} minimize: V(\vec w) = {1 \over 2} \vec w \cdot \vec w \\ s.t.\\ \begin{array}{lcl} \forall\ \vec x_i\ and\ \vec x_j \in X_{feature space},\\ \vec w^T(\vec x_i -\vec x_j)\geqq 1\quad if\ R(p_i \in I_0 )\gt R(p_j \in I_0). \end{array} \end{array} }[/math]

The obtained optimal [math]\displaystyle{ \vec w^* }[/math] can be used to order the future key points.

Ranking the Elements of Descriptor

Ranking techniques also can be used to generate the key point descriptor.[3]

Suppose [math]\displaystyle{ {\vec X} = \left \{ x_1,...,x_N \right \} }[/math] is the feature vector of a key point and the elements of [math]\displaystyle{ {R} = \left \{r_1,...r_N \right \} }[/math] is the corresponding rank of [math]\displaystyle{ x_i }[/math] in [math]\displaystyle{ X }[/math]. [math]\displaystyle{ r_i }[/math] is defined as follows:

[math]\displaystyle{ r_i = \left\vert \left \{x_k:x_k \geqq x_i \right \} \right \vert . }[/math]

After transforming original feature vector [math]\displaystyle{ \vec X }[/math] to the ordinal descriptor [math]\displaystyle{ \vec R }[/math], the difference between two ordinal descriptors can be evaluated in the following two measurements.

  • The Spearman correlation coefficient

The Spearman correlation coefficient also refers to Spearman's rank correlation coefficient. For two ordinal descriptors [math]\displaystyle{ \vec R }[/math] and [math]\displaystyle{ \vec R^' }[/math], it can be proved that

[math]\displaystyle{ \rho(\vec R, \vec R^') = 1- {6\sum_{i=1}^N(r_i-r_i^')^2 \over N(N^2-1)} }[/math]

  • The Kendall's Tau

The Kendall's Tau also refers to Kendall tau rank correlation coefficient. In the above case, the Kendall's Tau between [math]\displaystyle{ R }[/math] and [math]\displaystyle{ R^' }[/math] is

[math]\displaystyle{ \tau(\vec R, \vec R^')= {2\sum_{i=1}^N\sum_{j=i+1}^Ns(r_i-r_j, r_i^'-r_j^')\over N(N-1)}, }[/math]

[math]\displaystyle{ where \quad s(a,b) = \begin{cases} 1, & \text{if } sign(a) = sign(b) \\ -1, & o.w. \end{cases} }[/math]

References

  1. Bing Li; Rong Xiao; Zhiwei Li; Rui Cai; Bao-Liang Lu; Lei Zhang; "Rank-SIFT: Learning to rank repeatable local interest points", Computer Vision and Pattern Recognition (CVPR), 2011
  2. Joachims, T. (2003), "Optimizing Search Engines using Clickthrough Data", Proceedings of the ACM Conference on Knowledge Discovery and Data Mining
  3. Toews, M.; Wells, W."SIFT-Rank: Ordinal Description for Invariant Feature Correspondence", Computer Vision and Pattern Recognition, 2009.