Concordance correlation coefficient

From HandWiki

In statistics, the concordance correlation coefficient measures the agreement between two variables, e.g., to evaluate reproducibility or for inter-rater reliability.

Definition

The form of the concordance correlation coefficient [math]\displaystyle{ \rho_c }[/math] as[1]

[math]\displaystyle{ \rho_c = \frac{2\rho\sigma_x\sigma_y}{\sigma_x^2 + \sigma_y^2 + (\mu_x - \mu_y)^2}, }[/math]

where [math]\displaystyle{ \mu_x }[/math] and [math]\displaystyle{ \mu_y }[/math] are the means for the two variables and [math]\displaystyle{ \sigma^2_x }[/math] and [math]\displaystyle{ \sigma^2_y }[/math] are the corresponding variances. [math]\displaystyle{ \rho }[/math] is the correlation coefficient between the two variables.

This follows from its definition[1] as

[math]\displaystyle{ \rho_c = 1 - \frac{{\rm Expected\ orthogonal\ squared\ distance\ from\ the\ diagonal\ }x=y} {{\rm Expected\ orthogonal\ squared\ distance\ from\ the\ diagonal\ }x=y{\rm \ assuming\ independence}}. }[/math]

When the concordance correlation coefficient is computed on a [math]\displaystyle{ N }[/math]-length data set (i.e., [math]\displaystyle{ N }[/math] paired data values [math]\displaystyle{ (x_n, y_n) }[/math], for [math]\displaystyle{ n=1,...,N }[/math]), the form is

[math]\displaystyle{ \hat{\rho}_c = \frac{2 s_{xy}}{s_x^2 + s_y^2 + (\bar{x} - \bar{y})^2}, }[/math]

where the mean is computed as

[math]\displaystyle{ \bar{x} = \frac{1}{N} \sum_{n=1}^N x_n }[/math]

and the variance

[math]\displaystyle{ s_x^2 = \frac{1}{N} \sum_{n=1}^N (x_n - \bar{x})^2 }[/math]

and the covariance

[math]\displaystyle{ s_{xy} = \frac{1}{N} \sum_{n=1}^N (x_n - \bar{x})(y_n - \bar{y}) . }[/math]

Whereas the ordinary correlation coefficient (Pearson's) is immune to whether the biased or unbiased versions for estimation of the variance is used, the concordance correlation coefficient is not. In the original article Lin suggested the 1/N normalization,[1] while in another article Nickerson appears to have used the 1/(N-1),[2] i.e., the concordance correlation coefficient may be computed slightly differently between implementations.

Relation to other measures of correlation

The concordance correlation coefficient is nearly identical to some of the measures called intra-class correlations. Comparisons of the concordance correlation coefficient with an "ordinary" intraclass correlation on different data sets found only small differences between the two correlations, in one case on the third decimal.[2] It has also been stated[3] that the ideas for concordance correlation coefficient "are quite similar to results already published by Krippendorff[4] in 1970".

In the original article[1] Lin suggested a form for multiple classes (not just 2). Over ten years later a correction to this form was issued.[5]

One example of the use of the concordance correlation coefficient is in a comparison of analysis method for functional magnetic resonance imaging brain scans.[6]

References

  1. 1.0 1.1 1.2 1.3 "A concordance correlation coefficient to evaluate reproducibility". Biometrics 45 (1): 255–268. March 1989. doi:10.2307/2532051. PMID 2720055. 
  2. 2.0 2.1 Carol A. E. Nickerson (December 1997). "A Note on "A Concordance Correlation Coefficient to Evaluate Reproducibility". Biometrics 53 (4): 1503–1507. doi:10.2307/2533516. 
  3. Reinhold Müller; Petra Büttner (December 1994). "A critical discussion of intraclass correlation coefficients". Statistics in Medicine 13 (23–24): 2465–2476. doi:10.1002/sim.4780132310. PMID 7701147. 
  4. Klaus Krippendorff (1970). E. F. Borgatta. ed. "Bivariate Agreement Coefficients for Reliability of Data". Sociological Methodology (San Francisco: Jossey-Bass) 2: 139–150. doi:10.2307/270787. 
  5. Lawrence I-Kuei Lin (March 2000). "A Note on the Concordance Correlation Coefficient". Biometrics 56: 324–325. doi:10.1111/j.0006-341X.2000.00324.x. 
  6.  , Wikidata Q21012624

For a small Excel and VBA implementation by Peter Urbani see here