Trimean

From HandWiki

In statistics the trimean (TM), or Tukey's trimean, is a measure of a probability distribution's location defined as a weighted average of the distribution's median and its two quartiles:

[math]\displaystyle{ TM= \frac{Q_1 + 2Q_2 + Q_3}{4} }[/math]

This is equivalent to the average of the median and the midhinge:

[math]\displaystyle{ TM= \frac{1}{2}\left(Q_2 + \frac{Q_1 + Q_3}{2}\right) }[/math]

The foundations of the trimean were part of Arthur Bowley's teachings, and later popularized by statistician John Tukey in his 1977 book[1] which has given its name to a set of techniques called exploratory data analysis.

Like the median and the midhinge, but unlike the sample mean, it is a statistically resistant L-estimator with a breakdown point of 25%. This beneficial property has been described as follows:

An advantage of the trimean as a measure of the center (of a distribution) is that it combines the median's emphasis on center values with the midhinge's attention to the extremes.
—Herbert F. Weisberg, Central Tendency and Variability[2]

Efficiency

Despite its simplicity, the trimean is a remarkably efficient estimator of population mean. More precisely, for a large data set (over 100 points) from a symmetric population, the average of the 20th, 50th, and 80th percentile is the most efficient 3-point L-estimator, with 88% efficiency.[3] For context, the best single point estimate by L-estimators is the median, with an efficiency of 64% or better (for all n), while using two points (for a large data set of over 100 points from a symmetric population), the most efficient estimate is the 29% midsummary (mean of 29th and 71st percentiles), which has an efficiency of about 81%. Using quartiles, these optimal estimators can be approximated by the midhinge and the trimean. Using further points yield higher efficiency, though it is notable that only three points are needed for very high efficiency.

See also

References

  1. Tukey, John Wilder (1977). Exploratory Data Analysis. Addison-Wesley. ISBN 0-201-07616-0. https://archive.org/details/exploratorydataa00tuke_0. 
  2. Weisberg, H. F. (1992). Central Tendency and Variability. Sage University. ISBN:0-8039-4007-6 (p. 39)
  3. Evans 1955, Appendix G: Inefficient statistics, pp. 902–904.

External links