Pareto principle

From HandWiki
Short description: Statistical principle about ratio of effects to causes
The Pareto principle may apply to fundraising, i.e. 20% of the donors contributing towards 80% of the total.

The Pareto principle states that for many outcomes, roughly 80% of consequences come from 20% of causes (the "vital few").[1] Other names for this principle are the 80/20 rule, the law of the vital few or the principle of factor sparsity.[1][2]

In 1941, management consultant Joseph M. Juran developed the concept in the context of quality control and improvement after reading the works of Italian sociologist and economist Vilfredo Pareto, who wrote in 1906 about the 80/20 connection while teaching at the University of Lausanne.[3] In his first work, Cours d'économie politique, Pareto showed that approximately 80% of the land in the Kingdom of Italy was owned by 20% of the population. The Pareto principle is only tangentially related to the Pareto efficiency.

Mathematically, the 80/20 rule is roughly described by a power law distribution (also known as a Pareto distribution) for a particular set of parameters. Many natural phenomena distribute according to power law statistics.[4] It is an adage of business management that "80% of sales come from 20% of clients."[5]

History

In 1941, Joseph M. Juran, a Romanian-born American engineer, came across the work of Italian polymath Vilfredo Pareto. Pareto noted that approximately 80% of Italy's land was owned by 20% of the population.[6][4] Juran applied the observation that 80% of an issue is caused by 20% of the causes to quality issues. Later during his career, Juran preferred to describe this as "the vital few and the useful many" to highlight that the contribution of the remaining 80% should not be discarded entirely.[7]

Mathematical explanation

The demonstration of the Pareto principle is explained by a large proportion of process variation being associated with a small proportion of process variables.[2] This is a special case of the wider phenomenon of Pareto distributions. If the Pareto index α, which is one of the parameters characterizing a Pareto distribution, is chosen as α = log45 ≈ 1.16, then one has 80% of effects coming from 20% of causes.[8]

The term 80/20 is only a shorthand for the general principle at work. In individual cases, the distribution could be nearer to 90/10 or 70/30. There is also no need for the two numbers to add up to the number 100, as they are measures of different things. The Pareto principle is an illustration of a "power law" relationship, which also occurs in phenomena such as bush fires and earthquakes.[9] Because it is self-similar over a wide range of magnitudes, it produces outcomes completely different from Normal or Gaussian distribution phenomena. This fact explains the frequent breakdowns of sophisticated financial instruments, which are modeled on the assumption that a Gaussian relationship is appropriate to something like stock price movements.[10]

Gini coefficient and Hoover index

Using the "A:B" notation (for example, 0.8:0.2) and with A + B = 1, inequality measures like the Gini index (G) and the Hoover index (H) can be computed. In this case both are the same:

[math]\displaystyle{ H=G=|2A-1|=|1-2B| \, }[/math]
[math]\displaystyle{ A:B = \left( \frac{1+H} 2 \right): \left( \frac{1-H} 2 \right) }[/math]

In economics

Pareto's observation was in connection with population and wealth. Pareto noticed that approximately 80% of Italy's land was owned by 20% of the population.[6] He then carried out surveys on a variety of other countries and found to his surprise that a similar distribution applied.[citation needed]

A chart that demonstrated the effect appeared in the 1992 United Nations Development Program Report, which showed that the richest 20% of the world's population receives 82.7% of the world's income.[11] However, among nations, the Gini index shows that wealth distributions vary substantially around this norm.[12]

Distribution of world GDP, 1989[13]
Quintile of population Income
Richest 20% 82.70%
Second 20% 11.75%
Third 20% 2.30%
Fourth 20% 1.85%
Poorest 20% 1.40%

The principle also holds within the tails of the distribution. The physicist Victor Yakovenko of the University of Maryland, College Park and AC Silva analyzed income data from the US Internal Revenue Service from 1983 to 2001 and found that the income distribution of the richest 1–3% of the population also follows Pareto's principle.[14]

In computing

In computer science the Pareto principle can be applied to optimization efforts.[15] For example, Microsoft noted that by fixing the top 20% of the most-reported bugs, 80% of the related errors and crashes in a given system would be eliminated.[16] Lowell Arthur expressed that "20% of the code has 80% of the errors. Find them, fix them!"[17] It was also discovered that, in general, 80% of a piece of software can be written in 20% of the total allocated time. Conversely, the hardest 20% of the code takes 80% of the time. This factor is usually a part of COCOMO estimating for software coding.[citation needed]

Occupational health and safety

Occupational health and safety professionals use the Pareto principle to underline the importance of hazard prioritization. Assuming 20% of the hazards account for 80% of the injuries, and by categorizing hazards, safety professionals can target those 20% of the hazards that cause 80% of the injuries or accidents. Alternatively, if hazards are addressed in random order, a safety professional is more likely to fix one of the 80% of hazards that account only for some fraction of the remaining 20% of injuries.[18]

Aside from ensuring efficient accident prevention practices, the Pareto principle also ensures hazards are addressed in an economical order, because the technique ensures the utilized resources are best used to prevent the most accidents.[19]

Other applications

Engineering and quality control

The Pareto principle is the basis for the Pareto chart, one of the key tools used in total quality control and Six Sigma techniques. The Pareto principle serves as a baseline for ABC-analysis and XYZ-analysis, widely used in logistics and procurement for the purpose of optimizing stock of goods, as well as costs of keeping and replenishing that stock.[20] In engineering control theory, such as for electromechanical energy converters, the 80/20 principle applies to optimization efforts.[15]

The remarkable success of statistically based searches for root causes is based upon a combination of an empirical principle and mathematical logic. The empirical principle is usually known as the Pareto principle.[21] With regard to variation causality, this principle states that there is a non-random distribution of the slopes of the numerous (theoretically infinite) terms in the general equation.

All of the terms are independent of each other by definition. Interdependent factors appear as multiplication terms. The Pareto principle states that the effect of the dominant term is very much greater than the second-largest effect term, which in turn is very much greater than the third, and so on.[22] There is no explanation for this phenomenon; that is why we refer to it as an empirical principle.

The mathematical logic is known as the square-root-of-the-sum-of-the-squares axiom. This states that the variation caused by the steepest slope must be squared, and then the result added to the square of the variation caused by the second-steepest slope, and so on. The total observed variation is then the square root of the total sum of the variation caused by individual slopes squared. This derives from the probability density function for multiple variables or the multivariate distribution (we are treating each term as an independent variable).

The combination of the Pareto principle and the square-root-of-the-sum-of-the-squares axiom means that the strongest term in the general equation totally dominates the observed variation of effect. Thus, the strongest term will dominate the data collected for hypothesis testing.

In the systems science discipline, Joshua M. Epstein and Robert Axtell created an agent-based simulation model called Sugarscape, from a decentralized modeling approach, based on individual behavior rules defined for each agent in the economy. Wealth distribution and Pareto's 80/20 principle emerged in their results, which suggests the principle is a collective consequence of these individual rules.[23]

Health and social outcomes

In 2009, the Agency for Healthcare Research and Quality said 20% of patients incurred 80% of healthcare expenses due to chronic conditions.[24] A 2021 analysis showed unequal distribution of healthcare costs, with older patients and those with poorer health incurring more costs.[25] The 20/80 rule has been proposed as a rule of thumb for the infection distribution in superspreading events.[26][27] However, the degree of infectiousness has been found to be distributed continuously in the population.[27] In epidemics with super-spreading, the majority of individuals infect relatively few secondary contacts.

See also


References

  1. 1.0 1.1 Bunkley, Nick (March 3, 2008). "Joseph Juran, 103, Pioneer in Quality Control, Dies". The New York Times. https://www.nytimes.com/2008/03/03/business/03juran.html. 
  2. 2.0 2.1 Box, George E.P.; Meyer, R. Daniel (1986). "An Analysis for Unreplicated Fractional Factorials". Technometrics 28 (1): 11–18. doi:10.1080/00401706.1986.10488093. 
  3. Pareto, Vilfredo (1896–1897). Cours d'Économie Politique (in two volumes). F. Rouge (Lausanne) & F. Pichon (Paris).  Volume 1 Volume 2
  4. 4.0 4.1 Newman, MEJ (2005). "Power laws, Pareto Distributions, and Zipf's law". Contemporary Physics 46 (5): 323–351. doi:10.1080/00107510500052444. Bibcode2005ConPh..46..323N. https://arxiv.org/PS_cache/cond-mat/pdf/0412/0412004v3.pdf. Retrieved 10 April 2011. 
  5. Marshall, Perry (2013-10-09). "The 80/20 Rule of Sales: How to Find Your Best Customers" (in en). Entrepreneur. https://www.entrepreneur.com/article/229294. 
  6. 6.0 6.1 Pareto, Vilfredo; Page, Alfred N. (1971), Translation of Manuale di economia politica ("Manual of political economy"), A.M. Kelley, ISBN 978-0-678-00881-2 
  7. "Pareto Principle (80/20 Rule) & Pareto Analysis Guide" (in en-US). 2019-03-12. https://www.juran.com/blog/a-guide-to-the-pareto-principle-80-20-rule-pareto-analysis/. 
  8. Dunford (2014), "The Pareto Principle", The Plymouth Student Scientist, https://pearl.plymouth.ac.uk/bitstream/handle/10026.1/14054/TPSS-2014-Vol7n1_140-148Dunford.pdf 
  9. Bak, Per (1999), How Nature Works: the science of self-organized criticality, Springer, p. 89, ISBN 0-387-94791-4 
  10. Taleb, Nassim (2007), The Black Swan, pp. 229–252, 274–285 
  11. United Nations Development Program (1992), 1992 Human Development Report, New York: Oxford University Press 
  12. Hillebrand, Evan (June 2009). "Poverty, Growth, and Inequality over the Next 50 Years". FAO, United Nations – Economic and Social Development Department. ftp://ftp.fao.org/docrep/fao/012/ak968e/ak968e00.pdf. 
  13. Human Development Report 1992, Chapter 3, http://hdr.undp.org/en/reports/global/hdr1992/chapters/, retrieved 2007-07-08 
  14. Yakovenko, Victor M.; Silva, A. Christian (2005), Chatterjee, Arnab; Yarlagadda, Sudhakar; Chakrabarti, Bikas K., eds., "Two-class Structure of Income Distribution in the USA: Exponential Bulk and Power-law Tail" (in en), Econophysics of Wealth Distributions: Econophys-Kolkata I, New Economic Windows (Springer Milan): pp. 15–23, doi:10.1007/88-470-0389-x_2, ISBN 978-88-470-0389-7 
  15. 15.0 15.1 Gen, M.; Cheng, R. (2002), Genetic Algorithms and Engineering Optimization, New York: Wiley 
  16. Rooney, Paula (October 3, 2002), Microsoft's CEO: 80–20 Rule Applies To Bugs, Not Just Features, ChannelWeb, http://www.crn.com/news/security/18821726/microsofts-ceo-80-20-rule-applies-to-bugs-not-just-features.htm 
  17. Pressman, Roger S. (2010). Software Engineering: A Practitioner's Approach (7th ed.). Boston, Mass: McGraw-Hill, 2010. ISBN:978-0-07-337597-7.
  18. Woodcock, Kathryn (2010). Safety Evaluation Techniques. Toronto, ON: Ryerson University. pp. 86. http://www.ryerson.ca/woodcock/. 
  19. "Introduction to Risk-based Decision-Making". USCG Safety Program. United States Coast Guard. http://www.uscg.mil/hq/cg5/cg5211/docs/RBDM_Files/PDF/RBDM_Guidelines/Volume%202/Volume%202-Chapter%206.pdf. 
  20. (Rushton Oxley), pp. 107–108.
  21. Juran, Joseph M., Frank M. Gryna, and Richard S. Bingham. Quality control handbook. Vol. 3. New York: McGraw-Hill, 1974.
  22. Shainin, Richard D. “Strategies for Technical Problem Solving.” 1992, Quality Engineering, 5:3, 433-448
  23. Epstein, Joshua; Axtell, Robert (1996), Growing Artificial Societies: Social Science from the Bottom-Up, MIT Press, p. 208, ISBN 0-262-55025-3, https://books.google.com/books?id=xXvelSs2caQC 
  24. Weinberg, Myrl (July 27, 2009). "Myrl Weinberg: In health-care reform, the 20-80 solution". http://www.projo.com/opinion/contributors/content/CT_weinberg27_07-27-09_HQF0P1E_v15.3f89889.html. 
  25. Sawyer, Bradley; Claxton, Gary. "How do health expenditures vary across the population?". Peterson Center on Healthcare and the Kaiser Family Foundation. https://www.healthsystemtracker.org/chart-collection/health-expenditures-vary-across-population/#item-discussion-of-health-spending-often-focus-on-averages-but-a-small-share-of-the-population-incurs-most-of-the-cost_2016. 
  26. Galvani, Alison P.; May, Robert M. (2005). "Epidemiology: Dimensions of superspreading". Nature 438 (7066): 293–295. doi:10.1038/438293a. PMID 16292292. Bibcode2005Natur.438..293G. 
  27. 27.0 27.1 Lloyd-Smith, JO; Schreiber, SJ; Kopp, PE; Getz, WM (2005). "Superspreading and the effect of individual variation on disease emergence". Nature 438 (7066): 355–359. doi:10.1038/nature04153. PMID 16292310. Bibcode2005Natur.438..355L. 

Further reading

External links