Probability of direction

From HandWiki
Short description: Mathematical index used in Bayesian statistics

In Bayesian statistics, the probability of direction (pd) is a measure of effect existence representing the certainty with which an effect is positive or negative.[1] This index is numerically similar to the frequentist p-value.[2][3]

Definition

It is mathematically defined as the proportion of the posterior distribution that is of the median's sign. It typically varies between 50% and 100%.[4]

History

The original formulation of this index and its usage in Bayesian statistics can be found in the psycho software documentation by Dominique Makowski under the appellation Maximum Probability of Effect (MPE).[5][6] It was later renamed Probability of Direction and implemented in the easystats collection of software. Similar formulations have also been described in the context of bootstrapped parameters interpretation.[citation needed]

Properties

The probability of direction is typically independent of the statistical model, as it is solely based on the posterior distribution and does not require any additional information from the data or the model. Contrary to indices related to the Region of Practical Interest (ROPE), it is robust to the scale of both the response variable and the predictors. However, similarly to its frequentist counterpart - the p-value, this index is not able to quantify evidence in favor of the null hypothesis.[2][7] Advantages and limitations of the probability of direction have been studied by comparing it to other indices including the Bayes factor or Bayesian Equivalence test.[2][8][4][9]

Relationship with p-value

The probability of direction has a direct correspondence with the frequentist one-sided p-value through the formula [math]\displaystyle{ p_\text{one-sided} = 1 - pd }[/math] and to the two-sided p-value through the formula [math]\displaystyle{ p_\text{two-sided} = 2 \left(1 - pd\right) }[/math]. Thus, a two-sided p-value of respectively .1, .05, .01 and .001 would correspond approximately to a pd of 95%, 97.5%, 99.5% and 99.95%.[10] The proximity between the pd and the p-value is in line with the interpretation of the former as an index of effect existence, as it follows the original definition of the p-value.[11][12]

Interpretation

The bayestestR package for R suggests the following rule of thumb guidelines:[13]

! pd !! p-value equivalence !! Interpretation |- | [math]\displaystyle{ \leq 95\% }[/math] || [math]\displaystyle{ p \gt .1 }[/math] || Uncertain |- | [math]\displaystyle{ \gt 95\% }[/math] || [math]\displaystyle{ p \lt .1 }[/math] || Possibly existing |- | [math]\displaystyle{ \gt 97\% }[/math] || [math]\displaystyle{ p \lt .06 }[/math] || Likely existing |- | [math]\displaystyle{ \gt 99\% }[/math] || [math]\displaystyle{ p \lt .02 }[/math] || Probably existing |- | [math]\displaystyle{ \gt 99.9\% }[/math] || [math]\displaystyle{ p \lt .002 }[/math] || Certainly existing |- |}

See also

References

  1. Makowski, Dominique; Ben-Shachar, Mattan; Lüdecke, Daniel (13 August 2019). "bayestestR: Describing Effects and their Uncertainty, Existence and Significance within the Bayesian Framework". Journal of Open Source Software 4 (40): 1541. doi:10.21105/joss.01541. Bibcode2019JOSS....4.1541M. 
  2. 2.0 2.1 2.2 Makowski, Dominique; Ben-Shachar, Mattan S.; Chen, S. H. Annabel; Lüdecke, Daniel (10 December 2019). "Indices of Effect Existence and Significance in the Bayesian Framework". Frontiers in Psychology 10: 2767. doi:10.3389/fpsyg.2019.02767. PMID 31920819. 
  3. Heiss, Andrew. "Bayesian statistics resources" (in en-us). https://evalf21.classes.andrewheiss.com/resource/bayes/#probability-of-direction. 
  4. 4.0 4.1 Kelter, Riko (December 2020). "Analysis of Bayesian posterior significance and effect size indices for the two-sample t-test to support reproducible medical research". BMC Medical Research Methodology 20 (1): 88. doi:10.1186/s12874-020-00968-2. PMID 32321438. 
  5. Makowski, Dominique (5 February 2018). "The psycho Package: an Efficient and Publishing-Oriented Workflow for Psychological Science". The Journal of Open Source Software 3 (22): 470. doi:10.21105/joss.00470. Bibcode2018JOSS....3..470M. 
  6. Makowski, Dominique. "psycho - The Bayesian Framework". https://cran.r-hub.io/web/packages/psycho/vignettes/bayesian.html. 
  7. Kelter, Riko (28 September 2021). "How to Choose between Different Bayesian Posterior Indices for Hypothesis Testing in Practice". Multivariate Behavioral Research 58 (1): 160–188. doi:10.1080/00273171.2021.1967716. PMID 34582284. 
  8. Baig, Sabeeh A (22 October 2021). "Bayesian Inference: Parameter Estimation for Inference in Small Samples". Nicotine & Tobacco Research 24 (6): 937–941. doi:10.1093/ntr/ntab221. PMID 34679175. 
  9. Kelter, Riko (2021). "Bayesian and frequentist testing for differences between two groups with parametric and nonparametric two-sample tests". WIREs Computational Statistics 13 (6): e1523. doi:10.1002/wics.1523. 
  10. "BayestestR - Probability of Direction" (in en). https://easystats.github.io/bayestestR/reference/p_direction.html. 
  11. Fisher, R. A. (1925). Statistical methods for research workers (11th ed. rev.). Edinburgh: Oliver and Boyd. 
  12. Cohen, Jacob (1994). "The earth is round (p < .05).". American Psychologist 49 (12): 997–1003. doi:10.1037/0003-066X.49.12.997. 
  13. "Bayesian Reporting Guidelines" (in en). https://easystats.github.io/bayestestR/articles/guidelines.html. 

External links

  • bayestestR — an R package for computing Bayesian indices