# Mediation (statistics)

__: Statistical model__

**Short description**In statistics, a **mediation** model seeks to identify and explain the mechanism or process that underlies an observed relationship between an independent variable and a dependent variable via the inclusion of a third hypothetical variable, known as a **mediator variable** (also a **mediating variable**, **intermediary variable**, or **intervening variable**).^{[1]} Rather than a direct causal relationship between the independent variable and the dependent variable, which is often false, a mediation model proposes that the independent variable influences the mediator variable, which in turn influences the dependent variable. Thus, the mediator variable serves to clarify the nature of the relationship between the independent and dependent variables.^{[2]}^{[3]}

Mediation analyses are employed to understand a known relationship by exploring the underlying mechanism or process by which one variable influences another variable through a mediator variable.^{[4]} In particular, mediation analysis can contribute to better understanding the relationship between an independent variable and a dependent variable when these variables do not have an obvious direct connection.

## Baron and Kenny's (1986) steps for mediation analysis

Baron and Kenny (1986) laid out several requirements that must be met to form a true mediation relationship.^{[5]} They are outlined below using a real-world example. See the diagram above for a visual representation of the overall mediating relationship to be explained. The original steps are as follows.

### Step 1

- Regress the dependent variable on the independent variable to confirm that the independent variable is a significant predictor of the dependent variable.

- Independent variable [math]\displaystyle{ \to }[/math] dependent variable
- [math]\displaystyle{ Y=\beta_{10} +\beta_{11}X + \varepsilon_1 }[/math]

*β*_{11}is significant

### Step 2

- Regress the mediator on the independent variable to confirm that the independent variable is a significant predictor of the mediator. If the mediator is not associated with the independent variable, then it couldn’t possibly mediate anything.

- Independent variable [math]\displaystyle{ \to }[/math] mediator
- [math]\displaystyle{ Me=\beta_{20} +\beta_{21}X + \varepsilon_2 }[/math]

*β*_{21}is significant

### Step 3

- Regress the dependent variable on both the mediator and independent variable to confirm that a) the mediator is a significant predictor of the dependent variable, and b) the strength of the coefficient of the previously significant independent variable in Step #1 is now greatly reduced, if not rendered nonsignificant.

- [math]\displaystyle{ Y=\beta_{30} +\beta_{31}X +\beta_{32}Me + \varepsilon_3 }[/math]

*β*_{32}is significant*β*_{31}should be smaller in absolute value than the original effect for the independent variable (β_{11}above)

### Example

The following example, drawn from Howell (2009),^{[6]} explains each step of Baron and Kenny's requirements to understand further how a mediation effect is characterized. Step 1 and step 2 use simple regression analysis, whereas step 3 uses multiple regression analysis.

- How you were parented (i.e., independent variable) predicts how confident you feel about parenting your own children (i.e., dependent variable).
- How you were parented (i.e., independent variable) predicts your feelings of competence and self-esteem (i.e., mediator).
- Your feelings of competence and self-esteem (i.e., mediator) predict how confident you feel about parenting your own children (i.e., dependent variable), while controlling for how you were parented (i.e., independent variable).

Such findings would lead to the conclusion implying that your feelings of competence and self-esteem mediate the relationship between how you were parented and how confident you feel about parenting your own children.

If step 1 does not yield a significant result, one may still have grounds to move to step 2. Sometimes there is actually a significant relationship between independent and dependent variables but because of small sample sizes, or other extraneous factors, there could not be enough power to predict the effect that actually exists.^{[7]}

## Direct versus indirect effects

In the diagram shown above, the indirect effect is the product of path coefficients "A" and "B". The direct effect is the coefficient " C' ".
The direct effect measures the extent to which the dependent variable changes when the independent variable increases by one unit and the mediator variable remains unaltered. In contrast, the indirect effect measures the extent to which the dependent variable changes when the independent variable is held constant and the mediator variable changes by the amount it would have changed had the independent variable increased by one unit.^{[8]}^{[9]}

In linear systems, the total effect is equal to the sum of the direct and indirect (*C' + AB* in the model above). In nonlinear models, the total effect is not generally equal to the sum of the direct and indirect effects, but to a modified combination of the two.^{[9]}

## Full mediation versus partial mediation

A mediator variable can either account for all or some of the observed relationship between two variables.

### Full mediation

Maximum evidence for mediation, also called full mediation, would occur if inclusion of the mediation variable drops the relationship between the independent variable and dependent variable (see pathway c*′* in diagram above) to zero.

### Partial mediation

Partial mediation maintains that the mediating variable accounts for some, but not all, of the relationship between the independent variable and dependent variable. Partial mediation implies that there is not only a significant relationship between the mediator and the dependent variable, but also some direct relationship between the independent and dependent variable.

In order for either full or partial mediation to be established, the reduction in variance explained by the independent variable must be significant as determined by one of several tests, such as the Sobel test.^{[10]} The effect of an independent variable on the dependent variable can become nonsignificant when the mediator is introduced simply because a trivial amount of variance is explained (i.e., not true mediation). Thus, it is imperative to show a significant reduction in variance explained by the independent variable before asserting either full or partial mediation. It is possible to have statistically significant indirect effects in the absence of a total effect.^{[11]} This can be explained by the presence of several mediating paths that cancel each other out, and become noticeable when one of the cancelling mediators is controlled for. This implies that the terms 'partial' and 'full' mediation should always be interpreted relative to the set of variables that are present in the model. In all cases, the operation of "fixing a variable" must be distinguished from that of "controlling for a variable," which has been inappropriately used in the literature.^{[8]}^{[12]} The former stands for physically fixing, while the latter stands for conditioning on, adjusting for, or adding to the regression model. The two notions coincide only when all error terms (not shown in the diagram) are statistically uncorrelated. When errors are correlated, adjustments must be made to neutralize those correlations before embarking on mediation analysis (see Bayesian network).

## Sobel's test

Sobel's test^{[10]} is performed to determine if the relationship between the independent variable and dependent variable has been significantly reduced after inclusion of the mediator variable. In other words, this test assesses whether a mediation effect is significant. It examines the relationship between the independent variable and the dependent variable compared to the relationship between the independent variable and dependent variable including the mediation factor.

The Sobel test is more accurate than the Baron and Kenny steps explained above; however, it does have low statistical power. As such, large sample sizes are required in order to have sufficient power to detect significant effects. This is because the key assumption of Sobel's test is the assumption of normality. Because Sobel's test evaluates a given sample on the normal distribution, small sample sizes and skewness of the sampling distribution can be problematic (see Normal distribution for more details). Thus, the rule of thumb as suggested by MacKinnon et al., (2002) ^{[13]} is that a sample size of 1000 is required to detect a small effect, a sample size of 100 is sufficient in detecting a medium effect, and a sample size of 50 is required to detect a large effect.

The equation for Sobel is:^{[14]}

- [math]\displaystyle{ z= \frac{ab}{\sqrt{b^2s^2_a + a^2s^2_b}} }[/math]

## Preacher–Hayes bootstrap method

The bootstrapping method provides some advantages to the Sobel's test, primarily an increase in power. The Preacher and Hayes bootstrapping method is a non-parametric test and does not impose the assumption of normality. Therefore, if the raw data is available, the bootstrap method is recommended.^{[14]} Bootstrapping involves repeatedly randomly sampling observations with replacement from the data set to compute the desired statistic in each resample. Computing over hundreds, or thousands, of bootstrap resamples provide an approximation of the sampling distribution of the statistic of interest. The Preacher–Hayes method provides point estimates and confidence intervals by which one can assess the significance or nonsignificance of a mediation effect. Point estimates reveal the mean over the number of bootstrapped samples and if zero does not fall between the resulting confidence intervals of the bootstrapping method, one can confidently conclude that there is a significant mediation effect to report.

## Significance of mediation

As outlined above, there are a few different options one can choose from to evaluate a mediation model.

Bootstrapping^{[15]}^{[16]} is becoming the most popular method of testing mediation because it does not require the normality assumption to be met, and because it can be effectively utilized with smaller sample sizes (*N* < 25). However, mediation continues to be most frequently determined using the logic of Baron and Kenny^{[17]} or the Sobel test. It is becoming increasingly more difficult to publish tests of mediation based purely on the Baron and Kenny method or tests that make distributional assumptions such as the Sobel test. Thus, it is important to consider your options when choosing which test to conduct.^{[11]}

## Approaches to mediation

While the concept of mediation as defined within psychology is theoretically appealing, the methods used to study mediation empirically have been challenged by statisticians and epidemiologists^{[8]}^{[12]}^{[18]} and interpreted formally.^{[9]}

- Experimental-causal-chain design
- An experimental-causal-chain design is used when the proposed mediator is experimentally manipulated. Such a design implies that one manipulates some controlled third variable that they have reason to believe could be the underlying mechanism of a given relationship.
- Measurement-of-mediation design
- A measurement-of-mediation design can be conceptualized as a statistical approach. Such a design implies that one measures the proposed intervening variable and then uses statistical analyses to establish mediation. This approach does not involve manipulation of the hypothesized mediating variable, but only involves measurement.
^{[19]}

## Criticisms of mediation measurement

### Potentially unnecessary step

Hayes (2009) critiqued Baron and Kenny's mediation steps approach,^{[11]} and as of 2019, David A. Kenny on his website stated that mediation can exist in the absence of a 'significant' total effect (sometimes referred to as "inconsistent mediation"), and therefore step 1 of the original 1986 approach may not be needed. Later publications by Hayes questioned the concepts of full mediation and partial mediation, and advocated for the abandonment of these terms and of the steps in classical (1986) mediation.

### Importance of caution

Experimental approaches to mediation must be carried out with caution. First, it is important to have strong theoretical support for the exploratory investigation of a potential mediating variable.
A criticism of a mediation approach rests on the ability to manipulate and measure a mediating variable. Thus, one must be able to manipulate the proposed mediator in an acceptable and ethical fashion. As such, one must be able to measure the intervening process without interfering with the outcome. The mediator must also be able to establish construct validity of manipulation.
One of the most common criticisms of the measurement-of-mediation approach is that it is ultimately a correlational design. Consequently, it is possible that some other third variable, independent from the proposed mediator, could be responsible for the proposed effect. However, researchers have worked hard to provide counter-evidence to this disparagement. Specifically, the following counter-arguments have been put forward:^{[4]}

- Temporal precedence
- For example, if the independent variable precedes the dependent variable in time, this would provide evidence suggesting a directional, and potentially causal, link from the independent variable to the dependent variable.
- Nonspuriousness and/or no confounds
- For example, should one identify other third variables and prove that they do not alter the relationship between the independent variable and the dependent variable he/she would have a stronger argument for their mediation effect. See other 3rd variables below.

Mediation can be an extremely useful and powerful statistical test; however, it must be used properly. It is important that the measures used to assess the mediator and the dependent variable are theoretically distinct and that the independent variable and mediator cannot interact. Should there be an interaction between the independent variable and the mediator one would have grounds to investigate moderation.

## Other third variables

### Confounding

Another model that is often tested is one in which competing variables in the model are alternative potential mediators or an unmeasured cause of the dependent variable. An additional variable in a causal model may obscure or confound the relationship between the independent and dependent variables. Potential confounders are variables that may have a causal impact on both the independent variable and dependent variable. They include common sources of measurement error (as discussed above) as well as other influences shared by both the independent and dependent variables.

In experimental studies, there is a special concern about aspects of the experimental manipulation or setting that may account for study effects, rather than the motivating theoretical factor. Any of these problems may produce spurious relationships between the independent and dependent variables as measured. Ignoring a confounding variable may bias empirical estimates of the causal effect of the independent variable.

### Suppression

A suppressor variable increases the predictive validity of another variable when included in a regression equation. Suppression can occur when a single causal variable is related to an outcome variable through two separate mediator variables, and when one of those mediated effects is positive and one is negative. In such a case, each mediator variable suppresses or conceals the effect that is carried through the other mediator variable. For example, higher intelligence scores (a causal variable, *A*) may cause an increase in error detection (a mediator variable, *B*) which in turn may cause a decrease in errors made at work on an assembly line (an outcome variable, *X*); at the same time, intelligence could also cause an increase in boredom (*C*), which in turn may cause an *increase* in errors (*X*). Thus, in one causal path intelligence decreases errors, and in the other it increases them. When neither mediator is included in the analysis, intelligence appears to have no effect or a weak effect on errors. However, when boredom is controlled intelligence will appear to decrease errors, and when error detection is controlled intelligence will appear to increase errors. If intelligence could be increased while only boredom was held constant, errors would decrease; if intelligence could be increased while holding only error detection constant, errors would increase.

In general, the omission of suppressors or confounders will lead to either an underestimation or an overestimation of the effect of *A* on *X*, thereby either reducing or artificially inflating the magnitude of a relationship between two variables.

### Moderators

Other important third variables are moderators. Moderators are variables that can make the relationship between two variables either stronger or weaker. Such variables further characterize interactions in regression by affecting the direction and/or strength of the relationship between *X* and *Y*. A moderating relationship can be thought of as an interaction. It occurs when the relationship between variables A and B depends on the level of C. See moderation for further discussion.

## Moderated mediation

Mediation and moderation can co-occur in statistical models. It is possible to mediate moderation and moderate mediation.

Moderated mediation is when the effect of the treatment *A* on the mediator and/or the partial effect *B* on the dependent variable depend in turn on levels of another variable (moderator). Essentially, in moderated mediation, mediation is first established, and then one investigates if the mediation effect that describes the relationship between the independent variable and dependent variable is moderated by different levels of another variable (i.e., a moderator). This definition has been outlined by Muller, Judd, and Yzerbyt (2005)^{[20]} and Preacher, Rucker, and Hayes (2007).^{[21]}

### Models of moderated mediation

There are five possible models of moderated mediation, as illustrated in the diagrams below.^{[20]}

- In the first model the independent variable also moderates the relationship between the mediator and the dependent variable.
- The second possible model of moderated mediation involves a new variable which moderates the relationship between the independent variable and the mediator (the
*A*path). - The third model of moderated mediation involves a new moderator variable which moderates the relationship between the mediator and the dependent variable (the
*B*path). - Moderated mediation can also occur when one moderating variable affects both the relationship between the independent variable and the mediator (the
*A*path) and the relationship between the mediator and the dependent variable (the*B*path). - The fifth and final possible model of moderated mediation involves two new moderator variables, one moderating the
*A*path and the other moderating the*B*path.

In addition to the models mentioned above, a new variable can also exist which moderates the relationship between the independent variable and mediator (the A path) while at the same time have the new variable moderate the relationship between the independent variable and dependent variable (the C Path).^{[1]}

## Mediated moderation

Mediated moderation is a variant of both moderation and mediation. This is where there is initially overall moderation and the direct effect of the moderator variable on the outcome is mediated. The main difference between mediated moderation and moderated mediation is that for the former there is initial (overall) moderation and this effect is mediated and for the latter there is no moderation but the effect of either the treatment on the mediator (path *A*) is moderated or the effect of the mediator on the outcome (path *B*) is moderated.^{[20]}

In order to establish mediated moderation, one must first establish moderation, meaning that the direction and/or the strength of the relationship between the independent and dependent variables (path *C*) differs depending on the level of a third variable (the moderator variable). Researchers next look for the presence of mediated moderation when they have a theoretical reason to believe that there is a fourth variable that acts as the mechanism or process that causes the relationship between the independent variable and the moderator (path *A*) or between the moderator and the dependent variable (path *C*).

### Example

The following is a published example of mediated moderation in psychological research.^{[22]}
Participants were presented with an initial stimulus (a prime) that made them think of morality or made them think of might. They then participated in the Prisoner's Dilemma Game (PDG), in which participants pretend that they and their partner in crime have been arrested, and they must decide whether to remain loyal to their partner or to compete with their partner and cooperate with the authorities. The researchers found that prosocial individuals were affected by the morality and might primes, whereas proself individuals were not. Thus, social value orientation (proself vs. prosocial) moderated the relationship between the prime (independent variable: morality vs. might) and the behaviour chosen in the PDG (dependent variable: competitive vs. cooperative).

The researchers next looked for the presence of a mediated moderation effect. Regression analyses revealed that the type of prime (morality vs. might) mediated the moderating relationship of participants’ social value orientation on PDG behaviour. Prosocial participants who experienced the morality prime expected their partner to cooperate with them, so they chose to cooperate themselves. Prosocial participants who experienced the might prime expected their partner to compete with them, which made them more likely to compete with their partner and cooperate with the authorities. In contrast, participants with a pro-self social value orientation always acted competitively.

## Regression equations for moderated mediation and mediated moderation

Muller, Judd, and Yzerbyt (2005)^{[20]} outline three fundamental models that underlie both moderated mediation and mediated moderation. *Mo* represents the moderator variable(s), *Me* represents the mediator variable(s), and *ε _{i}* represents the measurement error of each regression equation.

### Step 1

Moderation of the relationship between the independent variable (X) and the dependent variable (Y), also called the overall treatment effect (path *C* in the diagram).

- [math]\displaystyle{ Y=\beta_{40} +\beta_{41}X +\beta_{42}Mo +\beta_{43}XMo + \varepsilon_4 }[/math]

- To establish overall moderation, the
*β*_{43}regression weight must be significant (first step for establishing mediated moderation). - Establishing moderated mediation requires that there be no moderation effect, so the
*β*_{43}regression weight must not be significant.

### Step 2

Moderation of the relationship between the independent variable and the mediator (path *A*).

- [math]\displaystyle{ Me=\beta_{50} +\beta_{51}X +\beta_{52}Mo +\beta_{53}XMo + \varepsilon_5 }[/math]

- If the
*β*_{53}regression weight is significant, the moderator affects the relationship between the independent variable and the mediator.

### Step 3

Moderation of both the relationship between the independent and dependent variables (path *A*) and the relationship between the mediator and the dependent variable (path *B*).

- [math]\displaystyle{ Y=\beta_{60} +\beta_{61}X +\beta_{62}Mo +\beta_{63}XMo +\beta_{64}Me +\beta_{65}MeMo + \varepsilon_6 }[/math]

- If both
*β*_{53}in step 2 and*β*_{63}in step 3 are significant, the moderator affects the relationship between the independent variable and the mediator (path*A*). - If both
*β*_{53}in step 2 and*β*in step 3 are significant, the moderator affects the relationship between the mediator and the dependent variable (path_{65}*B*). - Either or both of the conditions above may be true.

## Causal mediation analysis

### Fixing versus conditioning

Mediation analysis quantifies the extent to which a variable participates in the transmittance of change from a cause to its effect. It is inherently a causal notion, hence it cannot be defined in statistical terms. Traditionally, however, the bulk of mediation analysis has been conducted within the confines of linear regression, with statistical terminology masking the causal character of the relationships involved. This led to difficulties, biases, and limitations that have been alleviated by modern methods of causal analysis, based on causal diagrams and counterfactual logic.

The source of these difficulties lies in defining mediation in terms of changes induced by adding a third variables into a regression equation. Such statistical changes are epiphenomena which sometimes accompany mediation but, in general, fail to capture the causal relationships that mediation analysis aims to quantify.

The basic premise of the causal approach is that it is
not always appropriate to "control" for the mediator *M*
when we seek to estimate the direct effect of *X* on *Y*
(see the Figure above).
The classical rationale for "controlling" for *M*"
is that, if we succeed in preventing *M* from changing, then
whatever changes we measure in Y are attributable solely
to variations in *X* and we are justified then in proclaiming the
effect observed as "direct effect of *X* on *Y*." Unfortunately,
"controlling for *M*" does not physically prevent *M* from changing;
it merely narrows the analyst's attention to cases
of equal *M* values. Moreover, the language of probability
theory does not possess the notation to express the idea
of "preventing *M* from changing" or "physically holding *M* constant".
The only operator probability provides is "Conditioning"
which is what we do when we "control" for *M*,
or add *M* as a regressor in the equation for *Y*.
The result is that, instead of physically holding *M" constant *
(say at *M* = *m*) and comparing *Y* for units under *X* = 1' to those under
*X* = 0, we allow *M* to vary but ignore all units except those in
which *M* achieves the value *M* = *m*. These two operations are
fundamentally different, and yield different results,^{[23]}^{[24]} except in the case of no omitted variables. Improperly conditioning mediated effects can be a type of bad control.

To illustrate, assume that the error terms of *M* and *Y*
are correlated. Under such conditions, the
structural coefficient *B* and *A* (between *M* and *Y* and between *Y* and *X*)
can no longer be estimated by regressing *Y* on *X* and *M*.
In fact, the regression slopes may both be nonzero even when *C* is zero.^{[25]} This has two consequences. First, new strategies must be devised for estimating the structural coefficients *A, B* and *C*. Second, the basic definitions of direct and indirect effects must go beyond regression analysis, and should invoke an operation that mimics "fixing *M*", rather than "conditioning on *M*."

### Definitions

Such an operator, denoted do(*M* = *m*), was defined in Pearl (1994)^{[24]} and it operates by removing the equation of *M* and replacing it by a constant *m*. For example, if the basic mediation model consists of the equations:

- [math]\displaystyle{ X=f(\varepsilon_1),~~M=g(X,\varepsilon_2),~~Y=h(X,M,\varepsilon_3) , }[/math]

then after applying the operator do(*M* = *m*) the model becomes:

- [math]\displaystyle{ X=f(\varepsilon_1),~~M=m,~~Y=h(X,m,\varepsilon_3) }[/math]

and after applying the operator do(*X* = *x*) the model becomes:

- [math]\displaystyle{ X=x, M=g(x, \varepsilon_2), Y=h(x,M,\varepsilon_3) }[/math]

where the functions *f* and *g*, as well as the
distributions of the error terms ε_{1} and ε_{3} remain
unaltered. If we further rename the variables *M* and *Y* resulting from do(*X* = *x*)
as *M*(*x*) and *Y*(*x*), respectively, we obtain what
came to be known as "potential
outcomes"^{[26]} or "structural counterfactuals".^{[27]}
These new variables provide convenient notation
for defining direct and indirect effects. In particular,
four types of effects have been defined for the
transition from *X* = 0 to *X* = 1:

(a) Total effect –

- [math]\displaystyle{ TE = E [Y(1) - Y(0)] }[/math]

(b) Controlled direct effect -

- [math]\displaystyle{ CDE(m) = E [Y(1,m) - Y(0,m) ] }[/math]

(c) Natural direct effect -

- [math]\displaystyle{ NDE = E [Y(1,M(0)) - Y(0,M(0))] }[/math]

(d) Natural indirect effect

- [math]\displaystyle{ NIE = E [Y(0,M(1)) - Y(0,M(0))] }[/math]

Where *E*[ ] stands for expectation taken over the error terms.

These effects have the following interpretations:

*TE*measures the expected increase in the outcome*Y*as*X*changes from*X=0*to*X*=1*, while the mediator is allowed to track the change in*X*as dictated by the function*M = g(X, ε_{2})*.*- CDE measures the expected increase in the outcome
*Y*as*X*changes from*X*= 0 to*X*= 1, while the mediator is fixed at a pre-specified level*M = m*uniformly over the entire population *NDE*measures the expected increase in*Y*as*X*changes from*X*= 0 to*X*= 1, while setting the mediator variable to whatever value it*would have obtained*under*X*= 0, i.e., before the change.*NIE*measures the expected increase in*Y*when the*X*is held constant, at*X*= 1, and*M*changes to whatever value it would have attained (for each individual) under*X*= 1.- The difference
*TE-NDE*measures the extent to which mediation is*necessary*for explaining the effect, while the*NIE*measures the extent to which mediation is*sufficient*for sustaining it.

A controlled version of the indirect effect does not exist because there is no way of disabling the direct effect by fixing a variable to a constant.

According to these definitions the total effect can be decomposed as a sum

- [math]\displaystyle{ TE = NDE - NIE_r }[/math]

where *NIE _{r}* stands for the reverse transition, from

*X*= 1 to

*X*= 0; it becomes additive in linear systems, where reversal of transitions entails sign reversal.

The power of these definitions lies in their generality; they are applicable to models with arbitrary nonlinear interactions, arbitrary dependencies among the disturbances, and both continuous and categorical variables.

### The mediation formula

In linear analysis, all effects are determined by sums of products of structural coefficients, giving

- [math]\displaystyle{ \begin{align} TE & = C + AB \\ CDE(m) & = NDE = C, \text{ independent of } m\\ NIE & = AB. \end{align} }[/math]

Therefore, all effects are estimable whenever the model is identified. In non-linear systems, more stringent conditions are needed for estimating the direct and indirect effects.^{[9]}^{[28]}
^{[29]} For example, if no confounding exists,
(i.e., ε_{1}, ε_{2}, and ε_{3} are mutually independent) the following formulas can be derived:^{[9]}

- [math]\displaystyle{ \begin{align} TE & = E(Y\mid X=1)- E(Y\mid X=0)\\ CDE(m) & = E(Y\mid X=1, M=m) - E(Y\mid X=0, M=m) \\ NDE & = \sum_m [E(Y|X=1, M=m) - E(Y\mid X=0, M=m) ] P(M=m\mid X=0) \\ NIE & = \sum_m [P(M=m\mid X=1) - P(M=m\mid X=0)] E(Y\mid X=0, M=m). \end{align} }[/math]

The last two equations are called *Mediation Formulas*^{[30]}^{[31]}^{[32]} and have become the target of estimation in many studies of mediation.^{[28]}^{[29]}^{[31]}^{[32]} They give distribution-free expressions for direct and indirect effects and demonstrate that, despite the arbitrary nature of the error distributions and the functions *f*, *g*, and *h*, mediated effects can nevertheless be estimated from data using regression. The analyses of *moderated mediation* and *mediating moderators* fall as special cases of the causal mediation analysis, and the mediation formulas identify how various interactions coefficients contribute to the necessary and sufficient components of mediation.^{[29]}^{[30]}

### Example

Assume the model takes the form

- [math]\displaystyle{ \begin{align} X & = \varepsilon_1 \\ M & = b_0 + b_1X + \varepsilon_2 \\ Y & = c_0 + c_1X + c_2M + c_3XM + \varepsilon_3 \end{align} }[/math]

where the parameter [math]\displaystyle{ c_3 }[/math] quantifies the degree to which *M* modifies the effect of *X* on *Y*. Even when all parameters are estimated from data, it is still not obvious what combinations of parameters measure the direct and indirect effect of *X* on *Y*, or, more practically, how to assess the fraction of the total effect [math]\displaystyle{ TE }[/math] that is *explained* by mediation and the fraction of [math]\displaystyle{ TE }[/math] that is *owed* to mediation. In linear analysis, the former fraction is captured by the product [math]\displaystyle{ b_1 c_2 / TE }[/math], the latter by the difference [math]\displaystyle{ (TE - c_1)/TE }[/math], and the two quantities coincide. In the presence of interaction, however, each fraction demands a separate analysis, as dictated by the Mediation Formula, which yields:

- [math]\displaystyle{ \begin{align} NDE & = c_1 + b_0 c_3 \\ NIE & = b_1 c_2 \\ TE & = c_1 + b_0 c_3 + b_1(c_2 + c_3) \\ & = NDE + NIE + b_1 c_3. \end{align} }[/math]

Thus, the fraction of output response for which mediation would be *sufficient* is

- [math]\displaystyle{ \frac{NIE}{TE} = \frac{b_1 c_2}{c_1 + b_0 c_3 + b_1 (c_2 + c_3)}, }[/math]

while the fraction for which mediation would be *necessary* is

- [math]\displaystyle{ 1- \frac{NDE}{TE} = \frac{b_1 (c_2 +c_3)}{c_1 + b_0c_3 + b_1 (c_2 + c_3)}. }[/math]

These fractions involve non-obvious combinations of the model's parameters, and can be constructed mechanically with the help of the Mediation Formula. Significantly, due to interaction, a direct effect can be sustained even when the parameter [math]\displaystyle{ c_1 }[/math] vanishes and, moreover, a total effect can be sustained even when both the direct and indirect effects vanish. This illustrates that estimating parameters in isolation tells us little about the effect of mediation and, more generally, mediation and moderation are intertwined and cannot be assessed separately.

## References

- Notes

- ↑
^{1.0}^{1.1}"Types of Variables". http://www.indiana.edu/~educy520/sec5982/week_2/variable_types.pdf. - ↑ MacKinnon, D. P. (2008).
*Introduction to Statistical Mediation Analysis*. New York: Erlbaum. - ↑ VanderWeele, T.J. (2016). "Mediation analysis: a practitioner's guide".
*Annual Review of Public Health***37**: 17–32. doi:10.1146/annurev-publhealth-032315-021402. PMID 26653405. - ↑
^{4.0}^{4.1}Cohen, J.; Cohen, P.; West, S. G.; Aiken, L. S. (2003)*Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences*(3rd ed.). Mahwah, NJ: Erlbaum. - ↑ Baron, R. M.; Kenny, D. A. (1986). "The Moderator-Mediator Variable Distinction in Social Psychological Research : Conceptual, Strategic, and Statistical Considerations".
*Journal of Personality and Social Psychology***51**(6): 1173–1182. doi:10.1037/0022-3514.51.6.1173. PMID 3806354. - ↑ Howell, D. C. (2009).
*Statistical Methods for Psychology*(7th ed.). Belmot, CA: Cengage Learning. ISBN 978-0-495-59785-8. - ↑ Shrout, P. E.; Bolger, N. (2002). "Mediation in experimental and nonexperimental studies: New procedures and recommendations".
*Psychological Methods***7**(4): 422–445. doi:10.1037/1082-989x.7.4.422. PMID 12530702. - ↑
^{8.0}^{8.1}^{8.2}Robins, J. M.; Greenland, S. (1992). "Identifiability and exchangeability for direct and indirect effects".*Epidemiology***3**(2): 143–55. doi:10.1097/00001648-199203000-00013. PMID 1576220. - ↑
^{9.0}^{9.1}^{9.2}^{9.3}^{9.4}Pearl, J. (2001) "Direct and indirect effects". Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence, Morgan Kaufmann, 411–420. - ↑
^{10.0}^{10.1}Sobel, M. E. (1982). "Asymptotic confidence intervals for indirect effects in structural equation models".*Sociological Methodology***13**: 290–312. doi:10.2307/270723. - ↑
^{11.0}^{11.1}^{11.2}Hayes, A. F. (2009). "Beyond Baron and Kenny: Statistical mediation analysis in the new millennium".*Communication Monographs***76**(4): 408–420. doi:10.1080/03637750903310360. - ↑
^{12.0}^{12.1}Kaufman, J. S.; MacLehose, R. F.; Kaufman, S (2004). "A further critique of the analytic strategy of adjusting for covariates to identify biologic mediation".*Epidemiologic Perspectives & Innovations***1**(1): 4. doi:10.1186/1742-5573-1-4. PMID 15507130. - ↑ MacKinnon, D. P.; Lockwood, C. M.; Lockwood, J. M.; West, S. G.; Sheets, V. (2002). "A comparison of methods to test mediation and other intervening variable effects".
*Psychological Methods***7**(1): 83–104. doi:10.1037/1082-989x.7.1.83. PMID 11928892. - ↑
^{14.0}^{14.1}"Interactive Mediation Tests". http://quantpsy.org/sobel/sobel.htm. - ↑ "Testing of Mediation Models in SPSS and SAS". Comm.ohio-state.edu. http://www.comm.ohio-state.edu/ahayes/sobel.htm.
- ↑ "SPSS and SAS Macro for Bootstrapping Specific Indirect Effects in Multiple Mediation Models". Comm.ohio-state.edu. http://www.comm.ohio-state.edu/ahayes/SPSS%20programs/indirect.htm.
- ↑ "Mediation".
*davidakenny.net*. Retrieved April 25, 2012. - ↑ Bullock, J. G.; Green, D. P.; Ha, S. E. (2010). "Yes, but what's the mechanism? (don't expect an easy answer)".
*Journal of Personality and Social Psychology***98**(4): 550–8. doi:10.1037/a0018933. PMID 20307128. http://www2.psych.ubc.ca/~schaller/528Readings/BullockGreenHa2010.pdf. - ↑ Spencer, S. J.; Zanna, M. P.; Fong, G. T. (2005). "Establishing a causal chain: Why experiments are often more effective than mediational analyses in examining psychological processes".
*Journal of Personality and Social Psychology***89**(6): 845–51. doi:10.1037/0022-3514.89.6.845. PMID 16393019. http://www2.psych.ubc.ca/~schaller/528Readings/SpencerZannaFong2005.pdf. - ↑
^{20.0}^{20.1}^{20.2}^{20.3}Muller, D.; Judd, C. M.; Yzerbyt, V. Y. (2005). "When moderation is mediated and mediation is moderated".*Journal of Personality and Social Psychology***89**(6): 852–863. doi:10.1037/0022-3514.89.6.852. PMID 16393020. - ↑ Preacher, K. J., Rucker, D. D. & Hayes, A. F. (2007). Assessing moderated mediation hypotheses: Strategies, methods, and prescriptions. Multivariate Behavioral Research, 42, 185–227.
- ↑ Smeesters, D.; Warlop, L.; Avermaet, E. V.; Corneille, O.; Yzerbyt, V. (2003). "Do not prime hawks with doves: The interplay of construct activation and consistency of social value orientation on cooperative behavior".
*Journal of Personality and Social Psychology***84**(5): 972–987. doi:10.1037/0022-3514.84.5.972. PMID 12757142. - ↑ Robins, J.M.; Greenland, S. (1992). "Identifiability and exchangeability for direct and indirect effects".
*Epidemiology***3**(2): 143–155. doi:10.1097/00001648-199203000-00013. PMID 1576220. - ↑
^{24.0}^{24.1}Pearl, Judea (1994). Lopez de Mantaras, R.; Poole, D.. eds. "A probabilistic calculus of actions".*Uncertainty in Artificial Intelligence 10*(San Mateo, CA: Morgan Kaufmann)**1302**: 454–462. Bibcode: 2013arXiv1302.6835P. - ↑ Pearl, J (2014). "Interpretation and identification of causal mediation".
*Psychological Methods***19**(4): 459–81. doi:10.1037/a0036434. PMID 24885338. http://ftp.cs.ucla.edu/pub/stat_ser/r389-imai-etal-commentary-r421-reprint.pdf. - ↑ Rubin, D.B. (1974). "Estimating causal effects of treatments in randomized and nonrandomized studies".
*Journal of Educational Psychology***66**(5): 688–701. doi:10.1037/h0037350. - ↑ Balke, A.; Pearl, J. (1995). Besnard, P.; Hanks, S.. eds. "Counterfactuals and Policy Analysis in Structural Models".
*Uncertainty in Artificial Intelligence 11*(San Francisco, CA: Morgan Kaufmann)**1302**: 11–18. Bibcode: 2013arXiv1302.4929B. - ↑
^{28.0}^{28.1}Imai, K.; Keele, L.; Yamamoto, T. (2010). "Identification, inference, and sensitivity analysis for causal mediation effects".*Statistical Science***25**(1): 51–71. doi:10.1214/10-sts321. Bibcode: 2010arXiv1011.1079I. - ↑
^{29.0}^{29.1}^{29.2}VanderWeele, T.J. (2009). "Marginal structural models for the estimation of direct and indirect effects".*Epidemiology***20**(1): 18–26. doi:10.1097/ede.0b013e31818f69ce. PMID 19234398. - ↑
^{30.0}^{30.1}Pearl, Judea (2009). "Causal inference in statistics: An overview".*Statistics Surveys***3**: 96–146. doi:10.1214/09-ss057. http://ftp.cs.ucla.edu/pub/stat_ser/r350.pdf. - ↑
^{31.0}^{31.1}Vansteelandt, Stijn; Bekaert, Maarten; Lange, Theis (2012). "Imputation strategies for the estimation of natural direct and indirect effects".*Epidemiologic Methods***1**(1, Article 7). doi:10.1515/2161-962X.1014. - ↑
^{32.0}^{32.1}Albert, Jeffrey (2012). "Distribution-Free Mediation Analysis for Nonlinear Models with Confounding".*Epidemiology***23**(6): 879–888. doi:10.1097/ede.0b013e31826c2bb9. PMID 23007042.

- Bibliography

- Preacher, Kristopher J.; Hayes, Andrew F. (2004). "SPSS and SAS procedures for estimating indirect effects in simple mediation models".
*Behavior Research Methods, Instruments, and Computers***36**(4): 717–731. doi:10.3758/BF03206553. PMID 15641418. - Preacher, Kristopher J.; Hayes, Andrew F. (2008). "Asymptotic and resampling strategies for assessing and comparing indirect effects in multiple mediator models".
*Behavior Research Methods***40**(3): 879–891. doi:10.3758/BRM.40.3.879. PMID 18697684. - Preacher, K. J.; Zyphur, M. J.; Zhang, Z. (2010). "A general multilevel SEM framework for assessing multilevel mediation".
*Psychological Methods***15**(3): 209–233. doi:10.1037/a0020141. PMID 20822249. - Baron, R. M. and Kenny, D. A. (1986) "The Moderator-Mediator Variable Distinction in Social Psychological Research – Conceptual, Strategic, and Statistical Considerations", Journal of Personality and Social Psychology, Vol. 51(6), pp. 1173–1182.
- Cohen, J. (1988).
*Statistical power analysis for the behavioral sciences*(2nd ed.). New York, NY: Academic Press. - Hayes, A. F. (2009). "Beyond Baron and Kenny: Statistical mediation analysis in the new millennium".
*Communication Monographs***76**(4): 408–420. doi:10.1080/03637750903310360. - Howell, D. C. (2009).
*Statistical methods for psychology*(7th ed.). Belmot, CA: Cengage Learning. - MacKinnon, D. P.; Lockwood, C. M. (2003). "Advances in statistical methods for substance abuse prevention research".
*Prevention Science***4**(3): 155–171. doi:10.1023/A:1024649822872. PMID 12940467. - Preacher, K. J.; Kelley, K. (2011). "Effect sizes measures for mediation models: Quantitative strategies for communicating indirect effects".
*Psychological Methods***16**(2): 93–115. doi:10.1037/a0022658. PMID 21500915. - Rucker, D.D., Preacher, K.J., Tormala, Z.L. & Petty, R.E. (2011). "Mediation analysis in social psychology: Current practices and new recommendations".
*Social and Personality Psychology Compass*, 5/6, 359–371. - Sobel, M. E. (1982). "Asymptotic confidence intervals for indirect effects in structural equation models".
*Sociological Methodology***13**: 290–312. doi:10.2307/270723. - Spencer, S. J.; Zanna, M. P.; Fong, G. T. (2005). "Establishing a causal chain: why experiments are often more effective than mediational analyses in examining psychological processes".
*Journal of Personality and Social Psychology***89**(6): 845–851. doi:10.1037/0022-3514.89.6.845. PMID 16393019. - Pearl, Judea (2012). "The Mediation Formula: A guide to the assessment of causal pathways in nonlinear models". in Berzuini, C.; Dawid, P.; Bernardinelli, L..
*Causality: Statistical Perspectives and Applications*. Chichester, UK: John Wiley and Sons, Ltd.. pp. 151–179. - Shaughnessy J.J., Zechmeister E. & Zechmeister J. (2006).
*Research Methods in Psychology*(7th ed., pp. 51–52). New York: McGraw Hill. - Tolman, E. C. (1938). "The Determiners of Behavior at a Choice Point".
*Psychological Review***45**: 1–41. doi:10.1037/h0062733. - Tolman, E. C.; Honzik, C. H. (1930). "Degrees of hunger, reward and nonreward, and maze learning in rats".
*University of California Publications in Psychology***4**: 241–275. - Vanderweele, Tyler J. (2015).
*Explanation in Causal Inference*.

## External links

- Summary of mediation methods at PsychWiki
- Example of Causal Mediation Using Propensity Scores The Methodology Center, Penn State University
- Book on moderation and mediation analysis, including an introduction to the PROCESS macro for SPSS and SAS Andrew F. Hayes, Ohio State University
- Online text of "The Determiner of Behavior at a Choice Point"
- Kenneth MacCorquodale and Paul E. Meehl (1948) ON A DISTINCTION BETWEEN HYPOTHETICAL CONSTRUCTS AND INTERVENING VARIABLES
*Classics in the History of Psychology*, retr. 22 Aug 2011.

Original source: https://en.wikipedia.org/wiki/Mediation (statistics).
Read more |