Main effect

From HandWiki

In the design of experiments and analysis of variance, a main effect is the effect of an independent variable on a dependent variable averaged across the levels of any other independent variables. The term is frequently used in the context of factorial designs and regression models to distinguish main effects from interaction effects. Relative to a factorial design, under an analysis of variance, a main effect test will test the hypotheses expected such as H0, the null hypothesis. Running a hypothesis for a main effect will test whether there is evidence of an effect of different treatments. However, a main effect test is nonspecific and will not allow for a localization of specific mean pairwise comparisons (simple effects). A main effect test will merely look at whether overall there is something about a particular factor that is making a difference. In other words, it is a test examining differences amongst the levels of a single factor (averaging over the other factor and/or factors). Main effects are essentially the overall effect of a factor.

Definition

A factor averaged over all other levels of the effects of other factors is termed as main effect (also known as marginal effect). The contrast of a factor between levels over all levels of other factors is the main effect. The difference between the marginal means of all the levels of a factor is the main effect of the response variable on that factor .[1] Main effects are the primary independent variables or factors tested in the experiment.[2] Main effect is the specific effect of a factor or independent variable regardless of other parameters in the experiment.[3] In design of experiment, it is referred to as a factor but in regression analysis it is referred to as the independent variable.

Estimating Main Effects

In factorial designs, thus two levels each of factor A and B in a factorial design, the main effects of two factors say A and B be can be calculated. The main effect of A is given by

[math]\displaystyle{ A={1 \over 2n}[ab+a-b-1] }[/math]

The main effect of B is given by

[math]\displaystyle{ B={1 \over 2n}[ab+b-a-1] }[/math]

Where n is total number of replicates. We use factor level 1 to denote the low level, and level 2 to denote the high level. The letter "a" represent the factor combination of level 2 of A and level 1 of B and "b" represents the factor combination of level 1 of A and level 2 of B. "ab" is the represents both factors at level 2. Finally, 1 represents when both factors are set to level 1. [2]

Hypothesis Testing for Two Way Factorial Design.

Consider a two-way factorial design in which factor A has 3 levels and factor B has 2 levels with only 1 replicate. There are 6 treatments with 5 degrees of freedom. in this example, we have two null hypotheses. The first for Factor A is: [math]\displaystyle{ H_0 : \alpha_1=\alpha_2=\alpha_3=0 }[/math] and the second for Factor B is: [math]\displaystyle{ H_0 : \beta_1=\beta_2=0 }[/math].[4] The main effect for factor A can be computed with 2 degrees of freedom. This variation is summarized by the sum of squares denoted by the term SSA. Likewise the variation from factor B can be computed as SSB with 1 degree of freedom. The expected value for the mean of the responses in column i is [math]\displaystyle{ \mu + \beta_j }[/math]while the expected value for the mean of the responses in row j is [math]\displaystyle{ \mu + \alpha_i }[/math]where i corresponds to the level of factor in factor A and j corresponds to the level of factor in factor B. [math]\displaystyle{ \alpha_i }[/math]and [math]\displaystyle{ \beta_j }[/math]are main effects. SSA and SSB are main-effects sums of squares. The two remaining degrees of freedom can be used to describe the variation that comes from the interaction between the two factors and can be denoted as SSAB.[4] A table can show the layout of this particular design with the main effects (where [math]\displaystyle{ x_{ij} }[/math] is the observation of the ith level of factor B and the jth level of factor A):

3x2 Factorial Experiment
Factor/Levels [math]\displaystyle{ \alpha_1 }[/math] [math]\displaystyle{ \alpha_2 }[/math] [math]\displaystyle{ \alpha_3 }[/math]
[math]\displaystyle{ \beta_1 }[/math] [math]\displaystyle{ x_{11} }[/math] [math]\displaystyle{ x_{12} }[/math] [math]\displaystyle{ x_{13} }[/math]
[math]\displaystyle{ \beta_2 }[/math] [math]\displaystyle{ x_{21} }[/math] [math]\displaystyle{ x_{22} }[/math] [math]\displaystyle{ x_{23} }[/math]

Example

Take a [math]\displaystyle{ 2^2 }[/math]factorial design (2 levels of two factors) testing the taste ranking of fried chicken at two fast food restaurants. Let taste testers rank the chicken from 1 to 10 (best tasting), for factor X: "spiciness" and factor Y: "crispiness." Level X1 is for "not spicy" chicken and X2 is for "spicy" chicken. Level Y1 is for "not crispy" and level Y2 is for "crispy" chicken. Suppose that five people (5 replicates) tasted all four kinds of chicken and gave a ranking of 1-10 for each. The hypotheses of interest would be: Factor X is: [math]\displaystyle{ H_0 : X_1=X_2=0 }[/math] and for Factor Y is: [math]\displaystyle{ H_0 : Y_1=Y_2=0 }[/math]. The table of hypothetical results is given here:

(Replicates)
Factor Combination I II III IV V Total
Not Spicy, Not Crispy (X1,Y1) 3 2 6 1 9 21
Not Spicy, Crispy (X1, Y2) 7 2 4 2 8 23
Spicy, Not Crispy (X2, Y1) 5 5 6 1 8 25
Spicy, Crispy (X2, Y2) 9 10 8 6 8 41

The "Main Effect" of X (spiciness) when we are at Y1 (not crunchy) is given as:

[math]\displaystyle{ \frac{[X_2 Y_1]-[X_1 Y_1]}{n} }[/math] where n is the number of replicates. Likewise, the "Main Effect" of X at Y2 (crunchy) is given as:

[math]\displaystyle{ \frac{[X_2 Y_2]-[X_1 Y_2]}{n} }[/math], upon which we can take the simple average of these two to determine the overall main effect of the Factor X, which results as the above

formula, written here as:

[math]\displaystyle{ A=X={1 \over 2n}[ab+a-b-1] }[/math] = [math]\displaystyle{ \frac{[X_2 Y_2]+[X_2 Y_1] - [X_1 Y_2]-[X_1 Y_1]}{2n} }[/math]

Likewise, for Y, the overall main effect will be:[5]

[math]\displaystyle{ B=Y={1 \over 2n}[ab+b-a-1] }[/math]= [math]\displaystyle{ \frac{[X_2 Y_2]+[X_1 Y_2]-[X_2 Y_1] - [X_1 Y_1]}{2n} }[/math]

For the Chicken tasting experiment, we would have the resulting main effects:

[math]\displaystyle{ X: \frac{[25]-[21]+[41] - [23]}{2*5}= 2.2 }[/math]

[math]\displaystyle{ Y: \frac{[41]-[25]+[23] - [21]}{2*5} = 1.8 }[/math]

References

  • McBurney, D.M., White, T.L. (2004). Research Methods. CA: Wadsworth Learning.
  • Mook, Douglas G. (2001). Psychological Research: The Ideas Behind the Methods. NY: W. W. Norton & Company.
  1. Kuehl, Robert (1999). Design of Experiment: Statistical Principles of Research Design and Analysis.. Cengage Learning. pp. 178. ISBN 9780534368340. 
  2. 2.0 2.1 Montgomery, Douglas C. (1976). Design and Analysis of Experiments. Wiley, 1976. pp. 180. ISBN 9780471614210. 
  3. kotz, johnson (2005). encyclopedia of statistical sciences. pp. 181. ISBN 978-0-471-15044-2. 
  4. 4.0 4.1 Oehlert, Gary (2010). A First Course in Design and Analysis of Experiments. pp. 181. ISBN 978-0-7167-3510-6. 
  5. Montgomery, Douglas (2005). DESIGN AND ANALYSIS OF EXPERIMENTS. 6th: Wiley and Sons. pp. 205–206.