Introduction to Mediation Analysis

This post intends to introduce the basics of mediation analysis and does not explain statistical details. For details, please refer to the articles at the end of this post.

What is mediation?

Let’s say previous studies have suggested that higher grades predict higher happiness: X (grades) → Y (happiness). (This research example is made up for illustration purposes. Please don’t consider it a scientific statement.)

I think, however, grades are not the real reason that happiness increases. I hypothesize that good grades boost one’s self-esteem and then high self-esteem boosts one’s happiness: X (grades) → M (self-esteem) → Y (happiness).

This is a typical case of mediation analysis. Self-esteem is a mediator that explains the underlying mechanism of the relationship between grades (IV) and happiness (DV).

How to analyze mediation analysis effects?

Before we start, please keep in mind that, as any other regression analysis, mediation analysis does not imply causal relationships unless it is based on experimental design.

To analyze mediation:
1. Follow Baron & Kenny’s steps
2. Use either the Sobel test or bootstrapping for significance testing.

The following shows the basic steps for mediation analysis suggested by Baron & Kenny (1986). A mediation analysis is comprised of three sets of regression: X → Y, X → M, and X + M → Y. This post will show examples using R, but you can use any statistical software. They are just three regression analyses!

# Download data online. This is a simulated dataset for this post.
myData <- read.csv('http://static.lib.virginia.edu/statlab/materials/data/mediationData.csv')

Step 1.

Y = b 0 + b 1 X + e

Is $b_{1}$ significant? We want X to affect Y. If there is no relationship between X and Y, there is nothing to mediate.

Although this is what Baron and Kenny originally suggested, this step is controversial. Even if we don’t find a significant association between X and Y, we could move forward to the next step if we have a good theoretical background about their relationship. See Shrout & Bolger (2002) for details.

model.0 <- lm(Y ~ X, myData)
summary(model.0)
# Coefficients:
#             Estimate Std. Error t value Pr(>|t|)    
# (Intercept)   2.8572     0.6932   4.122 7.88e-05 ***
# X             0.3961     0.1112   3.564 0.000567 ***

### b1 = 0.3961, p < .001  # significant!

Step 2.

M = b 0 + b 2 X + e

Is $b_{2}$ significant? We want X to affect M. If X and M have no relationship, M is just a third variable that may or may not be associated with Y. A mediation makes sense only if X affects M.

model.M <- lm(M ~ X, myData)
summary(model.M)
# Coefficients:
#             Estimate Std. Error t value Pr(>|t|)    
# (Intercept)  1.49952    0.58920   2.545   0.0125 *  
# X            0.56102    0.09448   5.938 4.39e-08 ***

### b2 = 0.5610, p < .001  # significant!

Step 3.

Y = b 0 + b 4 X + b 3 M + e

Is $b_{4}$ non-significant or smaller than before? We want M to affect Y, but X to no longer affect Y (or X to still affect Y but in a smaller magnitude). If a mediation effect exists, the effect of X on Y will disappear (or at least weaken) when M is included in the regression. The effect of X on Y goes through M.

model.Y <- lm(Y ~ X + M, myData)
summary(model.Y)
# Coefficients:
#             Estimate Std. Error t value Pr(>|t|)    
# (Intercept)   1.9043     0.6055   3.145   0.0022 ** 
# X             0.0396     0.1096   0.361   0.7187    
# M             0.6355     0.1005   6.321 7.92e-09 ***

### b4 = 0.0396, p = 0.719   # the effect of X on Y disappeared!
### b3 = 0.6355, p < 0.001

library(mediation) results <- mediate(model.M, model.Y, treat='X', mediator='M', boot=TRUE, sims=500) summary(results) # Estimate 95% CI Lower 95% CI Upper p-value # ACME 0.3565 0.2155 0.5291 0.00 # ADE 0.0396 -0.1761 0.2598 0.66 # Total Effect 0.3961 0.1563 0.5794 0.00 # Prop. Mediated 0.9000 0.5254 1.8820 0.00 ### ACME = 0.3565, 95% CI [0.2155, 0.5291] # significant! ### ACME stands for Average Causal Mediation Effects ### ADE stands for Average Direct Effects ### Total Effect is a sum of a mediation (indirect) effect and a direct effect

Introduction to Mediation Analysis

Introduction to Mediation Analysis

Get Help with Data Analysis, Research, Thesis, Dissertation and Assignments.

Data Analytics Services

Need Our Services?

Econometrics & Statistics Modelling Services

Stuck with Your Research or Data Analysis Project?

Let Our Experts Help You:

Whatsapp Us:

Email Us:

We Make Sense out of your Data

CONTACT US

NAVIGATION

PRIVACY & TOS

Introduction to Mediation Analysis

Get Help with Data Analysis, Research, Thesis, Dissertation and Assignments.

Data Analytics Services

Need Our Services?

Econometrics & Statistics Modelling Services

Stuck with Your Research or Data Analysis Project?Let Our Experts Help You:

Whatsapp Us:

Email Us:

We Make Sense out of your Data

CONTACT US

NAVIGATION

PRIVACY & TOS

Stuck with Your Research or Data Analysis Project?

Let Our Experts Help You: