Presentation Slides - Kosuke Imai
Transcription
Presentation Slides - Kosuke Imai
Unpacking the Black Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies Kosuke Imai Princeton University Talk at the Center for Lifespan Psychology Max Planck Institute for Human Development April 28, 2015 Joint work with Luke Keele Dustin Tingley Teppei Yamamoto Project References (click the article titles) General: Unpacking the Black Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies. American Political Science Review Theory: Identification, Inference, and Sensitivity Analysis for Causal Mediation Effects. Statistical Science Extensions: A General Approach to Causal Mediation Analysis. Psychological Methods Experimental Designs for Identifying Causal Mechanisms. Journal of the Royal Statistical Society, Series A (with discussions) Identification and Sensitivity Analysis for Multiple Causal Mechanisms: Revisiting Evidence from Framing Experiments. Political Analysis Software: mediation: R Package for Causal Mediation Analysis. Journal of Statistical Software Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 2 / 49 Identification of Causal Mechanisms Causal inference is a central goal of scientific research Scientists care about causal mechanisms, not just about causal effects Randomized experiments often only determine whether the treatment causes changes in the outcome Not how and why the treatment affects the outcome Common criticism of experiments and statistics: black box view of causality Question: How can we learn about causal mechanisms from experimental and observational studies? Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 3 / 49 Overview of the Talk Present a general framework for statistical analysis and research design strategies to understand causal mechanisms 1 Show that the sequential ignorability assumption is required to identify mechanisms even in experiments 2 Offer a flexible estimation strategy under this assumption 3 Introduce a sensitivity analysis to probe this assumption 4 Illustrate how to use statistical software mediation 5 Consider research designs that relax sequential ignorability Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 4 / 49 Causal Mediation Analysis Graphical representation Mediator, M Treatment, T Outcome, Y Goal is to decompose total effect into direct and indirect effects Alternative approach: decompose the treatment into different components Causal mediation analysis as quantitative process tracing Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 5 / 49 Decomposition of Incumbency Advantage Incumbency effects: one of the most studied topics in American politics Consensus emerged in 1980s: incumbency advantage is positive and growing in magnitude New direction in 1990s: Where does incumbency advantage come from? Scare-off/quality effect (Cox and Katz): the ability of incumbents to deter high-quality challengers from entering the race Alternative causal mechanisms: name recognition, campaign spending, personal vote, television, etc. Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 6 / 49 Causal Mediation Analysis in Cox and Katz Quality of challenger, M Incumbency, T Electoral outcome, Y How much of incumbency advantage can be explained by scare-off/quality effect? How large is the mediation effect relative to the total effect? Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 7 / 49 Psychological Study of Media Effects Large literature on how media influences public opinion A media framing experiment of Brader et al.: 1 (White) Subjects read a mock news story about immigration: Treatment: Hispanic immigrant in the story Control: European immigrant in the story 2 Measure attitudinal and behavioral outcome variables: Opinions about increasing or decrease immigration Contact legislator about the issue Send anti-immigration message to legislator Why is group-based media framing effective?: role of emotion Hypothesis: Hispanic immigrant increases anxiety, leading to greater opposition to immigration The primary goal is to examine how, not whether, media framing shapes public opinion Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 8 / 49 Causal Mediation Analysis in Brader et al. Anxiety, M Media Cue, T Immigration Attitudes, Y Does the media framing shape public opinion by making people anxious? An alternative causal mechanism: change in beliefs Can we identify mediation effects from randomized experiments? Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 9 / 49 The Standard Estimation Method Linear models for mediator and outcome: Yi = α1 + β1 Ti + ξ1> Xi + 1i Mi = α2 + β2 Ti + ξ2> Xi + 2i Yi = α3 + β3 Ti + γMi + ξ3> Xi + 3i where Xi is a set of pre-treatment or control variables 1 2 3 4 Total effect (ATE) is β1 Direct effect is β3 Indirect or mediation effect is β2 γ Effect decomposition: β1 = β3 + β2 γ. Some motivating questions: 1 2 3 4 What should we do when we have interaction or nonlinear terms? What about other models such as logit? In general, under what conditions can we interpret β1 and β2 γ as causal effects? What do we really mean by causal mediation effect anyway? Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 10 / 49 Potential Outcomes Framework of Causal Inference Observed data: Binary treatment: Ti ∈ {0, 1} Mediator: Mi ∈ M Outcome: Yi ∈ Y Observed pre-treatment covariates: Xi ∈ X Potential outcomes model (Neyman, Rubin): Potential mediators: Mi (t) where Mi = Mi (Ti ) Potential outcomes: Yi (t, m) where Yi = Yi (Ti , Mi (Ti )) Total causal effect: τi ≡ Yi (1, Mi (1)) − Yi (0, Mi (0)) Fundamental problem of causal inference: only one potential outcome can be observed for each i Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 11 / 49 Back to the Examples Mi (1): 1 2 Quality of her challenger if politician i is an incumbent Level of anxiety individual i would report if he reads the story with Hispanic immigrant Yi (1, Mi (1)): 1 2 Election outcome that would result if politician i is an incumbent and faces a challenger whose quality is Mi (1) Immigration attitude individual i would report if he reads the story with Hispanic immigrant and reports the anxiety level Mi (1) Mi (0) and Yi (0, Mi (0)) are the converse Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 12 / 49 Causal Mediation Effects Causal mediation (Indirect) effects: δi (t) ≡ Yi (t, Mi (1)) − Yi (t, Mi (0)) Causal effect of the change in Mi on Yi that would be induced by treatment Change the mediator from Mi (0) to Mi (1) while holding the treatment constant at t Represents the mechanism through Mi Zero treatment effect on mediator =⇒ Zero mediation effect Examples: 1 2 Part of incumbency advantage that is due to the difference in challenger quality induced by incumbency status Difference in immigration attitudes that is due to the change in anxiety induced by the treatment news story Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 13 / 49 Total Effect = Indirect Effect + Direct Effect Direct effects: ζi (t) ≡ Yi (1, Mi (t)) − Yi (0, Mi (t)) Causal effect of Ti on Yi , holding mediator constant at its potential value that would realize when Ti = t Change the treatment from 0 to 1 while holding the mediator constant at Mi (t) Represents all mechanisms other than through Mi Total effect = mediation (indirect) effect + direct effect: τi = δi (t) + ζi (1 − t) = Kosuke Imai (Princeton) 1 {(δi (0) + ζi (0)) + (δi (1) + ζi (1))} 2 Causal Mechanisms Max Planck (April, 2015) 14 / 49 Mechanisms, Manipulations, and Interactions Mechanisms Indirect effects: δi (t) ≡ Yi (t, Mi (1)) − Yi (t, Mi (0)) Counterfactuals about treatment-induced mediator values Manipulations Controlled direct effects: ξi (t, m, m0 ) ≡ Yi (t, m) − Yi (t, m0 ) Causal effect of directly manipulating the mediator under Ti = t Interactions Interaction effects: ξ(1, m, m0 ) − ξ(0, m, m0 ) The extent to which controlled direct effects vary by the treatment Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 15 / 49 What Does the Observed Data Tell Us? Recall the Brader et al. experimental design: 1 2 randomize Ti measure Mi and then Yi Among observations with Ti = t, we observe Yi (t, Mi (t)) but not Yi (t, Mi (1 − t)) unless Mi (t) = Mi (1 − t) But we want to estimate δi (t) ≡ Yi (t, Mi (1)) − Yi (t, Mi (0)) For t = 1, we observe Yi (1, Mi (1)) but not Yi (1, Mi (0)) Similarly, for t = 0, we observe Yi (0, Mi (0)) but not Yi (0, Mi (1)) We have the identification problem =⇒ Need assumptions or better research designs Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 16 / 49 Counterfactuals in the Examples 1 Incumbency advantage: An incumbent (Ti = 1) faces a challenger with quality Mi (1) We observe the electoral outcome Yi = Yi (1, Mi (1)) We also want Yi (1, Mi (0)) where Mi (0) is the quality of challenger this incumbent politician would face if she is not an incumbent 2 Media framing effects: A subject viewed the news story with Hispanic immigrant (Ti = 1) For this person, Yi (1, Mi (1)) is the observed immigration opinion Yi (1, Mi (0)) is his immigration opinion in the counterfactual world where he still views the story with Hispanic immigrant but his anxiety is at the same level as if he viewed the control news story In both cases, we can’t observe Yi (1, Mi (0)) because Mi (0) is not realized when Ti = 1 Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 17 / 49 Sequential Ignorability Assumption Proposed identification assumption: Sequential Ignorability (SI) {Yi (t 0 , m), Mi (t)} ⊥ ⊥ Ti | Xi = x, (1) Yi (t 0 , m) ⊥ ⊥ Mi (t) | Ti = t, Xi = x (2) In words, 1 2 Ti is (as-if) randomized conditional on Xi = x Mi (t) is (as-if) randomized conditional on Xi = x and Ti = t Important limitations: 1 2 3 4 In a standard experiment, (1) holds but (2) may not Xi needs to include all confounders Xi must be pre-treatment confounders =⇒ post-treatment confounder is not allowed Randomizing Mi via manipulation is not the same as assuming Mi (t) is as-if randomized Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 18 / 49 Sequential Ignorability in the Standard Experiment Back to Brader et al.: Treatment is randomized =⇒ (1) is satisfied But (2) may not hold: 1 2 Pre-treatment confounder or Xi : state of residence those who live in AZ tend to have higher levels of perceived harm and be opposed to immigration Post-treatment confounder: alternative mechanism beliefs about the likely negative impact of immigration makes people anxious Pre-treatment confounders =⇒ measure and adjust for them Post-treatment confounders =⇒ adjusting is not sufficient Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 19 / 49 Nonparametric Identification Under SI, both ACME and average direct effects are nonparametrically identified (can be consistently estimated without modeling assumption) ¯ ACME δ(t) Z Z E(Yi | Mi , Ti = t, Xi ) {dP(Mi | Ti = 1, Xi ) − dP(Mi | Ti = 0, Xi )} dP(Xi ) ¯ Average direct effects ζ(t) Z Z {E(Yi | Mi , Ti = 1, Xi ) − E(Yi | Mi , Ti = 0, Xi )} dP(Mi | Ti = t, Xi ) dP(Xi ) Implies the general mediation formula under any statistical model Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 20 / 49 Traditional Estimation Methods: LSEM Linear structural equation model (LSEM): Mi = α2 + β2 Ti + ξ2> Xi + i2 , Yi = α3 + β3 Ti + γMi + ξ3> Xi + i3 . Fit two least squares regressions separately Use product of coefficients (βˆ2 γˆ ) to estimate ACME Use asymptotic variance to test significance (Sobel test) ¯ ¯ Under SI and the no-interaction assumption (δ(1) 6= δ(0)), βˆ2 γˆ consistently estimates ACME Can be extended to LSEM with interaction terms Problem: Only valid for the simplest LSEM Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 21 / 49 Popular Baron-Kenny Procedure The procedure: 1 2 3 Regress Y on T and show a significant relationship Regress M on T and show a significant relationship Regress Y on M and T , and show a significant relationship between Y and M The problems: 1 2 3 First step can lead to false negatives especially if indirect and direct effects in opposite directions The procedure only anticipates simplest linear models Don’t do star-gazing. Report quantities of interest Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 22 / 49 Proposed General Estimation Algorithm 1 Model outcome and mediator Outcome model: p(Yi | Ti , Mi , Xi ) Mediator model: p(Mi | Ti , Xi ) These models can be of any form (linear or nonlinear, semi- or nonparametric, with or without interactions) 2 Predict mediator for both treatment values (Mi (1), Mi (0)) 3 Predict outcome by first setting Ti = 1 and Mi = Mi (0), and then Ti = 1 and Mi = Mi (1) 4 Compute the average difference between two outcomes to obtain a consistent estimate of ACME 5 Monte-Carlo or bootstrap to estimate uncertainty Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 23 / 49 Example: Binary Mediator and Outcome Two logistic regression models: Pr(Mi = 1 | Ti , Xi ) = logit−1 (α2 + β2 Ti + ξ2> Xi ) Pr(Yi = 1 | Ti , Mi , Xi ) = logit−1 (α3 + β3 Ti + γMi + ξ3> Xi ) Can’t multiply β2 by γ Difference of coefficients β1 − β3 doesn’t work either Pr(Yi = 1 | Ti , Xi ) = logit−1 (α1 + β1 Ti + ξ1> Xi ) Can use our algorithm (example: E{Yi (1, Mi (0))}) 1 2 Predict Mi (0) given Ti = 0 using the first model b i (0), Xi ) using the Compute Pr(Yi (1, Mi (0)) = 1 | Ti = 1, Mi = M second model Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 24 / 49 Sensitivity Analysis Standard experiments require sequential ignorability to identify mechanisms The sequential ignorability assumption is often too strong Need to assess the robustness of findings via sensitivity analysis Question: How large a departure from the key assumption must occur for the conclusions to no longer hold? Parametric sensitivity analysis by assuming {Yi (t 0 , m), Mi (t)} ⊥ ⊥ Ti | Xi = x but not Yi (t 0 , m) ⊥ ⊥ Mi (t) | Ti = t, Xi = x Possible existence of unobserved pre-treatment confounder Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 25 / 49 Parametric Sensitivity Analysis Sensitivity parameter: ρ ≡ Corr(i2 , i3 ) Sequential ignorability implies ρ = 0 Set ρ to different values and see how ACME changes Result: β 2 σ1 ¯ ¯ δ(0) = δ(1) = σ2 q 2 2 ρ˜ − ρ (1 − ρ˜ )/(1 − ρ ) , where σj2 ≡ var(ij ) for j = 1, 2 and ρ˜ ≡ Corr(i1 , i2 ). When do my results go away completely? ¯ = 0 if and only if ρ = ρ˜ δ(t) Easy to estimate from the regression of Yi on Ti : Yi = α1 + β1 Ti + i1 Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 26 / 49 Interpreting Sensitivity Analysis with R squares Interpreting ρ: how small is too small? An unobserved (pre-treatment) confounder formulation: i2 = λ2 Ui + 0i2 and i3 = λ3 Ui + 0i3 How much does Ui have to explain for our results to go away? Sensitivity parameters: R squares 1 Proportion of previously unexplained variance explained by Ui 2∗ RM ≡ 1− 2 var(0i2 ) var(i2 ) and RY2∗ ≡ 1 − var(0i3 ) var(i3 ) Proportion of original variance explained by Ui 0 e 2 ≡ var(i2 ) − var(i2 ) R M var(Mi ) Kosuke Imai (Princeton) and Causal Mechanisms 0 e 2 ≡ var(i3 ) − var(i3 ) R Y var(Yi ) Max Planck (April, 2015) 27 / 49 2∗ , R 2∗ ) (or (R e2 , R e 2 )): Then reparameterize ρ using (RM Y M Y eY eM R sgn(λ2 λ3 )R ∗ ∗ ρ = sgn(λ2 λ3 )RM , RY = q 2 )(1 − R 2 ) (1 − RM Y 2 and R 2 are from the original mediator and outcome where RM Y models sgn(λ2 λ3 ) indicates the direction of the effects of Ui on Yi and Mi 2∗ , R 2∗ ) (or (R e2 , R e 2 )) to different values and see how Set (RM Y M Y mediation effects change Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 28 / 49 Reanalysis: Estimates under Sequential Ignorability Original method: Product of coefficients with the Sobel test — Valid only when both models are linear w/o T –M interaction (which they are not) Our method: Calculate ACME using our general algorithm Outcome variables Decrease Immigration ¯ δ(1) Support English Only Laws ¯ δ(1) Request Anti-Immigration Information ¯ δ(1) Send Anti-Immigration Message ¯ δ(1) Kosuke Imai (Princeton) Product of Coefficients Average Causal Mediation Effect (δ) .347 [0.146, 0.548] .204 [0.069, 0.339] .277 [0.084, 0.469] .276 [0.102, 0.450] .105 [0.048, 0.170] .074 [0.027, 0.132] .029 [0.007, 0.063] .086 [0.035, 0.144] Causal Mechanisms Max Planck (April, 2015) 29 / 49 0.4 0.2 0.0 −0.2 −0.4 Average Mediation Effect: δ(1) Reanalysis: Sensitivity Analysis w.r.t. ρ −1.0 −0.5 0.0 0.5 1.0 Sensitivity Parameter: ρ ACME > 0 as long as the error correlation is less than 0.39 (0.30 with 95% CI) Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 30 / 49 0.5 0.4 −0 .15 0.3 −0. 1 5 0.1 0.2 −0.0 0 0.0 Proportion of Total Variance in Y Explained by Confounder ˜ 2 and R ˜2 Reanalysis: Sensitivity Analysis w.r.t. R M Y 0.05 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 Proportion of Total Variance in M Explained by Confounder 0.8 An unobserved confounder can account for up to 26.5% of the variation in both Yi and Mi before ACME becomes zero Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 31 / 49 Open-Source Software “Mediation” Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 32 / 49 Implementation Examples 1 Fit models for the mediator and outcome variable and store these models > m <- lm(Mediator ~ Treat + X) > y <- lm(Y ~ Treat + Mediator + X) 2 Mediation analysis: Feed model objects into the mediate() function. Call a summary of results > m.out<-mediate(m, y, treat = "Treat", mediator = "Mediator") > summary(m.out) 3 Sensitivity analysis: Feed the output into the medsens() function. Summarize and plot > > > > s.out <- medsens(m.out) summary(s.out) plot(s.out, "rho") plot(s.out, "R2") Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 33 / 49 Data Types Available via mediation Outcome Model Types Mediator Model Types Linear GLM Ordered Censored Quantile GAM Survival Linear (lm/lmer) X X X∗ X X X∗ X GLM (glm/bayesglm/glmer) X X X∗ X X X∗ X Ordered (polr/bayespolr) X X X∗ X X X∗ X - - - - - - - Quantile (rq) X∗ X∗ X∗ X∗ X∗ X∗ X GAM (gam) X∗ X∗ X∗ X∗ X∗ X∗ X∗ X X X∗ X X X∗ X Censored (tobit via vglm) Survival (survreg) Types of Models That Can be Handled by mediate. Stars (∗ ) indicate the model combinations that can only be estimated using the nonparametric bootstrap (i.e. with boot = TRUE). Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 34 / 49 Additional Features Treatment/mediator interactions, with formal statistical tests Treatment/mediator/pre-treatment interactions and reporting of quantities by pre-treatment values Factoral, continuous treatment variables Cluster standard errors/adjustable CI reporting/p-values Support for multiple imputation Multiple mediators Multilevel mediation (NEW!) Please read our vignette file here. Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 35 / 49 Causal Mediation Analysis in Stata Based on the same algorithm Hicks, R, Tingley D. 2011. Causal Mediation Analysis. Stata Journal. 11(4):609-615. ssc install mediation More limited coverage of models (just bc. of time though!) Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 36 / 49 Syntax: medeff medeff (equation 1) (equation 2) [if] [in] [[weight]] , [sims(integer) seed(integer) vce(vcetype) Level(#) interact(varname)] mediate(varname) treat(varname) Where “equation 1” or “equation 2” are of the form (For equation 1, the mediator equation): probit M T x or regress M T x Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 37 / 49 Beyond Sequential Ignorability Without sequential ignorability, standard experimental design lacks identification power Even the sign of ACME is not identified Need to develop alternative experimental designs for more credible inference Possible when the mediator can be directly or indirectly manipulated All proposed designs preserve the ability to estimate the ACME under the SI assumption Trade-off: statistical power These experimental designs can then be extended to natural experiments in observational studies Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 38 / 49 Parallel Design Must assume no direct effect of manipulation on outcome More informative than standard single experiment If we assume no T –M interaction, ACME is point identified Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 39 / 49 Why Do We Need No-Interaction Assumption? Numerical Example: Prop. 0.3 0.3 0.1 0.3 Mi (1) 1 0 0 1 Mi (0) 0 0 1 1 Yi (t, 1) 0 1 0 1 Yi (t, 0) 1 0 1 0 δi (t) −1 0 1 0 ¯ = −0.2 E(Mi (1) − Mi (0)) = E(Yi (t, 1) − Yi (t, 0)) = 0.2, but δ(t) The Problem: Causal effect heterogeneity T increases M only on average M increases Y only on average T − M interaction: Many of those who have a positive effect of T on M have a negative effect of M on Y (first row) A solution: sensitivity analysis (see Imai and Yamamoto, 2013) Pitfall of “mechanism experiments” or “causal chain approach” Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 40 / 49 Encouragement Design Direct manipulation of mediator is difficult in most situations Use an instrumental variable approach: Advantage: allows for unobserved confounder between M and Y Key Assumptions: 1 2 Z is randomized or as-if random No direct effect of Z on Y (a.k.a. exclusion restriction) Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 41 / 49 Example: Social Norm Experiment on Property Taxes Lucia Del Carpio. “Are Neighbors Cheating?” Treatment: informing average rate of compliance Outcome: compliance rate obtained from administrative records Large positive effect on compliance rate ≈ 20 percentage points Mediators: 1 2 3 social norm (not measured; direct effect) M1 : beliefs about compliance (measured) M2 : beliefs about enforcement (measured) Instruments: 1 2 Z1 : informing average rate of enforcement Z2 : payment-reminder Assumptions: 1 2 Z1 affects Y only through M1 and M2 Z2 affects Y only through M1 Results: Average direct effect is estimated to be large The author interprets this effect as the effect of social norm Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 42 / 49 Crossover Design Recall ACME can be identified if we observe Yi (t 0 , Mi (t)) Get Mi (t), then switch Ti to t 0 while holding Mi = Mi (t) Crossover design: 1 2 Round 1: Conduct a standard experiment Round 2: Change the treatment to the opposite status but fix the mediator to the value observed in the first round Very powerful – identifies mediation effects for each subject Must assume no carryover effect: Round 1 must not affect Round 2 Can be made plausible by design Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 43 / 49 Example: Labor Market Discrimination E XAMPLE Bertrand & Mullainathan (2004, AER) Treatment: Black vs. White names on CVs Mediator: Perceived qualifications of applicants Outcome: Callback from employers Quantity of interest: Direct effects of (perceived) race Would Jamal get a callback if his name were Greg but his qualifications stayed the same? Round 1: Send Jamal’s actual CV and record the outcome Round 2: Send his CV as Greg and record the outcome Assumption: their different names do not change the perceived qualifications of applicants Under this assumption, the direct effect can be interpreted as blunt racial discrimination Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 44 / 49 Designing Observational Studies Key difference between experimental and observational studies: treatment assignment Sequential ignorability: 1 2 Ignorability of treatment given covariates Ignorability of mediator given treatment and covariates Both (1) and (2) are suspect in observational studies Statistical control: matching, propensity scores, etc. Search for quasi-randomized treatments: “natural” experiments How can we design observational studies? Experiments can serve as templates for observational studies Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 45 / 49 Cross-over Design in Observational Studies E XAMPLE Back to incumbency advantage Use of cross-over design (Levitt and Wolfram) 1 2 1st Round: two non-incumbents in an open seat 2nd Round: same candidates with one being an incumbent Assume challenger quality (mediator) stays the same Estimation of direct effect is possible Redistricting as natural experiments (Ansolabehere et al.) 1 2 1st Round: incumbent in the old part of the district 2nd Round: incumbent in the new part of the district Challenger quality is the same but treatment is different Estimation of direct effect is possible Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 46 / 49 Multiple Mediators Quantity of interest = The average indirect effect with respect to M W represents the alternative observed mediators Left: Assumes independence between the two mechanisms Right: Allows M to be affected by the other mediators W Applied work often assumes the independence of mechanisms Under this independence assumption, one can apply the same analysis as in the single mediator case For causally dependent mediators, we must deal with the heterogeneity in the T × M interaction as done under the parallel design =⇒ sensitivity analysis Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 47 / 49 Concluding Remarks Even in a randomized experiment, a strong assumption is needed to identify causal mechanisms However, progress can be made toward this fundamental goal of scientific research with modern statistical tools A general, flexible estimation method is available once we assume sequential ignorability Sequential ignorability can be probed via sensitivity analysis More credible inferences are possible using clever experimental designs Insights from new experimental designs can be directly applied when designing observational studies Multiple mediators require additional care when they are causally dependent Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 48 / 49 The project website for papers and software: http://imai.princeton.edu/projects/mechanisms.html Email for questions and suggestions: kimai@princeton.edu Kosuke Imai (Princeton) Causal Mechanisms Max Planck (April, 2015) 49 / 49