An observational study where causal inference comes from the design that draws on randomization of treatment.
experiment: because it has randomization
natural: because it occurs without researcher intervention in real-world
solution to confounding
no complex estimands (hopefully!), simple and transparent statistical models
Piccardi et al use simple \(t\) tests (in a regression context) to compare differences in means
“The power of multiple regression analysis is that it allows us to do in non-experimental environments what natural scientists are able to do in a controlled laboratory setting: keep other factors fixed” (Wooldridge 2009: 77)
A matter of degree
Statistical evidence for causality combines observed data and a mathematical model of the world.
Causal evidence varies in terms of complexity of math/restrictiveness of assumptions: a matter of degree
Model-based inferences about causality involve many choices in complex statistical models with many difficult-to-assess assumptions
Design-based inferences about causality use carefully controlled comparisons with simple models and transparent assumptions
How do we know this is random?
“Standard natural experiments”: treatment \(D\) is randomized for some units
Instrumental Variables: some instrument \(Z\) is randomized and affects treatment \(D\)
Regression Discontinuity: treatment assignment of \(D\) takes place at cutoff \(c\), random close to \(c\)
What is it? Extends logic of \(CACE\) to
How?
Assumptions:
Issues:
Recent work by Lal et al (2024) finds two major issues:
Lal et al show we must:
all in ivDiag package.
If exclusion restriction is violated, weak instruments \(\to\) amplified bias
Classic example:
What is the effect of electing a criminal vs. non-criminal politician on provision of government handouts?
Classic example:
When criminal either won or came in second…
Sharp RD: Move from all untreated to all treated at the cutoff
\(\tau_{SRD} = E[Y_i(1) - Y_i(0) | X_i = c]\)
Fuzzy RD: Some units shift from untreated to treated at cutoff \(c\)
Key assumption:
continuity at cutoff:
Only need to estimate \(CEF\) at the cutoff, \(c\)…
How?
Choices to make:
Choosing polynomial order
Choosing kernel (not super consequential)
Choosing bandwidth (most consequential)
Standard Errors (consequential)
Check continuity
Check sorting/manipulation:
Logic: if \(D\) (or \(Z\)) is random, then independent of other causes of \(Y\). Should not observed differences in pre-treatment variables across \(D\)/\(Z\).
Nellis et al appendix:
But which covariates should you examine?
Variables plausibly linked to:
This requires theoretical knowledge and qualitative contextual knowledge about your cases
Dunning highlights importance of causal process observation of treatment assignment.
Key questions to ask are:
1.) which actors/processes are involved to assigning / receiving treatment?
For actors:
Paper Dialogue:
Things that seem arbitrary/beyond human control not necessarily random: really make the case for HOW the random assignment process would work (if it is ‘as-if’ random)