November 27, 2025
Every way of using correlation as evidence for causality makes assumptions
| Solution | How Bias Solved |
Which Bias Removed |
Assumes | Internal Validity |
External Validity |
|---|---|---|---|---|---|
| Experiment | Randomization Breaks \(W \rightarrow X\) link |
All confounding variables | 1. \(X\) is random 2. Change only \(X\) |
High | Low |
| Conditioning | Hold confounders constant |
Only confounders conditioned on |
1. Condition on all confounders 2. Low measurement error 3. Cases similar in \(W\) |
Low | High |
Yesterday
Back in 2017…
Did Trump’s rhetoric affect anti-Muslim hate crimes?
We can make Before and After comparisons: What is it?
before and after: examine the change in \(Y\) in a single case (or group of cases) where \(X\) changes over time (from before to after)
Taking the same data from Feinberg, Branton, and Martinez-Ebers…
Conditioning removes confounding by:
Design-based solutions (like Before and After) remove confounding by:
All confounding variables (affect whether a rally occurs; affect hate crimes) that are unchanging over time (before to) are held constant.
| Solution | How Bias Solved |
Which Bias Removed |
Assumes | Internal Validity |
External Validity |
|---|---|---|---|---|---|
| Experiment | Randomization Breaks \(W \rightarrow X\) link |
All confounding variables | 1. \(X\) is random 2. Change only \(X\) |
High | Low |
| Conditioning | Hold confounders constant |
Only confounders conditioned on |
1. Condition on all confounders 2. Low measurement error 3. Cases similar in \(W\) |
Low | High |
| Before and After | Hold confounders constant |
variables unchanging over time |
? | ? | ? |
Just like experiments and confounding, Before and After comparisons plug in for MISSING potential outcomes.
| County | Time | \(Y:\) HC(Yes) | \(Y:\) HC(No) | \(X:\) Rally |
|---|---|---|---|---|
| \(c\) | Before | \(\color{red}{\text{Hate Crimes}_{c,Before}[\text{Rally}]}\) | \(\color{black}{\text{Hate Crimes}_{c,Before}[\text{No Rally}]}\) | No |
| \(\Downarrow\) | ||||
| \(c\) | After | \(\color{black}{\text{Hate Crimes}_{c,After}[\text{Rally}]}\) | \(\color{red}{\text{Hate Crimes}_{c,After}[\text{No Rally}]}\) | Yes |
| County | Time | \(Y:\) HC(Yes) | \(Y:\) HC(No) | \(X:\) Rally |
|---|---|---|---|---|
| \(c\) | Before | \(\color{red}{\text{Hate Crimes}_{c,Before}[\text{Rally}]}\) | \(\color{black}{\text{Hate Crimes}_{c,Before}[\text{No Rally}]}\) | No |
| \(\Downarrow\) | ||||
| \(c\) | After | \(\color{black}{\text{Hate Crimes}_{c,After}[\text{Rally}]}\) | \(\boxed{\color{black}{\text{Hate Crimes}_{c,Before}[\text{No Rally}]}}\) | Yes |
We assume \(\color{red}{\text{Hate Crimes}_{c,After}[\text{No Rally}]} = \\ \color{black}{\text{Hate Crimes}_{c,Before}[\text{No Rally}]}\)
That is: if \(X\) had not changed, \(Y\) would not have changed.
Not quite as widely applicable as confounding… \(\to\) Higher, but not Highest external validity
Any \(W\) that affects \(Y\) and changes with \(X\) will produce confounding even if it does not cause \(X\).
Must assume there are no variables \(W\) that affect \(Y\) and change over time with \(X\).
Mostly flat trend; change when rallies occur
Mostly constant upward trend; no change when rallies occur
Over-time comparison, we can create confounding from variables that do not cause \(X\) to change, if they also change with \(X\) over time…
Does rally change measurement, but not actual number of hate crimes? (Measurement bias)
Are there are other changes over the same time-frame (change at the same time as \(X\), rallies)?
Looking at the 2017 Truck attack: by DAY
Looking at the 2017 Truck attack: by MONTH
FBI data shows no change, compared to Anti-Defamation League \(\to\) “effect” is changing measurement of hate crimes?
| Solution | How Bias Solved |
Which Bias Removed |
Assumes | Internal Validity |
External Validity |
|---|---|---|---|---|---|
| Experiment | Randomization Breaks \(W \rightarrow X\) link |
All confounding variables | 1. \(X\) is random 2. Change only \(X\) |
High | Low |
| Conditioning | Hold confounders constant |
Only confounders conditioned on |
1. Condition on all confounders 2. Low measurement error 3. Cases similar in \(W\) |
Low | High |
| Before and After | Hold confounders constant |
variables unchanging over time |
No causes of \(Y\) change w/ \(X\) |
Lower | Higher |

Webster et al (2014) investigate:
Did the repeal CAUSE a change in murders using guns?
Holds all unique, unchanging characteristics of Missouri constant…
But, we have to assume that there is nothing else about Missouri that
(or more technically, assume that \(\color{red}{\text{Murders}_{MO,After}[\text{No Repeal}]} = \color{black}{\text{Murders}_{MO,Before}[\text{No Repeal}]}\))
No long-term trends, no effects on measurement,
no changes in crimes \(\to\) PTP repeal
Does this plot make it easier/harder to believe PTP repeal caused more murders? (DISCUSS)
Could be that other things were changing between 2007-2008 that confound relationship between PTP and Murders?
| State | Time | Murder(Yes) | Murder(No) | Repeal |
|---|---|---|---|---|
| Missouri | Before | \(\color{red}{\text{Murders}_{MO,Before}[\text{Repeal}]}\) | \(\color{black}{\text{Murders}_{MO,Before}[\text{No Repeal}]}\) | No |
| \(\neq\not\Downarrow\) | ||||
| Missouri | After | \(\color{black}{\text{Murders}_{MO,After}[\text{Repeal}]}\) | \(\color{red}{\text{Murders}_{MO,After}[\text{No Repeal}]}\) | Yes |
It appears that \(\color{red}{\text{Murders}_{MO,After}[\text{No Repeal}]} \neq \\ \color{black}{\text{Murders}_{MO,Before}[\text{No Repeal}]}\)
Because other factors changing murders, regardless of repeal
What can we do to remove confounding from other variables that change over time, like…
We want to compare the actual trend in Missouri:
\(\begin{equation}\begin{split}\text{Trend}_{MO} ={} & \color{black}{\text{Murders}_{MO,After}[\text{Repeal}]} - \\ & \color{black}{\text{Murders}_{MO,Before}[\text{No Repeal}]}\end{split}\end{equation}\)
against the counterfactual trend in Missouri:
\(\begin{equation}\begin{split}\color{red}{\text{CF Trend}_{MO}} ={} & \color{red}{\text{Murders}_{MO,After}[\text{No Repeal}]} - \\ & \color{black}{\text{Murders}_{MO,Before}[\text{No Repeal}]}\end{split}\end{equation}\)
\(\small{\begin{equation}\begin{split} = {} & \overbrace{\{\text{Murders}_{MO,After}(\text{Repeal}) - \text{Murders}_{MO,Before}(\text{No Repeal})\}}^{\text{Missouri observed trend}} - \\ & \underbrace{\{\color{red}{\text{Murders}_{MO,After}(\text{No Repeal})} - \text{Murders}_{MO,Before}(\text{No Repeal})\}}_{\color{red}{\text{Missouri counterfactual trend}}}\end{split}\end{equation}}\)
Many possible counterfactual trends…
Which counterfactual is right?
Which counterfactual is right?
We can’t know the counterfactual trend in Missouri…
but we can observe the trends in other states that did not change their gun purchasing laws (no change in Gun Control, \(X\)).

Arkansas has a different history that Missouri, so there are differences that are unchanging between them.
But, if Arkansas experiences same regional economic, political, cultural, weather trends as Missouri, they likely share the same trends over time.
Then, we can plug in
\(\small{\begin{equation}\begin{split} = {} & \overbrace{\{\text{Murders}_{MO,After}(\text{Repeal}) - \text{Murders}_{MO,Before}(\text{No Repeal})\}}^{\text{Missouri observed trend}} - \\ & \{\underbrace{\text{Murders}_{AR,After}(\text{No Repeal}) - \text{Murders}_{AR,Before}(\text{No Repeal})\}}_{\text{Arkansas observed trend}}\end{split}\end{equation}}\)
Missouri/Arkansas different in 2007, but if Missouri had same trend (counterfactually) as Arkansas, what would we expect Murders to have done in 2008 w/out the repeal?
Missouri’s counterfactual trend is parallel to / same as Arkansas’s factual trend
With your neighbors, discuss: Do you believe this is evidence of causality? What confounding does this address? What confounding does it not address?
Like before and after, difference in differences comparisons are design based
What is it?
Why is it called difference in differences?
\(\small{\begin{equation}\begin{split} = {} & \overbrace{\{\text{Murders}_{MO,After}(\text{Repeal}) - \text{Murders}_{MO,Before}(\text{No Repeal})\}}^{\text{Missouri observed trend}} - \\ & \{\underbrace{\text{Murders}_{AR,After}(\text{No Repeal}) - \text{Murders}_{AR,Before}(\text{No Repeal})\}}_{\text{Arkansas observed trend}}\end{split}\end{equation}}\)
So:
| \(Murder_{Before}\) | \(Murder_{After}\) | First Difference | |
|---|---|---|---|
| \(\mathrm{Missouri}\) | \(4.6\) | \(6.2\) | \(1.6\) |
| \(\mathrm{Arkansas}\) | \(5.6\) | \(5.4\) | \(-0.2\) |
| Second Difference | \(1.8\) |
How does it work?
We don’t need to know/measure what these variables are!
All confounding variables (affect whether a PTP repealed; affect firearms homicides) that are unchanging over time are held constant
All confounding variables that change the similarly in “treated” and “untreated” case are held constant.
In order to infer \(X\) causes \(Y\) if \(X,Y\) correlated in difference-in-differences comparison…
Do you believe assumption of “parallel trends”? (Counterfactual Missouri trend same as factual Arkansas trend)
Perhaps there are some things that affect murder rates that change differently in these two states.
When is the assumption plausible?
| Solution | How Bias Solved |
Which Bias Removed |
Assumes | Internal Validity |
External Validity |
|---|---|---|---|---|---|
| Experiment | Randomization Breaks \(W \rightarrow X\) link |
All confounding variables | 1. \(X\) is random 2. Change only \(X\) |
Highest | Lowest |
| Conditioning | Hold confounders constant |
Only variables conditioned on |
see above | Lowest | Highest |
| Before and After | Hold confounders constant |
variables unchanging over time |
No causes of \(Y\) change w/ \(X\) |
Lower | Higher |
| Diff in Diff | Hold confounders constant |
unchanging and similarly changing |
Parallel trends | Higher | Lower |