December 4, 2024
Every way of using correlation as evidence for causality makes assumptions
Solution | How Bias Solved |
Which Bias Removed |
Assumes | Internal Validity |
External Validity |
---|---|---|---|---|---|
Experiment | Randomization Breaks \(W \rightarrow X\) link |
All confounding variables | 1. \(X\) is random 2. Change only \(X\) |
Highest | Lowest |
Conditioning | Hold confounders constant |
Only variables conditioned on |
1. Condition on all confounders 2. Low measurement error 3. Cases similar in \(W\) |
Lowest | Highest |
Before and After | Hold confounders constant |
variables unchanging over time |
No causes of \(Y\) change w/ \(X\) |
Lower | Higher |
Webster et al (2014) investigate:
Did the repeal CAUSE a change in murders using guns?
Holds all unique, unchanging characteristics of Missouri constant…
But, we have to assume that there is nothing else about Missouri that
(or more technically, assume that \(\color{red}{\text{Murders}_{MO,After}[\text{No Repeal}]} = \color{black}{\text{Murders}_{MO,Before}[\text{No Repeal}]}\))
No long-term trends, no effects on measurement, no changes in crimes \(\to\) PTP repeal
Does this plot make it easier/harder to believe PTP repeal caused more murders? (DISCUSS)
Could be that other things were changing between 2007-2008 that confound relationship between PTP and Murders?
State | Time | Murder(Yes) | Murder(No) | Repeal |
---|---|---|---|---|
Missouri | Before | \(\color{red}{\text{Murders}_{MO,Before}[\text{Repeal}]}\) | \(\color{black}{\text{Murders}_{MO,Before}[\text{No Repeal}]}\) | No |
\(\neq\not\Downarrow\) | ||||
Missouri | After | \(\color{black}{\text{Murders}_{MO,After}[\text{Repeal}]}\) | \(\color{red}{\text{Murders}_{MO,After}[\text{No Repeal}]}\) | Yes |
It appears that \(\color{red}{\text{Murders}_{MO,After}[\text{No Repeal}]} \neq \\ \color{black}{\text{Murders}_{MO,Before}[\text{No Repeal}]}\)
Because other factors changing murders, regardless of repeal
What can we do to remove confounding from other variables that change over time, like…
We want to compare the actual trend in Missouri:
\(\begin{equation}\begin{split}\text{Trend}_{MO} ={} & \color{black}{\text{Murders}_{MO,After}[\text{Repeal}]} - \\ & \color{black}{\text{Murders}_{MO,Before}[\text{No Repeal}]}\end{split}\end{equation}\)
against the counterfactual trend in Missouri:
\(\begin{equation}\begin{split}\color{red}{\text{CF Trend}_{MO}} ={} & \color{red}{\text{Murders}_{MO,After}[\text{No Repeal}]} - \\ & \color{black}{\text{Murders}_{MO,Before}[\text{No Repeal}]}\end{split}\end{equation}\)
\(\small{\begin{equation}\begin{split} = {} & \overbrace{\{\text{Murders}_{MO,After}(\text{Repeal}) - \text{Murders}_{MO,Before}(\text{No Repeal})\}}^{\text{Missouri observed trend}} - \\ & \underbrace{\{\color{red}{\text{Murders}_{MO,After}(\text{No Repeal})} - \text{Murders}_{MO,Before}(\text{No Repeal})\}}_{\color{red}{\text{Missouri counterfactual trend}}}\end{split}\end{equation}}\)
Many possible counterfactual trends…
Which counterfactual is right?
Which counterfactual is right?
We can’t know the counterfactual trend in Missouri…
but we can observe the trends in other states that did not change their gun purchasing laws (no change in Gun Control, \(X\)).
Arkansas has a different history that Missouri, so there are differences that are unchanging between them.
But, if Arkansas experiences same regional economic, political, cultural, weather trends as Missouri, they likely share the same trends over time.
Then, we can plug in
\(\small{\begin{equation}\begin{split} = {} & \overbrace{\{\text{Murders}_{MO,After}(\text{Repeal}) - \text{Murders}_{MO,Before}(\text{No Repeal})\}}^{\text{Missouri observed trend}} - \\ & \{\underbrace{\text{Murders}_{AR,After}(\text{No Repeal}) - \text{Murders}_{AR,Before}(\text{No Repeal})\}}_{\text{Arkansas observed trend}}\end{split}\end{equation}}\)
Missouri/Arkansas different in 2007, but if Missouri had same trend (counterfactually) as Arkansas, what would we expect Murders to have done in 2008 w/out the repeal?
Missouri’s counterfactual trend is parallel to / same as Arkansas’s factual trend
With your neighbors, discuss: Do you believe this is evidence of causality? What confounding does this address? What confounding does it not address?
Like before and after, difference in differences comparisons are design based:
By comparing changes over time in “treated” (\(X\) changes) and “untreated” (\(X\) does not change) cases:
Regardless of whether we have thought of those variables, whether we can measure those variables.
What is it?
How does it work?
Why is it called difference in differences?
\(\small{\begin{equation}\begin{split} = {} & \overbrace{\{\text{Murders}_{MO,After}(\text{Repeal}) - \text{Murders}_{MO,Before}(\text{No Repeal})\}}^{\text{Missouri observed trend}} - \\ & \{\underbrace{\text{Murders}_{AR,After}(\text{No Repeal}) - \text{Murders}_{AR,Before}(\text{No Repeal})\}}_{\text{Arkansas observed trend}}\end{split}\end{equation}}\)
So:
\(Murder_{Before}\) | \(Murder_{After}\) | First Difference | |
---|---|---|---|
\(\mathrm{Missouri}\) | \(4.6\) | \(6.2\) | \(1.6\) |
\(\mathrm{Arkansas}\) | \(5.6\) | \(5.4\) | \(-0.2\) |
Second Difference | \(1.8\) |
All confounding variables (affect whether a PTP repealed; affect firearms homicides) that are unchanging over time are held constant
All confounding variables that change the similarly in “treated” and “untreated” case are held constant.
In order to infer \(X\) causes \(Y\) if \(X,Y\) correlated in difference-in-differences comparison…
Do you believe assumption of “parallel trends”? (Counterfactual Missouri trend same as factual Arkansas trend)
Perhaps there are some things that affect murder rates that change differently in these two states.
When is the assumption plausible?
Solution | How Bias Solved |
Which Bias Removed |
Assumes | Internal Validity |
External Validity |
---|---|---|---|---|---|
Experiment | Randomization Breaks \(W \rightarrow X\) link |
All confounding variables | 1. \(X\) is random 2. Change only \(X\) |
Highest | Lowest |
Conditioning | Hold confounders constant |
Only variables conditioned on |
see above | Lowest | Highest |
Before and After | Hold confounders constant |
variables unchanging over time |
No causes of \(Y\) change w/ \(X\) |
Lower | Higher |
Diff in Diff | Hold confounders constant |
unchanging and similarly changing |
Parallel trends | Higher | Lower |
Acemoglu and Restrepo (2022) investigate:
Has automation of work helped or hurt workers?
Data:
“Cases”:
What might be some confounding variables if we just compared wages for workers with exposure/no exposure to automation?
What might be some confounding variables if we just compared wages in the US before and after rise of automation?
Rather than looking at wages in industries with more automation, or change in wages in the US over time, use a difference in differences:
They compare:
Change in real wages for demographic groups with high exposure to automation between 1980 and 2016 (change in \(Y\) for group where \(X\) changes)
Change in real wages for demographic groups with low/no exposure to automation between 1980 and 2016 (change in \(Y\) for group where \(X\) does not change)
Assume that counterfactual trend in wages for workers exposed to automation SAME as factual trend in wages for workers not exposed to automation.
For groups with greater increase in automation exposure, greater decline in wages
Correlation suggestions Automation \(\xrightarrow{causes}\) declining wages
For this to be the causal effect of automation, need to believe that wages for workers exposed / not exposed to automation would have been similar without automation…
No differences in wage trends before automation.
It still could be that other things that affect wages changed differently for workers exposed to automation than for those who were not.
“capital takes what it will in the absence of constraints and technology is a tool that can be used for good or for ill… Yes, [during the Industrial Revolution of the 19th Century] you got progress, but you also had costs that were huge and very long-lasting. A hundred years of much harsher conditions for working people, lower real wages, much worse health and living conditions, less autonomy, greater hierarchy. And the reason that we came out of it wasn’t some law of economics, but rather a grass roots social struggle in which unions, more progressive politics and, ultimately, better institutions played a key role — and a redirection of technological change away from pure automation also contributed importantly.”
So… Luddites?