November 18, 2024
Midterm Grades
Recap:
Recognizing Confounding
“BC’s face-mask mandate reduced COVID mortality in the province over the course of the pandemic.”
Karaivanov et al (2020), economists at SFU, investigate:
Did indoor mask mandates reduced COVID cases, on average? If masks reduce COVID cases, could reduce mortality.
They compare COVID cases in Ontario Public Health Units (PHU) with and without mask mandates
They find that PHUs with mask mandates have slower COVID case growth than PHUs without mask mandates…
Correlation suffers from two sources of error:
random error: we observe patterns in \(X\) (independent variable) and \(Y\) (dependent variable) by chance, when there is in fact no relationship.
confounding (bias): the systematic observed pattern between \(X\) and \(Y\) is not the true causal relationship between \(X\) and \(Y\).
confounding is when there is a systematic observed correlation between \(X\) and \(Y\) that does NOT reflect the true causal effect of \(X\) on \(Y\).
It’s summer of 2020… the correlation Karaivanov et al use effectively compares…
COVID caseload in PHUs in Toronto area (with mask mandates)…
… with COVID caseload in PHUs like North Bay (without a mask mandate)
Correlation means we “plug in” missing counterfactuals from other cases:
Case | \(\overbrace{\text{Caseload (Mandate)}}^{Y}\) | \(\overbrace{\text{Caseload (No Mandate)}}^{Y}\) | \(\overbrace{\text{Mandate}}^{X}\) |
---|---|---|---|
Toronto | \(\mathrm{Cases_{Toronto}(Mandate)}\) | \(\color{red}{\mathrm{Cases_{Toronto}(No \ Mandate)}}\) | Yes |
\(\Downarrow{=}\) | \(\Uparrow{=}\) | ||
North Bay | \(\color{red}{\mathrm{Cases_{North \ Bay}(Mandate)}}\) | \(\mathrm{Cases_{North \ Bay}(No \ Mandate)}\) | No |
If this equivalence \((=)\) is false, then confounding (bias)
Confounding occurs because:
We use outcome in case B (North Bay) to stand in for counterfactual outcome from case A (Toronto). But outcomes in case B are not the same as in counterfactual case A.
Two ways of thinking about why this produces a bias; unifying idea is that case A and B are different in ways other than X. These other differences affect X and Y.
Explanation 1: visual
Confounding occurs when these other differences between cases (third variables, e.g. \(W\)) causally affect \(X\) and \(Y\).
This can be seen visually
Causal graphs represent a model of the true causal relationships between variables.
the nodes or dots correspond to variables
the arrows convey the direction of causality
A model of true causal relationships, because:
PHUs in Toronto (that had a mask mandate) may have a larger population of university educated adults than PHUs in North Bay (no mask mandate).
In a causal graph, there is confounding of correlation of \(X\) and \(Y\) if…
Confounding?
Why do backdoor paths produce confounding? Why is it systematic?
Explanation 2: Other differences, Different potential outcomes
Case | \(\overbrace{\text{Caseload (Yes)}}^{Y}\) | \(\overbrace{\text{Caseload (No)}}^{Y}\) | \(\overbrace{\text{Mandate}}^{X}\) | \(\overbrace{\text{Work from Home}}^{W}\) |
---|---|---|---|---|
Toronto | \(\text{Fewer Cases}\) | \(\color{red}{\text{Fewer Cases}}\) | Yes | More |
\(\Updownarrow{\neq}\) | \(\Updownarrow{\neq}\) | |||
North Bay | \(\color{red}{\text{More Cases}}\) | \(\text{More Cases}\) | No | Less |
antecedent variable: a variable that affects \(X\)
e.g. in this path, \(W \xrightarrow{} X \xrightarrow{} Y\), \(W\) is an antecedent variable.
antecedent variables (\(W\)) do not produce confounding if the only causal path from \(W\) to \(Y\) passes through \(X\).
antecedent variables do produce confounding if there is another causal path from \(W\) to \(Y\) that does NOT include \(X\).
intervening variable: a variable that affects \(Y\) and is affected by \(X\).
e.g. in this path, \(X \xrightarrow{} M \xrightarrow{} Y\), \(M\) is an intervening variable.
intervening variables (\(M\)) do not produce confounding because they are on the causal path from \(X\) to \(Y\). They do not produce backdoor path.
reverse causality describes the situation where the dependent variable \(Y\) actually causes the independent variable \(X\).
So while we use the correlation to describe the effect of \(X\) on \(Y\): \(X \to Y\), the correlation in fact is the result of the effect of \(Y\) on \(X\): \(Y \to X\).
This is a special case of bias or confounding.
Third Variable? | Key Attribute | Confounding? | |
---|---|---|---|
Antecedent Variables \((W)\) |
Yes | \(W \to X\) | If only causal path from \(W\) to \(Y\) contains \(X\): No If a causal path from \(W\) to \(Y\) excludes \(X\): Yes |
Intervening Variables \((M)\) | Yes | \(X \to M \to Y\) | No |
Reverse Causality | No | \(Y \to X\) | Yes |
Gun violence in Miami Beach has led to state of emergency and curfew during Spring Break.
“We haven’t been able to figure out how to stop spring break from coming,” Mr. Gelber said. “We don’t want spring break here, but they keep coming.”
Does gun ownership slightly reduce firearms deaths?
Let’s say Miami Beach is considering imposing gun control policies to reduce gun deaths during Spring Break. To make their decision, they look at the correlation on the previous slide…
If that correlation suffered from confounding (bias), how might it affect the policy decision and its consequences…
As with measurement bias, we need to apply weak severity principle to judge whether bias would present a problem.
Product of signs on causal path from \(W \to X\) and \(W \to Y\) gives us direction of bias created by confounding
\(W \xrightarrow{+} X\) | \(W \xrightarrow{-} X\) | |
---|---|---|
\(W \xrightarrow{+} Y\) | \(Correlation(X,Y)\) Biased (+) |
\(Correlation(X,Y)\) Biased (-) |
\(W \xrightarrow{-} Y\) | \(Correlation(X,Y)\) Biased (-) |
\(Correlation(X,Y)\) Biased (+) |
\(^*:\) this only works for a single backdoor paths; not the total of multiple backdoor paths. Then we would need more information.
We now know what confounding is and how it arises…
Consider the case of gun-control laws and gun violence::
Causal graphs point us to possible solutions:
If something else (\(W\)) changes \(X\) and \(Y\), leading to confounding…
We can use comparisons that hold \(W\) constant.
We can use comparisons that break the connection between \(W\) and \(X\).
Confounding