(1) Measurement Error
- What is it?
- Histograms
- bias/systematic error
- random measurement error
October 7, 2025
Light et al 2020 investigate claim: “undocumented migrants are prone to violent crime”
concepts: undocumented, violent criminals
variable: conviction rates for violent crime (homicide, assault, robbery, sexual assault) for US-born citizens, legal immigrants, undocumented immigrants
measure: individual crimes listed in arrests in the Texas Computerized Criminal History database, individual immigration status as determined by DHS and ICE using biometrics database, numbers of undocumented migrants using Census data
Not the end of the story: Kennedy et al at Center for Immigration Studies dispute these findings:
Complaints focus on:
Kennedy et al, argue:
It takes time for undocumented immigrants in custody to be identified.
Only people in custody for longer periods of time for serious crimes likely to be thoroughly checked:
is a difference between the observed value of a variable for a case (produced by the measurement procedure) and the true value of the variable for that case.
\[\mathrm{Value}_{observed} - \mathrm{Value}_{true} \neq 0 \xrightarrow{then} \mathrm{measurement \ error}\]
If what we observe is different from the true value for a case (difference is not 0), then there is measurement ERROR
(note: we don’t usually know the true value!)
Kennedy et al are saying that using the incorrect procedures leads to:
\[\mathrm{Migrant \ Crime \ Rate}_{observed} < \mathrm{Migrant \ Crime \ Rate}_{true} \\\xrightarrow{then} \\ \mathrm{measurement \ error}\]
What is the incidence of sexual misconduct defined here at UBC?
Let’s say a variable is the number of breaches of Sexual Misconduct Policy in a given year.
Measure: Reporting from the UBC Investigations Office.
What is the incidence of sexual misconduct defined here at UBC?
Let’s say a variable is the number of breaches of Sexual Misconduct Policy in a given year.
Measure: Reporting from the UBC Investigations Office.
That implies \(13\) incidents in 2024-2025 Academic Year (last available data).
39 reports \(\to\) 21 investigations \(\to\) 19 completed investigations \(\to\) 13 breaches found
\[\mathrm{Sexual \ Misconduct }_{observed} - \mathrm{Sexual \ Misconduct}_{true} \neq 0\]
\[\xrightarrow{then} \mathrm{measurement \ error}\]
Different the patterns of \(\mathrm{Value}_{observed} - \mathrm{Value}_{true}\) that we see.
Different implications for severity of evidence.
Measures may suffer from both.
You need to:
bias or systematic measurement error: error produced when our measurement procedure obtains values that are, on average, too high or too low (or incorrectly labelled) compared to the truth.
Kennedy et al argue that Light et al’s measurement procedures lead to, on average:
\[\mathrm{Migrant \ Homicide \ Rate }_{observed} - \mathrm{Migrant \ Homicide \ Rate}_{true} < 0\]
\[\xrightarrow{then} \mathrm{measurement \ bias}\]
bias different in different subgroups
random measurement error: errors that occur due to random features of measurement process. Even if observed values are sometimes wrong, they are, on average, correct
Variable: relative change in COVID-19 infections
Measure: “Composite wastewater influent is collected over a 24-hour period from wastewater treatment plants (WWTPs). Samples are collected 2-3x per week at each WWTP and are transported by the BCCDC PHL for analysis. Wastewater samples are concentrated by ultracentrifugal filtration, nucleic acids extracted and SARS-CoV-2 envelope gene (E gene) is detected by real-time quantitative polymerase chain reaction (RT-qPCR).”
Day-to-day variation in:
can lead to errors in measurement, but these errors…
We don’t usually know for sure whether measurement errors are bias or random:
About how many cases among people between 70 and 80?
About how many cases among people between 20 and 30?
Histograms of measurement errors for many cases help us tell us what type of measurement error
…if there is measurement error?
We only observe what we observe, how do we know procedure does not return the true value?
…if there is measurement error?
Measurement error is everywhere. Does not mean we can say nothing about descriptive claims.
Need two concepts/variables/measures:
For each one, what are possible kinds of measurement error?
concept: Anti-refugee Violence
variable: Number of attacks against refugee persons and property
measure: (for each week)
concept: Anti-refugee speech on social media
variable: Number of anti-refugee posts on Facebook per week
measure:
Example Facebook posts:
What pattern do we see here between anti-refugee Facebook posts and anti-refugee violence over time?
Could the sources of measurement error we lead us to observe shared trends between anti-refugee posts and anti-refugee violence when they in fact don’t?
Measurement Bias arises from observations made by and of people:
(\(1\)) Subjectivity/Perspective: Researcher/data collector systematically perceives and evaluates cases incorrectly
Examples:
(\(2\)) Motives/Incentives to mis-represent: beyond researchers, people generating the data
If we surveyed Canadians and asked them:
“And would you oppose stopping all immigration into Canada?”
They can choose “oppose”, “support”, “neither support nor oppose”
Do you think this survey response would suffer from measurement bias?
List experiments
(board)
List experiments in US vs Canada
How many people are opposed to stopping immigration?
When discussing crime rates for natural-born citizens, legal immigrants, and undocumented immigrants, need to get the number of undocumented immigrants.
Why might it be difficult to correctly count?
(\(3\)) Use of data beyond its intended purposes: without knowing how data is produced, unanticipated errors can arise.
Kennedy et al, argue:
It takes time for undocumented immigrants in custody to be identified.
Only people in custody for longer periods of time for serious crimes likely to be thoroughly checked:
Alex Nowsrateh shows that these conclusions came from misunderstanding of the Texas data:
Kennedy et al takes count of all incidents where people labelled as undocumented from DHS and TDCJ. May double count individuals.
“We can supply the number uniquely identified by TDCJ (Prison category) and the total number of Illegals identified through PEP (this can include illegals also identified by TDCJ). Please note, if someone was uniquely identified through TDCJ, but at a later time is identified through PEP, the individual would no longer be in the Prison category and would reflect the PEP identification” [emphasis added].
Anything that is unrelated to the actual values for the cases we want to observe and may equally over-/under-estimate. Unless the error is literally generated by e.g. rolling dice, we only assume/argue that it is random
We need to distinguish between random and systematic errors. Does the source of the error suggest a systematic direction to the error?
Is the magnitude of the error likely to be large or small? Is it possible to assess how wrong it could be?
If the error is a bias, what is the systematic pattern that is produced? (upward?, downward?)
These answers can inform us whether measurement error affects whether evidence has weak severity. (next lecture)
Measurement Error