2024-08-08
observe an \(X\), want to know what to expect for \(Y\)
1. X = patient caughs, Y = patient has lung cancer
2. X = ECG, Y = patient has heart attack
3. X = CT-scan, Y = patient dies within 2 years
Let \(f\) depend on parameter \(\theta\), prediction typically aims for:
\[f_{\theta}(x) \to E[Y|X=x]\]
typical estimand:
\[E[Y|\text{do}(T=1)] - E[Y|\text{do}(T=0)]\]
prediction
causal inference
A treatment policy \(\pi\) is a procedure for determining the treatment
Assuming \(T\) is binary, \(\pi\) can be:
\[\pi(blood pressure) = \begin{cases} 1, &blood pressure > 140mmHg\\ 0, &\text{otherwise} \end{cases}\]
\[\pi(X) = \begin{cases} 1, &f(X) > 0.1\\ 0, &\text{otherwise} \end{cases}\]
For example:
TRIPOD+AI on prediction models (Collins et al. 2024)
“Their primary use is to support clinical decision making, such as … initiate treatment or lifestyle changes.”
This may lead to bad situations when:
The question is not “is my model accurate before / after deployment”,
but did deploying the model improve patient outcomes?
\[\begin{align} E[Y|X] \class{fragment}{= E[E_{t~\sim \pi_0(X)}[Y|X,t]]} \end{align}\]
Is this obvious?
What to do?
What to do?
With causal inference!
Using cluster RCTs to evaluated models for decision making is not a new idea (Cooper et al. 1997)
“As one possibility, suppose that a trial is performed in which clinicians are randomized either to have or not to have access to such a decision aid in making decisions about where to treat patients who present with pneumonia.”
What we don’t learn
was the model predicting anything sensible?
How to build prediction models for decision support?
Let \(f: \mathbb{X} \to \mathbb{Y}\) be a prediction model for outcome \(Y\) using features \(X\)
\[f(x) = E[Y|\text{do}(X=x)]\]
\[f(t,x) = E[Y|\text{do}(T=t),X=x]\]
Let \(f: \mathbb{X} \to \mathbb{Y}\) be a prediction model for outcome \(Y\) using features \(X\)
\[f(x) = E[Y|\text{do}(X=x)]\]
Qrisk3: a risk prediction model for cardiovascular events in the coming 10-years. Widely used in the United Kingdom for deciding which patients should get statins
can go wrong when:
\[f(t,x) = E[Y|\text{do}(T=t),X=x]\]
using treatment naive prediction models for decision support
prediction-under-intervention
What is the estimand?
note:
requires causal inference assumptions or RCTs
single RCTs often not big enough, or did not measure the right \(X\)s
when \(X\) is not a sufficient adjustment set, but \(X+L\) is, can use e.g. propensity score methods
assumption of no unobserved confounding often hard to justify in observational data
but there’s more between heaven (RCT) and earth (confounder adjustment)
not covered now: formulating correct estimands (and getting the right data) becomes much more complicated when considering dynamic treatment decision processes (e.g. blood pressure control with multiple follow-up visits)
assuming \(x\) is discrete, otherwise replace sums with integrals for continuous \(x\)
want to compute the expected value of \(g(x)\) over distribution \(p\), but we have samples from another distribution \(x \sim q\)
\[E_{x \sim q} \left[ \frac{p(x)}{q(x)} g(x) \right] = \sum_x q(x) \left( \frac{p(x)}{q(x)} g(x) \right) = \sum_x p(x) g(x) = E_{x \sim p} \left[g(x) \right]\]
this assumes \(q(x)>0\) whenever \(p(x)>0\) for the ratio \(p/q\) to be defined