Probabilities, Conditional Independence and Assumption Parlance
2025-07-07
Probability statements about random events \(A\) and \(B\)
Say we have 100 patients, we can tabulate them according to their cancer status and whether they died or not.
A | ||||
---|---|---|---|---|
dies | lives | |||
B | has cancer | 5 | 5 | 10 |
has no cancer | 10 | 80 | 90 | |
15 | 85 | 100 |
statement | interpretation |
---|---|
\(P(A=1)\) | marginal probability that event \(A\) occurs |
\(P(B=1)\) | marginal probability that event \(B\) occurs |
A | ||||
---|---|---|---|---|
dies | lives | |||
B | has cancer | 5 | 5 | 10 |
has no cancer | 10 | 80 | 90 | |
15 | 85 | 100 |
A | ||||
---|---|---|---|---|
dies | lives | |||
B | has cancer | |||
has no cancer | ||||
15 | 85 | 100 |
\(P(A=1) = 15 / 100\)
A | ||||
---|---|---|---|---|
dies | lives | |||
B | has cancer | 10 | ||
has no cancer | 90 | |||
100 |
\(P(B=1) = 10 / 100\)
statement | interpretation |
---|---|
\(P(A)\) | marginal probability that event \(A\) occurs |
\(P(A=1,B=1)\) | joint probability of \(A\) and \(B\) |
A | ||||
---|---|---|---|---|
dies | lives | |||
B | has cancer | 5 | 5 | 10 |
has no cancer | 10 | 80 | 90 | |
15 | 85 | 100 |
A | ||||
---|---|---|---|---|
dies | lives | |||
B | has cancer | 5 | ||
has no cancer | ||||
100 |
\(P(B=1,A=1) = 5 / 100\)
statement | interpretation |
---|---|
\(P(A)\) | marginal probability that event \(A\) occurs |
\(P(A,B)\) | joint probability of \(A\) and \(B\) |
\(P(A=1|B=1)\) | conditional probability of \(A\) given \(B\) |
A | ||||
---|---|---|---|---|
dies | lives | |||
B | has cancer | 5 | 5 | 10 |
has no cancer | 10 | 80 | 90 | |
15 | 85 | 100 |
- marginal \(P(A=1) = 15/100\)
A | ||||
---|---|---|---|---|
dies | lives | |||
B | has cancer | 5 | 5 | 10 |
has no cancer | ||||
- marginal \(P(A=1) = 15/100\)
- conditional \(P(A=1|B=1) = 5 / 10\)
A marginal probability can be computed by summing over the joint probabilities of all possible values of the other random event.
statement | interpretation |
---|---|
\(P(A) = \sum_{b} P(A,B=b)\) | marginal is sum over joint |
A | ||||
---|---|---|---|---|
dies | lives | |||
B | has cancer | 5 | ||
has no cancer | 10 | |||
15 | 100 |
\[\begin{align} P(A=1) &= P(A=1,B=1) + P(A=1,B=0) \\ &= 5/100 + 10/100 \\ & = 15/100 \end{align}\]
A joint probability can be computed by multiplying the conditional probability of one random event given the other random event with the marginal probability of the other random event.
statement | interpretation |
---|---|
\(P(A) = \sum_{b} P(A,B=b)\) | marginal is sum over joint |
\(P(A,B) = P(A|B)P(B)\) | product rule |
A | ||||
---|---|---|---|---|
dies | lives | |||
B | has cancer | 5 | 10 | |
has no cancer | ||||
100 |
\[\begin{align} P(A=1,B=1) &= P(A=1|B=1)P(B=1) \\ &= 5/10 * 10/100 \\ & = 5/100 \end{align}\]
With these two rules and basic algebra, we can derive more identities
statement | interpretation |
---|---|
\(P(A) = \sum_{b} P(A,B=b)\) | marginal is sum over joint |
\(P(A,B) = P(A|B)P(B)\) | product rule |
a conditional probability can be computed by dividing the joint probability of the two random events by the marginal probability of the other random event, since1
\[ x = y * z \implies y = \frac{x}{z} \]
statement | interpretation |
---|---|
\(P(A) = \sum_{b} P(A,B=b)\) | marginal is sum over joint |
\(P(A,B) = P(A|B)P(B)\) | product rule |
\(P(A|B) = \frac{P(A,B)}{P(B)}\) | conditional is joint over marginal (follows from product rule) |
statement | interpretation |
---|---|
\(P(A) = \sum_{b} P(A,B=b)\) | marginal is sum over joint |
\(P(A,B) = P(A|B)P(B)\) | product rule |
\(P(A|B) = \frac{P(A,B)}{P(B)}\) | conditional is joint over marginal (follows from product rule) |
\(P(A|C) = \sum_{b} P(A|B=b,C)P(B=b|C)\) | total probability (consequence of marginal vs joint and product rule) |
statement | interpretation |
---|---|
\(P(A,B) = P(A)P(B)\) | (marginal) independence of \(A\) and \(B\) |
some events may not be independent in general, but they may be independent given some other event \(C\).
statement:
\[P(A,B|C) = P(A|C)P(B|C)\]
\[P(A|B,C) = P(A|C)\]
statement | interpretation |
---|---|
\(P(A,B) = P(A)P(B)\) | (marginal) independence of \(A\) and \(B\) |
\(P(A,B|C) = P(A|C)P(B|C)\) | conditional independence of \(A\) and \(B\) given \(C\) |
\(P(A|B,C) = P(A|C)\) | conditional independence of \(A\) and \(B\) given \(C\) |
We are asked to prove:
\[P(A \mid C) = \sum_b P(A \mid B = b, C) \, P(B = b \mid C)\]
\[\begin{align} P(A \mid C) &= \sum_b P(A, B = b \mid C) && \text{(sum rule)} \\ &= \sum_b P(A \mid B = b, C) \, P(B = b \mid C) && \text{(product rule)} \end{align}\]
We will prove that the conditional independence statement
\[ P(A \mid B, C) = P(A \mid C) \]
is equivalent to
\[ P(A, B \mid C) = P(A \mid C) \cdot P(B \mid C) \]
using basic rules of probability.
Assume
\[ P(A, B \mid C) = P(A \mid C) \cdot P(B \mid C) \]
By the product rule,
\[ P(A, B \mid C) = P(A \mid B, C) \cdot P(B \mid C) \]
Comparing both expressions:
\[ P(A \mid B, C) \cdot P(B \mid C) = P(A \mid C) \cdot P(B \mid C) \]
Divide both sides by \(P(B \mid C) > 0\), we get:
\[ P(A \mid B, C) = P(A \mid C) \]
Assume
\[ P(A \mid B, C) = P(A \mid C) \]
Again by the product rule:
\[ P(A, B \mid C) = P(A \mid B, C) \cdot P(B \mid C) \]
Substitute \(P(A \mid C)\) for \(P(A \mid B, C)\), we get:
\[ P(A, B \mid C) = P(A \mid C) \cdot P(B \mid C) \]
\[ P(A \mid B, C) = P(A \mid C) \quad \Longleftrightarrow \quad P(A, B \mid C) = P(A \mid C) \cdot P(B \mid C) \]
as required.