What you will learn
- calculate conditional probabilities using ,
- interpret the language of conditional probability: “given”, “if”, “of”, “knowing that”,
- determine whether two events are independent,
- apply conditional probability to sampling without replacement,
- read and construct two-way tables to find conditional probabilities,
- draw tree diagrams with conditional branches for dependent events.
A screening test for a rare disease has the following properties: the disease affects of the population. If a person has the disease, the test is positive of the time. If a person does not have the disease, the test is positive of the time (false positive). What is the probability that a person who tests positive actually has the disease?
- Let = has disease, = tests positive.
- , , .
- .
- .
Key idea: even with an accurate test, only about of positive results are true positives when the disease is rare. This is why confirmatory testing exists.
1. Conditional probability
The conditional probability of event given event has occurred is:
Conditional probability
Read as “the probability of given .”
The vertical bar "" means “given that” or “knowing that.” Watch for these phrases in word problems:
- “If a student plays sport, what is the probability they also play music?” means .
- “Of those who passed, what fraction studied more than hours?” means .
In a group of students, study maths, study science, and study both. Find .
- .
- .
- .
Interpretation: of the maths students, also study science — that is .
2. Independent and dependent events
Two events are independent if knowing one occurred does not change the probability of the other:
Equivalently, for independent events: .
If , the events are dependent.
From the example above: and .
Since , the events “studies science” and “studies maths” are dependent. Knowing a student studies maths slightly decreases the probability they study science.
3. Two-way tables for conditional probability
A two-way table provides all the information needed to calculate conditional probabilities directly from counts.
A survey of adults records exercise habits and health ratings:
| Good health | Poor health | Total | |
|---|---|---|---|
| Exercises regularly | 90 | 30 | 120 |
| Does not exercise | 40 | 40 | 80 |
| Total | 130 | 70 | 200 |
(a) .
(b) .
(c) .
Note: in general. Part (a) and part (c) have different values.
4. Tree diagrams with conditional branches
When events are dependent, the branches of a tree diagram show conditional probabilities. This is especially useful for sampling without replacement.
The second-draw probabilities are conditional on the first draw’s result. After drawing a red card first, only red and black remain out of total.
From the tree diagram above, find:
(a) .
(b) .
This can be read directly from the tree: on the lower branch (1st card black), the “R” branch has probability .
Practice
Tier 1: basic skills
- In a class of students, play basketball. What is ?
- Of the basketball players, are also in the swim team. What is ?
- A bag has red and blue marbles. One marble is drawn and not replaced. If the first marble was red, what is ?
- Events and satisfy , , . Find .
- Are and in Q4 independent? Justify.
- A two-way table shows: males own a pet, males do not, females own a pet, females do not. Find .
- Using the same table, find .
- A coin is tossed and a die is rolled. Are the events “heads” and “rolling a 6” independent? Explain.
- Two cards are drawn without replacement from a deck of . Find .
- State the formula for .
Tier 2: mixed practice
-
A box contains green and yellow balls. Two balls are drawn without replacement. Draw a tree diagram with conditional probabilities on each branch, and find .
-
In a school of students, study French, study German, and study both. Find: (a) , (b) .
-
A survey finds:
Supports policy Opposes policy Total Under 30 45 30 75 30 and over 35 40 75 Total 80 70 150 (a) Find and . (b) Is there an association between age group and opinion? Justify.
-
Events and are such that , , and . Find and determine whether and are independent.
-
Three machines produce items. Machine X makes of items with a defect rate. Machine Y makes with a defect rate. Machine Z makes with a defect rate. An item is selected at random. Find the probability it is defective.
Tier 3: explain and apply
- Using the machine data from Tier 2 Q5, an item is found to be defective. Find the probability it came from Machine Z.
- Explain, with a numerical example, why in general. Why is confusing these two a common and dangerous error in medical or legal contexts?
- A jar contains red and blue marbles. Three marbles are drawn without replacement. Find using a chain of conditional probabilities.
- Two events satisfy and . If , find .
Challenge
Harder reasoning
- A game show has three doors. Behind one door is a prize; behind the other two, nothing. You pick a door. The host, who knows what is behind each door, opens a different door to reveal no prize. You are offered the chance to switch. Using conditional probability, show that switching gives you a chance of winning.
- In a population, use a certain drug. A drug test has a true positive rate and a false positive rate. Find the probability that a person who tests positive actually uses the drug. Comment on the usefulness of the test.
- Prove that if and are independent, then and (the complement of ) are also independent.
- Five cards are dealt from a standard deck of . Find the probability that all five are spades, using a chain of conditional probabilities.
Answer key
Attempt the practice first. When you're ready to check, expand the answers below.
Show the full answer key
Tier 1
- .
- .
- After removing a red marble: red and blue remain out of . .
- .
- Yes, and are independent because . Knowing occurred does not change the probability of .
- .
- .
- Yes, they are independent. The outcome of the coin does not affect the die, and vice versa. .
- After removing one heart, hearts remain out of cards. .
- , where .
Tier 2
- First draw: , . If 1st is green: , . If 1st is yellow: , . .
- (a) . (b) .
- (a) . . (b) Yes, there is an association. The conditional probabilities differ: younger respondents are more likely to support the policy ( vs ). If there were no association, both groups would have the same support rate of .
- . For independence: . Since , and are not independent.
- . The probability that a randomly selected item is defective is .
Tier 3
- . . There is about a chance the defective item came from Machine Z.
- Example: . A test might detect of sick people (), but if the disease is rare, could be much lower (e.g. ). Confusing the two — called the “prosecutor’s fallacy” in legal contexts — leads to wildly wrong conclusions. In medicine, it means overestimating how likely a patient is to have the disease after a positive test.
- . . . .
- . Also , so , giving .
Challenge
- Label the doors . Suppose the prize is behind door (by symmetry, the argument is the same for any door). You pick door : host opens door or ; switching loses. You pick door : host must open door ; switching wins. You pick door : host must open door ; switching wins. So switching wins out of times. Formally: let = prize behind your door. . Given the host reveals a losing door, .
- . . . . . Only about of positive results are true positives. The test produces many false positives because the user base is so small. A confirmatory (more specific) test is essential.
- (using independence) . Since , events and are independent.
- .
Prefer paper? Print the answer key as a separate booklet: open print view ->