Conditional Probability: Real-World Scenarios (6)

hard 2 min read

Question

In a city, 1% of people have a rare disease. A test detects the disease correctly in 99% of sick people, but also gives a false positive in 5% of healthy people. If a randomly chosen person tests positive, what is the probability they actually have the disease?

This is the classic Bayes’ theorem problem — counter-intuitive and a JEE Advanced favourite.

Solution — Step by Step

Let DD = has disease, DcD^c = no disease, ++ = tests positive.

P(D)=0.01P(D) = 0.01, P(Dc)=0.99P(D^c) = 0.99.

P(+D)=0.99P(+|D) = 0.99, P(+Dc)=0.05P(+|D^c) = 0.05.

P(D+)=P(+D)P(D)P(+D)P(D)+P(+Dc)P(Dc)P(D|+) = \frac{P(+|D)P(D)}{P(+|D)P(D) + P(+|D^c)P(D^c)}

Numerator: 0.99×0.01=0.00990.99 \times 0.01 = 0.0099.

Denominator: 0.0099+0.05×0.99=0.0099+0.0495=0.05940.0099 + 0.05 \times 0.99 = 0.0099 + 0.0495 = 0.0594.

P(D+)=0.0099/0.05940.167P(D|+) = 0.0099/0.0594 \approx 0.167, or about 16.7%16.7\%.

Final answer: 16.7%\approx 16.7\%.

Why This Works

The result feels wrong because the test is “99% accurate” — but that statistic refers to true-positive rate, not the predictive value of a positive result. Most healthy people test positive too, simply because there are vastly more healthy people than sick people.

Bayes’ theorem balances the prior probability (P(D)=1%P(D) = 1\%, very small) against the likelihood ratio (0.99/0.05200.99/0.05 \approx 20, fairly informative). The product gives the posterior — moving from 1%1\% to 17%\sim 17\% is a 17-fold update, not a slam-dunk diagnosis.

Alternative Method

Use a population of 1000010000 people. Expected: 100100 sick, 99009900 healthy. True positives: 9999 (99%99\% of 100100). False positives: 495495 (5%5\% of 99009900). Total positives: 594594. Probability of disease given positive = 99/594=1/616.7%99/594 = 1/6 \approx 16.7\%.

This “natural frequencies” approach is what physicians and statisticians prefer — no formulas to remember.

JEE Advanced 2018 had a near-identical question with different numbers. Always present the population-based reasoning as your alternative — it earns step marks even if you mess up the algebra.

Common Mistake

Reading “99% accurate” and concluding P(D+)=0.99P(D|+) = 0.99. The conditional flips: P(+D)=0.99P(+|D) = 0.99 does NOT imply P(D+)=0.99P(D|+) = 0.99. The base rate P(D)P(D) matters enormously.

Want to master this topic?

Read the complete guide with more examples and exam tips.

Go to full topic guide →

Try These Next