Suppose that you are worried that you might have a
rare disease. You decide to get tested, and
suppose that the testing methods for this disease
are accurate 99 percent of the time
(regardless of whether the results come back positive
or negative). Suppose this disease is actually quite
rare, occurring randomly in the general population
in only one of every 10,000 people.
If your test results come back positive,
what are your chances
that you actually have the disease?
Do you think it is approximately: (a) .99, (b) .90,
(c) .10, or (d) .01?
Surprisingly, the answer is (d), less than 1 percent
chance that you have the disease!
Presentation Suggestions:
After discussing the reasons why the test results are
not so reliable, see how changing the parameters
affects the outcome. Would the result be so surprising
if the disease were more common? How would things change
if you allow the percentage of false positives and
false negatives to be different?
The Math Behind the Fact:
This fact may be deduced using something called
Bayes' theorem, a computational device used to find
the probability of event A given event B, written P(A|B),
in terms of the probability of B given A, written P(B|A),
and the probabilities of A and B:
P(A|B)=P(A)P(B|A) / P(B)
In this case, event A is the event you have this disease,
and event B is the event that you test positive. Here,
P(B|A)=.99, P(A)=.0001, and P(B) may be derived by
conditioning on whether event A does or does not occur:
P(B)=P(B|A)P(A)+P(B|not A)P(not A)
or .99*.0001+.01*.9999, which is less than 1 percent.
Alternatively, we can see this by thinking about
what we can expect in 1 million cases.
In those million, about 100 will have the disease,
and on average 99 of those cases will be correctly
diagnosed as having it. Otherwise about 999,900 of the
million will not have the disease, but of those cases
9999 of those will be false positives
(test results that are positive because of errors).
So, if you test positive, then the likelihood that
you actually have the disease is 99/(99+9999), which
gives the same fraction as above, approximately .0098
or less than 1 percent!
You may wish to discuss why these calculations
wouldn't hold if the disease were not
independently and identically distributed
throughout the population.
For instance, if the disease were a cancer
like breast cancer that runs in families or
mesothelioma related to asbestos exposure in the workplace,
then the estimate of P(A)=.0001 would not be correct.
How to Cite this Page:
Su, Francis E., et al. "Medical Tests and Bayes' Theorem."
Mudd Math Fun Facts.
<http://www.math.hmc.edu/funfacts>.
Bookmark this page on:
|
Digg this!
|
Del.icio.us
|
Technorati
|
Reddit
|
Fark
|
Squidoo
|
Furl
|
Blinklist
|
Yahoo MyWeb
|
Google
|
Stumbleupon
|
|
|