Stat 110 Ch 2, Ex 14: Tversky and Kahneman Burglar Alarm Problem

November 22, 2020

Warning: Spoilers Follow

Ex 2.14: Consider the following scenario, from Tversky and Kahneman:
Let A be the event that before the end of next year, Peter will have installed a burglar alarm system in his home. Let B denote the event that Peter's home will be burglarized before the end of next year.

(a) Intuitively, which do you think is bigger, P(AB)P(A|B) or P(ABc)P(A|B^{c})? Explain your intuition.

P(AB)>P(ABc)P(A|B) > P(A|B^{c})

We're not given any information about Peter's situation at all. Information such as whether or not Peter has considered installing a burglar alarm previously, whether or not his home has been burglarized previously, how much money Peter has to invest in an alarm, or what he stands to lose in case of a burglary; if he has little in the way of valuable possessions, an alarm wouldn't seem worth the financial investment. And, there are many other questions you could ask about Peter's situation. Keeping things simple, though, we could say intuitively that the average person wouldn't be motivated to install a burglar alarm without having suffered a burglary first. Of course, there are people who have a strong desire to protect their homes and families, and who might think proactively and install an alarm before ever being burglarized, but it would be reasonable to assume that if Peter were such a person, he would have already installed such an alarm.

(b) Intuitively, which do you think is bigger, P(BA)P(B|A) or P(BAc)P(B|A^{c})? Explain your intuition.

P(BA)>P(Bc)P(B|A) > P(B|^{c})

The event A can happen anytime before the end of next year, which means there are more than 365 days on which a burglary could happen. The same is true for the event B. There's no information about the chronology of the events. It helps to read out the probability expressions:

P(AB)P(A|B) - The probability that A will occur in a time period that stretches between now and the end of next year given that B will occur in the same time period.
P(BA)P(B|A) - The probability that B will occur in a time period that stretches between now and the end of next year given that A will occur in the same time period.

Consider the extreme case in which the following is true:
Peter will install an alarm immediately if and only if his home is burglarized.

Fast forward to the end of next year. We're told that Peter's home was burglarized, which means we know that he has installed an alarm system. Or, say we fast forward to the end of next year, and we're told Peter has installed an alarm system. This means we know his home has been burglarized. If we think it's more likely to be the case that he will install an alarm system if his home has been burglarized, we should conclude that if his home will have been burglarized by the end of next year, that his home will have had an alarm installed.

(c) Show that for any events A and B (with probabilities not equal to 0 or 1), the inequality P(AB)>P(ABc)P(A|B) > P(A|B^{c}) is equivalent to P(BA)>P(BAc)P(B|A) > P(B|A^{c}).

The sample space can be thought of as the collection of the following 4 pieces:

a=P(AcB)a = P(A^{c} \cap B) b=P(AB)b = P(A \cap B) c=P(ABc)c = P(A \cap B^{c}) d=P(AcBc)d = P(A^{c} \cap B^{c})

We start with the inequality P(AB)>P(ABc)P(A|B) > P(A|B^{c})

(1) P(AB)P(B)>P(ABc)P(Bc)\frac {P(A \cap B)}{P(B)} >\frac {P(A \cap B^{c})}{P(B^{c})}

P(B)=P(BA)+P(BAc)=b+aP(B) = P(B \cap A) + P(B \cap A^{c}) = b + a
P(Bc)=P(BcA)+P(BcAc)=c+dP(B^{c}) = P(B^{c} \cap A) + P(B^{c} \cap A^{c}) = c + d

Cross multiplying and using a, b, c, and d above gives

(2) b(c+d)>c(a+b)b(c + d) > c(a + b)

(3) bc+bd>ac+bcbc + bd > ac + bc

(4) bd>acbd > ac

(5) bc>ad\frac{b}{c} > \frac{a}{d}

(6) P(BA)P(ABc)>P(BAc)P(AcBc)\frac{P(B \cap A)}{P(A \cap B^{c})} > \frac{P(B \cap A^{c})}{P(A^{c} \cap B^{c})}

To get us where we need to be, the left and right denominators need to be, respectively,

P(A)P(A) and P(Ac)P(A^{c})

P(A)=P(AB)+P(ABc)=b+cP(A) = P(A \cap B) + P(A \cap B^{c}) = b + c
P(Ac)=P(AcB)+P(AcBc)=a+dP(A^{c}) = P(A^{c} \cap B) + P(A^{c} \cap B^{c}) = a + d

To achieve this, we can add abab to both sides of (4) above

ab+bd>ab+acab + bd > ab + ac

b(a+d)>a(b+c)b(a + d) > a(b + c)

bb+c>aa+d\frac{b}{b + c} > \frac{a}{a + d}

P(BA)P(A)>P(BAc)P(Ac)\frac{P(B \cap A)}{P(A)} > \frac{P(B \cap A^{c})}{P(A^{c})}

P(BA)>P(BAc)P(B|A) > P(B|A^{c})
QED

(d) Tversky and Kahneman report that 131 out of 162 people whom they posed (a) and (b) to said that P(AB)>P(ABc)P(A|B) > P(A|B^{c}) and P(BA)<P(BAc)P(B|A) < P(B|A^{c}). What is a plausible explanation for why this was such a popular opinion despite (c) showing that it is impossible for both inequalities to hold?

P(BAc)P(B|A^{c}) given the definitions of A and B in the original problem statement is the probability that Peter will suffer a burglary before the end of the year given that he will have a burglar alarm installed by the end of the year. The chronology of the events is not specified, so it is possible that he will suffer a burglary before he has the alarm installed. So, if you think he will install a burglar alarm after having suffered a burglary, it makes sense to say P(BA)>P(BAc)P(B|A) > P(B|A^{c}). But, it seems many respondents took P(BAc)P(B|A^{c}) to be the probability of suffering a burglary given that he didn't have an alarm installed prior to the burglary and P(BA)P(B|A) to be the probability that he had a burglary after having an alarm installed. In that case, there's a good chance they would say P(BA)<P(BAc)P(B|A) < P(B|A^{c}) meaning that it's more likely Peter will suffer a burglary without an alarm than with one. This seems like a problem of not having parsed the language of the original problem statement thoroughly.