User:IssaRice/Two envelopes problem

From Machinelearning

This page looks at the two envelopes problem, as stated on Wikipedia. The "Wikipedia" column contains the steps given in the fallacious argument, and the "Formal interpretation" column gives the equivalent step translated into the language of probability theory.

Step Wikipedia Formal interpretation
1 I denote by A the amount in my selected envelope. First, let be the amount of money in the lesser envelope, so that is the amount in the greater envelope. We need to define a sample space. Let be a set. The intended interpretation here is that is the world in which envelope contains the larger amount () and we chose envelope . Since all choices are made randomly, we can assume a uniform distribution, i.e. for all . We need to decide what sort of object is. Since later on, we start talking about hypotheticals (e.g. "If A is the smaller amount") it makes sense to let be a random variable. We define as

For instance, if the larger amount was in envelope 1 and we chose envelope 2, then we are in the world so as expected.
2 The probability that A is the smaller amount is 1/2, and that it is the larger amount is also 1/2. We can interpret this as saying that and . Both of these are true, e.g. .
3 The other envelope may contain either 2A or A/2. Here it helps to introduce a random variable tracking the amount in the unchosen envelope. Call it . We have

Now the claim from Wikipedia is saying that either or . Since this is an equality between random variables, things are starting to get a little tricky. Formally, is an event; in fact, it is the set (for both of those pairs, and so indeed). Similarly, is the set . Now, "either or " means that and (interpreted as events) include all the possible worlds, i.e. that . This is indeed the case.
4 If A is the smaller amount, then the other envelope contains 2A. This is saying that "if then ". What could that possibly mean? Like the previous step, we have to interpret this one in terms of events. This time, the "if ... then ..." means we are talking about set inclusion. In other words, the claim is that . Well, is the event and is also the event . So in fact .
5 If A is the larger amount, then the other envelope contains A/2. Similar to the previous step. This is saying that . This is true, and in fact equality holds.
6 Thus the other envelope contains 2A with probability 1/2 and A/2 with probability 1/2. This is saying that and . Both statements are true.
7 So the expected value of the money in the other envelope is: This is where things start to go downhill. On the surface, the statement is claiming that . But this is nonsense, since the left-hand side is a real number and the right-hand side is a random variable! In other words, not even our types match. Is there some way to rescue this statement by interpreting it a different way? The left-hand side looks like . In fact, a valid way to calculate

is to partition into subsets on which takes a constant value (this is a very special case of the law of total expectation). So we have where is the value that takes on the set (i.e. the value ) and is the value that takes on the set (i.e. the value ). So we have . But now, the Wikipedia column tries to replace and by the equivalent values for , i.e. by and . This is fine, since (hence ) and (hence ). So finally, what the Wikipedia column wants to do is:

Now the problem is apparent! We have . So we are not allowed to combine these into a single "".

8 This is greater than A, so I gain on average by swapping.
9 After the switch, I can denote that content by B and reason in exactly the same manner as above.
10 I will conclude that the most rational thing to do is to swap back again.
11 To be rational, I will thus end up swapping envelopes indefinitely.
12 As it seems more rational to open just any envelope than to swap indefinitely, we have a contradiction.