User:IssaRice/Computability and logic/Eliezer Yudkowsky's Löb's theorem puzzle: Difference between revisions

From Machinelearning
 
(31 intermediate revisions by the same user not shown)
Line 2: Line 2:


current LW link: https://www.lesswrong.com/posts/ALCnqX6Xx8bpFMZq3/the-cartoon-guide-to-loeb-s-theorem
current LW link: https://www.lesswrong.com/posts/ALCnqX6Xx8bpFMZq3/the-cartoon-guide-to-loeb-s-theorem
This page gives spoilers for the puzzle!
This pages states the puzzle and gives the solution.
==Notation==
Precedence:
* <math>\Box</math> has high precedence, so e.g. <math>\Box C \to C</math> means <math>(\Box C) \to C</math>, and <math>\Box \Box L \to \Box C</math> means <math>(\Box \Box L) \to (\Box C)</math>.
* <math>\vdash</math> is a metalogical notion and has low precedence, so <math>\mathsf{PA} \vdash \Box C \to C</math> means <math>\mathsf{PA} \vdash (\Box C \to C)</math>.


==Translating the puzzle using logic notation==
==Translating the puzzle using logic notation==
Line 12: Line 23:


When translating to logic notation, it becomes obvious that the application of the deduction theorem is illegitimate, because we don't actually have <math>\mathsf{PA} \cup \{\Box C \to C\} \vdash C</math>. This is the initial answer that Larry D'Anna gives in comments.
When translating to logic notation, it becomes obvious that the application of the deduction theorem is illegitimate, because we don't actually have <math>\mathsf{PA} \cup \{\Box C \to C\} \vdash C</math>. This is the initial answer that Larry D'Anna gives in comments.
But now, suppose we define <math>\mathsf{PA}' := \mathsf{PA} \cup \{\Box C \to C\}</math>, and walk through the proof of Löb's theorem for this new theory <math>\mathsf{PA}'</math>. Then we would obtain the following implication: if <math>\mathsf{PA}' \vdash \Box C \to C</math>, then <math>\mathsf{PA}' \vdash C</math>. But clearly, <math>\mathsf{PA}' \vdash \Box C \to C</math> since <math>\Box C \to C</math> is one of the axioms of <math>\mathsf{PA}'</math>. Therefore by modus ponens, we have <math>\mathsf{PA}' \vdash C</math>, i.e. <math>\mathsf{PA}\cup\{\Box C \to C\} \vdash C</math>. Now we can apply the deduction theorem to obtain <math>\mathsf{PA} \vdash (\Box C \to C) \to C</math>. This means that our "Löb's theorem" for <math>\mathsf{PA}'</math> must be incorrect (note: the proof ''is'' correct for <math>\mathsf{PA}</math>, which is why Löb's theorem is a theorem; it's just incorrect for <math>\mathsf{PA}'</math>), and somewhere in the ten-step proof is an error.


==Translating the Löb's theorem back to logic==
==Translating the Löb's theorem back to logic==
Line 19: Line 32:
Since the solution to the puzzle refers back to the proof of Löb's theorem, we first translate the proof from the cartoon version back to logic:
Since the solution to the puzzle refers back to the proof of Löb's theorem, we first translate the proof from the cartoon version back to logic:


# <math>\mathsf{PA} \vdash \Box L \leftrightarrow \Box(\Box L \to C)</math>
# <math>\mathsf{PA} \vdash \Box L \leftrightarrow \Box(\Box L \to C)</math> by construction of <math>L</math>
# <math>\mathsf{PA} \vdash \Box C \to C</math>
# <math>\mathsf{PA} \vdash \Box C \to C</math> by hypothesis
# <math>\mathsf{PA} \vdash \Box(\Box L \to C) \to (\Box \Box L \to \Box C)</math>
# <math>\mathsf{PA} \vdash \Box(\Box L \to C) \to (\Box \Box L \to \Box C)</math>
# <math>\mathsf{PA} \vdash \Box L \to (\Box \Box L \to \Box C)</math>
# <math>\mathsf{PA} \vdash \Box L \to (\Box \Box L \to \Box C)</math>
Line 29: Line 42:
# <math>\mathsf{PA} \vdash \Box L</math>
# <math>\mathsf{PA} \vdash \Box L</math>
# <math>\mathsf{PA} \vdash C</math>
# <math>\mathsf{PA} \vdash C</math>
==Repeating the proof of Löb's theorem for modified theory==
We now repeat the proof of Löb's theorem for <math>\mathsf{PA}' := \mathsf{PA} \cup \{\Box C \to C\}</math> to see where the error is.
# <math>\mathsf{PA}' \vdash \Box L \leftrightarrow \Box(\Box L \to C)</math> by construction of <math>L</math>
# <math>\mathsf{PA}' \vdash \Box C \to C</math> because <math>\Box C \to C</math> is one of the axioms of <math>\mathsf{PA}'</math>
# <math>\mathsf{PA}' \vdash \Box(\Box L \to C) \to (\Box \Box L \to \Box C)</math> instance of A3
# <math>\mathsf{PA}' \vdash \Box L \to (\Box \Box L \to \Box C)</math>
# <math>\mathsf{PA}' \vdash \Box L \to \Box \Box L</math>
# <math>\mathsf{PA}' \vdash \Box L \to \Box C</math>
# <math>\mathsf{PA}' \vdash \Box L \to C</math>
So far, everything is fine. But can we assert <math>\mathsf{PA}' \vdash \Box(\Box L \to C)</math>? For <math>\mathsf{PA}</math>, we had the following:
:A1: For each sentence <math>X</math>, if <math>\mathsf{PA}\vdash X</math> then <math>\mathsf{PA} \vdash \Box X</math>
We need the following:
:A1&prime;: For each sentence <math>X</math>, if <math>\mathsf{PA}'\vdash X</math> then <math>\mathsf{PA}' \vdash \Box X</math>
Since <math>\mathsf{PA}'</math> has more axioms than <math>\mathsf{PA}</math>, we know that <math>\mathsf{PA}'</math> can prove everything that <math>\mathsf{PA}</math> can. Thus, compared to A1, both the antecedent and consequent of A1&prime; are weaker, so A1&prime; does not necessarily follow from A1.
Suppose we take <math>X</math> in A1&prime; to be <math>\Box C \to C</math>. Then we obtain
:If <math>\mathsf{PA}'\vdash \Box C \to C</math> then <math>\mathsf{PA}' \vdash \Box (\Box C \to C)</math>
Other ways we might try to get step 8 to work:
* <math>\mathsf{PA}' \vdash X \to \Box X</math>
* if <math>X</math>, then <math>\mathsf{PA} \vdash X</math> -- i think this is the one about which Larry says "ARGH WHAT ARE YOU DOING". Or maybe he means "if <math>\Delta \vdash X</math>, then <math>\Delta \vdash \Box X</math>" for arbitrary <math>\Delta</math>.
** Now one might wonder, why is <math>\Delta=\mathsf{PA}</math> so special? I think this happens because <math>\Box</math> is the provability predicate of <math>\mathsf{PA}</math> and not some other theory. In particular, if <math>\Delta \supseteq \mathsf{PA}</math>, then if <math>\Delta \vdash X</math> we only know the weaker statement that <math>\Delta</math> proves <math>X</math>, not that <math>\mathsf{PA}</math> proves it. So we shouldn't be able to prove that <math>\Delta</math> can prove <math>\Box X</math>. What if <math>\Delta \subseteq \mathsf{PA}</math>? Then I think we are in a better position. e.g. if <math>\Delta = \mathsf{Q}</math>, then I think <math>\mathsf Q</math> can still represent the provability predicate of <math>\mathsf{PA}</math> (?) so then we have <math>\mathsf Q \vdash \Box X</math>. (this part might be wrong)
But can we ''prove'' that we can never get step 8 for <math>\mathsf{PA}'</math>? Well, this is exactly what the puzzle statement does -- it assumes the proof goes through, and derives an absurdity! (? actually, i'm not sure this is actually a contradiction. If we assume PA is consistent, then it seems to be a contradiction. I think this is the point nate soares and alex flint are trying to make in the comments. but we can re-word the puzzle to say one should find the ''first'' error.)
==Other questions to answer==
* does <math>\Box</math> in <math>\mathsf{PA}'</math> still mean provability about <math>\mathsf{PA}</math>, or about itself?
* are we removing a layer or adding one?
* can't <math>\mathsf{PA}'</math> prove everything <math>\mathsf{PA}</math> can? yes, but this doesn't mean we can substitute implications automatically because the consequent also gets stronger

Latest revision as of 06:33, 10 February 2019

original link: https://web.archive.org/web/20160319050228/http://lesswrong.com/lw/t6/the_cartoon_guide_to_l%C3%B6bs_theorem/

current LW link: https://www.lesswrong.com/posts/ALCnqX6Xx8bpFMZq3/the-cartoon-guide-to-loeb-s-theorem

This page gives spoilers for the puzzle!

This pages states the puzzle and gives the solution.

Notation

Precedence:

  • has high precedence, so e.g. CC means (C)C, and LC means (L)(C).
  • is a metalogical notion and has low precedence, so PACC means PA(CC).

Translating the puzzle using logic notation

Löb's theorem shows that if PACC, then PAC.

The deduction theorem says that if PA{H}F, then PAHF.

Applying the deduction theorem to Löb's theorem gives us PA(CC)C.

When translating to logic notation, it becomes obvious that the application of the deduction theorem is illegitimate, because we don't actually have PA{CC}C. This is the initial answer that Larry D'Anna gives in comments.

But now, suppose we define PA':=PA{CC}, and walk through the proof of Löb's theorem for this new theory PA'. Then we would obtain the following implication: if PA'CC, then PA'C. But clearly, PA'CC since CC is one of the axioms of PA'. Therefore by modus ponens, we have PA'C, i.e. PA{CC}C. Now we can apply the deduction theorem to obtain PA(CC)C. This means that our "Löb's theorem" for PA' must be incorrect (note: the proof is correct for PA, which is why Löb's theorem is a theorem; it's just incorrect for PA'), and somewhere in the ten-step proof is an error.

Translating the Löb's theorem back to logic

http://yudkowsky.net/assets/44/LobsTheorem.pdf

Since the solution to the puzzle refers back to the proof of Löb's theorem, we first translate the proof from the cartoon version back to logic:

  1. PAL(LC) by construction of L
  2. PACC by hypothesis
  3. PA(LC)(LC)
  4. PAL(LC)
  5. PALL
  6. PALC
  7. PALC
  8. PA(LC)
  9. PAL
  10. PAC

Repeating the proof of Löb's theorem for modified theory

We now repeat the proof of Löb's theorem for PA':=PA{CC} to see where the error is.

  1. PA'L(LC) by construction of L
  2. PA'CC because CC is one of the axioms of PA'
  3. PA'(LC)(LC) instance of A3
  4. PA'L(LC)
  5. PA'LL
  6. PA'LC
  7. PA'LC

So far, everything is fine. But can we assert PA'(LC)? For PA, we had the following:

A1: For each sentence X, if PAX then PAX

We need the following:

A1′: For each sentence X, if PA'X then PA'X

Since PA' has more axioms than PA, we know that PA' can prove everything that PA can. Thus, compared to A1, both the antecedent and consequent of A1′ are weaker, so A1′ does not necessarily follow from A1.

Suppose we take X in A1′ to be CC. Then we obtain

If PA'CC then PA'(CC)

Other ways we might try to get step 8 to work:

  • PA'XX
  • if X, then PAX -- i think this is the one about which Larry says "ARGH WHAT ARE YOU DOING". Or maybe he means "if ΔX, then ΔX" for arbitrary Δ.
    • Now one might wonder, why is Δ=PA so special? I think this happens because is the provability predicate of PA and not some other theory. In particular, if ΔPA, then if ΔX we only know the weaker statement that Δ proves X, not that PA proves it. So we shouldn't be able to prove that Δ can prove X. What if ΔPA? Then I think we are in a better position. e.g. if Δ=Q, then I think Q can still represent the provability predicate of PA (?) so then we have QX. (this part might be wrong)

But can we prove that we can never get step 8 for PA'? Well, this is exactly what the puzzle statement does -- it assumes the proof goes through, and derives an absurdity! (? actually, i'm not sure this is actually a contradiction. If we assume PA is consistent, then it seems to be a contradiction. I think this is the point nate soares and alex flint are trying to make in the comments. but we can re-word the puzzle to say one should find the first error.)

Other questions to answer

  • does in PA' still mean provability about PA, or about itself?
  • are we removing a layer or adding one?
  • can't PA' prove everything PA can? yes, but this doesn't mean we can substitute implications automatically because the consequent also gets stronger