User:IssaRice/Computability and logic/Eliezer Yudkowsky's Löb's theorem puzzle: Difference between revisions

From Machinelearning
No edit summary
Line 53: Line 53:
So far, everything is fine. But can we assert <math>\mathsf{PA}' \vdash \Box(\Box L \to C)</math>? For <math>\mathsf{PA}</math>, we had the following:
So far, everything is fine. But can we assert <math>\mathsf{PA}' \vdash \Box(\Box L \to C)</math>? For <math>\mathsf{PA}</math>, we had the following:


:A1: For each <math>X</math>, if <math>\mathsf{PA}\vdash X</math> then <math>\mathsf{PA} \vdash \Box X</math>
:A1: For each sentence <math>X</math>, if <math>\mathsf{PA}\vdash X</math> then <math>\mathsf{PA} \vdash \Box X</math>


We need the following:
We need the following:


:A1&prime;: For each <math>X</math>, if <math>\mathsf{PA}'\vdash X</math> then <math>\mathsf{PA}' \vdash \Box X</math>
:A1&prime;: For each sentence <math>X</math>, if <math>\mathsf{PA}'\vdash X</math> then <math>\mathsf{PA}' \vdash \Box X</math>


Since <math>\mathsf{PA}'</math> has more axioms than <math>\mathsf{PA}</math>, we know that <math>\mathsf{PA}'</math> can prove everything that <math>\mathsf{PA}</math> can. Thus, compared to A1, both the antecedent and consequent of A1&prime; are stronger, so A1&prime; does not necessarily follow from A1.
Since <math>\mathsf{PA}'</math> has more axioms than <math>\mathsf{PA}</math>, we know that <math>\mathsf{PA}'</math> can prove everything that <math>\mathsf{PA}</math> can. Thus, compared to A1, both the antecedent and consequent of A1&prime; are stronger, so A1&prime; does not necessarily follow from A1.

Revision as of 04:28, 10 February 2019

original link: https://web.archive.org/web/20160319050228/http://lesswrong.com/lw/t6/the_cartoon_guide_to_l%C3%B6bs_theorem/

current LW link: https://www.lesswrong.com/posts/ALCnqX6Xx8bpFMZq3/the-cartoon-guide-to-loeb-s-theorem

Notation

Precedence:

  • has high precedence, so e.g. CC means (C)C, and LC means (L)(C).
  • is a metalogical notion and has low precedence, so PACC means PA(CC).

Translating the puzzle using logic notation

Löb's theorem shows that if PACC, then PAC.

The deduction theorem says that if PA{H}F, then PAHF.

Applying the deduction theorem to Löb's theorem gives us PA(CC)C.

When translating to logic notation, it becomes obvious that the application of the deduction theorem is illegitimate, because we don't actually have PA{CC}C. This is the initial answer that Larry D'Anna gives in comments.

But now, suppose we define PA:=PA{CC}, and walk through the proof of Löb's theorem for this new theory PA. Then we would obtain the following implication: if PACC, then PAC. But clearly, PACC since CC is one of the axioms of PA. Therefore by modus ponens, we have PAC, i.e. PA{CC}C. Now we can apply the deduction theorem to obtain PA(CC)C. This means that our "Löb's theorem" for PA must be incorrect (note: the proof is correct for PA, which is why Löb's theorem is a theorem; it's just incorrect for PA), and somewhere in the ten-step proof is an error.

Translating the Löb's theorem back to logic

http://yudkowsky.net/assets/44/LobsTheorem.pdf

Since the solution to the puzzle refers back to the proof of Löb's theorem, we first translate the proof from the cartoon version back to logic:

  1. PAL(LC)
  2. PACC
  3. PA(LC)(LC)
  4. PAL(LC)
  5. PALL
  6. PALC
  7. PALC
  8. PA(LC)
  9. PAL
  10. PAC

Repeating the proof of Löb's theorem for modified theory

We now repeat the proof of Löb's theorem for PA:=PA{CC} to see where the error is.

  1. PAL(LC) by definition of L
  2. PACC because CC is one of the axioms of PA
  3. PA(LC)(LC)
  4. PAL(LC)
  5. PALL
  6. PALC
  7. PALC

So far, everything is fine. But can we assert PA(LC)? For PA, we had the following:

A1: For each sentence X, if PAX then PAX

We need the following:

A1′: For each sentence X, if PAX then PAX

Since PA has more axioms than PA, we know that PA can prove everything that PA can. Thus, compared to A1, both the antecedent and consequent of A1′ are stronger, so A1′ does not necessarily follow from A1.

Suppose we take X in A1′ to be CC. Then we obtain

If PACC then PA(CC)

Other ways we might try to get step 8 to work:

  • PAXX
  • if X, then PAX

Other questions to answer

  • does in PA still mean provability about PA, or about itself?
  • are we removing a layer or adding one?
  • can't PA prove everything PA can? yes, but this doesn't mean we can substitute implications automatically because the consequent also gets stronger