User:IssaRice/Strength of a mathematical statement

From Machinelearning

In mathematics, one talks about statements being "stronger" than others, "more general" than others, a method being "more powerful" than others, etc. This page tries to point out some of the subtleties of this way of speaking.

Negation

Negating a strong statement produces a weak statement, and negating a weak statement produces a strong statement. If a statement has strong and weak components, then the flip occurs at each stage. For example, in with a weak statement, negating it produces , where the strong has become the weak , and the weak has become a strong . See Gowers's posts for more discussion on this.

Strong vs subset

A puzzle: why do we say P is stronger than Q if P is a subset of Q, but we also say that a theorem is stronger if it is more general (so bigger)?

  • One reply/intuition uses something like possible world semantics, e.g. see Wei Dai's post on Aumann's agreement theorem.[1] There is just one possible world (a single ), but our information state is the set of all possible worlds that we cannot distinguish, so the less we know, the more possible worlds we think we could be in.
  • One visualization is to use a Venn diagram. The stronger the statement, the more our movement is restricted, as we are forced to be in more and more sets.
  • When we say a strong statement like , we are saying . When we say a weak statement like , we are saying . It seems like in both cases we are accumulating more and more things.
  • But if we're working in a proof system, means we have all of separately, whereas with we only have one long statement .
  • In causal inference, I think is stronger than , even though both seem to use a single "or"-type operation. But if and are disjoint, then I think the former is true while the latter may be false. I think this is similar to how is usually stronger than , unless .
  • Maybe another way to state the puzzle is this: "P is stronger than Q" ↔ "P implies Q" ↔ "Q is at least as true as P" ↔ "Q ≥ P" (as truth values T=1 and F=0) ↔ "Q is 'at least as powerful as' P"! Obviously, the last link is the problem.
  • Let's say we have . Then we can deduce . So we can say . Let's visualize this by drawing each of as points. Then if we know , the set of statements we know is . The set of statements we are trying to prove is . But now notice something strange: is stronger than , but we have . A question might be: how do we visualize in this scheme? My first thought was "Maybe we need three copies of the diagram, so that we have , , and ". But maybe a better way to think of this is that each set such as above is a microcosm. Once you're in , it's not as small as you thought! You're actually in the set . And once you're in this microcosm/"kingdom", you can navigate to wherever you please. This explains why a strong statement is bigger (in this visualization): when we start out with more statements, our "kingdom" is bigger (we get more combinations of OR statements for free). So we can navigate to many other smaller sets of sentences (villages, towns, whatever).
  • The above vs joint distribution. Symbolically, the contrast between (the joint distribution specifies an elementary event, which is small, whereas a marginal distribution specifies a "lumped together" event, which is large) and (the more statements we know, the larger the set of statements we know).
  • What do I mean by "navigate to"? Basically I mean from mathematical logic. If and are sets of sentences, then "if we have , we can navigate to " means .

References

  1. Wei Dai. "Probability Space & Aumann Agreement". December 10, 2009.

External links