User:IssaRice/Extreme value theorem: Difference between revisions
No edit summary |
No edit summary |
||
| Line 9: | Line 9: | ||
So now suppose <math>f(a) < M</math>. Then <math>a \in X</math>. We already know that <math>X</math> is bounded above, for instance by the number <math>b</math>. We can thus take the least upper bound of <math>X</math>, say <math>c = \sup X</math>. We already know <math>f(c) \leq M</math>, so if we can just eliminate the possibility that <math>f(c) < M</math>, we will be done. | So now suppose <math>f(a) < M</math>. Then <math>a \in X</math>. We already know that <math>X</math> is bounded above, for instance by the number <math>b</math>. We can thus take the least upper bound of <math>X</math>, say <math>c = \sup X</math>. We already know <math>f(c) \leq M</math>, so if we can just eliminate the possibility that <math>f(c) < M</math>, we will be done. | ||
So suppose for sake of contradiction that <math>f(c) < M</math>. Choose <math>\epsilon > 0</math> with <math>\epsilon < M - f(c)</math>.<ref group="note">It is important here that <math>\epsilon</math> does not equal <math>M - f(c)</math>; choosing this <math>\epsilon</math> would be too weak and we would not be able to conclude <math>\sup V_c < M</math>, rather only that <math>\sup V_c \leq M</math>.</ref> Continuity of <math>f</math> at <math>c</math> implies that there exists <math>\delta > 0</math> such that <math>x \in (c-\delta, c+\delta)\cap [a,b]</math> implies <math>|f(x)-f(c)|<\epsilon</math>. Now, <math>c-\delta</math> cannot be an upper bound of <math>X</math>, so there exists some <math>x_0 \in X</math> such that <math>c-\delta < x_0 \leq c</math>. Thus for <math>x \in [a,x_0]</math> we have <math>f(x) \leq \sup V_{x_0} < M</math>. Also, for <math>x \in [x_0, c] \subseteq (c-\delta, c+\delta)</math> we have <math>f(x) < f(c) + \epsilon < M</math>. So <math>f</math> is bounded above by <math>\max\{\sup V_{x_0}, f(c) + \epsilon\} < M</math> on <math>[a,c]</math>, which means <math>\sup V_c < M</math>.<ref group="note">This part of the proof uses quite a bit of "low-level" argumentation, so it can be easy to miss the broader point. The reason we split the interval <math>[a,c]</math> into two parts is that we know two facts about <math>f</math>: (1) near <math>c</math>, continuity shows that <math>f</math> must be close to the value of <math>f(c)</math>; since we assumed <math>f(c) < M</math>, this means we can find a neighborhood around <math>c</math> where <math>f</math> is bounded away from <math>M</math>. (2) up to <math>c</math>, our choice of <math>c</math> means the value of <math>f</math> is bounded away from <math>M</math>. Then we pick <math>x_0</math> as a "handing off point" to pass from one side to the other.</ref> If <math>c < b</math>, then there are points to the right of <math>c</math> where <math>f</math> continues to stay below <math>f(c) + \epsilon < M</math>, which contradicts the fact that <math>c</math> is an upper bound of <math>X</math>. So <math>c=b</math>, which means <math>M = \sup V_b = \sup V_c < M</math>, a contradiction. | So suppose for sake of contradiction that <math>f(c) < M</math>. Choose <math>\epsilon > 0</math> with <math>\epsilon < M - f(c)</math>.<ref group="note">It is important here that <math>\epsilon</math> does not equal <math>M - f(c)</math>; choosing this <math>\epsilon</math> would be too weak and we would not be able to conclude <math>\sup V_c < M</math>, rather only that <math>\sup V_c \leq M</math>.</ref> Continuity of <math>f</math> at <math>c</math> implies that there exists <math>\delta > 0</math> such that <math>x \in (c-\delta, c+\delta)\cap [a,b]</math> implies <math>|f(x)-f(c)|<\epsilon</math>. Now, <math>c-\delta</math> cannot be an upper bound of <math>X</math>, so there exists some <math>x_0 \in X</math> such that <math>c-\delta < x_0 \leq c</math>. Thus for <math>x \in [a,x_0]</math> we have <math>f(x) \leq \sup V_{x_0} < M</math>. Also, for <math>x \in [x_0, c] \subseteq (c-\delta, c+\delta)</math> we have <math>f(x) < f(c) + \epsilon < M</math>. So <math>f</math> is bounded above by <math>\max\{\sup V_{x_0}, f(c) + \epsilon\} < M</math> on <math>[a,c]</math>, which means <math>\sup V_c < M</math>.<ref group="note">This part of the proof uses quite a bit of "low-level" argumentation, so it can be easy to miss the broader point. The reason we split the interval <math>[a,c]</math> into two parts is that we know two facts about <math>f</math>: (1) near <math>c</math>, continuity shows that <math>f</math> must be close to the value of <math>f(c)</math>; since we assumed <math>f(c) < M</math>, this means we can find a neighborhood around <math>c</math> where <math>f</math> is bounded away from <math>M</math>. (2) up to <math>c</math>, our choice of <math>c</math> means the value of <math>f</math> is bounded away from <math>M</math>. Then we pick <math>x_0</math> as a "handing off point" to pass from one side to the other.</ref> If <math>c < b</math>, then there are points to the right of <math>c</math> where <math>f</math> continues to stay below <math>f(c) + \epsilon < M</math>, which contradicts the fact that <math>c</math> is an upper bound of <math>X</math>. So <math>c=b</math>, which means <math>M = \sup V_b = \sup V_c < M</math>, a contradiction. The assumption that <math>f(c) < M</math> was false, and we conclude <math>f(c) = M</math>. | ||
==Takeaways== | ==Takeaways== | ||
Revision as of 05:09, 29 July 2019
Working through the proof in Pugh's book by filling in the parts he doesn't talk about.
For , define to be the values that takes on as the input ranges from to (inclusive).
Let (this number exists by the boundedness theorem) and .[note 1]
Our goal now is to find some such that . The idea now is to locate the leftmost point where attains by taking the supremum of . But we have a small problem, which is that might be empty (it is however always bounded, so we don't need to worry about that part). This can happen if , in which case . But if that's the case, we have already found a point where equals , so we're actually done!
So now suppose . Then . We already know that is bounded above, for instance by the number . We can thus take the least upper bound of , say . We already know , so if we can just eliminate the possibility that , we will be done.
So suppose for sake of contradiction that . Choose with .[note 2] Continuity of at implies that there exists such that implies . Now, cannot be an upper bound of , so there exists some such that . Thus for we have . Also, for we have . So is bounded above by on , which means .[note 3] If , then there are points to the right of where continues to stay below , which contradicts the fact that is an upper bound of . So , which means , a contradiction. The assumption that was false, and we conclude .
Takeaways
- "less than" vs "bounded away from"
Notes
- ↑ If we had used "" in the definition of , then when we take the supremum we would just end up with , regardless of where achieves the maximum.
- ↑ It is important here that does not equal ; choosing this would be too weak and we would not be able to conclude , rather only that .
- ↑ This part of the proof uses quite a bit of "low-level" argumentation, so it can be easy to miss the broader point. The reason we split the interval into two parts is that we know two facts about : (1) near , continuity shows that must be close to the value of ; since we assumed , this means we can find a neighborhood around where is bounded away from . (2) up to , our choice of means the value of is bounded away from . Then we pick as a "handing off point" to pass from one side to the other.