User:IssaRice/Extreme value theorem

From Machinelearning

Working through the proof in Pugh's book by filling in the parts he doesn't talk about.

For x[a,b], define Vx=f([a,x])={f(t):atx} to be the image of f up to and including x.

Let M=sup{f(x):axb}=supVb and X={x[a,b]:supVx<M}.

Our goal now is to find some x such that f(x)=M. If f(a)=M this is easy.

So now suppose f(a)<M. Then aX. We already know that X is bounded above, for instance by the number b. We can thus take the least upper bound of X, say c=supX. We already know f(c)M, so if we can just eliminate the possibility that f(c)<M, we will be done.

So suppose f(c)<M. We want to find M<M such that f(t)<M for all t[a,c]. That would mean that supVcM<M. To do this, we split the interval into two parts. Choose ϵ>0 with ϵ<Mf(c).[note 1] By continuity at c, there exists a δ>0 such that |tc|<δ implies |f(t)f(c)|<ϵ. So now pick a point like cδ/2, and split the interval into [a,cδ/2] and [cδ/2,c].

  • Consider t=cδ/2. Then since t<c, there exists xX such that t<x. So supVtsupVc<M so f(t)supVt<M. This also means that for all x[a,cδ/2] we have f(x)supVt<M.
  • But now if cδ/2tc, then This means f(t)<f(c)+ϵ<M.

Now we can choose M=max{supVcδ/2,f(c)+ϵ}.

If c<b then


If tcδ/2 then there exists some xX such that t<x. This means supVtsupVx<M so tX. Otherwise if cδ/2tc then |tc|<δ so f(t)<f(c)+ϵ<M. So now what can we say about supVc? We want to say supVc<M. We can do this by showing that there exists a number M<M such that f(t)<M for all t[a,c]. That way, supVcM<M. But M=supVc works.


Therefore, c=b, which implies that . So the assumption that f(c)<M was false, and we conclude f(c)=M.

If c=b then M=supVb=supVc<M, a contradiction.

Notes

  1. It is important here that ϵ does not equal Mf(c); choosing this ϵ would be too weak and we would not be able to conclude supVc<M, rather only that supVcM.