User:IssaRice/Chain rule proofs: Difference between revisions
(→Proof) |
|||
| Line 17: | Line 17: | ||
(there is no magic: the terms just cancel out) | (there is no magic: the terms just cancel out) | ||
If we define <math>E_g( | If we define <math>E_g(y,y_0) := g(y) - (g(y_0) + g'(y_0)(y-y_0))</math> we can write | ||
<math display="block">g(y) = g(y_0) + g'(f(x_0))(y - y_0) + E_g( | <math display="block">g(y) = g(y_0) + g'(f(x_0))(y - y_0) + E_g(y,y_0)</math> | ||
Newton's approximation says that <math>|E_g( | Newton's approximation says that <math>|E_g(y,y_0)| \leq \epsilon|y-y_0|</math> as long as <math>|y-y_0|\leq \delta</math>. | ||
Since <math>f</math> is differentiable at <math>x_0</math>, we know that it must be continuous at <math>x_0</math>. This means we can keep <math>|f(x)-y_0|\leq \delta</math> as long as we keep <math>|x-x_0|\leq \delta'</math>. | Since <math>f</math> is differentiable at <math>x_0</math>, we know that it must be continuous at <math>x_0</math>. This means we can keep <math>|f(x)-y_0|\leq \delta</math> as long as we keep <math>|x-x_0|\leq \delta'</math>. | ||
| Line 27: | Line 27: | ||
Since <math>f(x) \in Y</math> and <math>|f(x)-y_0|\leq \delta</math>, this means we can substitute <math>y = f(x)</math> and get | Since <math>f(x) \in Y</math> and <math>|f(x)-y_0|\leq \delta</math>, this means we can substitute <math>y = f(x)</math> and get | ||
<math display="block">g(f(x)) = g(y_0) + g'(f(x_0))(f(x) - y_0) + E_g( | <math display="block">g(f(x)) = g(y_0) + g'(f(x_0))(f(x) - y_0) + E_g(f(x),y_0)</math> | ||
Now we use the differentiability of <math>f</math>. We can write | Now we use the differentiability of <math>f</math>. We can write | ||
| Line 33: | Line 33: | ||
<math display="block">f(x) = f(x_0) + f'(x_0)(x - x_0) + [f(x) - (f(x_0) + f'(x_0)(x-x_0))]</math> | <math display="block">f(x) = f(x_0) + f'(x_0)(x - x_0) + [f(x) - (f(x_0) + f'(x_0)(x-x_0))]</math> | ||
Again, we can define <math>E_f( | Again, we can define <math>E_f(x,x_0) := f(x) - (f(x_0) + f'(x_0)(x-x_0))</math> and write this as | ||
<math display="block">f(x) = f(x_0) + f'(x_0)(x - x_0) + E_f( | <math display="block">f(x) = f(x_0) + f'(x_0)(x - x_0) + E_f(x,x_0)</math> | ||
Now we can substitute this into the expression for <math>g(f(x))</math> to get | Now we can substitute this into the expression for <math>g(f(x))</math> to get | ||
<math display="block">g(f(x)) = g(y_0) + g'(f(x_0))(f'(x_0)(x - x_0) + E_f( | <math display="block">g(f(x)) = g(y_0) + g'(f(x_0))(f'(x_0)(x - x_0) + E_f(x,x_0)) + E_g(f(x),f(x_0))</math> | ||
where we have canceled out two terms using <math>f(x_0) = y_0</math>. | where we have canceled out two terms using <math>f(x_0) = y_0</math>. | ||
| Line 45: | Line 45: | ||
Thus we have | Thus we have | ||
<math display="block">g(f(x)) = g(y_0) + g'(f(x_0))f'(x_0)(x - x_0) + [g'(f(x_0))E_f( | <math display="block">g(f(x)) = g(y_0) + g'(f(x_0))f'(x_0)(x - x_0) + [g'(f(x_0))E_f(x,x_0) + E_g(f(x), f(x_0))]</math> | ||
We can write this as | We can write this as | ||
<math display="block">(g\circ f)(x) - ((g\circ f)(x_0) + L(x - x_0)) = [g'(f(x_0))E_f( | <math display="block">(g\circ f)(x) - ((g\circ f)(x_0) + L(x - x_0)) = [g'(f(x_0))E_f(x,x_0) + E_g(f(x), f(x_0))]</math> | ||
where <math>L := g'(f(x_0))f'(x_0)</math>. Now the left hand side looks like the expression in Newton's approximation. This means to show <math>g\circ f</math> is differentiable at <math>x_0</math>, we just need to show that <math>|g'(f(x_0))E_f( | where <math>L := g'(f(x_0))f'(x_0)</math>. Now the left hand side looks like the expression in Newton's approximation. This means to show <math>g\circ f</math> is differentiable at <math>x_0</math>, we just need to show that <math>|g'(f(x_0))E_f(x,x_0) + E_g(f(x), f(x_0))| \leq \epsilon|x - x_0|</math>. | ||
The stuff in square brackets is our "error term" for <math>g\circ f</math>. Now we just need to make sure it is small, even after dividing by <math>|x-x_0|</math>. | The stuff in square brackets is our "error term" for <math>g\circ f</math>. Now we just need to make sure it is small, even after dividing by <math>|x-x_0|</math>. | ||
| Line 57: | Line 57: | ||
But f is differentiable at <math>x_0</math>, so by Newton's approximation, | But f is differentiable at <math>x_0</math>, so by Newton's approximation, | ||
<math display="block">|g'(f(x_0))E_f( | <math display="block">|g'(f(x_0))E_f(x,x_0)| \leq |g'(f(x_0))| \epsilon_1 |x-x_0|</math> | ||
we also have | we also have | ||
<math display="block">|E_g( | <math display="block">|E_g(f(x), f(x_0))| \leq \epsilon_2 |f(x)-f(x_0)| = \epsilon_2 |f'(x_0)(x-x_0) + E_f(x,x_0)|</math> | ||
We can bound this from above using the triangle inequality: | We can bound this from above using the triangle inequality: | ||
<math display="block">\begin{align}|E_g( | <math display="block">\begin{align}|E_g(f(x), f(x_0))| &\leq \epsilon_2 |f'(x_0)(x-x_0)| + \epsilon_2 |E_f(x,x_0)| \\ &\leq \epsilon_2 |f'(x_0)| |x-x_0| + \epsilon_2 \epsilon_1 |x-x_0|\end{align}</math> | ||
Now we can just choose <math>\epsilon_1, \epsilon_2</math> small enough. | Now we can just choose <math>\epsilon_1, \epsilon_2</math> small enough. | ||
==Limits of sequences== | ==Limits of sequences== | ||
Revision as of 02:21, 28 November 2018
Using Newton's approximation
Main idea
The main idea of using Newton's approximation to prove the chain rule is that since f is differentiable at we have the approximation when is near . Similarly since g is differentiable at we have the approximation when is near . Since f is differentiable at , it is continuous there also, so we know that is near whenever is near . This allows us to substitute into whenever is near . So we get
Thus we get , which is what the chain rule says.
Proof
Since is differentiable at , we know is a real number, and we can write
(there is no magic: the terms just cancel out)
If we define we can write
Newton's approximation says that as long as .
Since is differentiable at , we know that it must be continuous at . This means we can keep as long as we keep .
Since and , this means we can substitute and get
Now we use the differentiability of . We can write
Again, we can define and write this as
Now we can substitute this into the expression for to get
where we have canceled out two terms using .
Thus we have
We can write this as
where . Now the left hand side looks like the expression in Newton's approximation. This means to show is differentiable at , we just need to show that .
The stuff in square brackets is our "error term" for . Now we just need to make sure it is small, even after dividing by .
But f is differentiable at , so by Newton's approximation,
we also have
We can bound this from above using the triangle inequality:
Now we can just choose small enough.