3.6 Section 3.2 Newton root finding

  3.6.1 Definition of convex function
  3.6.2 Newton method to solve set of equations

Understanding Newton method

Start from the line equation. You remember we normally write it as

y=c+mx where c is the y-axis intercept and m is the slope. But this equation always implicitly assumed that the slope is taken at x0=0, and the intercept is also at x0=0, hence the above can be written as (1)y=f(x)=f(x0)+f(x0)(xx0)

The above equation is the same as y=c+mx when x0=0

So from (1)

f(x)=f(x0)+f(x0)(xx0)f(x)f(x0)f(x0)=(xx0)

Now instead of writing x and x0 we write xn+1 and xn, so the above becomes

f(xn+1)f(xn)f(xn)=(xn+1xn)xn+1=f(xn+1)f(xn)f(xn)+xn

That is it. Now for xn+1, f(xn+1)=0, and that is the whole idea. Replace f(xn+1) by zero in the above we obtain

xn+1=xnf(xn)f(xn)

Which is Newton method.

Theorem: Let fbe C2 and let r be a simple zero. There is a neighborhood of r and a constant C s.t. if Newton method is started in that neighborhood it will converge to r according to |xn+1r|C|xn1|2

Proof of quadratic convergence order

pict
Figure 3.12:quadratic convergence

From diagram we see that

(1)en+1=rxn+1

But from definition of Newton root finding

(2)xn+1=xnf(xn)f(xn)

substituting (2) into (1) gives

en+1=r(xnf(xn)f(xn))=rxnen+f(xn)f(xn)=en+f(xn)f(xn)(3)=enf(xn)+f(xn)f(xn)

Now evaluate f(x) at r by expanding it using Taylor around xn

f(r)=f(xn)+hf(xn)+h2f(ξ)2!

But h is the distance between r and xn, which is en, hence the above becomes

f(r)=f(xn)+enf(xn)+en2f(ξ)2!

But f(r)=0 since this is a root finding, and that is our goal. Hence the above becomes

0=f(xn)+enf(xn)+en2f(ξ)2!(4)f(xn)+enf(xn)=f(ξ)2!en2

Substituting (4) into numerator of (3) gives

en+1=f(ξ)2!en2f(xn)=f(ξ)2f(xn)en2

but f(ξ)f(xn)f(r)f(r)=k (some constant), hence above becomes

en+1=k1en2

where k1 is some constant as well.

Hence we can find a constant C such that en+1Cen2 where C is any constant smaller than k1

The above proofes that Newton method converges quadratically.

3.6.1 Definition of convex function

A function f(x) is convex if f(x) 0 for all x.

Mathworld has this definition

”A convex function is a continuous function whose value at the midpoint of every interval in its domain does not exceed the arithmetic mean of its values at the ends of the interval.” diagram below from Mathworld

pict
Figure 3.13:Mathworld

How to use Newton method to find R ?

Let x=R, hence x2R=0 is f(x) to use with Newton method. This leads to xn+1=xnxn2R2xn=12(xn+Rxn)

3.6.2 Newton method to solve set of equations

Writing

[xn+1yn+1]=[xnyn]F(xn)J1(xn)=[xnyn][F1(xn,yn)F2(xn,yn)][F1(xn,yn)xnF1(xn,yn)ynF2(xn,yn)xnF2(xn,yn)xn]