Kantorovich theorem

From testwiki
Revision as of 18:57, 17 December 2023 by 192.16.204.250 (talk) (Assumptions: Fixed confusing text)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Template:Short description The Kantorovich theorem, or Newton–Kantorovich theorem, is a mathematical statement on the semi-local convergence of Newton's method. It was first stated by Leonid Kantorovich in 1948.[1][2] It is similar to the form of the Banach fixed-point theorem, although it states existence and uniqueness of a zero rather than a fixed point.[3]

Newton's method constructs a sequence of points that under certain conditions will converge to a solution x of an equation f(x)=0 or a vector solution of a system of equation F(x)=0. The Kantorovich theorem gives conditions on the initial point of this sequence. If those conditions are satisfied then a solution exists close to the initial point and the sequence converges to that point.[1][2]

Assumptions

Let Xn be an open subset and F:Xnn a differentiable function with a Jacobian F(𝐱) that is locally Lipschitz continuous (for instance if F is twice differentiable). That is, it is assumed that for any xX there is an open subset UX such that xU and there exists a constant L>0 such that for any 𝐱,𝐲U

F(𝐱)F(𝐲)L𝐱𝐲

holds. The norm on the left is the operator norm. In other words, for any vector 𝐯n the inequality

F(𝐱)(𝐯)F(𝐲)(𝐯)L𝐱𝐲𝐯

must hold.

Now choose any initial point 𝐱0X. Assume that F(𝐱0) is invertible and construct the Newton step 𝐡0=F(𝐱0)1F(𝐱0).

The next assumption is that not only the next point 𝐱1=𝐱0+𝐡0 but the entire ball B(𝐱1,𝐡0) is contained inside the set X. Let M be the Lipschitz constant for the Jacobian over this ball (assuming it exists).

As a last preparation, construct recursively, as long as it is possible, the sequences (𝐱k)k, (𝐡k)k, (αk)k according to

𝐡k=F(𝐱k)1F(𝐱k)αk=MF(𝐱k)1𝐡k𝐱k+1=𝐱k+𝐡k.

Statement

Now if α012 then

  1. a solution 𝐱* of F(𝐱*)=0 exists inside the closed ball B¯(𝐱1,𝐡0) and
  2. the Newton iteration starting in 𝐱0 converges to 𝐱* with at least linear order of convergence.

A statement that is more precise but slightly more difficult to prove uses the roots tt** of the quadratic polynomial

p(t)=(12LF(𝐱0)11)t2t+𝐡0,
t/**=2𝐡01±12α0

and their ratio

θ=t*t**=112α01+12α0.

Then

  1. a solution 𝐱* exists inside the closed ball B¯(𝐱1,θ𝐡0)B¯(𝐱0,t*)
  2. it is unique inside the bigger ball B(𝐱0,t*)
  3. and the convergence to the solution of F is dominated by the convergence of the Newton iteration of the quadratic polynomial p(t) towards its smallest root t,[4] if t0=0,tk+1=tkp(tk)p(tk), then
    𝐱k+p𝐱ktk+ptk.
  4. The quadratic convergence is obtained from the error estimate[5]
    𝐱n+1𝐱*θ2n𝐱n+1𝐱nθ2n2n𝐡0.

Corollary

In 1986, Yamamoto proved that the error evaluations of the Newton method such as Doring (1969), Ostrowski (1971, 1973),[6][7] Gragg-Tapia (1974), Potra-Ptak (1980),[8] Miel (1981),[9] Potra (1984),[10] can be derived from the Kantorovich theorem.[11]

Generalizations

There is a q-analog for the Kantorovich theorem.[12][13] For other generalizations/variations, see Ortega & Rheinboldt (1970).[14]

Applications

Oishi and Tanabe claimed that the Kantorovich theorem can be applied to obtain reliable solutions of linear programming.[15]

References

Template:Reflist

Further reading