Quasilinearization

From testwiki
Revision as of 22:56, 19 October 2024 by imported>GhostInTheMachine (Changing short description from "technique in mathematics" to "Technique in mathematics")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Template:Short description Template:More citations needed

Two curves on a gridded graph, each marked with crosses at the interpolation nodes. The top shallow curve u_1 has a minimum about 0.71 and the bottom deeper curve u_2 has a minimum about -4. Both curves touch y=1 at x=1 and at x=-1.
Two numerical solutions of the nonlinear example boundary value problem y=y2, y(1)=y(1)=1. Solved by a spectral Chebyshev method and quasilinearization. The top curve u1 used 21 interpolation nodes, and the bottom curve u2 used 34. Both used 3 iterations.

In mathematics, quasilinearization is a technique which replaces a nonlinear differential equation or operator equation (or system of such equations) with a sequence of linear problems, which are presumed to be easier, and whose solutions approximate the solution of the original nonlinear problem with increasing accuracy. It is a generalization of Newton's method; the word "quasilinearization" is commonly used when the differential equation is a boundary value problem.[1][2]

Abstract formulation

Quasilinearization replaces a given nonlinear operator Template:Math with a certain linear operator which, being simpler, can be used in an iterative fashion to approximately solve equations containing the original nonlinear operator. This is typically performed when trying to solve an equation such as Template:Math together with certain boundary conditions Template:Math for which the equation has a solution Template:Math. This solution is sometimes called the "reference solution". For quasilinearization to work, the reference solution needs to exist uniquely (at least locally). The process starts with an initial approximation Template:Math that satisfies the boundary conditions and is "sufficiently close" to the reference solution Template:Math in a sense to be defined more precisely later. The first step is to take the Fréchet derivative of the nonlinear operator Template:Math at that initial approximation, in order to find the linear operator Template:Math which best approximates Template:Math locally. The nonlinear equation may then be approximated as Template:Math, taking Template:Math. Setting this equation to zero and imposing zero boundary conditions and ignoring higher-order terms gives the linear equation Template:Math. The solution of this linear equation (with zero boundary conditions) might be called Template:Math. Computation of Template:Math for Template:Math... by solving these linear equations in sequence is analogous to Newton's iteration for a single equation, and requires recomputation of the Fréchet derivative at each Template:Math. The process can converge quadratically to the reference solution, under the right conditions. Just as with Newton's method for nonlinear algebraic equations, however, difficulties may arise: for instance, the original nonlinear equation may have no solution, or more than one solution, or a multiple solution, in which cases the iteration may converge only very slowly, may not converge at all, or may converge instead to the wrong solution.

The practical test of the meaning of the phrase "sufficiently close" earlier is precisely that the iteration converges to the correct solution. Just as in the case of Newton iteration, there are theorems stating conditions under which one can know ahead of time when the initial approximation is "sufficiently close".

Contrast with discretizing first

One could instead discretize the original nonlinear operator and generate a (typically large) set of nonlinear algebraic equations for the unknowns, and then use Newton's method proper on this system of equations. Generally speaking, the convergence behavior is similar: a similarly good initial approximation will produce similarly good approximate discrete solutions. However, the quasilinearization approach (linearizing the operator equation instead of the discretized equations) seems to be simpler to think about, and has allowed such techniques as adaptive spatial meshes to be used as the iteration proceeds.[3]

Example

As an example to illustrate the process of quasilinearization, we can approximately solve the two-point boundary value problem for the nonlinear node d2dx2y(x)=y2(x), where the boundary conditions are y(1)=1 and y(1)=1. The exact solution of the differential equation can be expressed using the Weierstrass elliptic function ℘, like so: y(x)=6(xα|0,β) where the vertical bar notation means that the invariants are g2=0 and g3=β. Finding the values of α and β so that the boundary conditions are satisfied requires solving two simultaneous nonlinear equations for the two unknowns α and β, namely 6(1α|0,β)=1 and 6(1α|0,β)=1. This can be done, in an environment where ℘ and its derivatives are available, for instance by Newton's method.Template:Efn

Applying the technique of quasilinearization instead, one finds by taking the Fréchet derivative at an unknown approximation yk(x) that the linear operator is L(ε)=d2dx2ε(x)2yk(x)ε(x). If the initial approximation is y0(x)=1 identically on the interval 1x1, then the first iteration (at least) can be solved exactly, but is already somewhat complicated. A numerical solution instead, for instance by a Chebyshev spectral method using n=21 Chebyshev—Lobatto points xk=cos(π(n1k)/(n1)) for k=0,1,,n1 gives a solution with residual less than 5109 after three iterations; that is, y3(x) is the exact solution to d2dx2y(x)y2(x)=5109v(x), where the maximum value of |v(x)| is less than 1 on the interval 1x1. This approximate solution (call it u1) agrees with the exact solution 6(xα|0,β) with {α3.524459420,β0.006691372637}.

Other values of α and β give other continuous solutions to this nonlinear two-point boundary-value problem for ODE, such as {α2.55347391110,β1.24923895273}. The solution corresponding to these values plotted in the figure is called u2. Yet other values of the parameters can give discontinuous solutions because ℘ has a double pole at zero and so y(x) has a double pole at x=α. Finding other continuous solutions by quasilinearization requires different initial approximations to the ones used here. The initial approximation y0=5x24 approximates the exact solution u2 and can be used to generate a sequence of approximations converging to u2. Both approximations are plotted in the accompanying figure.

Notes

Template:Notelist

See also

References

Template:Reflist

Further reading

Template:Undercategorized