Broyden's method: Difference between revisions

Latest revision as of 05:10, 11 November 2024

Template:Short description In numerical analysis, Broyden's method is a quasi-Newton method for finding roots in Template:Math variables. It was originally described by C. G. Broyden in 1965.^[1]

Newton's method for solving Template:Math uses the Jacobian matrix, Template:Math, at every iteration. However, computing this Jacobian can be a difficult and expensive operation; for large problems such as those involving solving the Kohn–Sham equations in quantum mechanics the number of variables can be in the hundreds of thousands. The idea behind Broyden's method is to compute the whole Jacobian at most only at the first iteration, and to do rank-one updates at other iterations.

In 1979 Gay proved that when Broyden's method is applied to a linear system of size Template:Math, it terminates in Template:Math steps,^[2] although like all quasi-Newton methods, it may not converge for nonlinear systems.

Description of the method

Solving single-variable nonlinear equation

In the secant method, we replace the first derivative Template:Math at Template:Math with the finite-difference approximation:

f^{'} (x_{n}) ≃ \frac{f (x_{n}) - f (x_{n - 1})}{x_{n} - x_{n - 1}},

and proceed similar to Newton's method:

x_{n + 1} = x_{n} - \frac{f (x_{n})}{f^{'} (x_{n})}

where Template:Math is the iteration index.

Solving a system of nonlinear equations

Consider a system of Template:Math nonlinear equations in $k$ unknowns

𝐟 (𝐱) = 𝟎,

where Template:Math is a vector-valued function of vector Template:Math

𝐱 = (x_{1}, x_{2}, x_{3}, \dots, x_{k}),

𝐟 (𝐱) = (f_{1} (x_{1}, x_{2}, \dots, x_{k}), f_{2} (x_{1}, x_{2}, \dots, x_{k}), \dots, f_{k} (x_{1}, x_{2}, \dots, x_{k})) .

For such problems, Broyden gives a variation of the one-dimensional Newton's method, replacing the derivative with an approximate Jacobian Template:Math. The approximate Jacobian matrix is determined iteratively based on the secant equation, a finite-difference approximation:

𝐉_{n} (𝐱_{n} - 𝐱_{n - 1}) ≃ 𝐟 (𝐱_{n}) - 𝐟 (𝐱_{n - 1}),

where Template:Math is the iteration index. For clarity, define

𝐟_{n} = 𝐟 (𝐱_{n}),

Δ 𝐱_{n} = 𝐱_{n} - 𝐱_{n - 1},

Δ 𝐟_{n} = 𝐟_{n} - 𝐟_{n - 1},

so the above may be rewritten as

𝐉_{n} Δ 𝐱_{n} ≃ Δ 𝐟_{n} .

The above equation is underdetermined when Template:Math is greater than one. Broyden suggested using the most recent estimate of the Jacobian matrix, Template:Math, and then improving upon it by requiring that the new form is a solution to the most recent secant equation, and that there is minimal modification to Template:Math:

𝐉_{n} = 𝐉_{n - 1} + \frac{Δ 𝐟_{n} - 𝐉_{n - 1} Δ 𝐱_{n}}{‖ Δ 𝐱_{n} ‖^{2}} Δ 𝐱_{n}^{T} .

This minimizes the Frobenius norm

‖ 𝐉_{n} - 𝐉_{n - 1} ‖_{F} .

One then updates the variables using the approximate Jacobian, what is called a quasi-Newton approach.

𝐱_{n + 1} = 𝐱_{n} - α 𝐉_{n}^{- 1} 𝐟 (𝐱_{n}) .

If $α = 1$ this is the full Newton step; commonly a line search or trust region method is used to control $α$ . The initial Jacobian can be taken as a diagonal, unit matrix, although more common is to scale it based upon the first step.^[3] Broyden also suggested using the Sherman–Morrison formula^[4] to directly update the inverse of the approximate Jacobian matrix:

𝐉_{n}^{- 1} = 𝐉_{n - 1}^{- 1} + \frac{Δ 𝐱_{n} - 𝐉_{n - 1}^{- 1} Δ 𝐟_{n}}{Δ 𝐱_{n}^{T} 𝐉_{n - 1}^{- 1} Δ 𝐟_{n}} Δ 𝐱_{n}^{T} 𝐉_{n - 1}^{- 1} .

This first method is commonly known as the "good Broyden's method."

A similar technique can be derived by using a slightly different modification to Template:Math. This yields a second method, the so-called "bad Broyden's method":

𝐉_{n}^{- 1} = 𝐉_{n - 1}^{- 1} + \frac{Δ 𝐱_{n} - 𝐉_{n - 1}^{- 1} Δ 𝐟_{n}}{‖ Δ 𝐟_{n} ‖^{2}} Δ 𝐟_{n}^{T} .

This minimizes a different Frobenius norm

‖ 𝐉_{n}^{- 1} - 𝐉_{n - 1}^{- 1} ‖_{F} .

In his original paper Broyden could not get the bad method to work, but there are cases where it does^[5] for which several explanations have been proposed.^[6]^[7] Many other quasi-Newton schemes have been suggested in optimization such as the BFGS, where one seeks a maximum or minimum by finding zeros of the first derivatives (zeros of the gradient in multiple dimensions). The Jacobian of the gradient is called the Hessian and is symmetric, adding further constraints to its approximation.

The Broyden Class of Methods

In addition to the two methods described above, Broyden defined a wider class of related methods.^[1]Template:Rp In general, methods in the Broyden class are given in the form^[8]Template:Rp $𝐉_{k + 1} = 𝐉_{k} - \frac{𝐉_{k} s_{k} s_{k}^{T} 𝐉_{k}}{s_{k}^{T} 𝐉_{k} s_{k}} + \frac{y_{k} y_{k}^{T}}{y_{k}^{T} s_{k}} + ϕ_{k} (s_{k}^{T} 𝐉_{k} s_{k}) v_{k} v_{k}^{T},$ where $y_{k} : = 𝐟 (𝐱_{k + 1}) - 𝐟 (𝐱_{k}),$ $s_{k} : = 𝐱_{k + 1} - 𝐱_{k},$ and $v_{k} = [\frac{y_{k}}{y_{k}^{T} s_{k}} - \frac{𝐉_{k} s_{k}}{s_{k}^{T} 𝐉_{k} s_{k}}],$ and $ϕ_{k} \in ℝ$ for each $k = 1, 2, . . .$ . The choice of $ϕ_{k}$ determines the method.

Other methods in the Broyden class have been introduced by other authors.

The Davidon–Fletcher–Powell (DFP) method, which is the only member of this class being published before the two methods defined by Broyden.^[1]Template:Rp For the DFP method, $ϕ_{k} = 1$ .^[8]Template:Rp
Anderson's iterative method, which uses a least squares approach to the Jacobian.^[9]
Schubert's or sparse Broyden algorithm – a modification for sparse Jacobian matrices.^[10]
The Pulay approach, often used in density functional theory.^[11]^[12]
A limited memory method by Srivastava for the root finding problem which only uses a few recent iterations.^[13]
Klement (2014) – uses fewer iterations to solve some systems.^[14]^[15]
Multisecant methods for density functional theory problems^[7]^[16]

References

Template:Reflist

External links

Simple basic explanation: The story of the blind archer

Template:Root-finding algorithms

[Broyden_1965-1] 1.0 ^1.1 ^1.2 Template:Cite journal

[2] Template:Cite journal

[3] Template:Cite journal

[4] Template:Cite journal

[5] Template:Cite journal

[6] Template:Cite journal

[:0-7] 7.0 ^7.1 Template:Cite journal

[Nocedal_2006-8] 8.0 ^8.1 Template:Cite book

[9] Template:Cite journal

[10] Template:Cite journal

[11] Template:Cite journal

[12] Template:Cite journal

[13] Template:Cite journal

[14] Template:Cite journal

[15] Template:Cite web

[16] Template:Cite journal

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

Broyden's method: Difference between revisions

Latest revision as of 05:10, 11 November 2024

Contents

Description of the method

Solving single-variable nonlinear equation

Solving a system of nonlinear equations

The Broyden Class of Methods

See also

References

Further reading

External links

Navigation menu

Broyden's method: Difference between revisions

Latest revision as of 05:10, 11 November 2024

Description of the method

Solving single-variable nonlinear equation

Solving a system of nonlinear equations

The Broyden Class of Methods

See also

References

Further reading

External links

Navigation menu

Search