Proximal operator

From testwiki
Jump to navigation Jump to search

Template:Short description In mathematical optimization, the proximal operator is an operator associated with a proper,[note 1] lower semi-continuous convex function f from a Hilbert space 𝒳 to [βˆ’βˆž,+∞], and is defined by: [1]

proxf(v)=argminxβˆˆπ’³(f(x)+12β€–xβˆ’v‖𝒳2).

For any function in this class, the minimizer of the right-hand side above is unique, hence making the proximal operator well-defined. The proximal operator is used in proximal gradient methods, which is frequently used in optimization algorithms associated with non-differentiable optimization problems such as total variation denoising.

Properties

The prox of a proper, lower semi-continuous convex function f enjoys several useful properties for optimization.

  • Fixed points of proxf are minimizers of f: {xβˆˆπ’³ | proxfx=x}=argminf.
  • Global convergence to a minimizer is defined as follows: If argminfβ‰ βˆ…, then for any initial point x0βˆˆπ’³, the recursion (βˆ€nβˆˆβ„•)xn+1=proxfxn yields convergence xnβ†’x∈argminf as nβ†’+∞. This convergence may be weak if 𝒳 is infinite dimensional.[2]
  • The proximal operator can be seen as a generalization of the projection operator. Indeed, in the specific case where f is the 0-∞ characteristic function ΞΉC of a nonempty, closed, convex set C we have that
proxΞΉC(x)=argminy{12β€–xβˆ’yβ€–22if y∈C+∞if yβˆ‰C=argminy∈C12β€–xβˆ’yβ€–22
showing that the proximity operator is indeed a generalisation of the projection operator.
  • A function is firmly non-expansive if (βˆ€(x,y)βˆˆπ’³2)β€–proxfxβˆ’proxfyβ€–2β‰€βŸ¨xβˆ’y ,proxfxβˆ’proxfy⟩.
  • The proximal operator of a function is related to the gradient of the Moreau envelope MΞ»f of a function Ξ»f by the following identity: βˆ‡MΞ»f(x)=1Ξ»(xβˆ’proxΞ»f(x)).
  • The proximity operator of f is characterized by inclusion p=proxf(x)⇔xβˆ’pβˆˆβˆ‚f(p), where βˆ‚f is the subdifferential of f, given by
βˆ‚f(x)={uβˆˆβ„Nβˆ£βˆ€yβˆˆβ„N,(yβˆ’x)Tu+f(x)≀f(y)} In particular, If f is differentiable then the above equation reduces to p=proxf(x)⇔xβˆ’p=βˆ‡f(p).

Notes

Template:Reflist

References

Template:Reflist


See also


Cite error: <ref> tags exist for a group named "note", but no corresponding <references group="note"/> tag was found