Proximal operator

Template:Short description In mathematical optimization, the proximal operator is an operator associated with a proper,^{[note 1]} lower semi-continuous convex function $f$ from a Hilbert space $𝒳$ to $[- \infty, + \infty]$ , and is defined by: ^[1]

{prox}_{f} (v) = \arg \min_{x \in 𝒳} (f (x) + \frac{1}{2} ‖ x - v ‖_{𝒳}^{2}) .

For any function in this class, the minimizer of the right-hand side above is unique, hence making the proximal operator well-defined. The proximal operator is used in proximal gradient methods, which is frequently used in optimization algorithms associated with non-differentiable optimization problems such as total variation denoising.

Properties

The $prox$ of a proper, lower semi-continuous convex function $f$ enjoys several useful properties for optimization.

Fixed points of ${prox}_{f}$ are minimizers of $f$ : ${x \in 𝒳 | {prox}_{f} x = x} = \arg \min f$ .
Global convergence to a minimizer is defined as follows: If $\arg \min f \neq \emptyset$ , then for any initial point $x_{0} \in 𝒳$ , the recursion $(\forall n \in ℕ) x_{n + 1} = {prox}_{f} x_{n}$ yields convergence $x_{n} \to x \in \arg \min f$ as $n \to + \infty$ . This convergence may be weak if $𝒳$ is infinite dimensional.^[2]
The proximal operator can be seen as a generalization of the projection operator. Indeed, in the specific case where $f$ is the 0- $\infty$ characteristic function $ι_{C}$ of a nonempty, closed, convex set $C$ we have that

\begin{matrix} {prox}_{ι_{C}} (x) & = \underset{y}{argmin} {\begin{matrix} \frac{1}{2} {‖ x - y ‖}_{2}^{2} & if y \in C \\ + \infty & if y \notin C \end{matrix} \\ = \underset{y \in C}{argmin} \frac{1}{2} {‖ x - y ‖}_{2}^{2} \end{matrix}

showing that the proximity operator is indeed a generalisation of the projection operator.

A function is firmly non-expansive if $(\forall (x, y) \in 𝒳^{2}) ‖ {prox}_{f} x - {prox}_{f} y ‖^{2} \leq ⟨ x - y, {prox}_{f} x - {prox}_{f} y ⟩$ .
The proximal operator of a function is related to the gradient of the Moreau envelope $M_{λ f}$ of a function $λ f$ by the following identity: $\nabla M_{λ f} (x) = \frac{1}{λ} (x - {p r o x}_{λ f} (x))$ .

The proximity operator of $f$ is characterized by inclusion $p = {prox}_{f} (x) \Leftrightarrow x - p \in \partial f (p)$ , where $\partial f$ is the subdifferential of $f$ , given by

\partial f (x) = {u \in ℝ^{N} ∣ \forall y \in ℝ^{N}, (y - x)^{T} u + f (x) \leq f (y)}

In particular, If

f

is differentiable then the above equation reduces to

p = {prox}_{f} (x) \Leftrightarrow x - p = \nabla f (p)

.

Notes

Template:Reflist

References

Template:Reflist

External links

The Proximity Operator repository: a collection of proximity operators implemented in Matlab and Python.
ProximalOperators.jl: a Julia package implementing proximal operators.
ODL: a Python library for inverse problems that utilizes proximal operators.

Cite error: <ref> tags exist for a group named "note", but no corresponding <references group="note"/> tag was found

[2] Template:Cite journal

[3] Template:Cite book

[note 1]

[1]

[2]

Proximal operator

Contents

Properties

Notes

References

See also

External links

Navigation menu

Proximal operator

Properties

Notes

References

See also

External links

Navigation menu

Search