Stein's unbiased risk estimate

In statistics, Stein's unbiased risk estimate (SURE) is an unbiased estimator of the mean-squared error of "a nearly arbitrary, nonlinear biased estimator."^[1] In other words, it provides an indication of the accuracy of a given estimator. This is important since the true mean-squared error of an estimator is a function of the unknown parameter to be estimated, and thus cannot be determined exactly.

The technique is named after its discoverer, Charles Stein.^[2]

Formal statement

Let $μ \in ℝ^{d}$ be an unknown parameter and let $x \in ℝ^{d}$ be a measurement vector whose components are independent and distributed normally with mean $μ_{i}, i = 1, . . ., d,$ and variance $σ^{2}$ . Suppose $h (x)$ is an estimator of $μ$ from $x$ , and can be written $h (x) = x + g (x)$ , where $g$ is weakly differentiable. Then, Stein's unbiased risk estimate is given by^[3]

SURE (h) = d σ^{2} + ‖ g (x) ‖^{2} + 2 σ^{2} \sum_{i = 1}^{d} \frac{\partial}{\partial x_{i}} g_{i} (x) = - d σ^{2} + ‖ g (x) ‖^{2} + 2 σ^{2} \sum_{i = 1}^{d} \frac{\partial}{\partial x_{i}} h_{i} (x),

where $g_{i} (x)$ is the $i$ th component of the function $g (x)$ , and $‖ \cdot ‖$ is the Euclidean norm.

The importance of SURE is that it is an unbiased estimate of the mean-squared error (or squared error risk) of $h (x)$ , i.e.

E_{μ} {SURE (h)} = MSE (h),

with

MSE (h) = E_{μ} ‖ h (x) - μ ‖^{2} .

Thus, minimizing SURE can act as a surrogate for minimizing the MSE. Note that there is no dependence on the unknown parameter $μ$ in the expression for SURE above. Thus, it can be manipulated (e.g., to determine optimal estimation settings) without knowledge of $μ$ .

Proof

We wish to show that

E_{μ} ‖ h (x) - μ ‖^{2} = E_{μ} {SURE (h)} .

We start by expanding the MSE as

\begin{matrix} E_{μ} ‖ h (x) - μ ‖^{2} & = E_{μ} ‖ g (x) + x - μ ‖^{2} \\ = E_{μ} ‖ g (x) ‖^{2} + E_{μ} ‖ x - μ ‖^{2} + 2 E_{μ} g (x)^{T} (x - μ) \\ = E_{μ} ‖ g (x) ‖^{2} + d σ^{2} + 2 E_{μ} g (x)^{T} (x - μ) . \end{matrix}

Now we use integration by parts to rewrite the last term:

\begin{matrix} E_{μ} g (x)^{T} (x - μ) & = \int_{ℝ^{d}} \frac{1}{\sqrt{2 π σ^{2 d}}} \exp (- \frac{‖ x - μ ‖^{2}}{2 σ^{2}}) \sum_{i = 1}^{d} g_{i} (x) (x_{i} - μ_{i}) d^{d} x \\ = σ^{2} \sum_{i = 1}^{d} \int_{ℝ^{d}} \frac{1}{\sqrt{2 π σ^{2 d}}} \exp (- \frac{‖ x - μ ‖^{2}}{2 σ^{2}}) \frac{d g_{i}}{d x_{i}} d^{d} x \\ = σ^{2} \sum_{i = 1}^{d} E_{μ} \frac{d g_{i}}{d x_{i}} . \end{matrix}

Substituting this into the expression for the MSE, we arrive at

E_{μ} ‖ h (x) - μ ‖^{2} = E_{μ} (d σ^{2} + ‖ g (x) ‖^{2} + 2 σ^{2} \sum_{i = 1}^{d} \frac{d g_{i}}{d x_{i}}) .

Applications

A standard application of SURE is to choose a parametric form for an estimator, and then optimize the values of the parameters to minimize the risk estimate. This technique has been applied in several settings. For example, a variant of the James–Stein estimator can be derived by finding the optimal shrinkage estimator.^[2] The technique has also been used by Donoho and Johnstone to determine the optimal shrinkage factor in a wavelet denoising setting.^[1]

References

Template:Reflist

[donoho95-1] 1.0 ^1.1 Template:Cite journal

[stein81-2] 2.0 ^2.1 Template:Cite journal

[wasserman05-3] Template:Cite book

[1]

[2]

[3]

Stein's unbiased risk estimate

Contents

Formal statement

Proof

Applications

References

Navigation menu

Stein's unbiased risk estimate

Formal statement

Proof

Applications

References

Navigation menu

Search