Scaled inverse chi-squared distribution

Template:Short description Template:Probability distribution The scaled inverse chi-squared distribution $ψ inv- χ^{2} (ν)$ , where $ψ$ is the scale parameter, equals the univariate inverse Wishart distribution $𝒲^{- 1} (ψ, ν)$ with degrees of freedom $ν$ .

This family of scaled inverse chi-squared distributions is linked to the inverse-chi-squared distribution and to the chi-squared distribution:

If $X \sim ψ inv- χ^{2} (ν)$ then $X / ψ \sim inv- χ^{2} (ν)$ as well as $ψ / X \sim χ^{2} (ν)$ and $1 / X \sim ψ^{- 1} χ^{2} (ν)$ .

Instead of $ψ$ , the scaled inverse chi-squared distribution is however most frequently parametrized by the scale parameter $τ^{2} = ψ / ν$ and the distribution $ν τ^{2} inv- χ^{2} (ν)$ is denoted by $Scale-inv- χ^{2} (ν, τ^{2})$ .

In terms of $τ^{2}$ the above relations can be written as follows:

If $X \sim Scale-inv- χ^{2} (ν, τ^{2})$ then $\frac{X}{ν τ^{2}} \sim inv- χ^{2} (ν)$ as well as $\frac{ν τ^{2}}{X} \sim χ^{2} (ν)$ and $1 / X \sim \frac{1}{ν τ^{2}} χ^{2} (ν)$ .

This family of scaled inverse chi-squared distributions is a reparametrization of the inverse-gamma distribution.

Specifically, if

X \sim ψ inv- χ^{2} (ν) = Scale-inv- χ^{2} (ν, τ^{2})

then

X \sim Inv-Gamma (\frac{ν}{2}, \frac{ψ}{2}) = Inv-Gamma (\frac{ν}{2}, \frac{ν τ^{2}}{2})

Either form may be used to represent the maximum entropy distribution for a fixed first inverse moment $(E (1 / X))$ and first logarithmic moment $(E (\ln (X))$ .

The scaled inverse chi-squared distribution also has a particular use in Bayesian statistics. Specifically, the scaled inverse chi-squared distribution can be used as a conjugate prior for the variance parameter of a normal distribution. The same prior in alternative parametrization is given by the inverse-gamma distribution.

Characterization

The probability density function of the scaled inverse chi-squared distribution extends over the domain $x > 0$ and is

f (x; ν, τ^{2}) = \frac{(τ^{2} ν / 2)^{ν / 2}}{Γ (ν / 2)} \frac{\exp [\frac{- ν τ^{2}}{2 x}]}{x^{1 + ν / 2}}

where $ν$ is the degrees of freedom parameter and $τ^{2}$ is the scale parameter. The cumulative distribution function is

F (x; ν, τ^{2}) = Γ (\frac{ν}{2}, \frac{τ^{2} ν}{2 x}) / Γ (\frac{ν}{2})

= Q (\frac{ν}{2}, \frac{τ^{2} ν}{2 x})

where $Γ (a, x)$ is the incomplete gamma function, $Γ (x)$ is the gamma function and $Q (a, x)$ is a regularized gamma function. The characteristic function is

φ (t; ν, τ^{2}) =

\frac{2}{Γ (\frac{ν}{2})} {(\frac{- i τ^{2} ν t}{2})}^{\frac{ν}{4}} K_{\frac{ν}{2}} (\sqrt{- 2 i τ^{2} ν t}),

where $K_{\frac{ν}{2}} (z)$ is the modified Bessel function of the second kind.

Parameter estimation

The maximum likelihood estimate of $τ^{2}$ is

τ^{2} = n / \sum_{i = 1}^{n} \frac{1}{x_{i}} .

The maximum likelihood estimate of $\frac{ν}{2}$ can be found using Newton's method on:

\ln (\frac{ν}{2}) - ψ (\frac{ν}{2}) = \frac{1}{n} \sum_{i = 1}^{n} \ln (x_{i}) - \ln (τ^{2}),

where $ψ (x)$ is the digamma function. An initial estimate can be found by taking the formula for mean and solving it for $ν .$ Let $\bar{x} = \frac{1}{n} \sum_{i = 1}^{n} x_{i}$ be the sample mean. Then an initial estimate for $ν$ is given by:

\frac{ν}{2} = \frac{\bar{x}}{\bar{x} - τ^{2}} .

Bayesian estimation of the variance of a normal distribution

The scaled inverse chi-squared distribution has a second important application, in the Bayesian estimation of the variance of a Normal distribution.

According to Bayes' theorem, the posterior probability distribution for quantities of interest is proportional to the product of a prior distribution for the quantities and a likelihood function:

p (σ^{2} | D, I) \propto p (σ^{2} | I) p (D | σ^{2})

where D represents the data and I represents any initial information about σ² that we may already have.

The simplest scenario arises if the mean μ is already known; or, alternatively, if it is the conditional distribution of σ² that is sought, for a particular assumed value of μ.

Then the likelihood term L(σ²|D) = p(D|σ²) has the familiar form

ℒ (σ^{2} | D, μ) = \frac{1}{{(\sqrt{2 π} σ)}^{n}} \exp [- \frac{\sum_{i}^{n} (x_{i} - μ)^{2}}{2 σ^{2}}]

Combining this with the rescaling-invariant prior p(σ²|I) = 1/σ², which can be argued (e.g. following Jeffreys) to be the least informative possible prior for σ² in this problem, gives a combined posterior probability

p (σ^{2} | D, I, μ) \propto \frac{1}{σ^{n + 2}} \exp [- \frac{\sum_{i}^{n} (x_{i} - μ)^{2}}{2 σ^{2}}]

This form can be recognised as that of a scaled inverse chi-squared distribution, with parameters ν = n and τ² = s² = (1/n) Σ (x_i-μ)²

Gelman and co-authors remark that the re-appearance of this distribution, previously seen in a sampling context, may seem remarkable; but given the choice of prior "this result is not surprising."^[1]

In particular, the choice of a rescaling-invariant prior for σ² has the result that the probability for the ratio of σ² / s² has the same form (independent of the conditioning variable) when conditioned on s² as when conditioned on σ²:

p (\frac{σ^{2}}{s^{2}} | s^{2}) = p (\frac{σ^{2}}{s^{2}} | σ^{2})

In the sampling-theory case, conditioned on σ², the probability distribution for (1/s²) is a scaled inverse chi-squared distribution; and so the probability distribution for σ² conditioned on s², given a scale-agnostic prior, is also a scaled inverse chi-squared distribution.

Use as an informative prior

If more is known about the possible values of σ², a distribution from the scaled inverse chi-squared family, such as Scale-inv-χ²(n₀, s₀²) can be a convenient form to represent a more informative prior for σ², as if from the result of n₀ previous observations (though n₀ need not necessarily be a whole number):

p (σ^{2} | I^{'}, μ) \propto \frac{1}{σ^{n_{0} + 2}} \exp [- \frac{n_{0} s_{0}^{2}}{2 σ^{2}}]

Such a prior would lead to the posterior distribution

p (σ^{2} | D, I^{'}, μ) \propto \frac{1}{σ^{n + n_{0} + 2}} \exp [- \frac{n s^{2} + n_{0} s_{0}^{2}}{2 σ^{2}}]

which is itself a scaled inverse chi-squared distribution. The scaled inverse chi-squared distributions are thus a convenient conjugate prior family for σ² estimation.

Estimation of variance when mean is unknown

If the mean is not known, the most uninformative prior that can be taken for it is arguably the translation-invariant prior p(μ|I) ∝ const., which gives the following joint posterior distribution for μ and σ²,

\begin{matrix} p (μ, σ^{2} ∣ D, I) & \propto \frac{1}{σ^{n + 2}} \exp [- \frac{\sum_{i}^{n} (x_{i} - μ)^{2}}{2 σ^{2}}] \\ = \frac{1}{σ^{n + 2}} \exp [- \frac{\sum_{i}^{n} (x_{i} - \bar{x})^{2}}{2 σ^{2}}] \exp [- \frac{n (μ - \bar{x})^{2}}{2 σ^{2}}] \end{matrix}

The marginal posterior distribution for σ² is obtained from the joint posterior distribution by integrating out over μ,

\begin{matrix} p (σ^{2} | D, I) \propto & \frac{1}{σ^{n + 2}} \exp [- \frac{\sum_{i}^{n} (x_{i} - \bar{x})^{2}}{2 σ^{2}}] \int_{- \infty}^{\infty} \exp [- \frac{n (μ - \bar{x})^{2}}{2 σ^{2}}] d μ \\ = & \frac{1}{σ^{n + 2}} \exp [- \frac{\sum_{i}^{n} (x_{i} - \bar{x})^{2}}{2 σ^{2}}] \sqrt{2 π σ^{2} / n} \\ \propto & (σ^{2})^{- (n + 1) / 2} \exp [- \frac{(n - 1) s^{2}}{2 σ^{2}}] \end{matrix}

This is again a scaled inverse chi-squared distribution, with parameters $n - 1$ and $s^{2} = \sum (x_{i} - \bar{x})^{2} / (n - 1)$ .

Related distributions

If $X \sim Scale-inv- χ^{2} (ν, τ^{2})$ then $k X \sim Scale-inv- χ^{2} (ν, k τ^{2})$
If $X \sim inv- χ^{2} (ν)$ (Inverse-chi-squared distribution) then $X \sim Scale-inv- χ^{2} (ν, 1 / ν)$
If $X \sim Scale-inv- χ^{2} (ν, τ^{2})$ then $\frac{X}{τ^{2} ν} \sim inv- χ^{2} (ν)$ (Inverse-chi-squared distribution)
If $X \sim Scale-inv- χ^{2} (ν, τ^{2})$ then $X \sim Inv-Gamma (\frac{ν}{2}, \frac{ν τ^{2}}{2})$ (Inverse-gamma distribution)
Scaled inverse chi square distribution is a special case of type 5 Pearson distribution

References

Template:Cite book

Template:Reflist

Template:ProbDistributions

↑ Template:Cite book

[1] Template:Cite book

[1]

Scaled inverse chi-squared distribution

Contents

Characterization

Parameter estimation

Bayesian estimation of the variance of a normal distribution

Use as an informative prior

Estimation of variance when mean is unknown

Related distributions

References

Navigation menu

Scaled inverse chi-squared distribution

Characterization

Parameter estimation

Bayesian estimation of the variance of a normal distribution

Use as an informative prior

Estimation of variance when mean is unknown

Related distributions

References

Navigation menu

Search