Normal-inverse-Wishart distribution

Template:Short description Template:Probability distribution In probability theory and statistics, the normal-inverse-Wishart distribution (or Gaussian-inverse-Wishart distribution) is a multivariate four-parameter family of continuous probability distributions. It is the conjugate prior of a multivariate normal distribution with unknown mean and covariance matrix (the inverse of the precision matrix).^[1]

Definition

Suppose

𝝁 | 𝝁_{0}, λ, 𝜮 \sim 𝒩 (𝝁 | 𝝁_{0}, \frac{1}{λ} 𝜮)

has a multivariate normal distribution with mean $𝝁_{0}$ and covariance matrix $\frac{1}{λ} 𝜮$ , where

𝜮 | 𝜳, ν \sim 𝒲^{- 1} (𝜮 | 𝜳, ν)

has an inverse Wishart distribution. Then $(𝝁, 𝜮)$ has a normal-inverse-Wishart distribution, denoted as

(𝝁, 𝜮) \sim N I W (𝝁_{0}, λ, 𝜳, ν) .

Characterization

Probability density function

f (𝝁, 𝜮 | 𝝁_{0}, λ, 𝜳, ν) = 𝒩 (𝝁 | 𝝁_{0}, \frac{1}{λ} 𝜮) 𝒲^{- 1} (𝜮 | 𝜳, ν)

The full version of the PDF is as follows:^[2]

$f (𝝁, 𝜮 | 𝝁_{0}, λ, 𝜳, ν) = \frac{λ^{D / 2} | 𝜳 |^{ν / 2} | 𝜮 |^{- \frac{ν + D + 2}{2}}}{(2 π)^{D / 2} 2^{\frac{ν D}{2}} Γ_{D} (\frac{ν}{2})} exp {- \frac{1}{2} T r ({𝜳 𝜮}^{- 1}) - \frac{λ}{2} (𝝁 - 𝝁_{0})^{T} 𝜮^{- 1} (𝝁 - 𝝁_{0})}$

Here $Γ_{D} [\cdot]$ is the multivariate gamma function and $T r (𝜳)$ is the Trace of the given matrix.

Properties

Scaling

Marginal distributions

By construction, the marginal distribution over $𝜮$ is an inverse Wishart distribution, and the conditional distribution over $𝝁$ given $𝜮$ is a multivariate normal distribution. The marginal distribution over $𝝁$ is a multivariate t-distribution.

Posterior distribution of the parameters

Suppose the sampling density is a multivariate normal distribution

𝒚_{𝒊} | 𝝁, 𝜮 \sim 𝒩_{p} (𝝁, 𝜮)

where $𝒚$ is an $n \times p$ matrix and $𝒚_{𝒊}$ (of length $p$ ) is row $i$ of the matrix .

With the mean and covariance matrix of the sampling distribution is unknown, we can place a Normal-Inverse-Wishart prior on the mean and covariance parameters jointly

(𝝁, 𝜮) \sim N I W (𝝁_{0}, λ, 𝜳, ν) .

The resulting posterior distribution for the mean and covariance matrix will also be a Normal-Inverse-Wishart

(𝝁, 𝜮 | y) \sim N I W (𝝁_{n}, λ_{n}, 𝜳_{n}, ν_{n}),

where

𝝁_{n} = \frac{λ 𝝁_{0} + n \bar{𝒚}}{λ + n}

λ_{n} = λ + n

ν_{n} = ν + n

𝜳_{n} = 𝜳 + 𝑺 + \frac{λ n}{λ + n} (\bar{𝒚} - 𝝁_{0}) (\bar{𝒚} - 𝝁_{0})^{T} w i t h 𝑺 = \sum_{i = 1}^{n} (𝒚_{𝒊} - \bar{𝒚}) (𝒚_{𝒊} - \bar{𝒚})^{T}

.

To sample from the joint posterior of $(𝝁, 𝜮)$ , one simply draws samples from $𝜮 | 𝒚 \sim 𝒲^{- 1} (𝜳_{n}, ν_{n})$ , then draw $𝝁 | 𝜮, 𝒚 \sim 𝒩_{p} (𝝁_{n}, 𝜮 / λ_{n})$ . To draw from the posterior predictive of a new observation, draw $\tilde{𝒚} | 𝝁, 𝜮, 𝒚 \sim 𝒩_{p} (𝝁, 𝜮)$ , given the already drawn values of $𝝁$ and $𝜮$ .^[3]

Generating normal-inverse-Wishart random variates

Generation of random variates is straightforward:

Sample $𝜮$ from an inverse Wishart distribution with parameters $𝜳$ and $ν$
Sample $𝝁$ from a multivariate normal distribution with mean $𝝁_{0}$ and variance $\frac{1}{λ} 𝜮$

Related distributions

The normal-Wishart distribution is essentially the same distribution parameterized by precision rather than variance. If $(𝝁, 𝜮) \sim N I W (𝝁_{0}, λ, 𝜳, ν)$ then $(𝝁, 𝜮^{- 1}) \sim N W (𝝁_{0}, λ, 𝜳^{- 1}, ν)$ .
The normal-inverse-gamma distribution is the one-dimensional equivalent.
The multivariate normal distribution and inverse Wishart distribution are the component distributions out of which this distribution is made.

Notes

Template:Reflist

References

Bishop, Christopher M. (2006). Pattern Recognition and Machine Learning. Springer Science+Business Media.
Murphy, Kevin P. (2007). "Conjugate Bayesian analysis of the Gaussian distribution." [1]

Template:ProbDistributions

↑ Murphy, Kevin P. (2007). "Conjugate Bayesian analysis of the Gaussian distribution." [2]
↑ Simon J.D. Prince(June 2012). Computer Vision: Models, Learning, and Inference. Cambridge University Press. 3.8: "Normal inverse Wishart distribution".
↑ Gelman, Andrew, et al. Bayesian data analysis. Vol. 2, p.73. Boca Raton, FL, USA: Chapman & Hall/CRC, 2014.

[murphy-1] Murphy, Kevin P. (2007). "Conjugate Bayesian analysis of the Gaussian distribution." [2]

[2] Simon J.D. Prince(June 2012). Computer Vision: Models, Learning, and Inference. Cambridge University Press. 3.8: "Normal inverse Wishart distribution".

[3] Gelman, Andrew, et al. Bayesian data analysis. Vol. 2, p.73. Boca Raton, FL, USA: Chapman & Hall/CRC, 2014.

[1]

[2]

[3]

Normal-inverse-Wishart distribution

Contents

Definition

Characterization

Probability density function

Properties

Scaling

Marginal distributions

Posterior distribution of the parameters

Generating normal-inverse-Wishart random variates

Related distributions

Notes

References

Navigation menu

Normal-inverse-Wishart distribution

Definition

Characterization

Probability density function

Properties

Scaling

Marginal distributions

Posterior distribution of the parameters

Generating normal-inverse-Wishart random variates

Related distributions

Notes

References

Navigation menu

Search