Inverse-gamma distribution

From testwiki
Revision as of 16:49, 11 October 2024 by imported>Bender235 (Probability density function)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Template:Short description Template:Refimprove Template:Probability distribution

In probability theory and statistics, the inverse gamma distribution is a two-parameter family of continuous probability distributions on the positive real line, which is the distribution of the reciprocal of a variable distributed according to the gamma distribution.

Perhaps the chief use of the inverse gamma distribution is in Bayesian statistics, where the distribution arises as the marginal posterior distribution for the unknown variance of a normal distribution, if an uninformative prior is used, and as an analytically tractable conjugate prior, if an informative prior is required.[1] It is common among some Bayesians to consider an alternative parametrization of the normal distribution in terms of the precision, defined as the reciprocal of the variance, which allows the gamma distribution to be used directly as a conjugate prior. Other Bayesians prefer to parametrize the inverse gamma distribution differently, as a scaled inverse chi-squared distribution.

Characterization

Probability density function

The inverse gamma distribution's probability density function is defined over the support x>0

f(x;α,β)=βαΓ(α)(1/x)α+1exp(β/x)

with shape parameter α and scale parameter β.[2] Here Γ() denotes the gamma function.

Unlike the gamma distribution, which contains a somewhat similar exponential term, β is a scale parameter as the density function satisfies:

f(x;α,β)=f(x/β;α,1)β

Cumulative distribution function

The cumulative distribution function is the regularized gamma function

F(x;α,β)=Γ(α,βx)Γ(α)=Q(α,βx)

where the numerator is the upper incomplete gamma function and the denominator is the gamma function. Many math packages allow direct computation of Q, the regularized gamma function.

Moments

Provided that α>n, the n-th moment of the inverse gamma distribution is given by[3]

E[Xn]=βnΓ(αn)Γ(α)=βn(α1)(αn).

Characteristic function

The inverse gamma distribution has characteristic function 2(iβt)α2Γ(α)Kα(4iβt) where Kα is the modified Bessel function of the 2nd kind.

Properties

For α>0 and β>0,

𝔼[ln(X)]=ln(β)ψ(α)

and

𝔼[X1]=αβ,

The information entropy is

H(X)=E[ln(p(X))]=E[αln(β)+ln(Γ(α))+(α+1)ln(X)+βX]=αln(β)+ln(Γ(α))+(α+1)ln(β)(α+1)ψ(α)+α=α+ln(βΓ(α))(α+1)ψ(α).

where ψ(α) is the digamma function.

The Kullback-Leibler divergence of Inverse-Gamma(αp, βp) from Inverse-Gamma(αq, βq) is the same as the KL-divergence of Gamma(αp, βp) from Gamma(αq, βq):

DKL(αp,βp;αq,βq)=𝔼[logρ(X)π(X)]=𝔼[logρ(1/Y)π(1/Y)]=𝔼[logρG(Y)πG(Y)],

where ρ,π are the pdfs of the Inverse-Gamma distributions and ρG,πG are the pdfs of the Gamma distributions, Y is Gamma(αp, βp) distributed.

DKL(αp,βp;αq,βq)=(αpαq)ψ(αp)logΓ(αp)+logΓ(αq)+αq(logβplogβq)+αpβqβpβp.

Derivation from Gamma distribution

Let XGamma(α,β), and recall that the pdf of the gamma distribution is

fX(x)=βαΓ(α)xα1eβx, x>0.

Note that β is the rate parameter from the perspective of the gamma distribution.

Define the transformation Y=g(X)=1X. Then, the pdf of Y is

fY(y)=fX(g1(y))|ddyg1(y)|=βαΓ(α)(1y)α1exp(βy)1y2=βαΓ(α)(1y)α+1exp(βy)=βαΓ(α)(y)α1exp(βy)

Note that β is the scale parameter from the perspective of the inverse gamma distribution. This can be straightforwardly demonstrated by seeing that β satisfies the conditions for being a scale parameter.

f(y/β;α,1)β=1β1Γ(α)(yβ)α1exp(1yβ)=βαΓ(α)(y)α1exp(βy)=f(y;α,β)

Occurrence

See also

References

Template:Reflist

Template:ProbDistributions