Natural exponential family

From testwiki
Jump to navigation Jump to search

In probability and statistics, a natural exponential family (NEF) is a class of probability distributions that is a special case of an exponential family (EF).

Definition

Univariate case

The natural exponential families (NEF) are a subset of the exponential families. A NEF is an exponential family in which the natural parameter η and the natural statistic T(x) are both the identity. A distribution in an exponential family with parameter θ can be written with probability density function (PDF) fX(xθ)=h(x) exp( η(θ)T(x)A(θ) ), where h(x) and A(θ) are known functions. A distribution in a natural exponential family with parameter θ can thus be written with PDF fX(xθ)=h(x) exp( θxA(θ) ). [Note that slightly different notation is used by the originator of the NEF, Carl Morris.[1] Morris uses ω instead of η and ψ instead of A.]

General multivariate case

Suppose that 𝐱𝒳p, then a natural exponential family of order p has density or mass function of the form: fX(𝐱θ)=h(𝐱) exp(θT𝐱A(θ) ), where in this case the parameter θp.

Moment and cumulant generating functions

A member of a natural exponential family has moment generating function (MGF) of the form MX(𝐭)=exp( A(θ+𝐭)A(θ) ).

The cumulant generating function is by definition the logarithm of the MGF, so it is KX(𝐭)=A(θ+𝐭)A(θ).

Kullback-Leibler divergence

The Kullback–Leibler divergence of two natural exponential families with parameters θ and λ is

DKL(θ||λ)=(θλ)A(θ)(A(θ)A(λ))=θλ(λt)A(t)dt

Examples

The five most important univariate cases are:

These five examples – Poisson, binomial, negative binomial, normal, and gamma – are a special subset of NEF, called NEF with quadratic variance function (NEF-QVF) because the variance can be written as a quadratic function of the mean. NEF-QVF are discussed below.

Distributions such as the exponential, Bernoulli, and geometric distributions are special cases of the above five distributions. For example, the Bernoulli distribution is a binomial distribution with n = 1 trial, the exponential distribution is a gamma distribution with shape parameter α = 1 (or k = 1 ), and the geometric distribution is a special case of the negative binomial distribution.

Some exponential family distributions are not NEF. The lognormal and Beta distribution are in the exponential family, but not the natural exponential family. The gamma distribution with two parameters is an exponential family but not a NEF and the chi-squared distribution is a special case of the gamma distribution with fixed scale parameter, and thus is also an exponential family but not a NEF (note that only a gamma distribution with fixed shape parameter is a NEF).

The inverse Gaussian distribution is a NEF with a cubic variance function.

The parameterization of most of the above distributions has been written differently from the parameterization commonly used in textbooks and the above linked pages. For example, the above parameterization differs from the parameterization in the linked article in the Poisson case. The two parameterizations are related by θ=log(λ), where λ is the mean parameter, and so that the density may be written as f(k;θ)=1k!exp( θ kexp(θ) ) , for θ, so h(k)=1k!, and A(θ)=exp(θ) .

This alternative parameterization can greatly simplify calculations in mathematical statistics. For example, in Bayesian inference, a posterior probability distribution is calculated as the product of two distributions. Normally this calculation requires writing out the probability distribution functions (PDF) and integrating; with the above parameterization, however, that calculation can be avoided. Instead, relationships between distributions can be abstracted due to the properties of the NEF described below.

An example of the multivariate case is the multinomial distribution with known number of trials.

Properties

The properties of the natural exponential family can be used to simplify calculations involving these distributions.

Univariate case

Template:Ordered list

Multivariate case

In the multivariate case, the mean vector and covariance matrix areTemplate:Citation needed E[X]=A(θ) and Cov[X]=TA(θ), where is the gradient and T is the Hessian matrix.

Natural exponential families with quadratic variance functions (NEF-QVF)

A special case of the natural exponential families are those with quadratic variance functions. Six NEFs have quadratic variance functions (QVF) in which the variance of the distribution can be written as a quadratic function of the mean. These are called NEF-QVF. The properties of these distributions were first described by Carl Morris. [2]

Var(X)=V(μ)=ν0+ν1μ+ν2μ2.

The six NEF-QVFs

The six NEF-QVF are written here in increasing complexity of the relationship between variance and mean.

  1. The normal distribution with fixed variance XN(μ,σ2) is NEF-QVF because the variance is constant. The variance can be written Var(X)=V(μ)=σ2, so variance is a degree 0 function of the mean.
  2. The Poisson distribution XPoisson(μ) is NEF-QVF because all Poisson distributions have variance equal to the mean Var(X)=V(μ)=μ, so variance is a linear function of the mean.
  3. The Gamma distribution XGamma(r,λ) is NEF-QVF because the mean of the Gamma distribution is μ=rλ and the variance of the Gamma distribution is Var(X)=V(μ)=μ2/r, so the variance is a quadratic function of the mean.
  4. The binomial distribution XBinomial(n,p) is NEF-QVF because the mean is μ=np and the variance is Var(X)=np(1p) which can be written in terms of the mean as
    V(X)=np2+np=μ2/n+μ.
  5. The negative binomial distribution XNegBin(n,p) is NEF-QVF because the mean is μ=np/(1p) and the variance is V(μ)=μ2/n+μ.
  6. The (not very famous) distribution generated by the generalizedTemplate:Clarify hyperbolic secant distribution (NEF-GHS) hasTemplate:Citation needed V(μ)=μ2/n+n and μ>0.

Properties of NEF-QVF

The properties of NEF-QVF can simplify calculations that use these distributions. Template:Ordered list Template:One source Template:Refimprove

See also

References

Template:Reflist

  • Morris C. (1982) Natural exponential families with quadratic variance functions: statistical theory. Dept of mathematics, Institute of Statistics, University of Texas, Austin.

Template:ProbDistributions

  1. Morris C. (2006) "Natural exponential families", Encyclopedia of Statistical Sciences.
  2. Template:Cite journal