Negative multinomial distribution

From testwiki
Revision as of 08:57, 5 February 2023 by imported>Hellacioussatyr
(diff) ← Older revision | Latest revision (diff) | Newer revision β†’ (diff)
Jump to navigation Jump to search

Template:Probability distribution

In probability theory and statistics, the negative multinomial distribution is a generalization of the negative binomial distribution (NB(x0, p)) to more than two outcomes.[1]

As with the univariate negative binomial distribution, if the parameter x0 is a positive integer, the negative multinomial distribution has an urn model interpretation. Suppose we have an experiment that generates m+1β‰₯2 possible outcomes, {X0,...,Xm}, each occurring with non-negative probabilities {p0,...,pm} respectively. If sampling proceeded until n observations were made, then {X0,...,Xm} would have been multinomially distributed. However, if the experiment is stopped once X0 reaches the predetermined value x0 (assuming x0 is a positive integer), then the distribution of the m-tuple {X1,...,Xm} is negative multinomial. These variables are not multinomially distributed because their sum X1+...+Xm is not fixed, being a draw from a negative binomial distribution.

Properties

Marginal distributions

If m-dimensional x is partitioned as follows 𝐗=[𝐗(1)𝐗(2)] with sizes [n×1(mn)×1] and accordingly 𝒑 𝒑=[𝒑(1)𝒑(2)] with sizes [n×1(mn)×1] and let q=1ipi(2)=p0+ipi(1)

The marginal distribution of 𝑿(1) is NM(x0,p0/q,𝒑(1)/q). That is the marginal distribution is also negative multinomial with the 𝒑(2) removed and the remaining p's properly scaled so as to add to one.

The univariate marginal m=1 is said to have a negative binomial distribution.

Conditional distributions

The conditional distribution of 𝐗(1) given 𝐗(2)=𝐱(2) is NM(x0+xi(2),𝐩(1)). That is, Pr(𝐱(1)𝐱(2),x0,𝐩)=Γ(i=0mxi)(1i=1npi(1))x0+i=1mnxi(2)Γ(x0+i=1mnxi(2))i=1n(pi(1))xi(xi(1))!.

Independent sums

If 𝐗1NM(r1,𝐩) and If 𝐗2NM(r2,𝐩) are independent, then 𝐗1+𝐗2NM(r1+r2,𝐩). Similarly and conversely, it is easy to see from the characteristic function that the negative multinomial is infinitely divisible.

Aggregation

If 𝐗=(X1,,Xm)NM(x0,(p1,,pm)) then, if the random variables with subscripts i and j are dropped from the vector and replaced by their sum, 𝐗=(X1,,Xi+Xj,,Xm)NM(x0,(p1,,pi+pj,,pm)).

This aggregation property may be used to derive the marginal distribution of Xi mentioned above.

Correlation matrix

The entries of the correlation matrix are ρ(Xi,Xi)=1. ρ(Xi,Xj)=cov(Xi,Xj)var(Xi)var(Xj)=pipj(p0+pi)(p0+pj).

Parameter estimation

Method of Moments

If we let the mean vector of the negative multinomial be μ=x0p0𝐩 and covariance matrix Σ=x0p02𝐩𝐩+x0p0diag(𝐩), then it is easy to show through properties of determinants that |Σ|=1p0i=1mμi. From this, it can be shown that x0=μiμi|Σ|μi and 𝐩=|Σ|μi|Σ|μiμ.

Substituting sample moments yields the method of moments estimates x^0=(i=1mxiΒ―)i=1mxiΒ―|𝐒|i=1mxiΒ― and 𝐩^=(|𝑺|i=1mxΒ―i|𝑺|i=1mxΒ―i)𝒙¯

References

  1. ↑ Le Gall, F. The modes of a negative multinomial distribution, Statistics & Probability Letters, Volume 76, Issue 6, 15 March 2006, Pages 619-624, ISSN 0167-7152, 10.1016/j.spl.2005.09.009.

Waller LA and Zelterman D. (1997). Log-linear modeling with the negative multi- nomial distribution. Biometrics 53: 971–82.

Further reading

Template:Cite book

Template:ProbDistributions Template:Use dmy dates