Negative multinomial distribution

From testwiki
Jump to navigation Jump to search

Template:Probability distribution

In probability theory and statistics, the negative multinomial distribution is a generalization of the negative binomial distribution (NB(x0, p)) to more than two outcomes.[1]

As with the univariate negative binomial distribution, if the parameter x0 is a positive integer, the negative multinomial distribution has an urn model interpretation. Suppose we have an experiment that generates m+1โ‰ฅ2 possible outcomes, {X0,...,Xm}, each occurring with non-negative probabilities {p0,...,pm} respectively. If sampling proceeded until n observations were made, then {X0,...,Xm} would have been multinomially distributed. However, if the experiment is stopped once X0 reaches the predetermined value x0 (assuming x0 is a positive integer), then the distribution of the m-tuple {X1,...,Xm} is negative multinomial. These variables are not multinomially distributed because their sum X1+...+Xm is not fixed, being a draw from a negative binomial distribution.

Properties

Marginal distributions

If m-dimensional x is partitioned as follows ๐—=[๐—(1)๐—(2)] with sizes [nร—1(mโˆ’n)ร—1] and accordingly ๐’‘ ๐’‘=[๐’‘(1)๐’‘(2)] with sizes [nร—1(mโˆ’n)ร—1] and let q=1โˆ’โˆ‘ipi(2)=p0+โˆ‘ipi(1)

The marginal distribution of ๐‘ฟ(1) is NM(x0,p0/q,๐’‘(1)/q). That is the marginal distribution is also negative multinomial with the ๐’‘(2) removed and the remaining p's properly scaled so as to add to one.

The univariate marginal m=1 is said to have a negative binomial distribution.

Conditional distributions

The conditional distribution of ๐—(1) given ๐—(2)=๐ฑ(2) is NM(x0+โˆ‘xi(2),๐ฉ(1)). That is, Pr(๐ฑ(1)โˆฃ๐ฑ(2),x0,๐ฉ)=ฮ“(โˆ‘i=0mxi)(1โˆ’โˆ‘i=1npi(1))x0+โˆ‘i=1mโˆ’nxi(2)ฮ“(x0+โˆ‘i=1mโˆ’nxi(2))โˆi=1n(pi(1))xi(xi(1))!.

Independent sums

If ๐—1โˆผNM(r1,๐ฉ) and If ๐—2โˆผNM(r2,๐ฉ) are independent, then ๐—1+๐—2โˆผNM(r1+r2,๐ฉ). Similarly and conversely, it is easy to see from the characteristic function that the negative multinomial is infinitely divisible.

Aggregation

If ๐—=(X1,โ€ฆ,Xm)โˆผNM(x0,(p1,โ€ฆ,pm)) then, if the random variables with subscripts i and j are dropped from the vector and replaced by their sum, ๐—=(X1,โ€ฆ,Xi+Xj,โ€ฆ,Xm)โˆผNM(x0,(p1,โ€ฆ,pi+pj,โ€ฆ,pm)).

This aggregation property may be used to derive the marginal distribution of Xi mentioned above.

Correlation matrix

The entries of the correlation matrix are ฯ(Xi,Xi)=1. ฯ(Xi,Xj)=cov(Xi,Xj)var(Xi)var(Xj)=pipj(p0+pi)(p0+pj).

Parameter estimation

Method of Moments

If we let the mean vector of the negative multinomial be ๐=x0p0๐ฉ and covariance matrix ๐œฎ=x0p02๐ฉ๐ฉ+x0p0diag(๐ฉ), then it is easy to show through properties of determinants that |๐œฎ|=1p0โˆi=1mฮผi. From this, it can be shown that x0=โˆ‘ฮผiโˆฮผi|๐œฎ|โˆ’โˆฮผi and ๐ฉ=|๐œฎ|โˆ’โˆฮผi|๐œฎ|โˆ‘ฮผi๐.

Substituting sample moments yields the method of moments estimates x^0=(โˆ‘i=1mxiยฏ)โˆi=1mxiยฏ|๐’|โˆ’โˆi=1mxiยฏ and ๐ฉ^=(|๐‘บ|โˆ’โˆi=1mxยฏi|๐‘บ|โˆ‘i=1mxยฏi)๐’™ยฏ

References

  1. โ†‘ Le Gall, F. The modes of a negative multinomial distribution, Statistics & Probability Letters, Volume 76, Issue 6, 15 March 2006, Pages 619-624, ISSN 0167-7152, 10.1016/j.spl.2005.09.009.

Waller LA and Zelterman D. (1997). Log-linear modeling with the negative multi- nomial distribution. Biometrics 53: 971โ€“82.

Further reading

Template:Cite book

Template:ProbDistributions Template:Use dmy dates