Negative multinomial distribution

Template:Probability distribution

In probability theory and statistics, the negative multinomial distribution is a generalization of the negative binomial distribution (NB(x₀, p)) to more than two outcomes.^[1]

As with the univariate negative binomial distribution, if the parameter $x_{0}$ is a positive integer, the negative multinomial distribution has an urn model interpretation. Suppose we have an experiment that generates m+1≥2 possible outcomes, {X₀,...,X_m}, each occurring with non-negative probabilities {p₀,...,p_m} respectively. If sampling proceeded until n observations were made, then {X₀,...,X_m} would have been multinomially distributed. However, if the experiment is stopped once X₀ reaches the predetermined value x₀ (assuming x₀ is a positive integer), then the distribution of the m-tuple {X₁,...,X_m} is negative multinomial. These variables are not multinomially distributed because their sum X₁+...+X_m is not fixed, being a draw from a negative binomial distribution.

Properties

Marginal distributions

If m-dimensional x is partitioned as follows $𝐗 = [\begin{matrix} 𝐗^{(1)} \\ 𝐗^{(2)} \end{matrix}] with sizes [\begin{matrix} n \times 1 \\ (m - n) \times 1 \end{matrix}]$ and accordingly $𝒑$ $𝒑 = [\begin{matrix} 𝒑^{(1)} \\ 𝒑^{(2)} \end{matrix}] with sizes [\begin{matrix} n \times 1 \\ (m - n) \times 1 \end{matrix}]$ and let $q = 1 - \sum_{i} p_{i}^{(2)} = p_{0} + \sum_{i} p_{i}^{(1)}$

The marginal distribution of $𝑿^{(1)}$ is $N M (x_{0}, p_{0} / q, 𝒑^{(1)} / q)$ . That is the marginal distribution is also negative multinomial with the $𝒑^{(2)}$ removed and the remaining p's properly scaled so as to add to one.

The univariate marginal $m = 1$ is said to have a negative binomial distribution.

Conditional distributions

The conditional distribution of $𝐗^{(1)}$ given $𝐗^{(2)} = 𝐱^{(2)}$ is $N M (x_{0} + \sum x_{i}^{(2)}, 𝐩^{(1)})$ . That is, $\Pr (𝐱^{(1)} ∣ 𝐱^{(2)}, x_{0}, 𝐩) = Γ (\sum_{i = 0}^{m} x_{i}) \frac{(1 - \sum_{i = 1}^{n} p_{i}^{(1)})^{x_{0} + \sum_{i = 1}^{m - n} x_{i}^{(2)}}}{Γ (x_{0} + \sum_{i = 1}^{m - n} x_{i}^{(2)})} \prod_{i = 1}^{n} \frac{(p_{i}^{(1)})^{x_{i}}}{(x_{i}^{(1)})!} .$

Independent sums

If $𝐗_{1} \sim N M (r_{1}, 𝐩)$ and If $𝐗_{2} \sim N M (r_{2}, 𝐩)$ are independent, then $𝐗_{1} + 𝐗_{2} \sim N M (r_{1} + r_{2}, 𝐩)$ . Similarly and conversely, it is easy to see from the characteristic function that the negative multinomial is infinitely divisible.

Aggregation

If $𝐗 = (X_{1}, \dots, X_{m}) \sim NM (x_{0}, (p_{1}, \dots, p_{m}))$ then, if the random variables with subscripts i and j are dropped from the vector and replaced by their sum, $𝐗^{'} = (X_{1}, \dots, X_{i} + X_{j}, \dots, X_{m}) \sim NM (x_{0}, (p_{1}, \dots, p_{i} + p_{j}, \dots, p_{m})) .$

This aggregation property may be used to derive the marginal distribution of $X_{i}$ mentioned above.

Correlation matrix

The entries of the correlation matrix are $ρ (X_{i}, X_{i}) = 1.$ $ρ (X_{i}, X_{j}) = \frac{cov (X_{i}, X_{j})}{\sqrt{var (X_{i}) var (X_{j})}} = \sqrt{\frac{p_{i} p_{j}}{(p_{0} + p_{i}) (p_{0} + p_{j})}} .$

Parameter estimation

Method of Moments

If we let the mean vector of the negative multinomial be $𝝁 = \frac{x_{0}}{p_{0}} 𝐩$ and covariance matrix $𝜮 = \frac{x_{0}}{p_{0}^{2}} 𝐩 𝐩^{'} + \frac{x_{0}}{p_{0}} diag (𝐩),$ then it is easy to show through properties of determinants that $| 𝜮 | = \frac{1}{p_{0}} \prod_{i = 1}^{m} μ_{i}$ . From this, it can be shown that $x_{0} = \frac{\sum μ_{i} \prod μ_{i}}{| 𝜮 | - \prod μ_{i}}$ and $𝐩 = \frac{| 𝜮 | - \prod μ_{i}}{| 𝜮 | \sum μ_{i}} 𝝁 .$

Substituting sample moments yields the method of moments estimates ${\hat{x}}_{0} = \frac{(\sum_{i = 1}^{m} \bar{x_{i}}) \prod_{i = 1}^{m} \bar{x_{i}}}{| 𝐒 | - \prod_{i = 1}^{m} \bar{x_{i}}}$ and $\hat{𝐩} = (\frac{| 𝑺 | - \prod_{i = 1}^{m} {\bar{x}}_{i}}{| 𝑺 | \sum_{i = 1}^{m} {\bar{x}}_{i}}) \bar{𝒙}$

Related distributions

Negative binomial distribution
Multinomial distribution
Inverted Dirichlet distribution, a conjugate prior for the negative multinomial
Dirichlet negative multinomial distribution

References

↑ Le Gall, F. The modes of a negative multinomial distribution, Statistics & Probability Letters, Volume 76, Issue 6, 15 March 2006, Pages 619-624, ISSN 0167-7152, 10.1016/j.spl.2005.09.009.

Waller LA and Zelterman D. (1997). Log-linear modeling with the negative multi- nomial distribution. Biometrics 53: 971–82.

Negative multinomial distribution

Contents

Properties

Marginal distributions

Conditional distributions

Independent sums

Aggregation

Correlation matrix

Parameter estimation

Method of Moments

Related distributions

References

Further reading

Navigation menu

Negative multinomial distribution

Properties

Marginal distributions

Conditional distributions

Independent sums

Aggregation

Correlation matrix

Parameter estimation

Method of Moments

Related distributions

References

Further reading

Navigation menu

Search