Compound Poisson distribution

From testwiki
Jump to navigation Jump to search

Template:Short description In probability theory, a compound Poisson distribution is the probability distribution of the sum of a number of independent identically-distributed random variables, where the number of terms to be added is itself a Poisson-distributed variable. The result can be either a continuous or a discrete distribution.

Definition

Suppose that

NPoisson(λ),

i.e., N is a random variable whose distribution is a Poisson distribution with expected value λ, and that

X1,X2,X3,

are identically distributed random variables that are mutually independent and also independent of N. Then the probability distribution of the sum of N i.i.d. random variables

Y=n=1NXn

is a compound Poisson distribution.

In the case N = 0, then this is a sum of 0 terms, so the value of Y is 0. Hence the conditional distribution of Y given that N = 0 is a degenerate distribution.

The compound Poisson distribution is obtained by marginalising the joint distribution of (Y,N) over N, and this joint distribution can be obtained by combining the conditional distribution Y | N with the marginal distribution of N.

Properties

The expected value and the variance of the compound distribution can be derived in a simple way from law of total expectation and the law of total variance. Thus

E(Y)=E[E(YN)]=E[NE(X)]=E(N)E(X),
Var(Y)=E[Var(YN)]+Var[E(YN)]=E[NVar(X)]+Var[NE(X)],=E(N)Var(X)+(E(X))2Var(N).

Then, since E(N) = Var(N) if N is Poisson-distributed, these formulae can be reduced to

E(Y)=E(N)E(X)=λE(X),
Var(Y)=E(N)(Var(X)+(E(X))2)=E(N)E(X2)=λE(X2).

The probability distribution of Y can be determined in terms of characteristic functions:

φY(t)=E(eitY)=E((E(eitXN))N)=E((φX(t))N),

and hence, using the probability-generating function of the Poisson distribution, we have

φY(t)=eλ(φX(t)1).

An alternative approach is via cumulant generating functions:

KY(t)=lnE[etY]=lnE[E[etYN]]=lnE[eNKX(t)]=KN(KX(t)).

Via the law of total cumulance it can be shown that, if the mean of the Poisson distribution λ = 1, the cumulants of Y are the same as the moments of X1.Template:Citation needed

Every infinitely divisible probability distribution is a limit of compound Poisson distributions.[1] And compound Poisson distributions is infinitely divisible by the definition.

Discrete compound Poisson distribution

When X1,X2,X3, are positive integer-valued i.i.d random variables with P(X1=k)=αk, (k=1,2,), then this compound Poisson distribution is named discrete compound Poisson distribution[2][3][4] (or stuttering-Poisson distribution[5]) . We say that the discrete random variable Y satisfying probability generating function characterization

PY(z)=i=0P(Y=i)zi=exp(k=1αkλ(zk1)),(|z|1)

has a discrete compound Poisson(DCP) distribution with parameters (α1λ,α2λ,) (where i=1αi=1, with αi0,λ>0), which is denoted by

XDCP(λα1,λα2,)

Moreover, if XDCP(λα1,,λαr), we say X has a discrete compound Poisson distribution of order r . When r=1,2, DCP becomes Poisson distribution and Hermite distribution, respectively. When r=3,4, DCP becomes triple stuttering-Poisson distribution and quadruple stuttering-Poisson distribution, respectively.[6] Other special cases include: shift geometric distribution, negative binomial distribution, Geometric Poisson distribution, Neyman type A distribution, Luria–Delbrück distribution in Luria–Delbrück experiment. For more special case of DCP, see the reviews paper[7] and references therein.

Feller's characterization of the compound Poisson distribution states that a non-negative integer valued r.v. X is infinitely divisible if and only if its distribution is a discrete compound Poisson distribution.[8] The negative binomial distribution is discrete infinitely divisible, i.e., if X has a negative binomial distribution, then for any positive integer n, there exist discrete i.i.d. random variables X1, ..., Xn whose sum has the same distribution that X has. The shift geometric distribution is discrete compound Poisson distribution since it is a trivial case of negative binomial distribution.

This distribution can model batch arrivals (such as in a bulk queue[5][9]). The discrete compound Poisson distribution is also widely used in actuarial science for modelling the distribution of the total claim amount.[3]

When some αk are negative, it is the discrete pseudo compound Poisson distribution.[3] We define that any discrete random variable Y satisfying probability generating function characterization

GY(z)=i=0P(Y=i)zi=exp(k=1αkλ(zk1)),(|z|1)

has a discrete pseudo compound Poisson distribution with parameters (λ1,λ2,)=:(α1λ,α2λ,) where i=1αi=1 and i=1|αi|<, with αi,λ>0.

Compound Poisson Gamma distribution

If X has a gamma distribution, of which the exponential distribution is a special case, then the conditional distribution of Y | N is again a gamma distribution. The marginal distribution of Y is a Tweedie distribution with variance power 1 < p < 2 (proof via comparison of characteristic function).[10] To be more explicit, if

NPoisson(λ),

and

XiΓ(α,β)

i.i.d., then the distribution of

Y=i=1NXi

is a reproductive exponential dispersion model ED(μ,σ2) with

E[Y]=λαβ=:μ,Var[Y]=λα(1+α)β2=:σ2μp.

The mapping of parameters Tweedie parameter μ,σ2,p to the Poisson and Gamma parameters λ,α,β is the following:

λ=μ2p(2p)σ2,α=2pp1,β=μ1p(p1)σ2.

Compound Poisson processes

Template:Main

A compound Poisson process with rate λ>0 and jump size distribution G is a continuous-time stochastic process {Y(t):t0} given by

Y(t)=i=1N(t)Di,

where the sum is by convention equal to zero as long as N(t) = 0. Here, {N(t):t0} is a Poisson process with rate λ, and {Di:i1} are independent and identically distributed random variables, with distribution function G, which are also independent of {N(t):t0}.[11]

For the discrete version of compound Poisson process, it can be used in survival analysis for the frailty models.[12]

Applications

A compound Poisson distribution, in which the summands have an exponential distribution, was used by Revfeim to model the distribution of the total rainfall in a day, where each day contains a Poisson-distributed number of events each of which provides an amount of rainfall which has an exponential distribution.[13] Thompson applied the same model to monthly total rainfalls.[14]

There have been applications to insurance claims[15][16] and x-ray computed tomography.[17][18][19]

See also

Template:Div col

Template:Div col end

References

Template:Reflist

Template:ProbDistributions

  1. Template:Cite book
  2. Johnson, N.L., Kemp, A.W., and Kotz, S. (2005) Univariate Discrete Distributions, 3rd Edition, Wiley, Template:ISBN.
  3. 3.0 3.1 3.2 Template:Cite journal
  4. Template:Cite journal
  5. 5.0 5.1 Template:Cite journal
  6. Patel, Y. C. (1976). Estimation of the parameters of the triple and quadruple stuttering-Poisson distributions. Technometrics, 18(1), 67-73.
  7. Wimmer, G., Altmann, G. (1996). The multiple Poisson distribution, its characteristics and a variety of forms. Biometrical journal, 38(8), 995-1011.
  8. Template:Cite book
  9. Template:Cite journal
  10. Template:Cite book
  11. Template:Cite book
  12. Template:Cite journal
  13. Template:Cite journal
  14. Template:Cite journal
  15. Template:Cite journal
  16. Template:Cite journal
  17. Template:Cite journal
  18. Template:Cite journal
  19. Template:Cite journal