Disintegration theorem

From testwiki
Jump to navigation Jump to search

Template:Short description Template:Use dmy dates In mathematics, the disintegration theorem is a result in measure theory and probability theory. It rigorously defines the idea of a non-trivial "restriction" of a measure to a measure zero subset of the measure space in question. It is related to the existence of conditional probability measures. In a sense, "disintegration" is the opposite process to the construction of a product measure.

Motivation

Consider the unit square S=[0,1]×[0,1] in the Euclidean plane 2. Consider the probability measure μ defined on S by the restriction of two-dimensional Lebesgue measure λ2 to S. That is, the probability of an event ES is simply the area of E. We assume E is a measurable subset of S.

Consider a one-dimensional subset of S such as the line segment Lx={x}×[0,1]. Lx has μ-measure zero; every subset of Lx is a μ-null set; since the Lebesgue measure space is a complete measure space, ELxμ(E)=0.

While true, this is somewhat unsatisfying. It would be nice to say that μ "restricted to" Lx is the one-dimensional Lebesgue measure λ1, rather than the zero measure. The probability of a "two-dimensional" event E could then be obtained as an integral of the one-dimensional probabilities of the vertical "slices" ELx: more formally, if μx denotes one-dimensional Lebesgue measure on Lx, then μ(E)=[0,1]μx(ELx)dx for any "nice" ES. The disintegration theorem makes this argument rigorous in the context of measures on metric spaces.

Statement of the theorem

(Hereafter, 𝒫(X) will denote the collection of Borel probability measures on a topological space (X,T).) The assumptions of the theorem are as follows:

  • Let Y and X be two Radon spaces (i.e. a topological space such that every Borel probability measure on it is inner regular, e.g. separably metrizable spaces; in particular, every probability measure on it is outright a Radon measure).
  • Let μ𝒫(Y).
  • Let π:YX be a Borel-measurable function. Here one should think of π as a function to "disintegrate" Y, in the sense of partitioning Y into {π1(x) | xX}. For example, for the motivating example above, one can define π((a,b))=a, (a,b)[0,1]×[0,1], which gives that π1(a)=a×[0,1], a slice we want to capture.
  • Let ν𝒫(X) be the pushforward measure ν=π*(μ)=μπ1. This measure provides the distribution of x (which corresponds to the events π1(x)).

The conclusion of the theorem: There exists a ν-almost everywhere uniquely determined family of probability measures {μx}xX𝒫(Y), which provides a "disintegration" of μ into Template:Nowrap such that:

  • the function xμx is Borel measurable, in the sense that xμx(B) is a Borel-measurable function for each Borel-measurable set BY;
  • μx "lives on" the fiber π1(x): for ν-almost all xX, μx(Yπ1(x))=0, and so μx(E)=μx(Eπ1(x));
  • for every Borel-measurable function f:Y[0,], Yf(y)dμ(y)=Xπ1(x)f(y)dμx(y)dν(x). In particular, for any event EY, taking f to be the indicator function of E,[1] μ(E)=Xμx(E)dν(x).

Applications

Product spaces

Template:More citations needed section The original example was a special case of the problem of product spaces, to which the disintegration theorem applies.

When Y is written as a Cartesian product Y=X1×X2 and πi:YXi is the natural projection, then each fibre π11(x1) can be canonically identified with X2 and there exists a Borel family of probability measures {μx1}x1X1 in 𝒫(X2) (which is (π1)*(μ)-almost everywhere uniquely determined) such that μ=X1μx1μ(π11(dx1))=X1μx1d(π1)*(μ)(x1), which is in particularTemplate:Clarify X1×X2f(x1,x2)μ(dx1,dx2)=X1(X2f(x1,x2)μ(dx2x1))μ(π11(dx1)) and μ(A×B)=Aμ(Bx1)μ(π11(dx1)).

The relation to conditional expectation is given by the identities E(fπ1)(x1)=X2f(x1,x2)μ(dx2x1), μ(A×Bπ1)(x1)=1A(x1)μ(Bx1).

Vector calculus

The disintegration theorem can also be seen as justifying the use of a "restricted" measure in vector calculus. For instance, in Stokes' theorem as applied to a vector field flowing through a compact surface Template:Nowrap, it is implicit that the "correct" measure on Σ is the disintegration of three-dimensional Lebesgue measure λ3 on Σ, and that the disintegration of this measure on ∂Σ is the same as the disintegration of λ3 on Σ.[2]

Conditional distributions

The disintegration theorem can be applied to give a rigorous treatment of conditional probability distributions in statistics, while avoiding purely abstract formulations of conditional probability.[3] The theorem is related to the Borel–Kolmogorov paradox, for example.

See also

References

Template:Reflist

Template:Measure theory