Total variation

From testwiki
Revision as of 04:35, 10 January 2025 by imported>Citation bot (Altered doi. Removed parameters. | Use this bot. Report bugs. | #UCB_CommandLine)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Template:Short description Template:Distinguish Template:Primary sources

In mathematics, the total variation identifies several slightly different concepts, related to the (local or global) structure of the codomain of a function or a measure. For a real-valued continuous function f, defined on an interval [a, b] ⊂ R, its total variation on the interval of definition is a measure of the one-dimensional arclength of the curve with parametric equation xf(x), for x ∈ [a, b]. Functions whose total variation is finite are called functions of bounded variation.

Historical note

The concept of total variation for functions of one real variable was first introduced by Camille Jordan in the paper Template:Harv.[1] He used the new concept in order to prove a convergence theorem for Fourier series of discontinuous periodic functions whose variation is bounded. The extension of the concept to functions of more than one variable however is not simple for various reasons.

Definitions

Total variation for functions of one real variable

Template:EquationRef The total variation of a real-valued (or more generally complex-valued) function f, defined on an interval [a,b] is the quantity

Vab(f)=sup𝒫i=0nP1|f(xi+1)f(xi)|,

where the supremum runs over the set of all partitions 𝒫={P={x0,,xnP}P is a partition of [a,b]} of the given interval. Which means that a=x0<x1<...<xnP=b.

Total variation for functions of n > 1 real variables

Template:Citation needed section Template:EquationRef[2] Let Ω be an open subset of Rn. Given a function f belonging to L1(Ω), the total variation of f in Ω is defined as

V(f,Ω):=sup{Ωf(x)divϕ(x)dx:ϕCc1(Ω,n), ϕL(Ω)1},

where

This definition does not require that the domain Ωn of the given function be a bounded set.

Total variation in measure theory

Classical total variation definition

Following Template:Harvtxt, consider a signed measure μ on a measurable space (X,Σ): then it is possible to define two set functions W(μ,) and W_(μ,), respectively called upper variation and lower variation, as follows

W(μ,E)=sup{μ(A)AΣ and AE}EΣ
W_(μ,E)=inf{μ(A)AΣ and AE}EΣ

clearly

W(μ,E)0W_(μ,E)EΣ

Template:EquationRef The variation (also called absolute variation) of the signed measure μ is the set function

|μ|(E)=W(μ,E)+|W_(μ,E)|EΣ

and its total variation is defined as the value of this measure on the whole space of definition, i.e.

μ=|μ|(X)

Modern definition of total variation norm

Template:Harvtxt uses upper and lower variations to prove the Hahn–Jordan decomposition: according to his version of this theorem, the upper and lower variation are respectively a non-negative and a non-positive measure. Using a more modern notation, define

μ+()=W(μ,),
μ()=W_(μ,),

Then μ+ and μ are two non-negative measures such that

μ=μ+μ
|μ|=μ++μ

The last measure is sometimes called, by abuse of notation, total variation measure.

Total variation norm of complex measures

If the measure μ is complex-valued i.e. is a complex measure, its upper and lower variation cannot be defined and the Hahn–Jordan decomposition theorem can only be applied to its real and imaginary parts. However, it is possible to follow Template:Harvtxt and define the total variation of the complex-valued measure μ as follows

Template:EquationRef The variation of the complex-valued measure μ is the set function

|μ|(E)=supπAπ|μ(A)|EΣ

where the supremum is taken over all partitions π of a measurable set E into a countable number of disjoint measurable subsets.

This definition coincides with the above definition |μ|=μ++μ for the case of real-valued signed measures.

Total variation norm of vector-valued measures

The variation so defined is a positive measure (see Template:Harvtxt) and coincides with the one defined by Template:EquationNote when μ is a signed measure: its total variation is defined as above. This definition works also if μ is a vector measure: the variation is then defined by the following formula

|μ|(E)=supπAπμ(A)EΣ

where the supremum is as above. This definition is slightly more general than the one given by Template:Harvtxt since it requires only to consider finite partitions of the space X: this implies that it can be used also to define the total variation on finite-additive measures.

Total variation of probability measures

Template:Unreferenced section Template:Main The total variation of any probability measure is exactly one, therefore it is not interesting as a means of investigating the properties of such measures. However, when μ and ν are probability measures, the total variation distance of probability measures can be defined as μν where the norm is the total variation norm of signed measures. Using the property that (μν)(X)=0, we eventually arrive at the equivalent definition

μν=|μν|(X)=2sup{|μ(A)ν(A)|:AΣ}

and its values are non-trivial. The factor 2 above is usually dropped (as is the convention in the article total variation distance of probability measures). Informally, this is the largest possible difference between the probabilities that the two probability distributions can assign to the same event. For a categorical distribution it is possible to write the total variation distance as follows

δ(μ,ν)=x|μ(x)ν(x)|.

It may also be normalized to values in [0,1] by halving the previous definition as follows

δ(μ,ν)=12x|μ(x)ν(x)|[3]

Basic properties

Total variation of differentiable functions

The total variation of a C1(Ω) function f can be expressed as an integral involving the given function instead of as the supremum of the functionals of definitions Template:EquationNote and Template:EquationNote.

The form of the total variation of a differentiable function of one variable

Template:EquationRef The total variation of a differentiable function f, defined on an interval [a,b], has the following expression if f is Riemann integrable

Vab(f)=ab|f(x)|dx

If f is differentiable and monotonic, then the above simplifies to

Vab(f)=|f(a)f(b)|

For any differentiable function f, we can decompose the domain interval [a,b], into subintervals [a,a1],[a1,a2],,[aN,b] (with a<a1<a2<<aN<b) in which f is locally monotonic, then the total variation of f over [a,b] can be written as the sum of local variations on those subintervals:

Vab(f)=Vaa1(f)+Va1a2(f)++VaNb(f)=|f(a)f(a1)|+|f(a1)f(a2)|++|f(aN)f(b)|

The form of the total variation of a differentiable function of several variables

Template:EquationRef Given a C1(Ω) function f defined on a bounded open set Ωn, with Ω of class C1, the total variation of f has the following expression

V(f,Ω)=Ω|f(x)|dx .
Proof

The first step in the proof is to first prove an equality which follows from the Gauss–Ostrogradsky theorem.

Lemma

Under the conditions of the theorem, the following equality holds:

Ωfdivφ=Ωfφ
Proof of the lemma

From the Gauss–Ostrogradsky theorem:

Ωdiv𝐑=Ω𝐑𝐧

by substituting 𝐑:=fφ, we have:

Ωdiv(fφ)=Ω(fφ)𝐧

where φ is zero on the border of Ω by definition:

Ωdiv(fφ)=0
Ωxi(fφi)=0
Ωφixif+fxiφi=0
Ωfxiφi=Ωφixif
Ωfdivφ=Ωφf
Proof of the equality

Under the conditions of the theorem, from the lemma we have:

Ωfdivφ=Ωφf|Ωφf|Ω|φ||f|Ω|f|

in the last part φ could be omitted, because by definition its essential supremum is at most one.

On the other hand, we consider θN:=𝕀[N,N]𝕀{f0}f|f| and θN* which is the up to ε approximation of θN in Cc1 with the same integral. We can do this since Cc1 is dense in L1. Now again substituting into the lemma:

limNΩfdivθN*=limN{f0}𝕀[N,N]ff|f|=limN[N,N]{f0}ff|f|=Ω|f|

This means we have a convergent sequence of Ωfdivφ that tends to Ω|f| as well as we know that ΩfdivφΩ|f|. Q.E.D.

It can be seen from the proof that the supremum is attained when

φf|f|.

The function f is said to be of bounded variation precisely if its total variation is finite.

Total variation of a measure

The total variation is a norm defined on the space of measures of bounded variation. The space of measures on a σ-algebra of sets is a Banach space, called the ca space, relative to this norm. It is contained in the larger Banach space, called the ba space, consisting of finitely additive (as opposed to countably additive) measures, also with the same norm. The distance function associated to the norm gives rise to the total variation distance between two measures μ and ν.

For finite measures on R, the link between the total variation of a measure μ and the total variation of a function, as described above, goes as follows. Given μ, define a function φ: by

φ(t)=μ((,t]).

Then, the total variation of the signed measure μ is equal to the total variation, in the above sense, of the function φ. In general, the total variation of a signed measure can be defined using Jordan's decomposition theorem by

μTV=μ+(X)+μ(X),

for any signed measure μ on a measurable space (X,Σ).

Applications

Total variation can be seen as a non-negative real-valued functional defined on the space of real-valued functions (for the case of functions of one variable) or on the space of integrable functions (for the case of functions of several variables). As a functional, total variation finds applications in several branches of mathematics and engineering, like optimal control, numerical analysis, and calculus of variations, where the solution to a certain problem has to minimize its value. As an example, use of the total variation functional is common in the following two kind of problems

See also

Notes

Template:More footnotes Template:Reflist

Historical references

Template:Sfn whitelist

References

One variable

One and more variables

Measure theory

Applications