Itô's lemma

From testwiki
Jump to navigation Jump to search

Template:Short description Template:About In mathematics, Itô's lemma or Itô's formula is an identity used in Itô calculus to find the differential of a time-dependent function of a stochastic process. It serves as the stochastic calculus counterpart of the chain rule. It can be heuristically derived by forming the Taylor series expansion of the function up to its second derivatives and retaining terms up to first order in the time increment and second order in the Wiener process increment. The lemma is widely employed in mathematical finance, and its best known application is in the derivation of the Black–Scholes equation for option values.

This result was discovered by Japanese mathematician Kiyoshi Itô in 1951.[1]

Motivation

Suppose we are given the stochastic differential equation dXt=μt dt+σt dBt, where Template:Math is a Wiener process and the functions μt,σt are deterministic (not stochastic) functions of time. In general, it's not possible to write a solution Xt directly in terms of Bt. However, we can formally write an integral solution Xt=0tμs ds+0tσs dBs.

This expression lets us easily read off the mean and variance of Xt (which has no higher moments). First, notice that every dBt individually has mean 0, so the expected value of Xt is simply the integral of the drift function: E[Xt]=0tμs ds.

Similarly, because the dB terms have variance 1 and no correlation with one another, the variance of Xt is simply the integral of the variance of each infinitesimal step in the random walk: Var[Xt]=0tσs2 ds.

However, sometimes we are faced with a stochastic differential equation for a more complex process Yt, in which the process appears on both sides of the differential equation. That is, say dYt=a1(Yt,t) dt+a2(Yt,t) dBt, for some functions a1 and a2. In this case, we cannot immediately write a formal solution as we did for the simpler case above. Instead, we hope to write the process Yt as a function of a simpler process Xt taking the form above. That is, we want to identify three functions f(t,x),μt, and σt, such that Yt=f(t,Xt) and dXt=μt dt+σt dBt. In practice, Ito's lemma is used in order to find this transformation. Finally, once we have transformed the problem into the simpler type of problem, we can determine the mean and higher moments of the process.

Derivation

We derive Itô's lemma by expanding a Taylor series and applying the rules of stochastic calculus.

Suppose Xt is an Itô drift-diffusion process that satisfies the stochastic differential equation

dXt=μtdt+σtdBt,

where Template:Math is a Wiener process.

If Template:Math is a twice-differentiable scalar function, its expansion in a Taylor series is

Δf(t)dtdt=f(t+dt,x)f(t,x)=ftdt+122ft2(dt)2++
Δf(x)dxdx=f(t,x+dx)f(t,x)=fxdx+122fx2(dx)2+

Then use the total derivative and the definition of the partial derivative fy=limdy0Δf(y)dy:

df=ftdt+fxdx=limdx0,dt0ftdt+122ft2(dt)2++fxdx+122fx2(dx)2+.

Substituting x=Xt and therefore dx=dXt=μtdt+σtdBt, we get

df=limdBt0,dt0ftdt+122ft2(dt)2++fx(μtdt+σtdBt)+122fx2(μt2(dt)2+2μtσtdtdBt+σt2(dBt)2)+.

In the limit dt0, the terms (dt)2 and dtdBt tend to zero faster than dt. (dBt)2 is O(dt) (due to the quadratic variation of a Wiener process which says Bt2=O(t)), so setting (dt)2,dtdBt and (dx)3 terms to zero and substituting dt for (dBt)2, and then collecting the dt terms, we obtain

df=limdt0(ft+μtfx+σt222fx2)dt+σtfxdBt

as required.

Alternatively,

df=limdt0(ft+σt222fx2)dt+fxdXt

Geometric intuition

When Xt+dt is a Gaussian random variable, f(Xt+dt) is also approximately Gaussian random variable, but its mean E[f(Xt+dt)] differs from f(E[Xt+dt]) by a factor proportional to f(E[Xt+dt]) and the variance of Xt+dt.

Suppose we know that Xt,Xt+dt are two jointly-Gaussian distributed random variables, and f is nonlinear but has continuous second derivative, then in general, neither of f(Xt),f(Xt+dt) is Gaussian, and their joint distribution is also not Gaussian. However, since Xt+dtXt is Gaussian, we might still find f(Xt+dt)f(Xt) is Gaussian. This is not true when dt is finite, but when dt becomes infinitesimal, this becomes true.

The key idea is that Xt+dt=Xt+μtdt+dWt has a deterministic part and a noisy part. When f is nonlinear, the noisy part has a deterministic contribution. If f is convex, then the deterministic contribution is positive (by Jensen's inequality).

To find out how large the contribution is, we write Xt+dt=Xt+μtdt+σtdtz, where z is a standard Gaussian, then perform Taylor expansion. f(Xt+dt)=f(Xt)+f(Xt)μtdt+f(Xt)σtdtz+12f(Xt)(σt2z2dt+2μtσtzdt3/2+μt2dt2)+o(dt)=(f(Xt)+f(Xt)μtdt+12f(Xt)σt2dt+o(dt))+(f(Xt)σtdtz+12f(Xt)σt2(z21)dt+o(dt))We have split it into two parts, a deterministic part, and a random part with mean zero. The random part is non-Gaussian, but the non-Gaussian parts decay faster than the Gaussian part, and at the dt0 limit, only the Gaussian part remains. The deterministic part has the expected f(Xt)+f(Xt)μtdt, but also a part contributed by the convexity: 12f(Xt)σt2dt.

To understand why there should be a contribution due to convexity, consider the simplest case of geometric Brownian walk (of the stock market): St+dt=St(1+dBt). In other words, d(lnSt)=dBt. Let Xt=lnSt, then St=eXt, and Xt is a Brownian walk. However, although the expectation of Xt remains constant, the expectation of St grows. Intuitively it is because the downside is limited at zero, but the upside is unlimited. That is, while Xt is normally distributed, St is log-normally distributed.

Mathematical formulation of Itô's lemma

In the following subsections we discuss versions of Itô's lemma for different types of stochastic processes.

Itô drift-diffusion processes (due to: Kunita–Watanabe)

In its simplest form, Itô's lemma states the following: for an Itô drift-diffusion process

dXt=μtdt+σtdBt

and any twice differentiable scalar function Template:Math of two real variables Template:Mvar and Template:Mvar, one has

df(t,Xt)=(ft+μtfx+σt222fx2)dt+σtfxdBt.

This immediately implies that Template:Math is itself an Itô drift-diffusion process.

In higher dimensions, if 𝐗t=(Xt1,Xt2,,Xtn)T is a vector of Itô processes such that

d𝐗t=μtdt+𝐆td𝐁t

for a vector μt and matrix 𝐆t, Itô's lemma then states that

df(t,𝐗t)=ftdt+(𝐗f)Td𝐗t+12(d𝐗t)T(H𝐗f)d𝐗t,={ft+(𝐗f)Tμt+12Tr[𝐆tT(H𝐗f)𝐆t]}dt+(𝐗f)T𝐆td𝐁t

where 𝐗f is the gradient of Template:Math w.r.t. Template:Math, Template:Math is the Hessian matrix of Template:Math w.r.t. Template:Math, and Template:Math is the trace operator.

Poisson jump processes

We may also define functions on discontinuous stochastic processes.

Let Template:Mvar be the jump intensity. The Poisson process model for jumps is that the probability of one jump in the interval Template:Math is Template:Math plus higher order terms. Template:Mvar could be a constant, a deterministic function of time, or a stochastic process. The survival probability Template:Math is the probability that no jump has occurred in the interval Template:Math. The change in the survival probability is

dps(t)=ps(t)h(t)dt.

So

ps(t)=exp(0th(u)du).

Let Template:Math be a discontinuous stochastic process. Write S(t) for the value of S as we approach t from the left. Write djS(t) for the non-infinitesimal change in Template:Math as a result of a jump. Then

djS(t)=limΔt0(S(t+Δt)S(t))

Let z be the magnitude of the jump and let η(S(t),z) be the distribution of z. The expected magnitude of the jump is

E[djS(t)]=h(S(t))dtzzη(S(t),z)dz.

Define dJS(t), a compensated process and martingale, as

dJS(t)=djS(t)E[djS(t)]=S(t)S(t)(h(S(t))zzη(S(t),z)dz)dt.

Then

djS(t)=E[djS(t)]+dJS(t)=h(S(t))(zzη(S(t),z)dz)dt+dJS(t).

Consider a function g(S(t),t) of the jump process Template:Math. If Template:Math jumps by Template:Math then Template:Math jumps by Template:Math. Template:Math is drawn from distribution ηg() which may depend on g(t), dg and S(t). The jump part of g is

g(t)g(t)=h(t)dtΔgΔgηg()dΔg+dJg(t).

If S contains drift, diffusion and jump parts, then Itô's Lemma for g(S(t),t) is

dg(t)=(gt+μgS+σ222gS2+h(t)Δg(Δgηg()dΔg))dt+gSσdW(t)+dJg(t).

Itô's lemma for a process which is the sum of a drift-diffusion process and a jump process is just the sum of the Itô's lemma for the individual parts.

Non-continuous semimartingales

Itô's lemma can also be applied to general Template:Mvar-dimensional semimartingales, which need not be continuous. In general, a semimartingale is a càdlàg process, and an additional term needs to be added to the formula to ensure that the jumps of the process are correctly given by Itô's lemma. For any cadlag process Template:Math, the left limit in Template:Mvar is denoted by Template:Math, which is a left-continuous process. The jumps are written as Template:Math. Then, Itô's lemma states that if Template:Math is a Template:Mvar-dimensional semimartingale and f is a twice continuously differentiable real valued function on Template:Math then f(X) is a semimartingale, and

f(Xt)=f(X0)+i=1d0tfi(Xs)dXsi+12i,j=1d0tfi,j(Xs)d[Xi,Xj]s+st(Δf(Xs)i=1dfi(Xs)ΔXsi12i,j=1dfi,j(Xs)ΔXsiΔXsj).

This differs from the formula for continuous semi-martingales by the additional term summing over the jumps of X, which ensures that the jump of the right hand side at time Template:Mvar is Δf(Xt).

Multiple non-continuous jump processes

Template:Citation neededThere is also a version of this for a twice-continuously differentiable in space once in time function f evaluated at (potentially different) non-continuous semi-martingales which may be written as follows:

f(t,Xt1,,Xtd)=f(0,X01,,X0d)+0tf˙(s,Xs1,,Xsd)ds+i=1d0tfi(s,Xs1,,Xsd)dXs(c,i)+12i1,,id=1d0tfi1,,id(s,Xs1,,Xsd)dXs(c,i1)Xs(c,id)+0<st[f(s,Xs1,,Xsd)f(s,Xs1,,Xsd)]

where Xc,i denotes the continuous part of the ith semi-martingale.

Examples

Geometric Brownian motion

A process S is said to follow a geometric Brownian motion with constant volatility σ and constant drift μ if it satisfies the stochastic differential equation dSt=σStdBt+μStdt, for a Brownian motion B. Applying Itô's lemma with f(St)=log(St) gives

df=f(St)dSt+12f(St)(dSt)2=1StdSt+12(St2)(St2σ2dt)=1St(σStdBt+μStdt)12σ2dt=σdBt+(μσ22)dt.

It follows that

log(St)=log(S0)+σBt+(μσ22)t,

exponentiating gives the expression for S,

St=S0exp(σBt+(μσ22)t).

The correction term of Template:Math corresponds to the difference between the median and mean of the log-normal distribution, or equivalently for this distribution, the geometric mean and arithmetic mean, with the median (geometric mean) being lower. This is due to the AM–GM inequality, and corresponds to the logarithm being concave (or convex upwards), so the correction term can accordingly be interpreted as a convexity correction. This is an infinitesimal version of the fact that the annualized return is less than the average return, with the difference proportional to the variance. See geometric moments of the log-normal distributionTemplate:Broken anchor for further discussion.

The same factor of Template:Math appears in the d1 and d2 auxiliary variables of the Black–Scholes formula, and can be interpreted as a consequence of Itô's lemma.

Doléans-Dade exponential

The Doléans-Dade exponential (or stochastic exponential) of a continuous semimartingale X can be defined as the solution to the SDE Template:Math with initial condition Template:Math. It is sometimes denoted by Template:Math. Applying Itô's lemma with f(Y) = log(Y) gives

dlog(Y)=1YdY12Y2d[Y]=dX12d[X].

Exponentiating gives the solution

Yt=exp(XtX012[X]t).

Black–Scholes formula

Template:More Itô's lemma can be used to derive the Black–Scholes equation for an option.[2] Suppose a stock price follows a geometric Brownian motion given by the stochastic differential equation Template:Math. Then, if the value of an option at time Template:Mvar is f(t, St), Itô's lemma gives

df(t,St)=(ft+12(Stσ)22fS2)dt+fSdSt.

The term Template:Math represents the change in value in time dt of the trading strategy consisting of holding an amount Template:Math of the stock. If this trading strategy is followed, and any cash held is assumed to grow at the risk free rate r, then the total value V of this portfolio satisfies the SDE

dVt=r(VtfSSt)dt+fSdSt.

This strategy replicates the option if V = f(t,S). Combining these equations gives the celebrated Black–Scholes equation

ft+σ2S222fS2+rSfSrf=0.

Product rule for Itô processes

Let 𝐗t be a two-dimensional Ito process with SDE:

d𝐗t=d(Xt1Xt2)=(μt1μt2)dt+(σt1σt2)dBt

Then we can use the multi-dimensional form of Ito's lemma to find an expression for d(Xt1Xt2).

We have μt=(μt1μt2) and 𝐆=(σt1σt2).

We set f(t,𝐗t)=Xt1Xt2 and observe that ft=0, (𝐗f)T=(Xt2  Xt1) and H𝐗f=(0110)

Substituting these values in the multi-dimensional version of the lemma gives us:

d(Xt1Xt2)=df(t,𝐗t)=0dt+(Xt2  Xt1)d𝐗t+12(dXt1  dXt2)(0110)(dXt1dXt2)=Xt2dXt1+Xt1dXt2+dXt1dXt2

This is a generalisation of Leibniz's product rule to Ito processes, which are non-differentiable.

Further, using the second form of the multidimensional version above gives us

d(Xt1Xt2)={0+(Xt2  Xt1)(μt1μt2)+12Tr[(σt1  σt2)(0110)(σt1σt2)]}dt+(Xt2σt1+Xt1σt2)dBt=(Xt2μt1+Xt1μt2+σt1σt2)dt+(Xt2σt1+Xt1σt2)dBt

so we see that the product Xt1Xt2 is itself an Itô drift-diffusion process.

Itô's formula for functions with finite quadratic variation

Hans Föllmer provided a non-probabilistic proof of the Itô formula and showed that it holds for all functions with finite quadratic variation.[3]

Let fC2 be a real-valued function and x:[0,] a right-continuous function with left limits and finite quadratic variation [x]. Then

f(xt)=f(x0)+0tf(xs)dxs+12]0,t]f(xs)d[x]s+0st(f(xs)f(xs)f(xs)Δxs12f(xs)(Δxs)2)).

where the quadratic variation of $x$ is defined as a limit along a sequence of partitions Dn of [0,t] with step decreasing to zero:

[x](t)=limntknDn(xtk+1nxtkn)2.

Higher-order Itô formula

Rama Cont and Nicholas Perkowski extended the Ito formula to functions with finite p-th variation:.[4] For a continuous function with finite p-th variation

[x]p(t)=limntknDn(xtk+1nxtkn)p

the change of variable formula is:

f(xt)=f(x0)+0tp1f(xs)dxs+1p!]0,t]fp(xs)d[x]sp

where the first integral is defined as a limit of compensated left Riemann sums along a sequence of partitions Dn:

0tp1f(xs)dxs:=tknDnk=1p1fk(xtkn)k!(xtk+1nxtkn)k.

Infinite-dimensional formulas

There exist a couple of extensions to infinite-dimensional spaces (e.g. Pardoux,[5] Gyöngy-Krylov,[6] Brzezniak-van Neerven-Veraar-Weis[7]).

See also

Notes

Template:Reflist

References

  • Kiyosi Itô (1944). Stochastic Integral. Proc. Imperial Acad. Tokyo 20, 519–524. This is the paper with the Ito Formula; Online
  • Kiyosi Itô (1951). On stochastic differential equations. Memoirs, American Mathematical Society 4, 1–51. Online
  • Bernt Øksendal (2000). Stochastic Differential Equations. An Introduction with Applications, 5th edition, corrected 2nd printing. Springer. Template:ISBN. Sections 4.1 and 4.2.
  • Philip E Protter (2005). Stochastic Integration and Differential Equations, 2nd edition. Springer. Template:ISBN. Section 2.7.