Ostrogradsky instability

From testwiki
Revision as of 07:05, 21 October 2024 by imported>Fgnievinski (top)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

In applied mathematics, the Ostrogradsky instability is a feature of some solutions of theories having equations of motion with more than two time derivatives (higher-derivative theories). It is suggested by a theorem of Mikhail Ostrogradsky in classical mechanics according to which a non-degenerate Lagrangian dependent on time derivatives higher than the first corresponds to a Hamiltonian unbounded from below. As usual, the Hamiltonian is associated with the Lagrangian via a Legendre transform. The Ostrogradsky instability has been proposed as an explanation as to why no differential equations of higher order than two appear to describe physical phenomena.[1] However, Ostrogradsky's theorem does not imply that all solutions of higher-derivative theories are unstable as many counterexamples are known.[2][3][4][5][6][7][8][9][10]

Outline of proof [11]

The main points of the proof can be made clearer by considering a one-dimensional system with a Lagrangian L(q,q˙,q¨). The Euler–Lagrange equation is

LqddtLq˙+d2dt2Lq¨=0.

Non-degeneracy of L means that the canonical coordinates can be expressed in terms of the derivatives of q and vice versa. Thus, L/q¨ is a function of q¨ (if it were not, the Jacobian det[2L/(q¨iq¨j)] would vanish, which would mean that L is degenerate), meaning that we can write q(4)=F(q,q˙,q¨,q(3)) or, inverting, q=G(t,q0,q˙0,q¨0,q0(3)). Since the evolution of q depends upon four initial parameters, this means that there are four canonical coordinates. We can write those as

Q1:=q
Q2:=q˙

and by using the definition of the conjugate momentum,

P1:=Lq˙ddtLq¨
P2:=Lq¨

The above results can be obtained as follows. First, we rewrite the Lagrangian into "ordinary" form by introducing a Lagrangian multiplier as a new dynamic variable λ

L(q,q˙,q¨)L~=L(Q1,Q1˙,Q2˙)λ(Q2Q1˙),

from which, the Euler-Lagrangian equations for Q1,Q2,λ read

Q1:ddtLQ1˙+λ˙LQ1=0,
Q2:ddtLQ2˙+λ=0,
λ:Q2Q1˙=0,

Now, the canonical momentum P1,P2 with respect to L~ are readily shown to be

P1=L~Q1˙=LQ1˙+λ=LQ1˙ddtLQ2˙
P2=L~Q2˙=LQ2˙

while

Pλ=0

These are precisely the definitions given above by Ostrogradski. One may proceed further to evaluate the Hamiltonian

H~=P1Q1˙+P2Q2˙+pλλ˙L~=P1Q2+P2Q2˙L,

where one makes use of the above Euler-Lagrangian equations for the second equality. We note that due to non-degeneracy, we can write q¨=Q2˙ as a(Q1,Q2,P2). Here, only three arguments are needed since the Lagrangian itself only has three free parameters. Therefore, the last expression only depends on P1,P2,Q1,Q2, it effectively serves as the Hamiltonian of the original theory, namely,

H=P1Q2+P2a(Q1,Q2,P2)L(Q1,Q2,P2).

We now notice that the Hamiltonian is linear in P1. This is a source of the Ostrogradsky instability, and it stems from the fact that the Lagrangian depends on fewer coordinates than there are canonical coordinates (which correspond to the initial parameters needed to specify the problem). The extension to higher dimensional systems is analogous, and the extension to higher derivatives simply means that the phase space is of even higher dimension than the configuration space.

Notes

Template:Reflist