Zermelo's navigation problem

From testwiki
Jump to navigation Jump to search

Template:Short description In mathematical optimization, Zermelo's navigation problem, proposed in 1931 by Ernst Zermelo, is a classic optimal control problem that deals with a boat navigating on a body of water, originating from a point A to a destination point B. The boat is capable of a certain maximum speed, and the goal is to derive the best possible control to reach B in the least possible time.

Zermelo Navigation with velocity 𝐯under constant wind 𝐰

Without considering external forces such as current and wind, the optimal control is for the boat to always head towards B. Its path then is a line segment from A to B, which is trivially optimal. With consideration of current and wind, if the combined force applied to the boat is non-zero the control for no current and wind does not yield the optimal path.

History

In his 1931 article,[1] Ernst Zermelo formulates the following problem: Template:Quote

Ernst Zermelo formulated and solved the general problem

This is an extension of the classical optimisation problem for geodesics – minimising the length of a curve I[c]=ab1+y'2dx connecting points A and B , with the added complexity of considering some wind velocity. Although it is usually impossible to find an exact solution in most cases, the general case was solved by Zermelo himself in the form of a partial differential equation, known as Zermelo's equation, which can be numerically solved.

The problem of navigating an airship which is surrounded by air, was presented first in 1929 at a conference by Ernst Zermelo. Other mathematicians have answered the challenge over the following years. The dominant technique for solving the equations is the calculus of variations.[2]

Constant-wind case

The case of constant wind is easy to solve exactly.[3] Let 𝐝=ABβ†’, and suppose that to minimise the travel time the ship travels at a constant maximum speed V. Thus the position of the ship at time t is 𝐱=t(𝐯+𝐰). Let T be the time of arrival at B, so that 𝐝=T(𝐯+𝐰). Taking the dot product of this with 𝐰 and 𝐝 respectively results in dβ†’wβ†’=T(𝐯wβ†’+𝐰2) and d2=T2(v2+2v→𝐰+𝐰2). Eliminating vβ†’wβ†’ and writing this system as a quadratic in T results in (vβ†’2wβ†’2)T2+2(𝐝𝐰)T𝐝2=0. Upon solving this, taking the positive square-root since T is positive, we obtain

T[𝐝]=2(𝐝𝐰)±4(𝐝𝐰)2+4𝐝2(𝐯2𝐰2)2(𝐯2𝐰2)=𝐝2𝐯2wβ†’2+(𝐝𝐰)2(vβ†’2wβ†’2)2𝐝𝐰𝐯2𝐰2

Claim: This defines a metric on ℝ2, provided |𝐯|>|𝐰|.

Proof

By our assumption, clearly T[𝐝]0 with equality if and only if 𝐝=0. Trivially if 𝐝~=BAβ†’, we have T[𝐝]=T[𝐝~]. It remains to show T satisfies a triangle inequality T[𝐝1+𝐝2]T[𝐝1]+T[𝐝2].

Indeed, letting c2:=𝐯2𝐰2, we note that this is true if and only if

(𝐝1+𝐝2)2c2+((dβ†’1+dβ†’2)wβ†’)2c4(𝐝1+𝐝2)𝐰c2𝐝12c2+(𝐝1𝐰)2c4𝐝2𝐰c2+𝐝22c2+(𝐝2𝐰)2c4𝐝2𝐰c2

if and only if

𝐝1𝐝2c2+(𝐝1𝐰)(𝐝2𝐰)c4[dβ†’12c2+(𝐝1𝐰)2c4]1/2[dβ†’22c2+(𝐝2𝐰)2c4]1/2,

which is true if and only if

(𝐝1𝐝2)2c4+2(𝐝1𝐝2)(𝐝1𝐰)(𝐝2𝐰)c6𝐝12𝐝22c4+𝐝12(𝐝2𝐰)2+𝐝22(𝐝1𝐰)2c6

Using the Cauchy–Schwarz inequality, we obtain (𝐝1𝐝2)2𝐝12𝐝22 with equality if and only if 𝐝1 and 𝐝2 are linearly dependent, and so the inequality is indeed true.

Note: Since this is a strict inequality if 𝐝1 and 𝐝2 are not linearly dependent, it immediately follows that a straight line from A to B is always a faster path than any other path made up of straight line segments. We use a limiting argument to prove this is true for any curve.

General solution

Consider the general example of a ship moving against a variable wind wβ†’(x,y). Writing this component-wise, we have the drift in the x-axis as u(x,y) and the drift in the y-axis as v(x,y). Then for a ship moving at maximum velocity V at variable heading θ, we have

xΛ™=Vcosθ+u(x,y)yΛ™=Vsinθ+v(x,y)

The Hamiltonian of the system is thus

H=λx(Vcosθ+u)+λy(Vsinθ+v)+1

Using the Euler–Lagrange equation, we obtain

λΛ™x=Hx=λxuxλyvxλΛ™y=Hy=λxuyλyvy0=Hθ=V(λxsinθ+λycosθ)

The last equation implies that tanθ=λy/λx. We note that the system is autonomous; the Hamiltonian does not depend on time t, thus H = constant, but since we are minimising time, the constant is equal to 0. Thus we can solve the simultaneous equations above to get[4]

λx=cosθV+ucosθ+vsinθλy=sinθV+ucosθ+vsinθ

Substituting these values into our EL-equations results in the differential equation

dθdt=sin2θvx+sinθcosθ(uxvy)cos2θuy

This result is known as Zermelo's equation. Solving this with our system allows us to find the general optimum path.

Constant-wind revisited example

If we go back to the constant wind problem 𝐰 for all time, we have

vy=vx=ux=uy=0

so our general solution implies dθdt=0, thus θ is constant, i.e. the optimum path is a straight line, as we had obtained before with an algebraic argument.

References

Template:Reflist