Jacobian matrix and determinant

From testwiki
Jump to navigation Jump to search

Template:Short description Template:Redirect {{#invoke:sidebar|collapsible | class = plainlist | titlestyle = padding-bottom:0.25em; | pretitle = Part of a series of articles about | title = Calculus | image = abf(t)dt=f(b)f(a) | listtitlestyle = text-align:center; | liststyle = border-top:1px solid #aaa;padding-top:0.15em;border-bottom:1px solid #aaa; | expanded = multivariable | abovestyle = padding:0.15em 0.25em 0.3em;font-weight:normal; | above =

Template:Startflatlist

Template:EndflatlistTemplate:Startflatlist

Template:Endflatlist

| list2name = differential | list2titlestyle = display:block;margin-top:0.65em; | list2title = Template:Bigger | list2 ={{#invoke:sidebar|sidebar|child=yes

 |contentclass=hlist
 | heading1 = Definitions
 | content1 =
 | heading2 = Concepts
 | content2 =
 | heading3 = Rules and identities
 | content3 =
}}

| list3name = integral | list3title = Template:Bigger | list3 ={{#invoke:sidebar|sidebar|child=yes

 |contentclass=hlist
 | content1 =

| heading2 = Definitions

 | content2 =
 | heading3 = Integration by
 | content3 =
}}

| list4name = series | list4title = Template:Bigger | list4 ={{#invoke:sidebar|sidebar|child=yes

 |contentclass=hlist
 | content1 =
 | heading2 = Convergence tests
 | content2 =
}}

| list5name = vector | list5title = Template:Bigger | list5 ={{#invoke:sidebar|sidebar|child=yes

 |contentclass=hlist
 | content1 =
 | heading2 = Theorems
 | content2 =
}}

| list6name = multivariable | list6title = Template:Bigger | list6 ={{#invoke:sidebar|sidebar|child=yes

 |contentclass=hlist
 | heading1 = Formalisms
 | content1 =
 | heading2 = Definitions
 | content2 =
}}

| list7name = advanced | list7title = Template:Bigger | list7 ={{#invoke:sidebar|sidebar|child=yes

 |contentclass=hlist
 | content1 =
}}

| list8name = specialized | list8title = Template:Bigger | list8 =

| list9name = miscellanea | list9title = Template:Bigger | list9 =

}}

In vector calculus, the Jacobian matrix (Template:IPAc-en,[1][2][3] Template:IPAc-en) of a vector-valued function of several variables is the matrix of all its first-order partial derivatives. When this matrix is square, that is, when the function takes the same number of variables as input as the number of vector components of its output, its determinant is referred to as the Jacobian determinant. Both the matrix and (if applicable) the determinant are often referred to simply as the Jacobian in literature.[4] They are named after Carl Gustav Jacob Jacobi.

Motivation

The Jacobian can be understood by considering a unit area in the new coordinate space; and examining how that unit area transforms when mapped into xy coordinate space in which the integral is visually understood.[5][6][7] The process involves taking partial derivatives with respect to the new coordinates, then applying the determinant and hence obtaining the Jacobian.

Definition

Suppose Template:Math is a function such that each of its first-order partial derivatives exists on Template:Math. This function takes a point Template:Math as input and produces the vector Template:Math as output. Then the Jacobian matrix of Template:Math, denoted Template:Math, is defined such that its Template:Math entry is fixj, or explicitly 𝐉𝐟=[𝐟x1𝐟xn]=[𝖳f1𝖳fm]=[f1x1f1xnfmx1fmxn] where 𝖳fi is the transpose (row vector) of the gradient of the i-th component.

The Jacobian matrix, whose entries are functions of Template:Math, is denoted in various ways; other common notations include Template:Math, 𝐟, and (f1,,fm)(x1,,xn).[8][9] Some authors define the Jacobian as the transpose of the form given above.

The Jacobian matrix represents the differential of Template:Math at every point where Template:Math is differentiable. In detail, if Template:Math is a displacement vector represented by a column matrix, the matrix product Template:Math is another displacement vector, that is the best linear approximation of the change of Template:Math in a neighborhood of Template:Math, if Template:Math is differentiable at Template:Math.Template:Efn This means that the function that maps Template:Math to Template:Math is the best linear approximation of Template:Math for all points Template:Math close to Template:Math. The linear map Template:Math is known as the derivative or the differential of Template:Math at Template:Math.

When Template:Math, the Jacobian matrix is square, so its determinant is a well-defined function of Template:Math, known as the Jacobian determinant of Template:Math. It carries important information about the local behavior of Template:Math. In particular, the function Template:Math has a differentiable inverse function in a neighborhood of a point Template:Math if and only if the Jacobian determinant is nonzero at Template:Math (see inverse function theorem for an explanation of this and Jacobian conjecture for a related problem of global invertibility). The Jacobian determinant also appears when changing the variables in multiple integrals (see substitution rule for multiple variables).

When Template:Math, that is when Template:Math is a scalar-valued function, the Jacobian matrix reduces to the row vector 𝖳f; this row vector of all first-order partial derivatives of Template:Math is the transpose of the gradient of Template:Math, i.e. 𝐉f=𝖳f. Specializing further, when Template:Math, that is when Template:Math is a scalar-valued function of a single variable, the Jacobian matrix has a single entry; this entry is the derivative of the function Template:Math.

These concepts are named after the mathematician Carl Gustav Jacob Jacobi (1804–1851).

Jacobian matrix

The Jacobian of a vector-valued function in several variables generalizes the gradient of a scalar-valued function in several variables, which in turn generalizes the derivative of a scalar-valued function of a single variable. In other words, the Jacobian matrix of a scalar-valued function in several variables is (the transpose of) its gradient and the gradient of a scalar-valued function of a single variable is its derivative.

At each point where a function is differentiable, its Jacobian matrix can also be thought of as describing the amount of "stretching", "rotating" or "transforming" that the function imposes locally near that point. For example, if Template:Math is used to smoothly transform an image, the Jacobian matrix Template:Math, describes how the image in the neighborhood of Template:Math is transformed.

If a function is differentiable at a point, its differential is given in coordinates by the Jacobian matrix. However, a function does not need to be differentiable for its Jacobian matrix to be defined, since only its first-order partial derivatives are required to exist.

If Template:Math is differentiable at a point Template:Math in Template:Math, then its differential is represented by Template:Math. In this case, the linear transformation represented by Template:Math is the best linear approximation of Template:Math near the point Template:Math, in the sense that

𝐟(𝐱)𝐟(𝐩)=𝐉𝐟(𝐩)(𝐱𝐩)+o(𝐱𝐩)(as 𝐱𝐩),

where Template:Math is a quantity that approaches zero much faster than the distance between Template:Math and Template:Math does as Template:Math approaches Template:Math. This approximation specializes to the approximation of a scalar function of a single variable by its Taylor polynomial of degree one, namely

f(x)f(p)=f(p)(xp)+o(xp)(as xp).

In this sense, the Jacobian may be regarded as a kind of "first-order derivative" of a vector-valued function of several variables. In particular, this means that the gradient of a scalar-valued function of several variables may too be regarded as its "first-order derivative".

Composable differentiable functions Template:Math and Template:Math satisfy the chain rule, namely 𝐉𝐠𝐟(𝐱)=𝐉𝐠(𝐟(𝐱))𝐉𝐟(𝐱) for Template:Math in Template:Math.

The Jacobian of the gradient of a scalar function of several variables has a special name: the Hessian matrix, which in a sense is the "second derivative" of the function in question.

Jacobian determinant

A nonlinear map f:22 sends a small square (left, in red) to a distorted parallelogram (right, in red). The Jacobian at a point gives the best linear approximation of the distorted parallelogram near that point (right, in translucent white), and the Jacobian determinant gives the ratio of the area of the approximating parallelogram to that of the original square.

If Template:Math, then Template:Math is a function from Template:Math to itself and the Jacobian matrix is a square matrix. We can then form its determinant, known as the Jacobian determinant. The Jacobian determinant is sometimes simply referred to as "the Jacobian".

The Jacobian determinant at a given point gives important information about the behavior of Template:Math near that point. For instance, the continuously differentiable function Template:Math is invertible near a point Template:Math if the Jacobian determinant at Template:Math is non-zero. This is the inverse function theorem. Furthermore, if the Jacobian determinant at Template:Math is positive, then Template:Math preserves orientation near Template:Math; if it is negative, Template:Math reverses orientation. The absolute value of the Jacobian determinant at Template:Math gives us the factor by which the function Template:Math expands or shrinks volumes near Template:Math; this is why it occurs in the general substitution rule.

The Jacobian determinant is used when making a change of variables when evaluating a multiple integral of a function over a region within its domain. To accommodate for the change of coordinates the magnitude of the Jacobian determinant arises as a multiplicative factor within the integral. This is because the Template:Math-dimensional Template:Math element is in general a parallelepiped in the new coordinate system, and the Template:Math-volume of a parallelepiped is the determinant of its edge vectors.

The Jacobian can also be used to determine the stability of equilibria for systems of differential equations by approximating behavior near an equilibrium point.

Inverse

According to the inverse function theorem, the matrix inverse of the Jacobian matrix of an invertible function Template:Math is the Jacobian matrix of the inverse function. That is, the Jacobian matrix of the inverse function at a point Template:Math is

𝐉𝐟1(𝐩)=𝐉𝐟1(𝐟1(𝐩)),

and the Jacobian determinant is

det(𝐉𝐟1(𝐩))=1det(𝐉𝐟(𝐟1(𝐩))).

If the Jacobian is continuous and nonsingular at the point Template:Math in Template:Math, then Template:Math is invertible when restricted to some neighbourhood of Template:Math. In other words, if the Jacobian determinant is not zero at a point, then the function is locally invertible near this point.

The (unproved) Jacobian conjecture is related to global invertibility in the case of a polynomial function, that is a function defined by n polynomials in n variables. It asserts that, if the Jacobian determinant is a non-zero constant (or, equivalently, that it does not have any complex zero), then the function is invertible and its inverse is a polynomial function.

Critical points

Template:Main

If Template:Math is a differentiable function, a critical point of Template:Math is a point where the rank of the Jacobian matrix is not maximal. This means that the rank at the critical point is lower than the rank at some neighbour point. In other words, let Template:Math be the maximal dimension of the open balls contained in the image of Template:Math; then a point is critical if all minors of rank Template:Math of Template:Math are zero.

In the case where Template:Math, a point is critical if the Jacobian determinant is zero.

Examples

Example 1

Consider a function Template:Math with Template:Math given by

𝐟([xy])=[f1(x,y)f2(x,y)]=[x2y5x+siny].

Then we have

f1(x,y)=x2y

and

f2(x,y)=5x+siny.

The Jacobian matrix of Template:Math is

𝐉𝐟(x,y)=[f1xf1yf2xf2y]=[2xyx25cosy]

and the Jacobian determinant is

det(𝐉𝐟(x,y))=2xycosy5x2.

Example 2: polar-Cartesian transformation

The transformation from polar coordinates Template:Math to Cartesian coordinates (x, y), is given by the function Template:Math with components

x=rcosφ;y=rsinφ.

𝐉𝐅(r,φ)=[xrxφyryφ]=[cosφrsinφsinφrcosφ]

The Jacobian determinant is equal to Template:Math. This can be used to transform integrals between the two coordinate systems:

𝐅(A)f(x,y)dxdy=Af(rcosφ,rsinφ)rdrdφ.

Example 3: spherical-Cartesian transformation

The transformation from spherical coordinates Template:Math[10] to Cartesian coordinates (x, y, z), is given by the function Template:Math with components

x=ρsinφcosθ;y=ρsinφsinθ;z=ρcosφ.

The Jacobian matrix for this coordinate change is

𝐉𝐅(ρ,φ,θ)=[xρxφxθyρyφyθzρzφzθ]=[sinφcosθρcosφcosθρsinφsinθsinφsinθρcosφsinθρsinφcosθcosφρsinφ0].

The determinant is Template:Math. Since Template:Math is the volume for a rectangular differential volume element (because the volume of a rectangular prism is the product of its sides), we can interpret Template:Math as the volume of the spherical differential volume element. Unlike rectangular differential volume element's volume, this differential volume element's volume is not a constant, and varies with coordinates (Template:Math and Template:Math). It can be used to transform integrals between the two coordinate systems:

𝐅(U)f(x,y,z)dxdydz=Uf(ρsinφcosθ,ρsinφsinθ,ρcosφ)ρ2sinφdρdφdθ.

Example 4

The Jacobian matrix of the function Template:Math with components

y1=x1y2=5x3y3=4x222x3y4=x3sinx1

is

𝐉𝐅(x1,x2,x3)=[y1x1y1x2y1x3y2x1y2x2y2x3y3x1y3x2y3x3y4x1y4x2y4x3]=[10000508x22x3cosx10sinx1].

This example shows that the Jacobian matrix need not be a square matrix.

Example 5

The Jacobian determinant of the function Template:Math with components

y1=5x2y2=4x122sin(x2x3)y3=x2x3

is

|0508x12x3cos(x2x3)2x2cos(x2x3)0x3x2|=8x1|50x3x2|=40x1x2.

From this we see that Template:Math reverses orientation near those points where Template:Math and Template:Math have the same sign; the function is locally invertible everywhere except near points where Template:Math or Template:Math. Intuitively, if one starts with a tiny object around the point Template:Math and apply Template:Math to that object, one will get a resulting object with approximately Template:Math times the volume of the original one, with orientation reversed.

Other uses

Dynamical systems

Consider a dynamical system of the form 𝐱˙=F(𝐱), where 𝐱˙ is the (component-wise) derivative of 𝐱 with respect to the evolution parameter t (time), and F:nn is differentiable. If F(𝐱0)=0, then 𝐱0 is a stationary point (also called a steady state). By the Hartman–Grobman theorem, the behavior of the system near a stationary point is related to the eigenvalues of 𝐉F(𝐱0), the Jacobian of F at the stationary point.[11] Specifically, if the eigenvalues all have real parts that are negative, then the system is stable near the stationary point. If any eigenvalue has a real part that is positive, then the point is unstable. If the largest real part of the eigenvalues is zero, the Jacobian matrix does not allow for an evaluation of the stability.[12]

Newton's method

A square system of coupled nonlinear equations can be solved iteratively by Newton's method. This method uses the Jacobian matrix of the system of equations.

Regression and least squares fitting

The Jacobian serves as a linearized design matrix in statistical regression and curve fitting; see non-linear least squares. The Jacobian is also used in random matrices, moments, local sensitivity and statistical diagnostics.[13][14]

See also

Notes

Template:Notelist

References

Template:Reflist

Further reading

Template:Matrix classes