Matrix exponential
Template:Short description Template:Use American English In mathematics, the matrix exponential is a matrix function on square matrices analogous to the ordinary exponential function. It is used to solve systems of linear differential equations. In the theory of Lie groups, the matrix exponential gives the exponential map between a matrix Lie algebra and the corresponding Lie group.
Let Template:Mvar be an Template:Math real or complex matrix. The exponential of Template:Mvar, denoted by Template:Math or Template:Math, is the Template:Math matrix given by the power series
where is defined to be the identity matrix with the same dimensions as , and Template:Tmath.[1] The series always converges, so the exponential of Template:Mvar is well-defined.
Equivalently,
for integer-valued Template:Mvar, where Template:Mvar is the Template:Math identity matrix.
Equivalently, given by the solution to the differential equation
When Template:Mvar is an Template:Math diagonal matrix then Template:Math will be an Template:Math diagonal matrix with each diagonal element equal to the ordinary exponential applied to the corresponding diagonal element of Template:Mvar.
Properties
Elementary properties
Let Template:Math and Template:Math be Template:Math complex matrices and let Template:Math and Template:Math be arbitrary complex numbers. We denote the Template:Math identity matrix by Template:Math and the zero matrix by 0. The matrix exponential satisfies the following properties.[2]
We begin with the properties that are immediate consequences of the definition as a power series:
- Template:Math
- Template:Math, where Template:Math denotes the transpose of Template:Math.
- Template:Math, where Template:Math denotes the conjugate transpose of Template:Math.
- If Template:Math is invertible then Template:Math
The next key result is this one:
- If then .
The proof of this identity is the same as the standard power-series argument for the corresponding identity for the exponential of real numbers. That is to say, as long as and commute, it makes no difference to the argument whether and are numbers or matrices. It is important to note that this identity typically does not hold if and do not commute (see Golden-Thompson inequality below).
Consequences of the preceding identity are the following:
Using the above results, we can easily verify the following claims. If Template:Math is symmetric then Template:Math is also symmetric, and if Template:Math is skew-symmetric then Template:Math is orthogonal. If Template:Math is Hermitian then Template:Math is also Hermitian, and if Template:Math is skew-Hermitian then Template:Math is unitary.
Finally, a Laplace transform of matrix exponentials amounts to the resolvent, for all sufficiently large positive values of Template:Mvar.
Linear differential equation systems
One of the reasons for the importance of the matrix exponential is that it can be used to solve systems of linear ordinary differential equations. The solution of where Template:Mvar is a constant matrix and y is a column vector, is given by
The matrix exponential can also be used to solve the inhomogeneous equation See the section on applications below for examples.
There is no closed-form solution for differential equations of the form where Template:Mvar is not constant, but the Magnus series gives the solution as an infinite sum.
The determinant of the matrix exponential
By Jacobi's formula, for any complex square matrix the following trace identity holds:[3]
In addition to providing a computational tool, this formula demonstrates that a matrix exponential is always an invertible matrix. This follows from the fact that the right hand side of the above equation is always non-zero, and so Template:Math, which implies that Template:Math must be invertible.
In the real-valued case, the formula also exhibits the map to not be surjective, in contrast to the complex case mentioned earlier. This follows from the fact that, for real-valued matrices, the right-hand side of the formula is always positive, while there exist invertible matrices with a negative determinant.
Real symmetric matrices
The matrix exponential of a real symmetric matrix is positive definite. Let be an Template:Math real symmetric matrix and a column vector. Using the elementary properties of the matrix exponential and of symmetric matrices, we have:
Since is invertible, the equality only holds for , and we have for all non-zero . Hence is positive definite.
The exponential of sums
For any real numbers (scalars) Template:Mvar and Template:Mvar we know that the exponential function satisfies Template:Math. The same is true for commuting matrices. If matrices Template:Mvar and Template:Mvar commute (meaning that Template:Math), then,
However, for matrices that do not commute the above equality does not necessarily hold.
The Lie product formula
Even if Template:Mvar and Template:Mvar do not commute, the exponential Template:Math can be computed by the Lie product formula[4]
Using a large finite Template:Mvar to approximate the above is basis of the Suzuki-Trotter expansion, often used in numerical time evolution.
The Baker–Campbell–Hausdorff formula
In the other direction, if Template:Mvar and Template:Mvar are sufficiently small (but not necessarily commuting) matrices, we have where Template:Mvar may be computed as a series in commutators of Template:Mvar and Template:Mvar by means of the Baker–Campbell–Hausdorff formula:[5] where the remaining terms are all iterated commutators involving Template:Mvar and Template:Mvar. If Template:Mvar and Template:Mvar commute, then all the commutators are zero and we have simply Template:Math.
Inequalities for exponentials of Hermitian matrices
Template:Main For Hermitian matrices there is a notable theorem related to the trace of matrix exponentials.
If Template:Mvar and Template:Mvar are Hermitian matrices, then[6]
There is no requirement of commutativity. There are counterexamples to show that the Golden–Thompson inequality cannot be extended to three matrices – and, in any event, Template:Math is not guaranteed to be real for Hermitian Template:Math, Template:Math, Template:Math. However, Lieb proved[7][8] that it can be generalized to three matrices if we modify the expression as follows
The exponential map
The exponential of a matrix is always an invertible matrix. The inverse matrix of Template:Math is given by Template:Math. This is analogous to the fact that the exponential of a complex number is always nonzero. The matrix exponential then gives us a map from the space of all n × n matrices to the general linear group of degree Template:Mvar, i.e. the group of all n × n invertible matrices. In fact, this map is surjective which means that every invertible matrix can be written as the exponential of some other matrix[9] (for this, it is essential to consider the field C of complex numbers and not R).
For any two matrices Template:Mvar and Template:Mvar,
where Template:Math denotes an arbitrary matrix norm. It follows that the exponential map is continuous and Lipschitz continuous on compact subsets of Template:Math.
The map defines a smooth curve in the general linear group which passes through the identity element at Template:Math.
In fact, this gives a one-parameter subgroup of the general linear group since
The derivative of this curve (or tangent vector) at a point t is given by Template:NumBlk The derivative at Template:Math is just the matrix X, which is to say that X generates this one-parameter subgroup.
More generally,[10] for a generic Template:Mvar-dependent exponent, Template:Math,
Taking the above expression Template:Math outside the integral sign and expanding the integrand with the help of the Hadamard lemma one can obtain the following useful expression for the derivative of the matrix exponent,[11]
The coefficients in the expression above are different from what appears in the exponential. For a closed form, see derivative of the exponential map.
Directional derivatives when restricted to Hermitian matrices
Let be a Hermitian matrix with distinct eigenvalues. Let be its eigen-decomposition where is a unitary matrix whose columns are the eigenvectors of , is its conjugate transpose, and the vector of corresponding eigenvalues. Then, for any Hermitian matrix , the directional derivative of at in the direction is [12] [13] where , the operator denotes the Hadamard product, and, for all , the matrix is defined as In addition, for any Hermitian matrix , the second directional derivative in directions and is[13] where the matrix-valued function is defined, for all , as with
Computing the matrix exponential
Finding reliable and accurate methods to compute the matrix exponential is difficult, and this is still a topic of considerable current research in mathematics and numerical analysis. Matlab, GNU Octave, R, and SciPy all use the Padé approximant.[14][15][16][17] In this section, we discuss methods that are applicable in principle to any matrix, and which can be carried out explicitly for small matrices.[18] Subsequent sections describe methods suitable for numerical evaluation on large matrices.
Diagonalizable case
If a matrix is diagonal: then its exponential can be obtained by exponentiating each entry on the main diagonal:
This result also allows one to exponentiate diagonalizable matrices. If Template:Block indent then Template:Block indent which is especially easy to compute when Template:Math is diagonal.
Application of Sylvester's formula yields the same result. (To see this, note that addition and multiplication, hence also exponentiation, of diagonal matrices is equivalent to element-wise addition and multiplication, and hence exponentiation; in particular, the "one-dimensional" exponentiation is felt element-wise for the diagonal case.)
Example : Diagonalizable
For example, the matrix can be diagonalized as
Thus,
Nilpotent case
A matrix Template:Mvar is nilpotent if Template:Math for some integer q. In this case, the matrix exponential Template:Math can be computed directly from the series expansion, as the series terminates after a finite number of terms:
Since the series has a finite number of steps, it is a matrix polynomial, which can be computed efficiently.
General case
Using the Jordan–Chevalley decomposition
By the Jordan–Chevalley decomposition, any matrix X with complex entries can be expressed as where
- A is diagonalizable
- N is nilpotent
- A commutes with N
This means that we can compute the exponential of X by reducing to the previous two cases:
Note that we need the commutativity of A and N for the last step to work.
Using the Jordan canonical form
A closely related method is, if the field is algebraically closed, to work with the Jordan form of Template:Mvar. Suppose that Template:Math where Template:Mvar is the Jordan form of Template:Mvar. Then
Also, since
Therefore, we need only know how to compute the matrix exponential of a Jordan block. But each Jordan block is of the form
where Template:Mvar is a special nilpotent matrix. The matrix exponential of Template:Mvar is then given by
Projection case
If Template:Mvar is a projection matrix (i.e. is idempotent: Template:Math), its matrix exponential is: Template:Block indent
Deriving this by expansion of the exponential function, each power of Template:Mvar reduces to Template:Mvar which becomes a common factor of the sum:
Rotation case
For a simple rotation in which the perpendicular unit vectors Template:Math and Template:Math specify a plane,[19] the rotation matrix Template:Mvar can be expressed in terms of a similar exponential function involving a generator Template:Mvar and angle Template:Mvar.[20][21]
The formula for the exponential results from reducing the powers of Template:Mvar in the series expansion and identifying the respective series coefficients of Template:Math and Template:Mvar with Template:Math and Template:Math respectively. The second expression here for Template:Math is the same as the expression for Template:Math in the article containing the derivation of the generator, Template:Math.
In two dimensions, if and , then , , and reduces to the standard matrix for a plane rotation.
The matrix Template:Math projects a vector onto the Template:Math-plane and the rotation only affects this part of the vector. An example illustrating this is a rotation of Template:Math in the plane spanned by Template:Math and Template:Math,
Let Template:Math, so Template:Math and its products with Template:Math and Template:Math are zero. This will allow us to evaluate powers of Template:Math.
Evaluation by Laurent series
By virtue of the Cayley–Hamilton theorem the matrix exponential is expressible as a polynomial of order Template:Mvar−1.
If Template:Mvar and Template:Math are nonzero polynomials in one variable, such that Template:Math, and if the meromorphic function is entire, then To prove this, multiply the first of the two above equalities by Template:Math and replace Template:Mvar by Template:Mvar.
Such a polynomial Template:Math can be found as follows−see Sylvester's formula. Letting Template:Mvar be a root of Template:Mvar, Template:Math is solved from the product of Template:Mvar by the principal part of the Laurent series of Template:Mvar at Template:Mvar: It is proportional to the relevant Frobenius covariant. Then the sum St of the Qa,t, where Template:Mvar runs over all the roots of Template:Mvar, can be taken as a particular Template:Math. All the other Qt will be obtained by adding a multiple of Template:Mvar to Template:Math. In particular, Template:Math, the Lagrange-Sylvester polynomial, is the only Template:Math whose degree is less than that of Template:Mvar.
Example: Consider the case of an arbitrary Template:Math matrix,
The exponential matrix Template:Math, by virtue of the Cayley–Hamilton theorem, must be of the form
(For any complex number Template:Mvar and any C-algebra Template:Mvar, we denote again by Template:Mvar the product of Template:Mvar by the unit of Template:Mvar.)
Let Template:Mvar and Template:Mvar be the roots of the characteristic polynomial of Template:Mvar,
Then we have hence
if Template:Math; while, if Template:Math,
so that
Defining
we have
where Template:Math is 0 if Template:Math, and Template:Mvar if Template:Math.
Thus,
Thus, as indicated above, the matrix Template:Mvar having decomposed into the sum of two mutually commuting pieces, the traceful piece and the traceless piece,
the matrix exponential reduces to a plain product of the exponentials of the two respective pieces. This is a formula often used in physics, as it amounts to the analog of Euler's formula for Pauli spin matrices, that is rotations of the doublet representation of the group SU(2).
The polynomial Template:Math can also be given the following "interpolation" characterization. Define Template:Math, and Template:Math. Then Template:Math is the unique degree Template:Math polynomial which satisfies Template:Math whenever Template:Mvar is less than the multiplicity of Template:Mvar as a root of Template:Mvar. We assume, as we obviously can, that Template:Mvar is the minimal polynomial of Template:Mvar. We further assume that Template:Mvar is a diagonalizable matrix. In particular, the roots of Template:Mvar are simple, and the "interpolation" characterization indicates that Template:Math is given by the Lagrange interpolation formula, so it is the Lagrange−Sylvester polynomial.
At the other extreme, if Template:Math, then
The simplest case not covered by the above observations is when with Template:Math, which yields
Evaluation by implementation of Sylvester's formula
A practical, expedited computation of the above reduces to the following rapid steps. Recall from above that an Template:Math matrix Template:Math amounts to a linear combination of the first Template:Mvar−1 powers of Template:Mvar by the Cayley–Hamilton theorem. For diagonalizable matrices, as illustrated above, e.g. in the Template:Math case, Sylvester's formula yields Template:Math, where the Template:Mvars are the Frobenius covariants of Template:Mvar.
It is easiest, however, to simply solve for these Template:Mvars directly, by evaluating this expression and its first derivative at Template:Math, in terms of Template:Mvar and Template:Mvar, to find the same answer as above.
But this simple procedure also works for defective matrices, in a generalization due to Buchheim.[22] This is illustrated here for a Template:Math example of a matrix which is not diagonalizable, and the Template:Mvars are not projection matrices.
Consider with eigenvalues Template:Math and Template:Math, each with a multiplicity of two.
Consider the exponential of each eigenvalue multiplied by Template:Mvar, Template:Math. Multiply each exponentiated eigenvalue by the corresponding undetermined coefficient matrix Template:Math. If the eigenvalues have an algebraic multiplicity greater than 1, then repeat the process, but now multiplying by an extra factor of Template:Mvar for each repetition, to ensure linear independence.
(If one eigenvalue had a multiplicity of three, then there would be the three terms: . By contrast, when all eigenvalues are distinct, the Template:Mvars are just the Frobenius covariants, and solving for them as below just amounts to the inversion of the Vandermonde matrix of these 4 eigenvalues.)
Sum all such terms, here four such,
To solve for all of the unknown matrices Template:Mvar in terms of the first three powers of Template:Mvar and the identity, one needs four equations, the above one providing one such at Template:Mvar = 0. Further, differentiate it with respect to Template:Mvar,
and again,
and once more,
(In the general case, Template:Mvar−1 derivatives need be taken.)
Setting Template:Mvar = 0 in these four equations, the four coefficient matrices Template:Mvars may now be solved for,
to yield
Substituting with the value for Template:Mvar yields the coefficient matrices
so the final answer is
The procedure is much shorter than Putzer's algorithm sometimes utilized in such cases.
Illustrations
Suppose that we want to compute the exponential of
Its Jordan form is where the matrix P is given by
Let us first calculate exp(J). We have
The exponential of a Template:Math matrix is just the exponential of the one entry of the matrix, so Template:Math. The exponential of J2(16) can be calculated by the formula Template:Math mentioned above; this yields[23]
Therefore, the exponential of the original matrix Template:Mvar is
Applications
Linear differential equations
The matrix exponential has applications to systems of linear differential equations. (See also matrix differential equation.) Recall from earlier in this article that a homogeneous differential equation of the form has solution Template:Math.
If we consider the vector we can express a system of inhomogeneous coupled linear differential equations as Making an ansatz to use an integrating factor of Template:Math and multiplying throughout, yields
The second step is possible due to the fact that, if Template:Math, then Template:Math. So, calculating Template:Math leads to the solution to the system, by simply integrating the third step with respect to Template:Mvar.
A solution to this can be obtained by integrating and multiplying by to eliminate the exponent in the LHS. Notice that while is a matrix, given that it is a matrix exponential, we can say that . In other words, .
Example (homogeneous)
Consider the system
The associated defective matrix is
The matrix exponential is
so that the general solution of the homogeneous system is
amounting to
Example (inhomogeneous)
Consider now the inhomogeneous system
We again have
and
From before, we already have the general solution to the homogeneous equation. Since the sum of the homogeneous and particular solutions give the general solution to the inhomogeneous problem, we now only need find the particular solution.
We have, by above, which could be further simplified to get the requisite particular solution determined through variation of parameters. Note c = yp(0). For more rigor, see the following generalization.
Inhomogeneous case generalization: variation of parameters
For the inhomogeneous case, we can use integrating factors (a method akin to variation of parameters). We seek a particular solution of the form Template:Math,
For Template:Math to be a solution,
Thus, where Template:Math is determined by the initial conditions of the problem.
More precisely, consider the equation
with the initial condition Template:Math, where
- Template:Mvar is an Template:Mvar by Template:Mvar complex matrix,
- Template:Mvar is a continuous function from some open interval Template:Mvar to Template:Math,
- is a point of Template:Mvar, and
- is a vector of Template:Math.
Left-multiplying the above displayed equality by Template:Math yields
We claim that the solution to the equation
with the initial conditions for Template:Math is
where the notation is as follows:
- is a monic polynomial of degree Template:Math,
- Template:Mvar is a continuous complex valued function defined on some open interval Template:Mvar,
- is a point of Template:Mvar,
- is a complex number, and
Template:Math is the coefficient of in the polynomial denoted by in Subsection Evaluation by Laurent series above.
To justify this claim, we transform our order Template:Mvar scalar equation into an order one vector equation by the usual reduction to a first order system. Our vector equation takes the form where Template:Mvar is the transpose companion matrix of Template:Mvar. We solve this equation as explained above, computing the matrix exponentials by the observation made in Subsection Evaluation by implementation of Sylvester's formula above.
In the case Template:Mvar = 2 we get the following statement. The solution to
is
where the functions Template:Math and Template:Math are as in Subsection Evaluation by Laurent series above.
Matrix-matrix exponentials
The matrix exponential of another matrix (matrix-matrix exponential),[24] is defined as for any normal and non-singular Template:Math matrix Template:Mvar, and any complex Template:Math matrix Template:Mvar.
For matrix-matrix exponentials, there is a distinction between the left exponential Template:Mvar and the right exponential Template:Mvar, because the multiplication operator for matrix-to-matrix is not commutative. Moreover,
- If Template:Mvar is normal and non-singular, then Template:Mvar and Template:Mvar have the same set of eigenvalues.
- If Template:Mvar is normal and non-singular, Template:Mvar is normal, and Template:Math, then Template:Math.
- If Template:Mvar is normal and non-singular, and Template:Mvar, Template:Mvar, Template:Mvar commute with each other, then Template:Math and Template:Math.
See also
- Matrix function
- Matrix logarithm
- C0-semigroup
- Exponential function
- Exponential map (Lie theory)
- Magnus expansion
- Derivative of the exponential map
- Vector flow
- Golden–Thompson inequality
- Phase-type distribution
- Lie product formula
- Baker–Campbell–Hausdorff formula
- Frobenius covariant
- Sylvester's formula
- Trigonometric functions of matrices
References
- Template:Citation
- Template:Cite book.
- Template:Cite journal.
- Template:Cite journal
- Template:Cite journal
- Template:Cite book
- Template:Cite journal
External links
- ↑ Template:Harvnb Equation 2.1
- ↑ Template:Harvnb Proposition 2.3
- ↑ Template:Harvnb Theorem 2.12
- ↑ Template:Harvnb Theorem 2.11
- ↑ Template:Harvnb Chapter 5
- ↑ Template:Cite book
- ↑ Template:Cite journal
- ↑ Template:Cite journal
- ↑ Template:Harvnb Exercises 2.9 and 2.10
- ↑ Template:Cite journal
- ↑ Template:Harvnb Theorem 5.4
- ↑ Template:Cite journal See Theorem 3.3.
- ↑ 13.0 13.1 Template:Cite journal See Propositions 1 and 2.
- ↑ Template:Cite web
- ↑ Template:Cite web
- ↑ Template:Cite web
- ↑ Template:Cite web
- ↑ See Template:Harvnb Section 2.2
- ↑ in a Euclidean space
- ↑ Template:Cite book
- ↑ Template:Cite book
- ↑ Rinehart, R. F. (1955). "The equivalence of definitions of a matric function". The American Mathematical Monthly, 62 (6), 395-414.
- ↑ This can be generalized; in general, the exponential of Template:Math is an upper triangular matrix with Template:Math on the main diagonal, Template:Math on the one above, Template:Math on the next one, and so on.
- ↑ Template:Cite web