Sparse identification of non-linear dynamics

Template:Short description Template:Orphan

Sparse identification of nonlinear dynamics (SINDy) is a data-driven algorithm for obtaining dynamical systems from data.^[1] Given a series of snapshots of a dynamical system and its corresponding time derivatives, SINDy performs a sparsity-promoting regression (such as LASSO and spare Bayesian inference^[2]) on a library of nonlinear candidate functions of the snapshots against the derivatives to find the governing equations. This procedure relies on the assumption that most physical systems only have a few dominant terms which dictate the dynamics, given an appropriately selected coordinate system and quality training data.^[3] It has been applied to identify the dynamics of fluids, based on proper orthogonal decomposition, as well as other complex dynamical systems, such as biological networks.^[4]

Mathematical Overview

First, consider a dynamical system of the form

$\dot{𝐱} = \frac{d}{d t} 𝐱 (t) = 𝐟 (𝐱 (t)),$

where $𝐱 (t) \in ℝ^{n}$ is a state vector (snapshot) of the system at time $t$ and the function $𝐟 (𝐱 (t))$ defines the equations of motion and constraints of the system. The time derivative may be either prescribed or numerically approximated from the snapshots.

With $𝐱$ and $\dot{𝐱}$ sampled at $m$ equidistant points in time ( $t_{1}, t_{2}, \dots, t_{m}$ ), these can be arranged into matrices of the form

$𝐗 = [\begin{matrix} 𝐱^{𝐓} (𝐭_{𝟏}) \\ 𝐱^{𝐓} (𝐭_{𝟐}) \\ ⋮ \\ 𝐱^{𝐓} (𝐭_{𝐦}) \end{matrix}] = [\begin{matrix} 𝐱_{𝟏} (𝐭_{𝟏}) & 𝐱_{𝟐} (𝐭_{𝟏}) & \dots & 𝐱_{𝐧} (𝐭_{𝟏}) \\ 𝐱_{𝟏} (𝐭_{𝟐}) & 𝐱_{𝟐} (𝐭_{𝟐}) & \dots & 𝐱_{𝐧} (𝐭_{𝟐}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 𝐱_{𝟏} (𝐭_{𝐦}) & 𝐱_{𝟐} (𝐭_{𝐦}) & \dots & 𝐱_{𝐧} (𝐭_{𝐦}) \end{matrix}],$

and similarly for $\dot{𝐗}$ .

Next, a library $𝜣 (𝐗)$ of nonlinear candidate functions of the columns of $𝐗$ is constructed, which may be constant, polynomial, or more exotic functions (like trigonometric and rational terms, and so on):

$𝜣 (𝐗) = [\begin{matrix} | & | & | & | & | & | \\ 𝟏 & 𝐗 & 𝐗^{𝟐} & 𝐗^{𝟑} & \dots & \sin (𝐗) & \cos (𝐗) & \dots \\ | & | & | & | & | & | \end{matrix}]$

The number of possible model structures from this library is combinatorically high. $𝐟 (𝐱 (t))$ is then substituted by $𝜣 (𝐗)$ and a vector of coefficients $𝜩 = [𝝃_{𝟏} 𝝃_{𝟐} \dots 𝝃_{𝐧}]$ determining the active terms in $𝐟 (𝐱 (t))$ :

$\dot{𝐗} = 𝜣 (𝐗) 𝜩$

Because only a few terms are expected to be active at each point in time, an assumption is made that $𝐟 (𝐱 (t))$ admits a sparse representation in $𝜣 (𝐗)$ . This then becomes an optimization problem in finding a sparse $𝜩$ which optimally embeds $\dot{𝐗}$ . In other words, a parsimonious model is obtained by performing least squares regression on the system Template:EquationRef with sparsity-promoting ( $L_{1}$ ) regularization

$𝝃_{𝐤} = \underset{𝝃'_{𝐤}}{\arg \min} {| | {\dot{𝐗}}_{k} - 𝜣 (𝐗) 𝝃'_{𝐤} | |}_{𝟐} + 𝝀 {| | 𝝃'_{𝐤} | |}_{𝟏},$

where $λ$ is a regularization parameter. Finally, the sparse set of $𝝃_{𝐤}$ can be used to reconstruct the dynamical system:

${\dot{x}}_{k} = 𝜣 (𝐱) 𝝃_{𝐤}$

References

[1] Template:Cite book

[2] Template:Cite journal

[3] Template:Cite journal

[4] Template:Cite arXiv

[1]

[2]

[3]

[4]

Sparse identification of non-linear dynamics

Mathematical Overview

References

Navigation menu

Search