Multilinear multiplication: Difference between revisions

Latest revision as of 22:02, 5 May 2020

In multilinear algebra, applying a map that is the tensor product of linear maps to a tensor is called a multilinear multiplication.

Abstract definition

Let $F$ be a field of characteristic zero, such as $ℝ$ or $ℂ$ . Let $V_{k}$ be a finite-dimensional vector space over $F$ , and let $𝒜 \in V_{1} \otimes V_{2} \otimes \dots \otimes V_{d}$ be an order-d simple tensor, i.e., there exist some vectors $𝐯_{k} \in V_{k}$ such that $𝒜 = 𝐯_{1} \otimes 𝐯_{2} \otimes \dots \otimes 𝐯_{d}$ . If we are given a collection of linear maps $A_{k} : V_{k} \to W_{k}$ , then the multilinear multiplication of $𝒜$ with $(A_{1}, A_{2}, \dots, A_{d})$ is defined^[1] as the action on $𝒜$ of the tensor product of these linear maps,^[2] namely

$\begin{matrix} A_{1} \otimes A_{2} \otimes \dots \otimes A_{d} : V_{1} \otimes V_{2} \otimes \dots \otimes V_{d} & \to W_{1} \otimes W_{2} \otimes \dots \otimes W_{d}, \\ 𝐯_{1} \otimes 𝐯_{2} \otimes \dots \otimes 𝐯_{d} & \mapsto A_{1} (𝐯_{1}) \otimes A_{2} (𝐯_{2}) \otimes \dots \otimes A_{d} (𝐯_{d}) \end{matrix}$

Since the tensor product of linear maps is itself a linear map,^[2] and because every tensor admits a tensor rank decomposition,^[1] the above expression extends linearly to all tensors. That is, for a general tensor $𝒜 \in V_{1} \otimes V_{2} \otimes \dots \otimes V_{d}$ , the multilinear multiplication is

$\begin{matrix} ℬ : = (A_{1} \otimes A_{2} \otimes \dots \otimes A_{d}) (𝒜) \\ = & (A_{1} \otimes A_{2} \otimes \dots \otimes A_{d}) (\sum_{i = 1}^{r} 𝐚_{i}^{1} \otimes 𝐚_{i}^{2} \otimes \dots \otimes 𝐚_{i}^{d}) \\ = & \sum_{i = 1}^{r} A_{1} (𝐚_{i}^{1}) \otimes A_{2} (𝐚_{i}^{2}) \otimes \dots \otimes A_{d} (𝐚_{i}^{d}) \end{matrix}$

where $𝒜 = \sum_{i = 1}^{r} 𝐚_{i}^{1} \otimes 𝐚_{i}^{2} \otimes \dots \otimes 𝐚_{i}^{d}$ with $𝐚_{i}^{k} \in V_{k}$ is one of $𝒜$ 's tensor rank decompositions. The validity of the above expression is not limited to a tensor rank decomposition; in fact, it is valid for any expression of $𝒜$ as a linear combination of pure tensors, which follows from the universal property of the tensor product.

It is standard to use the following shorthand notations in the literature for multilinear multiplications: $(A_{1}, A_{2}, \dots, A_{d}) \cdot 𝒜 : = (A_{1} \otimes A_{2} \otimes \dots \otimes A_{d}) (𝒜)$ and $A_{k} \cdot_{k} 𝒜 : = ({Id}_{V_{1}}, \dots, {Id}_{V_{k - 1}}, A_{k}, {Id}_{V_{k + 1}}, \dots, {Id}_{V_{d}}) \cdot 𝒜,$ where ${Id}_{V_{k}} : V_{k} \to V_{k}$ is the identity operator.

Definition in coordinates

In computational multilinear algebra it is conventional to work in coordinates. Assume that an inner product is fixed on $V_{k}$ and let $V_{k}^{*}$ denote the dual vector space of $V_{k}$ . Let ${e_{1}^{k}, \dots, e_{n_{k}}^{k}}$ be a basis for $V_{k}$ , let ${(e_{1}^{k})^{*}, \dots, (e_{n_{k}}^{k})^{*}}$ be the dual basis, and let ${f_{1}^{k}, \dots, f_{m_{k}}^{k}}$ be a basis for $W_{k}$ . The linear map $M_{k} = \sum_{i = 1}^{m_{k}} \sum_{j = 1}^{n_{k}} m_{i, j}^{(k)} f_{i}^{k} \otimes (e_{j}^{k})^{*}$ is then represented by the matrix ${\hat{M}}_{k} = [m_{i, j}^{(k)}] \in F^{m_{k} \times n_{k}}$ . Likewise, with respect to the standard tensor product basis ${e_{j_{1}}^{1} \otimes e_{j_{2}}^{2} \otimes \dots \otimes e_{j_{d}}^{d}}_{j_{1}, j_{2}, \dots, j_{d}}$ , the abstract tensor $𝒜 = \sum_{j_{1} = 1}^{n_{1}} \sum_{j_{2} = 1}^{n_{2}} \dots \sum_{j_{d} = 1}^{n_{d}} a_{j_{1}, j_{2}, \dots, j_{d}} e_{j_{1}}^{1} \otimes e_{j_{2}}^{2} \otimes \dots \otimes e_{j_{d}}^{d}$ is represented by the multidimensional array $\hat{𝒜} = [a_{j_{1}, j_{2}, \dots, j_{d}}] \in F^{n_{1} \times n_{2} \times \dots \times n_{d}}$ . Observe that $\hat{𝒜} = \sum_{j_{1} = 1}^{n_{1}} \sum_{j_{2} = 1}^{n_{2}} \dots \sum_{j_{d} = 1}^{n_{d}} a_{j_{1}, j_{2}, \dots, j_{d}} 𝐞_{j_{1}}^{1} \otimes 𝐞_{j_{2}}^{2} \otimes \dots \otimes 𝐞_{j_{d}}^{d},$

where $𝐞_{j}^{k} \in F^{n_{k}}$ is the jth standard basis vector of $F^{n_{k}}$ and the tensor product of vectors is the affine Segre map $\otimes : (𝐯^{(1)}, 𝐯^{(2)}, \dots, 𝐯^{(d)}) \mapsto [v_{i_{1}}^{(1)} v_{i_{2}}^{(2)} \dots v_{i_{d}}^{(d)}]_{i_{1}, i_{2}, \dots, i_{d}}$ . It follows from the above choices of bases that the multilinear multiplication $ℬ = (M_{1}, M_{2}, \dots, M_{d}) \cdot 𝒜$ becomes

$\begin{matrix} \hat{ℬ} & = ({\hat{M}}_{1}, {\hat{M}}_{2}, \dots, {\hat{M}}_{d}) \cdot \sum_{j_{1} = 1}^{n_{1}} \sum_{j_{2} = 1}^{n_{2}} \dots \sum_{j_{d} = 1}^{n_{d}} a_{j_{1}, j_{2}, \dots, j_{d}} 𝐞_{j_{1}}^{1} \otimes 𝐞_{j_{2}}^{2} \otimes \dots \otimes 𝐞_{j_{d}}^{d} \\ = \sum_{j_{1} = 1}^{n_{1}} \sum_{j_{2} = 1}^{n_{2}} \dots \sum_{j_{d} = 1}^{n_{d}} a_{j_{1}, j_{2}, \dots, j_{d}} ({\hat{M}}_{1}, {\hat{M}}_{2}, \dots, {\hat{M}}_{d}) \cdot (𝐞_{j_{1}}^{1} \otimes 𝐞_{j_{2}}^{2} \otimes \dots \otimes 𝐞_{j_{d}}^{d}) \\ = \sum_{j_{1} = 1}^{n_{1}} \sum_{j_{2} = 1}^{n_{2}} \dots \sum_{j_{d} = 1}^{n_{d}} a_{j_{1}, j_{2}, \dots, j_{d}} ({\hat{M}}_{1} 𝐞_{j_{1}}^{1}) \otimes ({\hat{M}}_{2} 𝐞_{j_{2}}^{2}) \otimes \dots \otimes ({\hat{M}}_{d} 𝐞_{j_{d}}^{d}) . \end{matrix}$

The resulting tensor $\hat{ℬ}$ lives in $F^{m_{1} \times m_{2} \times \dots \times m_{d}}$ .

Element-wise definition

From the above expression, an element-wise definition of the multilinear multiplication is obtained. Indeed, since $\hat{ℬ}$ is a multidimensional array, it may be expressed as $\hat{ℬ} = \sum_{j_{1} = 1}^{n_{1}} \sum_{j_{2} = 1}^{n_{2}} \dots \sum_{j_{d} = 1}^{n_{d}} b_{j_{1}, j_{2}, \dots, j_{d}} 𝐞_{j_{1}}^{1} \otimes 𝐞_{j_{2}}^{2} \otimes \dots \otimes 𝐞_{j_{d}}^{d},$ where $b_{j_{1}, j_{2}, \dots, j_{d}} \in F$ are the coefficients. Then it follows from the above formulae that

$\begin{matrix} ((𝐞_{i_{1}}^{1})^{T}, (𝐞_{i_{2}}^{2})^{T}, \dots, (𝐞_{i_{d}}^{d})^{T}) \cdot \hat{ℬ} \\ = & \sum_{j_{1} = 1}^{n_{1}} \sum_{j_{2} = 1}^{n_{2}} \dots \sum_{j_{d} = 1}^{n_{d}} b_{j_{1}, j_{2}, \dots, j_{d}} ((𝐞_{i_{1}}^{1})^{T} 𝐞_{j_{1}}^{1}) \otimes ((𝐞_{i_{2}}^{2})^{T} 𝐞_{j_{2}}^{2}) \otimes \dots \otimes ((𝐞_{i_{d}}^{d})^{T} 𝐞_{j_{d}}^{d}) \\ = & \sum_{j_{1} = 1}^{n_{1}} \sum_{j_{2} = 1}^{n_{2}} \dots \sum_{j_{d} = 1}^{n_{d}} b_{j_{1}, j_{2}, \dots, j_{d}} δ_{i_{1}, j_{1}} \cdot δ_{i_{2}, j_{2}} \dots δ_{i_{d}, j_{d}} \\ = & b_{i_{1}, i_{2}, \dots, i_{d}}, \end{matrix}$

where $δ_{i, j}$ is the Kronecker delta. Hence, if $ℬ = (M_{1}, M_{2}, \dots, M_{d}) \cdot 𝒜$ , then

$\begin{matrix} b_{i_{1}, i_{2}, \dots, i_{d}} = ((𝐞_{i_{1}}^{1})^{T}, (𝐞_{i_{2}}^{2})^{T}, \dots, (𝐞_{i_{d}}^{d})^{T}) \cdot \hat{ℬ} \\ = & ((𝐞_{i_{1}}^{1})^{T}, (𝐞_{i_{2}}^{2})^{T}, \dots, (𝐞_{i_{d}}^{d})^{T}) \cdot ({\hat{M}}_{1}, {\hat{M}}_{2}, \dots, {\hat{M}}_{d}) \cdot \sum_{j_{1} = 1}^{n_{1}} \sum_{j_{2} = 1}^{n_{2}} \dots \sum_{j_{d} = 1}^{n_{d}} a_{j_{1}, j_{2}, \dots, j_{d}} 𝐞_{j_{1}}^{1} \otimes 𝐞_{j_{2}}^{2} \otimes \dots \otimes 𝐞_{j_{d}}^{d} \\ = & \sum_{j_{1} = 1}^{n_{1}} \sum_{j_{2} = 1}^{n_{2}} \dots \sum_{j_{d} = 1}^{n_{d}} a_{j_{1}, j_{2}, \dots, j_{d}} ((𝐞_{i_{1}}^{1})^{T} {\hat{M}}_{1} 𝐞_{j_{1}}^{1}) \otimes ((𝐞_{i_{2}}^{2})^{T} {\hat{M}}_{2} 𝐞_{j_{2}}^{2}) \otimes \dots \otimes ((𝐞_{i_{d}}^{d})^{T} {\hat{M}}_{d} 𝐞_{j_{d}}^{d}) \\ = & \sum_{j_{1} = 1}^{n_{1}} \sum_{j_{2} = 1}^{n_{2}} \dots \sum_{j_{d} = 1}^{n_{d}} a_{j_{1}, j_{2}, \dots, j_{d}} m_{i_{1}, j_{1}}^{(1)} \cdot m_{i_{2}, j_{2}}^{(2)} \dots m_{i_{d}, j_{d}}^{(d)}, \end{matrix}$

where the $m_{i, j}^{(k)}$ are the elements of ${\hat{M}}_{k}$ as defined above.

Properties

Let $𝒜 \in V_{1} \otimes V_{2} \otimes \dots \otimes V_{d}$ be an order-d tensor over the tensor product of $F$ -vector spaces.

Since a multilinear multiplication is the tensor product of linear maps, we have the following multilinearity property (in the construction of the map):^[1]^[2]

$A_{1} \otimes \dots \otimes A_{k - 1} \otimes (α A_{k} + β B) \otimes A_{k + 1} \otimes \dots \otimes A_{d} = α A_{1} \otimes \dots \otimes A_{d} + β A_{1} \otimes \dots \otimes A_{k - 1} \otimes B \otimes A_{k + 1} \otimes \dots \otimes A_{d}$

Multilinear multiplication is a linear map:^[1]^[2] $(M_{1}, M_{2}, \dots, M_{d}) \cdot (α 𝒜 + β ℬ) = α (M_{1}, M_{2}, \dots, M_{d}) \cdot 𝒜 + β (M_{1}, M_{2}, \dots, M_{d}) \cdot ℬ$

It follows from the definition that the composition of two multilinear multiplications is also a multilinear multiplication:^[1]^[2]

$(M_{1}, M_{2}, \dots, M_{d}) \cdot ((K_{1}, K_{2}, \dots, K_{d}) \cdot 𝒜) = (M_{1} \circ K_{1}, M_{2} \circ K_{2}, \dots, M_{d} \circ K_{d}) \cdot 𝒜,$

where $M_{k} : U_{k} \to W_{k}$ and $K_{k} : V_{k} \to U_{k}$ are linear maps.

Observe specifically that multilinear multiplications in different factors commute,

$M_{k} \cdot_{k} (M_{ℓ} \cdot_{ℓ} 𝒜) = M_{ℓ} \cdot_{ℓ} (M_{k} \cdot_{k} 𝒜) = M_{k} \cdot_{k} M_{ℓ} \cdot_{ℓ} 𝒜,$

if $k \neq ℓ .$

Computation

The factor-k multilinear multiplication $M_{k} \cdot_{k} 𝒜$ can be computed in coordinates as follows. Observe first that

$\begin{matrix} M_{k} \cdot_{k} 𝒜 & = M_{k} \cdot_{k} \sum_{j_{1} = 1}^{n_{1}} \sum_{j_{2} = 1}^{n_{2}} \dots \sum_{j_{d} = 1}^{n_{d}} a_{j_{1}, j_{2}, \dots, j_{d}} 𝐞_{j_{1}}^{1} \otimes 𝐞_{j_{2}}^{2} \otimes \dots \otimes 𝐞_{j_{d}}^{d} \\ = \sum_{j_{1} = 1}^{n_{1}} \dots \sum_{j_{k - 1} = 1}^{n_{k - 1}} \sum_{j_{k + 1} = 1}^{n_{k + 1}} \dots \sum_{j_{d} = 1}^{n_{d}} 𝐞_{j_{1}}^{1} \otimes \dots \otimes 𝐞_{j_{k - 1}}^{k - 1} \otimes M_{k} (\sum_{j_{k} = 1}^{n_{k}} a_{j_{1}, j_{2}, \dots, j_{d}} 𝐞_{j_{k}}^{k}) \otimes 𝐞_{j_{k + 1}}^{k + 1} \otimes \dots \otimes 𝐞_{j_{d}}^{d} . \end{matrix}$

Next, since

$F^{n_{1}} \otimes F^{n_{2}} \otimes \dots \otimes F^{n_{d}} ≃ F^{n_{k}} \otimes (F^{n_{1}} \otimes \dots \otimes F^{n_{k - 1}} \otimes F^{n_{k + 1}} \otimes \dots \otimes F^{n_{d}}) ≃ F^{n_{k}} \otimes F^{n_{1} \dots n_{k - 1} n_{k + 1} \dots n_{d}},$

there is a bijective map, called the factor-k standard flattening,^[1] denoted by $(\cdot)_{(k)}$ , that identifies $M_{k} \cdot_{k} 𝒜$ with an element from the latter space, namely

${(M_{k} \cdot_{k} 𝒜)}_{(k)} : = \sum_{j_{1} = 1}^{n_{1}} \dots \sum_{j_{k - 1} = 1}^{n_{k - 1}} \sum_{j_{k + 1} = 1}^{n_{k + 1}} \dots \sum_{j_{d} = 1}^{n_{d}} M_{k} (\sum_{j_{k} = 1}^{n_{k}} a_{j_{1}, j_{2}, \dots, j_{d}} 𝐞_{j_{k}}^{k}) \otimes 𝐞_{μ_{k} (j_{1}, \dots, j_{k - 1}, j_{k + 1}, \dots, j_{d})} : = M_{k} 𝒜_{(k)},$

where $𝐞_{j}$ is the jth standard basis vector of $F^{N_{k}}$ , $N_{k} = n_{1} \dots n_{k - 1} n_{k + 1} \dots n_{d}$ , and $𝒜_{(k)} \in F^{n_{k}} \otimes F^{N_{k}} ≃ F^{n_{k} \times N_{k}}$ is the factor-k flattening matrix of $𝒜$ whose columns are the factor-k vectors $[a_{j_{1}, \dots, j_{k - 1}, i, j_{k + 1}, \dots, j_{d}}]_{i = 1}^{n_{k}}$ in some order, determined by the particular choice of the bijective map

$μ_{k} : [1, n_{1}] \times \dots \times [1, n_{k - 1}] \times [1, n_{k + 1}] \times \dots \times [1, n_{d}] \to [1, N_{k}] .$

In other words, the multilinear multiplication $(M_{1}, M_{2}, \dots, M_{d}) \cdot 𝒜$ can be computed as a sequence of d factor-k multilinear multiplications, which themselves can be implemented efficiently as classic matrix multiplications.

Applications

The higher-order singular value decomposition (HOSVD) factorizes a tensor given in coordinates $𝒜 \in F^{n_{1} \times n_{2} \times \dots \times n_{d}}$ as the multilinear multiplication $𝒜 = (U_{1}, U_{2}, \dots, U_{d}) \cdot 𝒮$ , where $U_{k} \in F^{n_{k} \times n_{k}}$ are orthogonal matrices and $𝒮 \in F^{n_{1} \times n_{2} \times \dots \times n_{d}}$ .

Multilinear multiplication: Difference between revisions

Latest revision as of 22:02, 5 May 2020

Contents

Abstract definition

Definition in coordinates

Element-wise definition

Properties

Computation

Applications

Further reading

Navigation menu

Multilinear multiplication: Difference between revisions

Latest revision as of 22:02, 5 May 2020

Abstract definition

Definition in coordinates

Element-wise definition

Properties

Computation

Applications

Further reading

Navigation menu

Search