Tensor reshaping

Template:More citations needed In multilinear algebra, a reshaping of tensors is any bijection between the set of indices of an order- $M$ tensor and the set of indices of an order- $L$ tensor, where $L < M$ . The use of indices presupposes tensors in coordinate representation with respect to a basis. The coordinate representation of a tensor can be regarded as a multi-dimensional array, and a bijection from one set of indices to another therefore amounts to a rearrangement of the array elements into an array of a different shape. Such a rearrangement constitutes a particular kind of linear map between the vector space of order- $M$ tensors and the vector space of order- $L$ tensors.

Definition

Given a positive integer $M$ , the notation $[M]$ refers to the set ${1, \dots, M}$ of the first Template:Mvar positive integers.

For each integer $m$ where $1 \leq m \leq M$ for a positive integer $M$ , let $V_{m}$ denote an $I_{m}$ -dimensional vector space over a field $F$ . Then there are vector space isomorphisms (linear maps)

$\begin{matrix} V_{1} \otimes \dots \otimes V_{M} & ≃ F^{I_{1}} \otimes \dots \otimes F^{I_{M}} \\ ≃ F^{I_{π_{1}}} \otimes \dots \otimes F^{I_{π_{M}}} \\ ≃ F^{I_{π_{1}} I_{π_{2}}} \otimes F^{I_{π_{3}}} \otimes \dots \otimes F^{I_{π_{M}}} \\ ≃ F^{I_{π_{1}} I_{π_{3}}} \otimes F^{I_{π_{2}}} \otimes F^{I_{π_{4}}} \otimes \dots \otimes F^{I_{π_{M}}} \\ ⋮ \\ ≃ F^{I_{1} I_{2} \dots I_{M}}, \end{matrix}$

where $π \in 𝔖_{M}$ is any permutation and $𝔖_{M}$ is the symmetric group on $M$ elements. Via these (and other) vector space isomorphisms, a tensor can be interpreted in several ways as an order- $L$ tensor where $L \leq M$ .

Coordinate representation

The first vector space isomorphism on the list above, $V_{1} \otimes \dots \otimes V_{M} ≃ F^{I_{1}} \otimes \dots \otimes F^{I_{M}}$ , gives the coordinate representation of an abstract tensor. Assume that each of the $M$ vector spaces $V_{m}$ has a basis ${v_{1}^{m}, v_{2}^{m}, \dots, v_{I_{m}}^{m}}$ . The expression of a tensor with respect to this basis has the form $𝒜 = \sum_{i_{1} = 1}^{I_{1}} \dots \sum_{i_{M} = 1}^{I_{M}} a_{i_{1}, i_{2}, \dots, i_{M}} v_{i_{1}}^{1} \otimes v_{i_{2}}^{2} \otimes \dots \otimes v_{i_{M}}^{M},$ where the coefficients $a_{i_{1}, i_{2}, \dots, i_{M}}$ are elements of $F$ . The coordinate representation of $𝒜$ is $\sum_{i_{1} = 1}^{I_{1}} \dots \sum_{i_{M} = 1}^{I_{M}} a_{i_{1}, i_{2}, \dots, i_{M}} 𝐞_{i_{1}}^{1} \otimes 𝐞_{i_{2}}^{2} \otimes \dots \otimes 𝐞_{i_{M}}^{M},$ where $𝐞_{i}^{m}$ is the $i^{th}$ standard basis vector of $F^{I_{m}}$ . This can be regarded as a M-way array whose elements are the coefficients $a_{i_{1}, i_{2}, \dots, i_{M}}$ .

General flattenings

For any permutation $π \in 𝔖_{M}$ there is a canonical isomorphism between the two tensor products of vector spaces $V_{1} \otimes V_{2} \otimes \dots \otimes V_{M}$ and $V_{π (1)} \otimes V_{π (2)} \otimes \dots \otimes V_{π (M)}$ . Parentheses are usually omitted from such products due to the natural isomorphism between $V_{i} \otimes (V_{j} \otimes V_{k})$ and $(V_{i} \otimes V_{j}) \otimes V_{k}$ , but may, of course, be reintroduced to emphasize a particular grouping of factors. In the grouping, $(V_{π (1)} \otimes \dots \otimes V_{π (r_{1})}) \otimes (V_{π (r_{1} + 1)} \otimes \dots \otimes V_{π (r_{2})}) \otimes \dots \otimes (V_{π (r_{L - 1} + 1)} \otimes \dots \otimes V_{π (r_{L})}),$ there are $L$ groups with $r_{l} - r_{l - 1}$ factors in the $l^{th}$ group (where $r_{0} = 0$ and $r_{L} = M$ ).

Letting $S_{l} = (π (r_{l - 1} + 1), π (r_{l - 1} + 2), \dots, π (r_{l}))$ for each $l$ satisfying $1 \leq l \leq L$ , an $(S_{1}, S_{2}, \dots, S_{L})$ -flattening of a tensor $𝒜$ , denoted $𝒜_{(S_{1}, S_{2}, \dots, S_{L})}$ , is obtained by applying the two processes above within each of the $L$ groups of factors. That is, the coordinate representation of the $l^{th}$ group of factors is obtained using the isomorphism $(V_{π (r_{l - 1} + 1)} \otimes V_{π (r_{l - 1} + 2)} \otimes \dots \otimes V_{π (r_{l})}) ≃ (F^{I_{π (r_{l - 1} + 1)}} \otimes F^{I_{π (r_{l - 1} + 2)}} \otimes \dots \otimes F^{I_{π (r_{l})}})$ , which requires specifying bases for all of the vector spaces $V_{k}$ . The result is then vectorized using a bijection $μ_{l} : [I_{π (r_{l - 1} + 1)}] \times [I_{π (r_{l - 1} + 2)}] \times \dots \times [I_{π (r_{l})}] \to [I_{S_{l}}]$ to obtain an element of $F^{I_{S_{l}}}$ , where $I_{S_{l}} := \prod_{i = r_{l - 1} + 1}^{r_{l}} I_{π (i)}$ , the product of the dimensions of the vector spaces in the $l^{th}$ group of factors. The result of applying these isomorphisms within each group of factors is an element of $F^{I_{S_{1}}} \otimes \dots \otimes F^{I_{S_{L}}}$ , which is a tensor of order $L$ .

Vectorization

By means of a bijective map $μ : [I_{1}] \times \dots \times [I_{M}] \to [I_{1} \dots I_{M}]$ , a vector space isomorphism between $F^{I_{1}} \otimes \dots \otimes F^{I_{M}}$ and $F^{I_{1} \dots I_{M}}$ is constructed via the mapping $𝐞_{i_{1}}^{1} \otimes \dots 𝐞_{i_{m}}^{m} \otimes \dots \otimes 𝐞_{i_{M}}^{M} \mapsto 𝐞_{μ (i_{1}, i_{2}, \dots, i_{M})},$ where for every natural number $i$ such that $1 \leq i \leq I_{1} \dots I_{M}$ , the vector $𝐞_{i}$ denotes the ith standard basis vector of $F^{i_{1} \dots i_{M}}$ . In such a reshaping, the tensor is simply interpreted as a vector in $F^{I_{1} \dots I_{M}}$ . This is known as vectorization, and is analogous to vectorization of matrices. A standard choice of bijection $μ$ is such that

$vec (𝒜) = {[\begin{matrix} a_{1, 1, \dots, 1} & a_{2, 1, \dots, 1} & \dots & a_{n_{1}, 1, \dots, 1} & a_{1, 2, 1, \dots, 1} & \dots & a_{I_{1}, I_{2}, \dots, I_{M}} \end{matrix}]}^{T},$

which is consistent with the way in which the colon operator in Matlab and GNU Octave reshapes a higher-order tensor into a vector. In general, the vectorization of $𝒜$ is the vector $[a_{μ^{- 1} (i)}]_{i = 1}^{I_{1} \dots I_{M}}$ .

The vectorization of $𝒜$ denoted with $v e c (𝒜)$ or $𝒜_{[:]}$ is an $[S_{1}, S_{2}]$ -reshaping where $S_{1} = (1, 2, \dots, M)$ and $S_{2} = \emptyset$ .

Mode-m Flattening / Mode-m Matrixization

Let $𝒜 \in F^{I_{1}} \otimes F^{I_{2}} \otimes \dots \otimes F^{I_{M}}$ be the coordinate representation of an abstract tensor with respect to a basis. Mode-m matrixizing (a.k.a. flattening) of $𝒜$ is an $[S_{1}, S_{2}]$ -reshaping in which $S_{1} = (m)$ and $S_{2} = (1, 2, \dots, m - 1, m + 1, \dots, M)$ . Usually, a standard matrixizing is denoted by

$𝐀_{[m]} = 𝒜_{[S_{1}, S_{2}]}$

This reshaping is sometimes called matrixizing, matricizing, flattening or unfolding in the literature. A standard choice for the bijections $μ_{1}, μ_{2}$ is the one that is consistent with the reshape function in Matlab and GNU Octave, namely

$𝐀_{[m]} := [\begin{matrix} a_{1, 1, \dots, 1, 1, 1, \dots, 1} & a_{2, 1, \dots, 1, 1, 1, \dots, 1} & \dots & a_{I_{1}, I_{2}, \dots, I_{m - 1}, 1, I_{m + 1}, \dots, I_{M}} \\ a_{1, 1, \dots, 1, 2, 1, \dots, 1} & a_{2, 1, \dots, 1, 2, 1, \dots, 1} & \dots & a_{I_{1}, I_{2}, \dots, I_{m - 1}, 2, I_{m + 1}, \dots, I_{M}} \\ ⋮ & ⋮ & ⋮ \\ a_{1, 1, \dots, 1, I_{m}, 1, \dots, 1} & a_{2, 1, \dots, 1, I_{m}, 1, \dots, 1} & \dots & a_{I_{1}, I_{2}, \dots, I_{m - 1}, I_{m}, I_{m + 1}, \dots, I_{M}} \end{matrix}]$

Definition Mode-m Matrixizing:^[1] $[𝐀_{[m]}]_{j k} = a_{i_{1} \dots i_{m} \dots i_{M}}, where j = i_{m} and k = 1 + \sum_{\binom{n = 0}{n \neq m}}^{M} (i_{n} - 1) \prod_{\binom{l = 0}{l \neq m}}^{n - 1} I_{l} .$ The mode-m matrixizing of a tensor $𝒜 \in F^{I_{1} \times ... I_{M}},$ is defined as the matrix $𝐀_{[m]} \in F^{I_{m} \times (I_{1} \dots I_{m - 1} I_{m + 1} \dots I_{M})}$ . As the parenthetical ordering indicates, the mode-m column vectors are arranged by sweeping all the other mode indices through their ranges, with smaller mode indexes varying more rapidly than larger ones; thus

References

Template:Reflist

↑ Template:Citation

[Vasilescu2009-1] Template:Citation

[1]

Tensor reshaping

Contents

Definition

Coordinate representation

General flattenings

Vectorization

Mode-m Flattening / Mode-m Matrixization

References

Navigation menu

Tensor reshaping

Definition

Coordinate representation

General flattenings

Vectorization

Mode-m Flattening / Mode-m Matrixization

References

Navigation menu

Search