Higher-order singular value decomposition

From testwiki
Revision as of 09:55, 28 November 2024 by imported>Citation bot (Added bibcode. | Use this bot. Report bugs. | Suggested by Dominic3203 | Category:Tensors | #UCB_Category 92/96)
(diff) โ† Older revision | Latest revision (diff) | Newer revision โ†’ (diff)
Jump to navigation Jump to search

Template:Short description In multilinear algebra, the higher-order singular value decomposition (HOSVD) of a tensor is a specific orthogonal Tucker decomposition. It may be regarded as one type of generalization of the matrix singular value decomposition. It has applications in computer vision, computer graphics, machine learning, scientific computing, and signal processing. Some aspects can be traced as far back as F. L. Hitchcock in 1928,[1] but it was L. R. Tucker who developed for third-order tensors the general Tucker decomposition in the 1960s,[2][3][4] further advocated by L. De Lathauwer et al.[5] in their Multilinear SVD work that employs the power method, or advocated by Vasilescu and Terzopoulos that developed M-mode SVD a parallel algorithm that employs the matrix SVD.

The term higher order singular value decomposition (HOSVD) was coined be DeLathauwer, but the algorithm referred to commonly in the literature as the HOSVD and attributed to either Tucker or DeLathauwer was developed by Vasilescu and Terzopoulos.[6][7][8] Robust and L1-norm-based variants of HOSVD have also been proposed.[9][10][11][12]

Definition

For the purpose of this article, the abstract tensor ๐’œ is assumed to be given in coordinates with respect to some basis as a M-way array, also denoted by ๐’œโ„‚I1×I2×Im×IM, where M is the number of modes and the order of the tensor. โ„‚ is the complex numbers and it includes both the real numbers โ„ and the pure imaginary numbers.

Let ๐’œ[m]โ„‚Im×(I1I2Im1Im+1IM) denote the standard mode-m flattening of ๐’œ, so that the left index of ๐’œ[m] corresponds to the m'th index ๐’œ and the right index of ๐’œ[m] corresponds to all other indices of ๐’œ combined. Let ๐”mโ„‚Im×Imbe a unitary matrix containing a basis of the left singular vectors of the ๐’œ[m] such that the jth column ๐ฎj of ๐”m corresponds to the jth largest singular value of ๐’œ[m]. Observe that the mode/factor matrix ๐”m does not depend on the particular on the specific definition of the mode m flattening. By the properties of the multilinear multiplication, we have๐’œ=๐’œ×(๐ˆ,๐ˆ,,๐ˆ)=๐’œ×(๐”1๐”1H,๐”2๐”2H,,๐”M๐”MH)=(๐’œ×(๐”1H,๐”2H,,๐”MH))×(๐”1,๐”2,,๐”M),where H denotes the conjugate transpose. The second equality is because the ๐”m's are unitary matrices. Define now the core tensor๐’ฎ:=๐’œ×(๐”1H,๐”2H,,๐”MH).Then, the HOSVD[5] of ๐’œ is the decomposition๐’œ=๐’ฎ×(๐”1,๐”2,,๐”M). The above construction shows that every tensor has a HOSVD.

Compact HOSVD

As in the case of the compact singular value decomposition of a matrix, where the rows and columns corresponding to vanishing singular values are dropped, it is also possible to consider a compact HOSVD, which is very useful in applications.

Assume that ๐”mโ„‚Im×Rm is a matrix with unitary columns containing a basis of the left singular vectors corresponding to the nonzero singular values of the standard factor-m flattening ๐’œ[m] of ๐’œ. Let the columns of ๐”m be sorted such that the rm th column ๐ฎrm of ๐”m corresponds to the rmth largest nonzero singular value of ๐’œ[m]. Since the columns of ๐”m form a basis for the image of ๐’œ[m], we have๐’œ[m]=๐”m๐”mH๐’œ[m]=(๐’œ×m(๐”m๐”mH))[m],where the first equality is due to the properties of orthogonal projections (in the Hermitian inner product) and the last equality is due to the properties of multilinear multiplication. As flattenings are bijective maps and the above formula is valid for all m=1,2,,m,,M, we find as before that๐’œ=๐’œ×(๐”1๐”1H,๐”2๐”2H,,๐”M๐”MH)=(๐’œ×(๐”1H,๐”2H,,๐”MH))×(๐”1,๐”2,,๐”M)=๐’ฎ×(๐”1,๐”2,,๐”M),where the core tensor ๐’ฎ is now of size R1×R2××RM.

Multilinear rank

The multilinear rank[1] of ๐’œ is denoted with rank-(R1,R2,,RM). The multilinear rank is a tuple in โ„•M where Rm:=rank(๐’œ[m]). Not all tuples in โ„•M are multilinear ranks.[13] The multilinear ranks are bounded by 1RmIm and it satisfy the constraint RmimRi must hold.[13]

The compact HOSVD is a rank-revealing decomposition in the sense that the dimensions of its core tensor correspond with the components of the multilinear rank of the tensor.

Interpretation

The following geometric interpretation is valid for both the full and compact HOSVD. Let (R1,R2,,RM) be the multilinear rank of the tensor ๐’œ. Since ๐’ฎโ„‚R1×R2××RM is a multidimensional array, we can expand it as follows๐’ฎ=r1=1R1r2=1R2rM=1RMsr1,r2,,rM๐žr1๐žr2๐žrM,where ๐žrm is the rmth standard basis vector of โ„‚Im. By definition of the multilinear multiplication, it holds that๐’œ=r1=1R1r2=1R2rM=1RMsr1,r2,,rM๐ฎr1๐ฎr2๐ฎrM,where the ๐ฎrm are the columns of ๐”mโ„‚Im×Rm. It is easy to verify that B={๐ฎr1๐ฎr2๐ฎrM}r1,r2,,rM is an orthonormal set of tensors. This means that the HOSVD can be interpreted as a way to express the tensor ๐’œ with respect to a specifically chosen orthonormal basis B with the coefficients given as the multidimensional array ๐’ฎ.

Computation

Let ๐’œโ„‚I1×I2××IM be a tensor with a rank-(R1,R2,,RM), where โ„‚ contains the reals โ„ as a subset.

Classic computation

The strategy for computing the Multilinear SVD and the M-mode SVD was introduced in the 1960s by L. R. Tucker,[3] further advocated by L. De Lathauwer et al.,[5] and by Vasilescu and Terzopulous.[8][6] The term HOSVD was coined by Lieven De Lathauwer, but the algorithm typically referred to in the literature as HOSVD was introduced by Vasilescu and Terzopoulos[6][8] with the name M-mode SVD. It is a parallel computation that employs the matrix SVD to compute the orthonormal mode matrices.

M-mode SVD

Sources:[6][8]

  • For m=1,,M, do the following:
  1. Construct the mode-m flattening ๐’œ[m];
  2. Compute the (compact) singular value decomposition ๐’œ[m]=๐”mΣm๐•mT, and store the left singular vectors ๐”โ„‚Im×Rm;
  • Compute the core tensor ๐’ฎ via the multilinear multiplication ๐’ฎ=๐’œ×1๐”1H×2๐”2H×m๐”mH×M๐”MH

Interlacing computation

A strategy that is significantly faster when some or all RmIm consists of interlacing the computation of the core tensor and the factor matrices, as follows:[14][15][16]

  • Set ๐’œ0=๐’œ;
  • For m=1,2,M perform the following:
    1. Construct the standard mode-m flattening ๐’œ[m]m1;
    2. Compute the (compact) singular value decomposition ๐’œ[m]m1=UmΣmVmT, and store the left singular vectors UmFIm×Rm;
    3. Set ๐’œm=UmH×m๐’œm1, or, equivalently, ๐’œ[m]m=ΣmVmT.

In-place computation

The HOSVD can be computed in-place via the Fused In-place Sequentially Truncated Higher Order Singular Value Decomposition (FIST-HOSVD) [16] algorithm by overwriting the original tensor by the HOSVD core tensor, significantly reducing the memory consumption of computing HOSVD.

Approximation

In applications, such as those mentioned below, a common problem consists of approximating a given tensor ๐’œโ„‚I1×I2××Im×IM by one with a reduced multilinear rank. Formally, if the multilinear rank of ๐’œ is denoted by rank(R1,R2,,Rm,,RM), then computing the optimal ๐’œยฏ that approximates ๐’œ for a given reduced rank(Rยฏ1,Rยฏ2,,Rยฏm,,RยฏM) is a nonlinear non-convex 2-optimization problem min๐’œยฏโ„‚I1×I2××IM12๐’œ๐’œยฏF2s.t.rank(Rยฏ1,Rยฏ2,,RยฏM),where (Rยฏ1,Rยฏ2,,RยฏM)โ„•M is the reduced multilinear rank with 1Rยฏm<RmIm, and the norm .F is the Frobenius norm.

A simple idea for trying to solve this optimization problem is to truncate the (compact) SVD in step 2 of either the classic or the interlaced computation. A classically truncated HOSVD is obtained by replacing step 2 in the classic computation by

  • Compute a rank-Rยฏm truncated SVD ๐’œ[m]UmΣmVmT, and store the top Rยฏm left singular vectors UmFIm×Rยฏm;

while a sequentially truncated HOSVD (or successively truncated HOSVD) is obtained by replacing step 2 in the interlaced computation by

  • Compute a rank-Rยฏm truncated SVD ๐’œ[m]m1UmΣmVmT, and store the top Rยฏm left singular vectors UmFIm×Rยฏm. Unfortunately, truncation does not result in an optimal solution for the best low multilinear rank optimization problem,.[5][6][14][16] However, both the classically and interleaved truncated HOSVD result in a quasi-optimal solution:[14][16][15][17] if ๐’œยฏt denotes the classically or sequentially truncated HOSVD and ๐’œยฏ* denotes the optimal solution to the best low multilinear rank approximation problem, then๐’œ๐’œยฏtFM๐’œ๐’œยฏ*F;in practice this means that if there exists an optimal solution with a small error, then a truncated HOSVD will for many intended purposes also yield a sufficiently good solution.

Applications

The HOSVD is most commonly applied to the extraction of relevant information from multi-way arrays.

Starting in the early 2000s, Vasilescu addressed causal questions by reframing the data analysis, recognition and synthesis problems as multilinear tensor problems. The power of the tensor framework was showcased by decomposing and representing an image in terms of its causal factors of data formation, in the context of Human Motion Signatures for gait recognition,[18] face recognitionโ€”TensorFaces[19][20] and computer graphicsโ€”TensorTextures.[21]

The HOSVD has been successfully applied to signal processing and big data, e.g., in genomic signal processing.[22][23][24] These applications also inspired a higher-order GSVD (HO GSVD)[25] and a tensor GSVD.[26]

A combination of HOSVD and SVD also has been applied for real-time event detection from complex data streams (multivariate data with space and time dimensions) in disease surveillance.[27]

It is also used in tensor product model transformation-based controller design.[28][29]

The concept of HOSVD was carried over to functions by Baranyi and Yam via the TP model transformation.[28][29] This extension led to the definition of the HOSVD-based canonical form of tensor product functions and Linear Parameter Varying system models[30] and to convex hull manipulation based control optimization theory, see TP model transformation in control theories.

HOSVD was proposed to be applied to multi-view data analysis[31] and was successfully applied to in silico drug discovery from gene expression.[32]

Robust L1-norm variant

L1-Tucker is the L1-norm-based, robust variant of Tucker decomposition.[10][11] L1-HOSVD is the analogous of HOSVD for the solution to L1-Tucker.[10][12]

References

  1. โ†‘ 1.0 1.1 Template:Cite journal
  2. โ†‘ Template:Cite journal
  3. โ†‘ 3.0 3.1 Template:Cite journal
  4. โ†‘ Template:Cite journal
  5. โ†‘ 5.0 5.1 5.2 5.3 Template:Cite journal
  6. โ†‘ 6.0 6.1 6.2 6.3 6.4 M. A. O. Vasilescu, D. Terzopoulos (2002) with the name M-mode SVD. The M-mode SVD is suitable for parallel computation and employs the matrix SVD "Multilinear Analysis of Image Ensembles: TensorFaces" Template:Webarchive, Proc. 7th European Conference on Computer Vision (ECCV'02), Copenhagen, Denmark, May, 2002
  7. โ†‘ M. A. O. Vasilescu, D. Terzopoulos (2003) "Multilinear Subspace Analysis of Image Ensembles", "Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPRโ€™03), Madison, WI, June, 2003"
  8. โ†‘ 8.0 8.1 8.2 8.3 M. A. O. Vasilescu, D. Terzopoulos (2005) "Multilinear Independent Component Analysis", "Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPRโ€™05), San Diego, CA, June 2005, vol.1, 547โ€“553."
  9. โ†‘ Template:Cite journal
  10. โ†‘ 10.0 10.1 10.2 Template:Cite journal
  11. โ†‘ 11.0 11.1 Template:Cite journal
  12. โ†‘ 12.0 12.1 Template:Cite book
  13. โ†‘ 13.0 13.1 Template:Cite journal
  14. โ†‘ 14.0 14.1 14.2 Template:Cite journal
  15. โ†‘ 15.0 15.1 Template:Cite book
  16. โ†‘ 16.0 16.1 16.2 16.3 Template:Cite conference
  17. โ†‘ Template:Cite journal
  18. โ†‘ M. A. O. Vasilescu (2002) "Human Motion Signatures: Analysis, Synthesis, Recognition," Proceedings of International Conference on Pattern Recognition (ICPR 2002), Vol. 3, Quebec City, Canada, Aug, 2002, 456โ€“460.
  19. โ†‘ M.A.O. Vasilescu, D. Terzopoulos (2003) "Multilinear Subspace Analysis for Image Ensembles, M. A. O. Vasilescu, D. Terzopoulos, Proc. Computer Vision and Pattern Recognition Conf. (CVPR '03), Vol.2, Madison, WI, June, 2003, 93โ€“99.
  20. โ†‘ M.A.O. Vasilescu, D. Terzopoulos (2002) "Multilinear Analysis of Image Ensembles: TensorFaces," Proc. 7th European Conference on Computer Vision (ECCV'02), Copenhagen, Denmark, May, 2002, in Computer Vision -- ECCV 2002, Lecture Notes in Computer Science, Vol. 2350, A. Heyden et al. (Eds.), Springer-Verlag, Berlin, 2002, 447โ€“460.
  21. โ†‘ M.A.O. Vasilescu, D. Terzopoulos (2004) "TensorTextures: Multilinear Image-Based Rendering", M. A. O. Vasilescu and D. Terzopoulos, Proc. ACM SIGGRAPH 2004 Conference Los Angeles, CA, August, 2004, in Computer Graphics Proceedings, Annual Conference Series, 2004, 336โ€“342.
  22. โ†‘ Template:Cite journal
  23. โ†‘ Template:Cite journal
  24. โ†‘ Template:Cite journal
  25. โ†‘ Template:Cite journal
  26. โ†‘ Template:Cite journal
  27. โ†‘ Template:Cite journal
  28. โ†‘ 28.0 28.1 Template:Cite journal
  29. โ†‘ 29.0 29.1 Template:Cite journal
  30. โ†‘ Template:Cite conference
  31. โ†‘ Template:Cite journal
  32. โ†‘ Template:Cite journal