Generalized singular value decomposition

From testwiki
Jump to navigation Jump to search

Template:Short description Template:Use dmy dates In linear algebra, the generalized singular value decomposition (GSVD) is the name of two different techniques based on the singular value decomposition (SVD). The two versions differ because one version decomposes two matrices (somewhat like the higher-order or tensor SVD) and the other version uses a set of constraints imposed on the left and right singular vectors of a single-matrix SVD.

First version: two-matrix decomposition

The generalized singular value decomposition (GSVD) is a matrix decomposition on a pair of matrices which generalizes the singular value decomposition. It was introduced by Van Loan [1] in 1976 and later developed by Paige and Saunders,[2] which is the version described here. In contrast to the SVD, the GSVD decomposes simultaneously a pair of matrices with the same number of columns. The SVD and the GSVD, as well as some other possible generalizations of the SVD,[3][4][5] are extensively used in the study of the conditioning and regularization of linear systems with respect to quadratic semi-norms. In the following, let 𝔽=ℝ, or 𝔽=β„‚.

Definition

The generalized singular value decomposition of matrices A1βˆˆπ”½m1Γ—n and A2βˆˆπ”½m2Γ—n isA1=U1Ξ£1[Wβˆ—D,0D]Qβˆ—,A2=U2Ξ£2[Wβˆ—D,0D]Qβˆ—,where

  • U1βˆˆπ”½m1Γ—m1 is unitary,
  • U2βˆˆπ”½m2Γ—m2 is unitary,
  • Qβˆˆπ”½nΓ—n is unitary,
  • Wβˆˆπ”½kΓ—kis unitary,
  • Dβˆˆβ„kΓ—k is real diagonal with positive diagonal, and contains the non-zero singular values of C=[A1A2] in decreasing order,
  • 0D=0βˆˆβ„kΓ—(nβˆ’k),
  • Ξ£1=⌈IA,S1,0AβŒ‹βˆˆβ„m1Γ—k is real non-negative block-diagonal, where S1=⌈αr+1,,Ξ±r+sβŒ‹ with 1>Ξ±r+1β‰₯β‹―β‰₯Ξ±r+s>0, IA=Ir, and 0A=0βˆˆβ„(m1βˆ’rβˆ’s)Γ—(kβˆ’rβˆ’s),
  • Ξ£2=⌈0B,S2,IBβŒ‹βˆˆβ„m2Γ—k is real non-negative block-diagonal, where S2=⌈βr+1,,Ξ²r+sβŒ‹ with 0<Ξ²r+1≀⋯≀βr+s<1, IB=Ikβˆ’rβˆ’s, and 0B=0βˆˆβ„(m2βˆ’k+r)Γ—r,
  • Ξ£1βˆ—Ξ£1=⌈α12,,Ξ±k2βŒ‹,
  • Ξ£2βˆ—Ξ£2=⌈β12,,Ξ²k2βŒ‹,
  • Ξ£1βˆ—Ξ£1+Ξ£2βˆ—Ξ£2=Ik,
  • k=rank(C).

We denote Ξ±1=β‹―=Ξ±r=1, Ξ±r+s+1=β‹―=Ξ±k=0, Ξ²1=β‹―=Ξ²r=0, and Ξ²r+s+1=β‹―=Ξ²k=1. While Ξ£1 is diagonal, Ξ£2 is not always diagonal, because of the leading rectangular zero matrix; instead Ξ£2 is "bottom-right-diagonal".

Variations

There are many variations of the GSVD. These variations are related to the fact that it is always possible to multiply Qβˆ— from the left by EEβˆ—=I where Eβˆˆπ”½nΓ—n is an arbitrary unitary matrix. We denote

  • X=([Wβˆ—D,0D]Qβˆ—)βˆ—
  • Xβˆ—=[0,R]Q^βˆ—, where Rβˆˆπ”½kΓ—k is upper-triangular and invertible, and Q^βˆˆπ”½nΓ—n is unitary. Such matrices exist by RQ-decomposition.
  • Y=Wβˆ—D. Then Y is invertible.

Here are some variations of the GSVD:

  • MATLAB (gsvd):A1=U1Ξ£1Xβˆ—,A2=U2Ξ£2Xβˆ—.
  • LAPACK (LA_GGSVD):A1=U1Ξ£1[0,R]Q^βˆ—,A2=U2Ξ£2[0,R]Q^βˆ—.
  • Simplified:A1=U1Ξ£1[Y,0D]Qβˆ—,A2=U2Ξ£2[Y,0D]Qβˆ—.

Generalized singular values

A generalized singular value of A1 and A2 is a pair (a,b)βˆˆβ„2 such that

limΞ΄β†’0det(b2A1βˆ—A1βˆ’a2A2βˆ—A2+Ξ΄In)/det(Ξ΄Inβˆ’k)=0,a2+b2=1,a,bβ‰₯0.We have

  • AiAjβˆ—=UiΞ£iYYβˆ—Ξ£jβˆ—Ujβˆ—
  • Aiβˆ—Aj=Q[Yβˆ—Ξ£iβˆ—Ξ£jY000]Qβˆ—=Q1Yβˆ—Ξ£iβˆ—Ξ£jYQ1βˆ—


By these properties we can show that the generalized singular values are exactly the pairs (Ξ±i,Ξ²i). We havedet(b2A1βˆ—A1βˆ’a2A2βˆ—A2+Ξ΄In)=det(b2A1βˆ—A1βˆ’a2A2βˆ—A2+Ξ΄QQβˆ—)=det(Q[Yβˆ—(b2Ξ£1βˆ—Ξ£1βˆ’a2Ξ£2βˆ—Ξ£2)Y+Ξ΄Ik00Ξ΄Inβˆ’k]Qβˆ—)=det(Ξ΄Inβˆ’k)det(Yβˆ—(b2Ξ£1βˆ—Ξ£1βˆ’a2Ξ£2βˆ—Ξ£2)Y+Ξ΄Ik).Therefore

limΞ΄β†’0det(b2A1βˆ—A1βˆ’a2A2βˆ—A2+Ξ΄In)/det(Ξ΄Inβˆ’k)=limΞ΄β†’0det(Yβˆ—(b2Ξ£1βˆ—Ξ£1βˆ’a2Ξ£2βˆ—Ξ£2)Y+Ξ΄Ik)=det(Yβˆ—(b2Ξ£1βˆ—Ξ£1βˆ’a2Ξ£2βˆ—Ξ£2)Y)=|det(Y)|2∏i=1k(b2Ξ±i2βˆ’a2Ξ²i2).

This expression is zero exactly when a=Ξ±i and b=Ξ²i for some i.

In,[2] the generalized singular values are claimed to be those which solve det(b2A1βˆ—A1βˆ’a2A2βˆ—A2)=0. However, this claim only holds when k=n, since otherwise the determinant is zero for every pair (a,b)βˆˆβ„2; this can be seen by substituting Ξ΄=0 above.

Generalized inverse

Define E+=Eβˆ’1 for any invertible matrix Eβˆˆπ”½nΓ—n , 0+=0βˆ— for any zero matrix 0βˆˆπ”½mΓ—n, and ⌈E1,E2βŒ‹+=⌈E1+,E2+βŒ‹ for any block-diagonal matrix. Then defineAi+=Q[Yβˆ’10]Ξ£i+Uiβˆ—It can be shown that Ai+ as defined here is a generalized inverse of Ai; in particular a {1,2,3}-inverse of Ai. Since it does not in general satisfy (Ai+Ai)βˆ—=Ai+Ai, this is not the Moore–Penrose inverse; otherwise we could derive (AB)+=B+A+ for any choice of matrices, which only holds for certain class of matrices.

Suppose Q=[Q1Q2], where Q1βˆˆπ”½nΓ—k and Q2βˆˆπ”½nΓ—(nβˆ’k). This generalized inverse has the following properties:

  • Ξ£1+=⌈IA,S1βˆ’1,0ATβŒ‹
  • Ξ£2+=⌈0BT,S2βˆ’1,IBβŒ‹
  • Ξ£1Ξ£1+=⌈I,I,0βŒ‹
  • Ξ£2Ξ£2+=⌈0,I,IβŒ‹
  • Ξ£1Ξ£2+=⌈0,S1S2βˆ’1,0βŒ‹
  • Ξ£1+Ξ£2=⌈0,S1βˆ’1S2,0βŒ‹
  • AiAj+=UiΞ£iΞ£j+Ujβˆ—
  • Ai+Aj=Q[Yβˆ’1Ξ£i+Ξ£jY000]Qβˆ—=Q1Yβˆ’1Ξ£i+Ξ£jYQ1βˆ—

Quotient SVD

A generalized singular ratio of A1 and A2 is Οƒi=Ξ±iΞ²i+. By the above properties, A1A2+=U1Ξ£1Ξ£2+U2βˆ—. Note that Ξ£1Ξ£2+=⌈0,S1S2βˆ’1,0βŒ‹ is diagonal, and that, ignoring the leading zeros, contains the singular ratios in decreasing order. If A2 is invertible, then Ξ£1Ξ£2+ has no leading zeros, and the generalized singular ratios are the singular values, and U1 and U2 are the matrices of singular vectors, of the matrix A1A2+=A1A2βˆ’1. In fact, computing the SVD of A1A2βˆ’1 is one of the motivations for the GSVD, as "forming ABβˆ’1 and finding its SVD can lead to unnecessary and large numerical errors when B is ill-conditioned for solution of equations".[2] Hence the sometimes used name "quotient SVD", although this is not the only reason for using GSVD. If A2 is not invertible, then U1Ξ£1Ξ£2+U2βˆ—is still the SVD of A1A2+ if we relax the requirement of having the singular values in decreasing order. Alternatively, a decreasing order SVD can be found by moving the leading zeros to the back: U1Ξ£1Ξ£2+U2βˆ—=(U1P1)P1βˆ—Ξ£1Ξ£2+P2(P2βˆ—U2βˆ—), where P1 and P2 are appropriate permutation matrices. Since rank equals the number of non-zero singular values, rank(A1A2+)=s.

Construction

Let

  • C=P⌈D,0βŒ‹Qβˆ— be the SVD of C=[A1A2], where Pβˆˆπ”½(m1+m2)Γ—(m1Γ—m2) is unitary, and Q and D are as described,
  • P=[P1,P2], where P1βˆˆπ”½(m1+m2)Γ—k and P2βˆˆπ”½(m1+m2)Γ—(nβˆ’k),
  • P1=[P11P21], where P11βˆˆπ”½m1Γ—k and P21βˆˆπ”½m2Γ—k,
  • P11=U1Ξ£1Wβˆ— by the SVD of P11, where U1, Ξ£1 and W are as described,
  • P21W=U2Ξ£2 by a decomposition similar to a QR-decomposition, where U2 and Ξ£2 are as described.

ThenC=P⌈D,0βŒ‹Qβˆ—=[P1D,0]Qβˆ—=[U1Ξ£1Wβˆ—D0U2Ξ£2Wβˆ—D0]Qβˆ—=[U1Ξ£1[Wβˆ—D,0]Qβˆ—U2Ξ£2[Wβˆ—D,0]Qβˆ—].We also have[U1βˆ—00U2βˆ—]P1W=[Ξ£1Ξ£2].ThereforeΞ£1βˆ—Ξ£1+Ξ£2βˆ—Ξ£2=[Ξ£1Ξ£2]βˆ—[Ξ£1Ξ£2]=Wβˆ—P1βˆ—[U100U2][U1βˆ—00U2βˆ—]P1W=I.Since P1 has orthonormal columns, ||P1||2≀1. Therefore||Ξ£1||2=||U1βˆ—P1W||2=||P1||2≀1.We also have for each xβˆˆβ„k such that ||x||2=1 that||P21x||22≀||P11x||22+||P21x||22=||P1x||22≀1.Therefore ||P21||2≀1, and||Ξ£2||2=||U2βˆ—P21W||2=||P21||2≀1.

Applications

The tensor GSVD is one of the comparative spectral decompositions, multi-tensor generalizations of the SVD, invented to simultaneously identify the similar and dissimilar among, and create a single coherent model from any data types, of any number and dimensions.

The GSVD, formulated as a comparative spectral decomposition,[6] has been successfully applied to signal processing and data science, e.g., in genomic signal processing.[7][8][9]

These applications inspired several additional comparative spectral decompositions, i.e., the higher-order GSVD (HO GSVD)[10] and the tensor GSVD.[11] [12]

It has equally found applications to estimate the spectral decompositions of linear operators when the eigenfunctions are parameterized with a linear model, i.e. a reproducing kernel Hilbert space.[13]

Second version: weighted single-matrix decomposition

The weighted version of the generalized singular value decomposition (GSVD) is a constrained matrix decomposition with constraints imposed on the left and right singular vectors of the singular value decomposition.[14][15][16] This form of the GSVD is an extension of the SVD as such. Given the SVD of an mΓ—n real or complex matrix M

M=UΞ£Vβˆ—

where

Uβˆ—WuU=Vβˆ—WvV=I.

Where I is the identity matrix and where U and V are orthonormal given their constraints (Wu and Wv). Additionally, Wu and Wv are positive definite matrices (often diagonal matrices of weights). This form of the GSVD is the core of certain techniques, such as generalized principal component analysis and Correspondence analysis.

The weighted form of the GSVD is called as such because, with the correct selection of weights, it generalizes many techniques (such as multidimensional scaling and linear discriminant analysis).[17]

References

Template:Reflist

Further reading

Template:Refbegin

Template:Refend

  1. ↑ Cite error: Invalid <ref> tag; no text was provided for refs named VanLoan
  2. ↑ 2.0 2.1 2.2 Cite error: Invalid <ref> tag; no text was provided for refs named Paige
  3. ↑ Template:Cite book
  4. ↑ Template:Cite web
  5. ↑ Template:Cite journal
  6. ↑ Template:Cite journal
  7. ↑ Template:Cite journal
  8. ↑ Template:Cite journal
  9. ↑ Template:Cite journal
  10. ↑ Cite error: Invalid <ref> tag; no text was provided for refs named Ponnapalli2011
  11. ↑ Cite error: Invalid <ref> tag; no text was provided for refs named Sankaranarayanan2015
  12. ↑ Cite error: Invalid <ref> tag; no text was provided for refs named Bradley2019
  13. ↑ Template:Cite arXiv
  14. ↑ Template:Cite book
  15. ↑ Template:Cite book
  16. ↑ Template:Cite journal
  17. ↑ Template:Cite book